What to do about "blocked by meta-robots"?

GPN

The crawl report tells me "Notices are interesting facts about your pages we found while crawling". One of these interesting facts is that my blog archives are "blocked by meta robots".

Articles are not blocked, just the archives.

What is a "meta" robot?

I think its just normal (since the article need only be crawled once) but want a second opinion. Should I care about this?

AlanBleiweiss

Meta robots refers to the < meta name="robots" > tag at the page header level. This is usually the case when a blog is set up with an SEO program like All In One SEO for example, where you can manually set which content is blocked. It's common to block archives, tags, and other sections, in the theory that allowing these to be crawled could either cause duplicate content issues, or drain link value from the primary category navigation.

RyanKent

In general, there are two ways you can block crawlers from indexing your content.

You can add a Disallow entry to your robots.txt file
You can add a meta tag to your pages:

What you are saying in either case is "please do not list this content in your search engine".

In general, you would not want to block your archives. There certainly can be specific cases where you only want the public to see your most current content, in which case you can block it.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

What to do about "blocked by meta-robots"?

Browse Questions

Explore more categories

Related Questions

Robots.txt - "File does not appear to be valid"

"non-WWW" vs "WWW" in Google SERPS and Lost Back Link Connection

Blocking Affiliate Links via robots.txt

Google is indexing blocked content in robots.txt

Rel="next"

Video thumbnail pages with "sort" feature -- tons of duplicate content?

Why crawl error "title missing or empty" when there is already "title and meta desciption" in place?

Is SEOMoz only good for "ideas"?