Indexed, though blocked by robots.txt: Need to bother?

vtmoz

Hi,

We have intentionally blocked some of the website files which were indexed for years. Now we receive a message "Indexed, though blocked by robots.txt" in GSC. We can ignore as per my knowledge? Are any actions required about this? We thought of blocking them with meta tags but these are PDF files.

Thanks

Gaston Riera

Hi there!

What Google is telling you is that you are indexing URLs that you probably are not wanting to be indexed, or the other way around, that important pages are being blocked but indexed for other reasons.

If I might ask, why did you blocked through robots.txt those files?
There most 2 answers are:
1- Wanted to remove those from search results. If this is your case, you've solved only a part of the problem. What you should have done is (previously allowing robots to crawl those urls) apply noindex rules (keep in mind that can be set up in the HTTP header, as long as not html files cant have meta robots tag), then after a sufficient time block them in robots.txt.
_2- Optimize how GoogleBot (crawiling) time. _Being this case, then you've done it correctly and there is nothing to worry.

Hope this help.
Best luck.
GR

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Indexed, though blocked by robots.txt: Need to bother?

Browse Questions

Explore more categories

Related Questions

Google Search Console Not Indexing Pages

Non-indexed or indexed top hierarchy pages get high PageRank at Google?

Sizable decrease in amount of pages indexed, however no drop in clicks, impressions, or ranking.

Need to be reindexed quickly - SERP is showing a 404

How To Index Backlinks Easily?

What is the appropriate Robot.txt to unblock if Google cannot get all the resources from my homepage?

Google indexing my website's Search Results pages. Should I block this?

Website moving up and down SERPs alongside others in 'blocks'.