Can I Disallow Faceted Nav URLs - Robots.txt

tylerfraser

I have been disallowing /*? So I know that works without affecting crawling. I am wondering if I can disallow the faceted nav urls.

So disallow: /category.html/? /category2.html/? /category3.html/*?

To prevent the price faceted url from being cached:

/category.html?price=1%2C1000
and
/category.html?price=1%2C1000&product_material=88

Thanks!

AlanMosley

If you can no-index , follow all but the default, then you will send link juice to the pages but it will return the link juice because it is follow, but they will not index because they are no-index.

If you use robots, then it can not read the page to follow the links.

Francisco_Meza

Hey Tyler! haven't seen you on SEOmoz in a while. Hope you are good!

Check to see if this would make sense for you. GWT > Site Configuration > URL Perameters. It says "Only use this feature if you feel confident about how parameters work for your site. Telling Googlebot to exclude URLs with certain parameters could result in large numbers of your pages disappearing from our index."

tylerfraser

If I can, then I disallow hundreds of pages that are duplicate content and should not be crawled.

If I don't then I send link juice to urls that I don't want seen.

This is a good answer though, thanks. Any other thoughts?

AlanMosley

You can, but then you have links passing link juice to non followed pages. it would be better if you used canonical. even better would be to add no-index, follow meta tag when non canonical page is displayed, but this requres some codeing.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Can I Disallow Faceted Nav URLs - Robots.txt

Browse Questions

Explore more categories

Related Questions

WP URL issue - Concatenated URLs (LOTS of them)

Should I block Map pages with robots.txt?

Blocking Affiliate Links via robots.txt

Long URL

Blocked by meta-robots but there is no robots file

Help needed with robots.txt regarding wordpress!

HTML url extension

Search Engine blocked by robots.txt