I want to block search bots in crawling all my website's pages expect for homepage. Is this rule correct?

esiow2013

User-agent: *

Disallow: /*

GPainter

some great answers you can also find a list of all the robots here & here

Depending on your site you can also for example hide the rest of your site behind a login screen or a form which bots won't fill in.

esiow2013

Thanks Matt! I will surely test this one.

esiow2013

Thanks David! Will try this one.

Kingof5

Use this:

User-agent: Googlebot
Noindex: /

User-agent: Googlebot
Disallow: /

User-agent: *
Disallow: /

This is what I use to block our dev sites from being indexed and we've had no issues.

MattAntonino

Actually, there are two regex that Robots can handle - asterisk and $.

You should test this one. I think it will work (about 95% sure - tested in WMT quickly):

User-agent: *
Disallow: /
Allow: /$

Travis_Bailey

I don't think that will work. Robots.txt doesn't handle regular expressions. You will have to explicitly list all of the folders, and files to be super sure, that nothing is indexed unless you want it to be found.

This is kind of an odd question. I haven't thought about something like this in a while. I usually want everything but a couple folders indexed. : ) I found something that may be a little more help. Try reading this.

If you're working with extensions, you can use **Disallow:/*.html$ **or php or what have you. That may get you closer to a solution.

Definitely test this with a crawler that obeys robots.txt.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

I want to block search bots in crawling all my website's pages expect for homepage. Is this rule correct?

Browse Questions

Explore more categories

Related Questions

Title Tags in Sitecore are the same as navigation. How do I add keyword phrases without effecting my website's navigation?

Client has an inexplicable jump in crawled pages being reported in Google Search Console

How will changing my website's page content affect SEO?

Google Adsbot crawling order confirmation pages?

Robots.txt - Do I block Bots from crawling the non-www version if I use www.site.com ?

Investigating Google's treatment of different pages on our site - canonicals, addresses, and more.

What can you do when Google can't decide which of two pages is the better search result

SEO and marketing for a company that doesn't want to promote their primary website