Robots.txt file - How to block thosands of pages when you don't have a folder path

Unity

Hello.
Just wondering if anyone has come across this and can tell me if it worked or not.

Goal:
To block review pages

Challenge:
The URLs aren't constructed using folders, they look like this:
www.website.com/default.aspx?z=review&PG1234
www.website.com/default.aspx?z=review&PG1235
www.website.com/default.aspx?z=review&PG1236

So the first part of the URL is the same (i.e. /default.aspx?z=review) and the unique part comes immediately after - so not as a folder. Looking at Google recommendations they show examples for ways to block 'folder directories' and 'individual pages' only.

Question:
If I add the following to the Robots.txt file will it block all review pages?

User-agent: *
Disallow: /default.aspx?z=review

Much thanks,
Davinia

Klarke

Also remember that blocking in robots.txt doesn't prevent Google from indexing those URLs. If the URLs are already indexed or if they are linked to, either internally or externally they may still in appear in the index with limited snippet information. If so, you'll need to add a noindex meta tag to those pages.

Unity

An * added to the end! Great thank you!

Klarke

http://support.google.com/webmasters/bin/answer.py?hl=en&answer=156449

Head down to the pattern matching section.

I think

User-agent: *
Disallow: /default.aspx?z=review*

should do the trick though.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Robots.txt file - How to block thosands of pages when you don't have a folder path

Browse Questions

Explore more categories

Related Questions

Set Robots.txt file to crawl my website at specific times

What are best page titles for sub-folders or sub-directories? Same as website?

Question about Syntax in Robots.txt

Block in robots.txt instead of using canonical?

Incoming links which don't exists...

What NAP format do I use if the USPS can't even find my client's address?

Reciprocal Links and nofollow/noindex/robots.txt

Should I robots block this directory?