Rel canonical and duplicate subdomains

94501

Hi,

I'm working with a site that has multiple sub domains of entirely duplicate content. So, the production level site that visitors see is (for made-up illustrative example):

123abc456.edu

Then, there are sub domains which are used by different developers to work on their own changes to the production site, before those changes are pushed to production:

Larry.123abc456.edu

Moe.123abc456.edu

Curly.123abc456.edu

Google ends up indexing these duplicate sub domains, which is of course not good.

If we add a canonical tag to the head section of the production page (and therefor all of the duplicate sub domains) will that cause some kind of problem... having a canonical tag on a page pointing to itself? Is it okay to have a canonical tag on a page pointing to that same page?

To complete the example...

In this example, where our production page is 123abc456.edu, our canonical tag on all pages (this page and therefor the duplicate subdomains) would be:

Is that going to be okay and fix this without causing some new problem of a canonical tag pointing to the page it's on?

Thanks!

94501

Hi Bob,

That excellent question I'll have to look in to and confirm. More later. Thanks!

bobjones

Is the subdomain data stored on the server as directories?

So for example, is the Moe.123abc456.edu data stored in a folder like 123abc456.edu/Moe

If so, you can simply have one robots.txt on your root domain, blocking those directories

Disallow: /Moe/

94501

Well, Bob, it looks like you're right! I guess it will for sure see all the pages in

Moe.123abc456.edu

as the ones to remove and not

123abc456.edu

Also, how does that robots text not get pushed to production as the developer working on that branch completes his work and pushes it to production.

I must confess, it still feels a little like bomb disposal.

bobjones

This should be exactly what you need: http://support.google.com/webmasters/bin/answer.py?hl=en&answer=1663427

94501

Hi Bob,

Thanks for the suggestion/question. I'm thinking about that, but wouldn't putting some robots do not crawl text on pages already indexed be a little like closing the barn door after the horses left? Do you think it would un-index the already crawled sub-domain? Thanks!

bobjones

Assuming that you do not need the development environments indexed in Google, why not simply block all crawlers on those subdomains?

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Rel canonical and duplicate subdomains

Browse Questions

Explore more categories

Related Questions

Fix Duplicate Content Before Migration?

Is this the correct way of using rel canonical, next and prev for paginated content?

Redirecting main www. subdomain to new domain. Can you then create a new subdomain on the old domain?

Rel=canonical on pre-migration website

Is legacy duplicate content an issue?

What constitutes a duplicate page?

Use of rel=canonical to view all page & No follow links

Duplicate page Content