Accidental Noindex/Mis-Canonicalisation - Please help!
-
Hi everybody,
I was hoping somebody might be able to help as this is an issue my team and I have never come across before.
A client of ours recently migrated to a new site design. 301 redirects were properly implemented and the transition was fairly smooth.
However, we realised soon after that a sub-section of pages had either one or both of the following errors:
- They featured a canonical tag pointing to the wrong page
- They featured the 'meta noindex' tag
After realising this, both the canonicals and the noindex tags were immediately removed. However, Google crawled the site while these were in place and the pages subsequently dropped out of Google's index.
We re-submitted the affected pages to Google's index and used WMT to 'Fetch' the pages as Google. We have also since 'allowed' the pages in the robots.txt file as an extra measure.
We found that the pages which just had the noindex tag were immediately re-indexed, while the pages which featured the noindex tag and which were mis-canonicalised are still not being re-indexed.
Can anyone think of a reason why this might be the case? One of the pages which featured both tags was one of our most important organic landing pages, so we're eager to resolve this.
Any help or advice would be appreciated.
Thanks!
-
I'm not sure how helpful it is, in the sense of being good news, but I did something like this to one of my sites on purpose once, and wrote it up:
http://www.seomoz.org/blog/catastrophic-canonicalization
A couple of tips:
(1) I think what Oleg is saying, which I agree with is that if Page A had a canonical to Page B, instead of just removing the canonical tag, put in a canonical tag pointing from Page A to Page A. Sometimes, the self-referencing canonical will help over-ride the old/bad canonical.
(2) Fetch is a good bet, but I'd also re-submit an XML sitemap with just the "bad" URLs. It's not a cure-all, but it can help nudge Google.
Unfortunately, it really can take time to sort out. Make sure your internal links are correct as well. You could temporarily build new internal links (list a few resources on your home-page, for example) to push link-juice temporarily. You could also post the proper URLs on Twitter/FB, etc., to kick them a bit. Of course, that only works for a few pages, not for hundreds.
-
Yes it may just be a waiting game as Oleg mentioned. But perhaps to help speed up the process you could link to some of those pages from a higher level page (like the homepage or a department landing page).Don't spam tho, no more than 100 links on a page (including navigation/footer etc).
I'd also recommend having an XML sitemap with all the URLs of your website on it. You'll need to upload this to Google Webmaster Tools as well.
When they do get re-indexed keep an eye out for how they have been indexed; so look at what keywords bring up that page in SERPs (Raven Tools is an easy way to track keywords and see which URL comes up). If you find that 'odd' pages are being indexed for a certain keyword search you should do some link building specific to the keyword you want ranked pointing to the page/URL you want ranked.
Good luck!
Davinia
-
Hi Oleg,
Thanks for your response. Unfortunately the canonical URL was another of our main organic landing pages so a redirect wouldn't be appropriate in this situation.
I agree that it's just a matter of time but it's frustrating that Google has crawled the site since we updated the pages and still hasn't re-indexed the page in question.
-
Can you set a canonical/redirect on the page that was incorrect pointing back to the correct page?
i.e. page1.html had wrong canonical to pgae1.html -> change pgae1.html canonical to page1.html
Overall, I think it's just a matter of time before Google is able to recrawl and fix itself... it's odd that canonical + noindex is slower than just noindex. Do whatever you can to get G to recrawl the pages.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Please Help (Newbie)
Hello, guys and gals, I am new to SEO and I am vigorously trying to rank my site here in Michigan for my company and respective niche. I have had some luck as I took many days to learn the basic foundation and apply what i have learned, but even after so I have had zero luck with establishing domain authority, page authority or even seeing the slightest SEO rank improvements. could someone please help?
Intermediate & Advanced SEO | | Charlesp31 -
[Very Urgent] More 100 "/search/adult-site-keywords" Crawl errors under Search Console
I just opened my G Search Console and was shocked to see more than 150 Not Found errors under Crawl errors. Mine is a Wordpress site (it's consistently updated too): Here's how they show up: Example 1: URL: www.example.com/search/adult-site-keyword/page2.html/feed/rss2 Linked From: http://an-adult-image-hosting.com/search/adult-site-keyword/page2.html Example 2 (this surprised me the most when I looked at the linked from data): URL: www.example.com/search/adult-site-keyword-2.html/page/3/ Linked From: www.example.com/search/adult-site-keyword-2.html/page/2/ (this is showing as if it's from our own site) http://a-spammy-adult-site.com/search/adult-site-keyword-2.html Example 3: URL: www.example.com/search/adult-site-keyword-3.html Linked From: http://an-adult-image-hosting.com/search/adult-site-keyword-3.html How do I address this issue?
Intermediate & Advanced SEO | | rmehta10 -
Problems with a website-help
Soooooo, I did a crawl report on this site : www.greatwesternflooring.com and this was what was on the report. This is a dnn site. I'm guessing the site has a redirect loop given the http status code. Can anyone help me with a fix. (the developers have said there is no redirect on the site......clearly there is....) | http://www.greatwesternflooring.com/ | 2015-01-07T21:32:25Z | 609 : Redirect to already-visited URL received for page request. | Error attempting to request page; see title for details. | 302 | http://www.greatwesternflooring.com | <colgroup><col width="319"> <col width="144"> <col width="378"> <col span="39" width="64"></colgroup>
Intermediate & Advanced SEO | | Britewave
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |0 -
Help understanding 301 domain redirect
Can anyone help me understand a specific process of a 301 redirecting a domain. Here is what I would like to know.... When you 301 redirect a site, most if not all the links follow to your new site. But how does this process happen? 1.When Google sees the new domain does it simply apply the backlink profile of the old site to the new one? 2. Does it have to re-crawl all the links one by one and apply them to the new domain? 3. or something else?
Intermediate & Advanced SEO | | gazzerman10 -
Circular Canonical/Redirect
My client's site has an issue (see below) and I'm wondering how much it could be affecting crawlability. Has anyone seen a major rankings bump after fixing something like this? 1. In each page the rel=canonical is pointing to the http version of the page while the http version is redirecting to the https version. Basically, a circular redirect-canonical loop is occurring.2. The sitemap.xml is also referring to the http version of the pages rather than the https.
Intermediate & Advanced SEO | | elenaroi0 -
Thousands of /img/img/img urls generated by website - where are they coming from?
Hello -just fed website into Screaming Frog and ended up crashing computer as these img/img/img urls went into the 10s of thousands (and the numbers of img/img/img/ in each URL ended up going into the dozens and probably hundreds and more per URL). Never seen anything like it! Any idea what might be going on with this website and why it's generating so many of these URLs - it is anything to worry about? Here's example of shorter URL... www.company.com/discover/img/img/img/img/img/img/img/img/img/img/img/img/img/img/img/img/photo-competition-winners
Intermediate & Advanced SEO | | McTaggart0 -
Page is noindex
Hi, We set pages with this and i can see in the view source of the page <meta name="robots" content="noindex"/> We had a new page posted in the site and its indexed by Google but now the new post is visible on a page thats shows partial data which we noindexed as above because its duplicate data and search engines dont have to see it But its still crawling Any ideas?
Intermediate & Advanced SEO | | mtthompsons0 -
Regular Expression / Wildcard Redirect Situation
I am dealing with an interesting situation. Here's what's going on: Current URLs Example1:
Intermediate & Advanced SEO | | NakulGoyal
www.domain.com/red-widgets-cid-1234.html
www.domain.com/red-widgets-cid-1234-1.html
www.domain.com/red-widgets-cid-1234-1-1.html Canonical on All Above URLs:
www.domain.com/red-widgets-cid-1234.html New URL:
www.domain.com/red-widgets-cid-4567.html Current URLs Example2:
www.domain.com/red-widgets-cid-1234+10.html
www.domain.com/red-widgets-cid-1234+10-1.html
www.domain.com/red-widgets-cid-1234+10-1-1.html Canonical on All Above URLs:
www.domain.com/red-widgets-cid-1234+10.html New URL:
www.domain.com/red-widgets-cid-6789.html I want to make sure all variations of the above URL redirect to the new url. What wildcard 301 redirect / regular expression can I use to tackle these ?0