Duplicate Page Errors
-
Hey guys,
I'm wondering if anyone can help... Here is my issue...
Our website:
http://www.cryopak.com
It's built on Concrete 5 CMSI'm noticing a ton of duplicate page errors (9530 to be exact). I'm looking at the issues and it looks like it is being caused by the CMS. For instance the home page seems to be duplicating..
http://www.cryopak.com/en/
http://www.cryopak.com/en/?DepartmentId=67
http://www.cryopak.com/en/?DepartmentId=25
http://www.cryopak.com/en/?DepartmentId=4
http://www.cryopak.com/en/?DepartmentId=66Do you think this is an issue? Is their anyway to fix this issue? It seems to be happening on every page.
Thanks
Jim
-
Thanks everyone for the help. This should def. help clean up some of the problems that I've been having with the website.
-
I ran a crawl with Xenu (similar to what Donna did with Screaming Frog), and came across some deep page that may be causing this problem. For example, on this page...
...the last link to "phase change material" goes to:
http://www.cryopak.com/product_line/default.aspx?DepartmentId=67
...which then redirects to...
http://www.cryopak.com/en/?DepartmentId=67
It seems like multiple pages share that template, so one canonical tag might clean up a lot. I'd have to understand the site structure a lot better to advise, though. Google doesn't seem to be indexing these URLs, so they probably aren't a huge problem, but they could be diluting your ranking power. It's worth cleaning them up.
-
James,
I did a scan of your site. Your problem appears to have several sources. Do you know how to use the screamingfrog scan utility? It's free for sites with less than 500 pages. When I ran a scan on your site, looking only at the html pages, i came up with 283.
- You have search result pages indexed that shouldn't be. They'll look like duplicates to Google.
- You have product pages that contain a lot of the same content, for example http://www.cryopak.com/en/cold-chain-packaging/pre-qualified-shipping-containers/timesaver-2-8-c-series/timesaver24-24-hour-pre-qualified-shipper/ and http://www.cryopak.com/en/cold-chain-packaging/pre-qualified-shipping-containers/timesaver-2-8-c-series/timesaver48-48-hour-pre-qualified-shipper/ (24-24-hour vs 48-48-hour).
- You have different pages with the exact same title tag.
- You have some pages that are identical with one extra character in the URL e.g. http://www.cryopak.com/en/about and http://www.cryopak.com/en/about/. See that extra slash at the end?
I suggest you run and scan and inventory to get a good idea of where your problems are.
I'm not seeing your http://www.cryopak.com/en/?DepartmentId=xx (where xx represents 67, 25, 4 and 66) in the scan results. They're not redirecting and I don't see a canonical tag in the source code so I don't know what to tell you about those.
If it helps, I can direct message you a CSV file with the results of my scan.
-
Okay.. well I don't see any duplicate page issues in webmaster tools but I only see them in the SEO Moz Craw Errors report. So if they aren't showing up in webmaster tools should I really worry about this???
I can't edit those pages individually because those pages don't exist they are just a product of the CMS system generating those URL strings with the numbers. So I don't think I can canonical tag those pages.
I guess I can group them together and do 301 redirects??
Yes.. http://cryopak.spydertrapdev.com/ is just a dev environment.
-
Hi James,
I suggest you canonical the duplicate pages rather than 301 redirect them. Using canonical tags instead of 301 redirects will allow you to preserve any incoming link equity from external links to those pages. With a 301 redirect, you'll lose that equity.
David may have run your site through Open Site Explorer (OSE) and seen that there's very few incoming links to the duplicate pages and therefore felt it unnecessary to canonicalize them. I see only 8 from the example you gave us above, but don’t want to assume that’s all there is, especially when you're saying you see duplicates on the site, If you have webmaster tools set up, you can get a more exhaustive list of incoming links there.
The other thing I noticed is that the incoming links to the sample pages are coming from a cryopak subdomain on another site. Here are the ones I can see using OSE.
|
http://cryopak.spydertrapdev.com/product_line/default.aspx?DepartmentId=25
http://cryopak.spydertrapdev.com/product_line/default.aspx?DepartmentId=4
http://cryopak.spydertrapdev.com/product_line/default.aspx?DepartmentId=66
http://cryopak.spydertrapdev.com/product_line/default.aspx?DepartmentId=67
|
I get an error when I try to look at spydertrapdev.com so can't tell if that's a development environment that's been set up for your site or what. These may not be links you want to maintain. You’ll have to decide.
Good luck.
Donna
-
There are two ways to fix this.
First is to redirect all the pages to the proper home page, using a 301. Duplicate pages are bad for seo. Google likes to see one set of content, for each URL. See the webmaster tools article on duplicate content here.
Second is to go into webmaster tools, and set the true URL for this page, using the "URL parameters" function. This way, you can set the proper version of the page, so Google knows what to index. Be very careful when doing this, as you can mess up the way Google sees your site. There is a video on the link, I would watch it, and do a bit of reading first.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Blog archive pages are meta noindexed but still flagged as duplicate
Hi all. I know there several threads related to noindexing blog archives and category pages, so if this has already been answered, please direct me to that post. My blog archive pages have preview text from the posts. Each time I post a blog, the last post on any given archive page shifts to the first spot on the next archive page. Moz seems to report these as new duplicate content issues each week. I have my archive pages set to meta noindex, so can I feel good about continuing to ignore these duplicate content issues, or is there something else I should be doing to prevent penalties? TIA!
Technical SEO | | mkupfer1 -
How to explain "No Return Tags" Error from non-existing page?
In the Search Console of our Google Webmaster account we see 3 "no return tags" errors. The attached screenshot shows the detail of one of these errors. I know that annotations must be confirmed from the pages they are pointing to. If page A links to page B, page B must link back to page A, otherwise the annotations may not be interpreted correctly. However, the originating URL (/#!/public/tutorial/website/joomla) doesn't exist anymore. How could these errors still show up? Screenshot%202016-07-11%2017.36.27.png?dl=0
Technical SEO | | Maximuxxx0 -
404 Errors for Form Generated Pages - No index, no follow or 301 redirect
Hi there I wonder if someone can help me out and provide the best solution for a problem with form generated pages. I have blocked the search results pages from being indexed by using the 'no index' tag, and I wondered if I should take this approach for the following pages. I have seen a huge increase in 404 errors since the new site structure and forms being filled in. This is because every time a form is filled in, this generates a new page, which only Google Search Console is reporting as a 404. Whilst some 404's can be explained and resolved, I wondered what is best to prevent Google from crawling these pages, like this: mydomain.com/webapp/wcs/stores/servlet/TopCategoriesDisplay?langId=-1&storeId=90&catalogId=1008&homePage=Y Implement 301 redirect using rules, which will mean that all these pages will redirect to the homepage. Whilst in theory this will protect any linked to pages, it does not resolve this issue of why GSC is recording as 404's in the first place. Also could come across to Google as 100,000+ redirected links, which might look spammy. Place No index tag on these pages too, so they will not get picked up, in the same way the search result pages are not being indexed. Block in robots - this will prevent any 'result' pages being crawled, which will improve the crawl time currently being taken up. However, I'm not entirely sure if the block will be possible? I would need to block anything after the domain/webapp/wcs/stores/servlet/TopCategoriesDisplay?. Hopefully this is possible? The no index tag will take time to set up, as needs to be scheduled in with development team, but the robots.txt will be an quicker fix as this can be done in GSC. I really appreciate any feedback on this one. Many thanks
Technical SEO | | Ric_McHale0 -
Why are some pages now duplicate content?
It is probably a silly question, but all of a sudden, the following pages of one of my clients are reported as Duplicate content. I cannot understand why. They weren't before... http://www.ciaoitalia.nl/product/pizza-originale/mediterranea-halal
Technical SEO | | MarketingEnergy
http://www.ciaoitalia.nl/product/pizza-originale/gyros-halal
http://www.ciaoitalia.nl/product/pizza-originale/döner-halal
http://www.ciaoitalia.nl/product/pizza-originale/vegetariana
http://www.ciaoitalia.nl/product/pizza-originale/seizoen-pizza-estate
http://www.ciaoitalia.nl/product/pizza-originale/contadina
http://www.ciaoitalia.nl/product/pizza-originale/4-stagioni
http://www.ciaoitalia.nl/product/pizza-originale/shoarma Thanks for any help in the right direction 🙂 | |
| |
| |
| |
| |
| |
| |
| | <colgroup><col style="mso-width-source: userset; mso-width-alt: 17225; width: 353pt;" width="471"></colgroup>
| http://www.ciaoitalia.nl/product/pizza-originale/mediterranea-halal |
| http://www.ciaoitalia.nl/product/pizza-originale/gyros-halal |
| http://www.ciaoitalia.nl/product/pizza-originale/döner-halal |
| http://www.ciaoitalia.nl/product/pizza-originale/vegetariana |
| http://www.ciaoitalia.nl/product/pizza-originale/seizoen-pizza-estate |
| http://www.ciaoitalia.nl/product/pizza-originale/contadina |
| http://www.ciaoitalia.nl/product/pizza-originale/4-stagioni |
| http://www.ciaoitalia.nl/product/pizza-originale/shoarma |0 -
Can iFrames count as duplicate content on either page?
Hi All Basically what we are wanting to do is insert an iframe with some text on onto a lot of different pages on one website. Does google crawl the content that is in an iFrame? Thanks
Technical SEO | | cttgroup0 -
After I 301 redirect duplicate pages to my rel=canonical page, do I need to add any tags or code to the non canonical pages?
I have many duplicate pages. Some pages have 2-3 duplicates. Most of which have Uppercase and Lowercase paths (generated by Microsoft IIS). Does this implementation of 301 and rel=canonical suffice? Or is there more I could do to optimize the passing of duplicate page link juice to the canonical. THANK YOU!
Technical SEO | | PFTools0 -
Duplicate Page Content error but I can't see it
Hi All We're getting a lot of Duplicate Page Content errors but I can't match it up. For example this page: http://www.daytripfinder.co.uk/attractions/32-antique-cottage It is saying the on page properties as follows: Title DayTripFinder - Things to do reviewed by you - 7,000 attractions <dt style="color: #5e5e5e; font-family: Helvetica, Arial, sans-serif; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; line-height: normal;">Meta Description</dt> <dt style="color: #5e5e5e; font-family: Helvetica, Arial, sans-serif; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; line-height: normal;">Read Reviews, Browse Opening Hours and Prices. View Photos, Maps. 7,000 UK Visitor Attractions.</dt> <dt style="color: #5e5e5e; font-family: Helvetica, Arial, sans-serif; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; line-height: normal;">But this isn't the page title or meta description.
Technical SEO | | KateWaite85
</dt> <dt style="color: #5e5e5e; font-family: Helvetica, Arial, sans-serif; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; line-height: normal;">And it's showing five (many others) example pages that share it. Again the page titles and description are different.</dt> <dt style="color: #5e5e5e; font-family: Helvetica, Arial, sans-serif; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; line-height: normal;">http://www.daytripfinder.co.uk/attractions/mckinlay-theatre</dt> <dt style="color: #5e5e5e; font-family: Helvetica, Arial, sans-serif; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; line-height: normal;">http://www.daytripfinder.co.uk/attractions/bakers-dolphin</dt> <dt style="color: #5e5e5e; font-family: Helvetica, Arial, sans-serif; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; line-height: normal;">http://www.daytripfinder.co.uk/attractions/shipley-park-fishing</dt> <dt style="color: #5e5e5e; font-family: Helvetica, Arial, sans-serif; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; line-height: normal;">http://www.daytripfinder.co.uk/attractions/king-johns-lodge-and-gardens</dt> <dt style="color: #5e5e5e; font-family: Helvetica, Arial, sans-serif; font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; line-height: normal;">http://www.daytripfinder.co.uk/attractions/city-hall
</dt> Any ideas? Not sure if I'm missing something here! Thanks!0 -
Duplicate pages
Hi Can anyone tell me why SEO MOZ thinks these paes are duplicates when they're clearly not? Thanks very much Kate http://www.katetooncopywriter.com.au/how-to-be-a-freelance-copywriter/picture-1-58/ http://www.katetooncopywriter.com.au/portfolio/clients/other/ http://www.katetooncopywriter.com.au/portfolio/clients/travel/ http://www.katetooncopywriter.com.au/webservices/what-i-do/blog-copywriter/
Technical SEO | | ToonyWoony0