WordPress Duplicate Content Issues
-
Everyone knows that WordPress has some duplicate content issues with tags, archive pages, category pages etc...
My question is, how do you handle these issues?
Is the smart strategy to use robots meta and add no follow/ no index category pages, archive pages tag pages etc?
By doing this are you missing out on the additional internal links to your important pages from you category pages and tag pages?
I hope this makes sense.
Regards,
Bill
-
Hey Bill
I like to start with this standard setup (image/chart from my wordpress post on moz);
Pages, Posts, Categories - Index
Tags, Dated Archives, Subpages, Author Archives - noindex
You can check out the full post - I will be updating the Yoast Screenshots very soon!
-Dan
-
Thanks for article,
Now 2 years ahead, are there any important updates for preventing duplicate content/titles?
-
Most of the Plugins for wordpress use canonical urls.
-
Unless I'm missing something here, wouldn't it be easier to set the canonical tag for the main post? There are also plugins like SEO Ultimate that handle this automatically.
-
I posted this article I wrote the other day for someone asking a similar question.
With the Yoast SEO Plugin I no-index everything except Categories. You can see how I set mine up under section 3. Indexation.
Here is the original question that Sha submitted:
http://www.seomoz.org/q/what-is-with-wordpress-dupe-issues -
Bill-
There are several SEO plugs available for WP that will handle these issues. Yes, you are right that adding "noindex" will be beneficial on tag, category, and archive pages. The idea here is avoiding duplicate content issues. BTW, check out: Yoast SEO for Wordpress.
Here is how the values for the robots meta tag work:
- noindex will keep a page from being crawled
- nofollow will prevent a page's links from being followed
I agree with noindex'ing these pages; though I would argue that a nofollow is still worth leaving out. If these pages have any juice you want to allow this to flow to the other links on the page.
-
The WP on my blog is set up as follows (this is a blog that gets between four and ten short posts per day - about two to four sentences, each post linking to an article or other content on a topic-related website)
Homepage: Full text of the most recent 25 posts are displayed. Pagination pages are not indexed (blocked by robots.txt).
Post Pages: Full text is displayed and the title plus a few words of 20 related posts are displayed.
Category Pages: I have over 100 categories and each post is placed into at least two categories (one by location and one by topic). Some posts go into three or four categoreis - sometimes more. Each category page displays the full text of the most recent 25 posts. Categories do not have pagination pages (blocked by robots.txt).
All of the above pages are fully indexed and a long list of category pages appears in the left-side navigation. I don't use tag pages or archive pages. There is a lot of dupe content in this system but so far I am lucky that it does not cause a problem. The category pages pull a lot of organic search traffic.
In January of each year I delete all of the posts that are over a year old. Before doing that I identify those that are pulling reasonable traffic and either redirect them to a permanent page about same topic, write an article about that topic and redirect, or recycle that post. All the rest are redirected to the homepage of the blog.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Possible duplicate content issue with my Blog and archive pages . Any help greatly appreciated
Dear Mozzers, I have been looking at my news section on my eCommerce site and I think I may have a duplicate content issue and wanted some advice on whether I do and if so , how best I handle this. http://www.website.co.uk/news
Technical SEO | | PeteC12
http://www.website.co.uk/news/page:1
http://www.website.co.uk/news/page:2
http://www.website.co.uk/news/page:3
http://www.website.co.uk/news/limit:9999 (This is show all) I also have the ability of showing articles by month : http://www.website.co.uk/news/archive/2015/04 (April)
http://www.website.co.uk/news/archive/2015/03 (March)
http://www.website.co.uk/news/archive/2015/02 (Feb)
http://www.website.co.uk/news/archive/2015/01 (Jan) I am wondering if there's a duplicate issue here or not given that I also articles by month as well and if so how best I handle this.? I already do pagination on my news pages (page 1 , page 2) by using rel=next and rel=Prev but I don't have an canconical or anything as yet. I enclose a couple of links if this would help and would appreciate if someone could take a browse. I have a View All link on my homepage for for all news items - http://goo.gl/JPPIvQ I which have a different urls - March 2015 Articles - http://goo.gl/0O1wYD and April 2015 articles - http://goo.gl/GdW2oK On another note, These articles are also linked to from the relevant category landing pages on my website to help with SEO. I have not used H tags on the article links in my landing pages , just displaying the weblink back to the news article.I've done this to try and improve the PR and rankings of my landing pages. Just wondered if anyone has any comments as to whether thats a good or bad idea and whether I could improve it in any way - An example is here (scroll down the page to the pressure washing guides) - http://goo.gl/nnRE49 Thanks Pete0 -
Handling of Duplicate Content
I just recently signed and joined the moz.com system. During the initial report for our web site it shows we have lots of duplicate content. The web site is real estate based and we are loading IDX listings from other brokerages into our site. If though these listings look alike, they are not. Each has their own photos, description and addresses. So why are they appear as duplicates – I would assume that they are all too closely related. Lots for Sale primarily – and it looks like lazy agents have 4 or 5 lots and input the description the same. Unfortunately for us, part of the IDX agreement is that you cannot pick and choose which listings to load and you cannot change the content. You are either all in or you cannot use the system. How should one manage duplicate content like this? Or should we ignore it? Out of 1500+ listings on our web site it shows 40 of them are duplicates.
Technical SEO | | TIM_DOTCOM0 -
Duplicate Post Titles in WordPress with
Hi, First off, this is a WordPress specific question. we migrated a site into WordPress, with hundreds of long articles that are split into 3 or 4 pages each. Each of these articles was entered as an individual post, split into different pages using the tag. We're using the yoast seo plugin. The problem then is that each of the pages gets the same title and meta description. Has anyone a good solution to differentiate the pages? Thanks,
Technical SEO | | Andybod1 -
Duplicate Content Issue
SEOMOZ is giving me a number of duplicate content warnings related to pages that have an email a friend and/or email when back in stock versions of a page. I thought I had those blocked via my robots.txt file which contains the following... Disallow: /EmailaFriend.asp Disallow: /Email_Me_When_Back_In_Stock.asp I had thought that the robot.txt file would solve this issue. Anyone have any ideas?
Technical SEO | | WaterSkis.com0 -
Duplicate Content Issue
Very strange issue I noticed today. In my SEOMoz Campaigns I noticed thousands of Warnings and Errors! I noticed that any page on my website ending in .php can be duplicated by adding anything you want to the end of the url, which seems to be causing these issues. Ex: Normal URL - www.example.com/testing.php Duplicate URL - www.example.com/testing.php/helloworld The duplicate URL displays the page without the images, but all the text and information is present, duplicating the Normal page. I Also found that many of my PDFs seemed to be getting duplicated burried in directories after directories, which I never ever put in place. Ex: www.example.com/catalog/pdfs/testing.pdf/pdfs/another.pdf/pdfs/more.pdfs/pdfs/ ... when the pdfs are only located in a pdfs directory! I am very confused on how to fix this problem. Maybe with some sort of redirect?
Technical SEO | | hfranz0 -
404's and duplicate content.
I have real estate based websites that add new pages when new listings are added to the market and then deletes pages when the property is sold. My concern is that there are a significant amount of 404's created and the listing pages that are added are going to be the same as others in my market who use the same IDX provider. I can go with a different IDX provider that uses IFrame which doesn't create new pages but I used a IFrame before and my time on site was 3min w/ 2.5 pgs per visit and now it's 7.5 pg/visit with 6+min on the site. The new pages create new content daily so is fresh content and better on site metrics (with the 404's) better or less 404's, no dup content and shorter onsite metrics better? Any thoughts on this issue? Any advice would be appreciated
Technical SEO | | AnthonyLasVegas0 -
What are some of the negative effects of having duplicate content from other sites?
This could include republishing several articles from another site with permission.
Technical SEO | | Charlessipe0 -
Are RSS Feeds deemed duplicate content?
If a website content management system includes built-in feeds of different categories that the client can choose from, does that endanger them of having duplicate content if their categories are the same as another client's feed? These feeds appear on templated home page designs by default. Just trying to figure out how big of an issue these feeds are in terms of duplicate content across clients' sites. Should I be concerned? Obviously, there's other content on the home page besides the feed and have not really seen negative effects, but could it be impacting results?
Technical SEO | | KyleNeuberger0