Duplicate Content?
-
My site has been archiving our newsletters since 2001. It's been helpful because our site visitors can search a database for ideas from those newsletters. (There are hundreds of pages with similar titles: archive1-Jan2000, archive2-feb2000, archive3-mar2000, etc.)
But, I see they are being marked as "similar content." Even though the actual page content is not the same. Could this adversely affect SEO? And if so, how can I correct it?
Would a separate folder of archived pages with a "nofollow robot" solve this issue? And would my site visitors still be able to search within the site with a nofollow robot?
-
Cool. No worries
StackOverFlow has always been awesome in helping me with my IIS rules and such.
If you Google: site:stackoverflow.com apache redirect
You will see MANY examples of how to set up 301 redirects, including redirecting from non-www to www pages, etc.
Hope this helps.
Mike
-
Yes, on Google webmaster...sorry. And it's apache.
thank u!
-
Google Analytics or Google Webmaster Tools? You will need to do that in Webmaster Tools.
That is a bummer they are having issues with your 301 redirects. If you know whether you are using Apache, IIS, etc. for your backend, you could post the code you are using in a new question and hopefully someone in the SEOMoz community can help; otherwise, there are Apache and IIS forums where you can post and get some great results and/or examples to base your redirects off of too.
Good luck Sarah! I hope you get your site in shape and back on page 1!!!
Mike
-
HI Mike,
Thank you. To change all the titles is a huge task, there are hundreds and hundreds of pages. I think I'll put them in a folder and mark the page link to that folder with a nofollow. As to the canoncalization of the two names, I have marked one of them as the top one in Google Analytics. But I have a much greater problem than that. I have several domain names that are on the same server and that all point to the one domain (same files and folders). I have been attempting to get my server techs to do a 301 redirect so that only http://www.sundayschoolnetwork.com displays in a browser. However, every time they attempt to do it, part or all of my site stops working correctly.
-
You can go back and fix all of your old title tags, making them unique, like Newsletter Archive | Month Year | Sunday School Network, which will get rid of your errors and provide a better user experience. This approach will allow you to target specific keywords on each page for ranking in Google. When you have the same title across multiple pages, the assumption is that the content is either the same or very similar.
I noticed you have a canonical issue, where you can access your site via http://sundayschoolnetwork.com as well as http://www.sundayschoolnetwork.com
The issue with this, that you have 44 relatively important links from external websites pointing to the non-www version (http://sundayschoolnetwork.com)... which means you are splitting up your potential power between two sites instead of one. There are many ways you can fix this.
As for why you are not ranking as well, it could be the market became more competitive for the keywords you were originally using. It could be that your site content does not reflect the keywords you are targeting. It could be lots of things.
Like I said in my previous post, the nofollow tells crawlers not to follow the internal and external links on those pages; however, they will still get indexed. This means that you will still have duplicate titles appearing in results. The way to remove them from the results would be to use the noindex directive - which will eventually remove them from the index and you will not have competing title tags.
If you fix your title tags, you do not need to worry about the nofollow or noindex directives.
That is about all I can help with, without knowing any additional information.
The only other thing I can suggest is to read the SEOMoz Beginners Guide to SEO - which will help a TON!
I hope that helps.
Mike
-
thank u. I'm gonna do that!
-
Hi Mike,
That was fast. I copied some of the report from Seomoz "Crawled Diagnostics." Some do have the same titles, which was an edition after many years. The early newsletters I didn't even title, so they have a "default title" of the url.
I happened on SEOmoz, because I am trying to figure out why after so many years of having been on the first or second page of Google search results, we are lucky to show up on page 10 or deeper, if at all.
So I'm trying out SEOmoz to see if this will help us get back on top!
|
The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive13_Apr10.html 1 18 1 The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive13_Apr11.html 1 18 1 The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive13_Apr12.html 1 18 1 http://sundayschoolnetwork.com/archive13_Feb06.html
http://sundayschoolnetwork.com/archive13_Feb06.html 1 18 1 http://sundayschoolnetwork.com/archive13_Feb07.html
http://sundayschoolnetwork.com/archive13_Feb07.html 1 18 1 The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive14_Apr08.html 1 18 1 The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive14_Apr09.html 1 18 1 The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive14_Apr11.html 1 18 1 The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive14_Apr12.html 1 18 1 http://sundayschoolnetwork.com/archive14_Feb06.html
-
Hi Sarah,
If the titles are different and the page content is different, I do not understand why you should be getting any errors.
What tool are you using that is giving you the "similar content" message?
Your site visitors will still be able to search your site with nofollow in place, because nofollow is simply a directive telling search engines to not follow the internal and external links on your page.
The noindex directive tells Google to not index the content on the selected pages.
If you can provide me with the name of the tool you are receiving the "similar content" message from and/or provide me with your website address I could take a look into things further.
... long story short, if your titles are unique and your content is unique, you should not have to worry about duplicate content.
Hope this helps,
Mike
-
The best way to go is to put all your newsletters in on folder and and disallow the folder in your robot.txt.
rel nofollow & robot.txt are only read by google bot, your visitors won't be affected and will be able to navigate & search the archives without problem.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content or an update ???
Buying Guide and Product Category page competing for the same keyword ? Got a “nuts and bold website” selling basic stuff. Imagine selling simple nuts, bolts and washers (the little ring that goes in between) in different metals. Imagine a website with a very wide and deep line of these simple products. For long tail keywords we rank well (Example: 0.25 inch bolts). For the keyword: “Nuts bolts” our main category page use to rank well low 1<sup>st</sup> page to second page up against the big guys (Amazon, Walmart, Target, Costco, some drug store who may have a mix pack of nuts and bolts, but still Google don’t see the difference and list 2 pages each for these guys). But then in mid-February there were an update and suddenly our “Buying guide for nuts and bolts” rank higher and started to compete with our own product category page. That was never our intention. These two pages now compete for the ranking on page 4<sup>th</sup>. Clearly there were more words on the buying guide page but no changes had been made to it for well months or years. To make up for it some more words were added to the category page, but of cause there is only so many way you can fraise words about “nuts and bolts” without sounding a bit duplicate/re-writing. So what do I do now ?? Clearly the product category page is the one we like to rank highest with the guide a close 2nd. Most customer don’t need the buying guide but it is good to have and great support as we got lot of good comments from customer who read it. Made a link to the buying guide from the category page and wise verses. The category page got an embedded video. Moz list the page authority for the category page to 16 and 1 for the buying guide but clearly G see it differently. Already tried to change the Meta Tag Title and Description a little but it is hard to do if the word “Nuts Bolts” is to appear in the description or people don’t know what to expect. Could just insert a “do not index” for the buying guide but not a good long term solution. Unfortunately I am out of imagination at this point. Any good suggestions ?? Thanks, Kim Any good suggestions ???
Technical SEO | | KimX0 -
Duplicated content in news portal: should we use noindex?
Hello, We have a news portal, and like other newspapers we have our own content and content from other contributors. Both our content and our contributors content can be found in other websites (we sell our content and they give theirs to us). In this regard, everything seems to work fine from the business and users perspective. The problem is that this means duplicated content... so my question is: "Should we add the noindex,nofollow" tag to these articles? Notice that there might be hundreds of articles everyday, something like a 1/3 of the website. I checked one newspaper which uses news from agencies, but they seem not to use any noindex tag. Not sure what others do. I would appreciate any opinion on that.
Technical SEO | | forex-websites0 -
Duplicate Content Reports
Hi Dupe content reports for a new client are sjhowing very high numbers (8000+) main of them seem to be for sign in, register, & login type pages, is this a scenario where best course of action to resolve is likely to be via the parameter handling tool in GWT ? Cheers Dan
Technical SEO | | Dan-Lawrence0 -
Content and url duplication?
One of the campaign tools flags one of my clients sites as having lots of duplicates. This is true in the sense the content is sort of boiler plate but with the different countries wording changed. The is same with the urls but they are different in the sense a couple of words have changed in the url`s. So its not the case of a cms or server issue as this seomoz advises. It doesnt need 301`s! Thing is in the niche, freight, transport operators, shipping, I can see many other sites doing the same thing and those sites have lots of similar pages ranking very well. In fact one site has over 300 keywords ranked on page 1-2, but it is a large site with an 12yo domain, which clearly helps. Of course having every page content unique is important, however, i suppose it is better than copy n paste from other sites. So its unique in that sense. Im hoping to convince the site owner to change the content over time for every country. A long process. My biggest problem for understanding duplication issues is that every tabloid or broadsheet media website would be canned from google as quite often they scrape Reuters or re-publish standard press releases on their sites as newsworthy content. So i have great doubt that there is a penalty for it. You only have to look and you can see media sites duplication everywhere, everyday, but they get ranked. I just think that google dont rank the worst cases of spammy duplication. They still index though I notice. So considering the business niche has very much the same content layout replicated content, which rank well, is this duplicate flag such a great worry? Many businesses sell the same service to many locations and its virtually impossible to re write the services in a dozen or so different ways.
Technical SEO | | xtopher660 -
Bad Duplicate content issue
Hi, for grappa.com I have about 2700 warnings of duplicate page content. My CMS generates long url like: http://www.grappa.com/deu/news.php/categoria=latest_news/idsottocat=5 and http://www.grappa.com/deu/news.php/categoria%3Dlatest_news/idsottocat%3D5 (this is a duplicated content). What's the best solution to fix this problem? Do I have to set up a 301 redirect for all the duplicated pages or insert the rel=canonical or rel=prev,next ? It's complicated becouse it's a multilingual site, and it's my first time dealing with this stuff. Thanks in advance.
Technical SEO | | nico860 -
If two websites pull the same content from the same source in a CMS, does it count as duplicate content?
I have a client who wants to publish the same information about a hotel (summary, bullet list of amenities, roughly 200 words + images) to two different websites that they own. One is their main company website where the goal is booking, the other is a special program where that hotel is featured as an option for booking under this special promotion. Both websites are pulling the same content file from a centralized CMS, but they are different domains. My question is two fold: • To a search engine does this count as duplicate content? • If it does, is there a way to configure the publishing of this content to avoid SEO penalties (such as a feed of content to the microsite, etc.) or should the content be written uniquely from one site to the next? Any help you can offer would be greatly appreciated.
Technical SEO | | HeadwatersContent0 -
How to Fix Duplicate Content Issue of Manufacturer Details Paragraph?
I am surviving with Google's crawling issue. Google had not index my product pages yet. I have Google a lot and read too many articles to get it done. But, I did not get satisfy answer with it. I just checked my product pages and found that: There is one tab with Manufacturers Details containing one paragraph. This content is available on too many product pages with same manufacturer. You can know more by visiting following URL. http://www.vistastores.com/indoorlighting-elklighting-d1472.html So, Does it matter to stop my crawling? If yes so How can I fix it?
Technical SEO | | CommercePundit0 -
Aspx filters causing duplicate content issues
A client has a url which is duplicated by filters on the page, for example: - http://www.example.co.uk/Home/example.aspx is duplicated by http://www.example.co.uk/Home/example.aspx?filter=3 The client is moving to a new website later this year and is using an out-of-date Kentico CMS which would need some development doing to it in order to enable implementation of rel canonical tags in the header, I don't have access to the server and they have to pay through the nose everytime they want the slightest thing altering. I am trying to resolve this duplicate content issue though and am wondering what is the best way to resolve it in the short term. The client is happy to remove the filter links from the page but that still leaves the filter urls in Google. I am concerned that a 301 redirect will cause a loop and don't understand the behaviour of this type of code enough. I hope this makes sense, any advice appreciated.
Technical SEO | | travelinnovations0