Resolving duplicate text issues with a duplicate image?
-
We are a listing site for programs overseas. Many of our listings are inherently the same content, because in many cases the same exact information applies. We have resolved duplicate content issues to some extent by making some of the content in these listings unique. However, for the rest of the content which is going to be the same for about 100 pages, we were wondering if its better to have an image in place instead of duplicate text content (this would basically be an image of the text in question). We know this is a problem, because this is inherently duplicate content as well (only its a duplicate image instead of duplicate text). However, what's the best solution to this problem, and is a duplicate image just asking for trouble, or might this actually be a good idea?
-
Google won't index image-embedded text on a webpage (currently only .pdf documents)
If you want a little more insurance, which you won't really need, use your handy robot.txt or rel="canonical"
As usual, keep your eyes forward:
"While search engines may not use OCR for indexing the content of web pages now, that doesn’t mean that they might not in the future, and there are some indications that the search engines are developing a much greater proficiency in the use of optical character recognition."
Here's that article, including some great references.
Good luck.
-
Could you point me to a valid reference on that OCR issue?
-
You should use rel=canonical tag on duplicate content pages. Google can read text embedded as an image through OCR algorithm. So duplicate image is not a good option. Moreover think how these images will increase the load time of the web pages.
-
To directly answer your question, there are a few ways you can present content in a manner that is not readily crawlable for search engines: flash, iframe and images.
As far as good ideas, I much prefer to offer real content which is unique to the given area. Let's say you are a US-based site offering programs for attending universities overseas. Add some content specific to each country's page to make it unique.
If you present Malaysia as a country, talk about their universities by name, awards they have won, landmarks and other items of interest such as their incredibly diverse forests. You can also provide testimonials from satisfied clients. Testimonials can help establish a lot of relevancy as clients will often mention specifics about where they are from "John from Miami, FL" and where they visited.
In short, you will achieve better results if you work within Google's system then by trying to work around it.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Content Duplication Issue On Content Publishing Site
I am running Free Article Posting site but I discover people are posting their content which already has been published on different sites before. What should i need to do in order to save my site from Google penalty.3 waiting for your kind help in this regard Thanks in advance
Content Development | | Mustansar0 -
Wordpress Blog Pages, Duplicate Title Tag
Anyone have any experience in fixing the duplicate Title tag on a Wordpress blog multiple pages Basically the title tag remains the same on the pages /Blog/ /Blog/Page/2/ /Blog/Page/3/ My good friend Yoast Plugin doesn't seem to of resolved this (Unless i have missed something?) I don't really see this to be effecting anything and wouldn't of through it would either, but it would be nice to not see the notification within Moz site crawls and campaigns etc, its more of a cosmetic problem Any solutions ? Thanks James
Content Development | | Antony_Towle0 -
Duplicate Content
I have a service based client that is interested in optimizing his website for all the services that he provides in all the locations that he provides them in. For example: Service 1, location 1 Service 1, location 2 Service 2, location 1 Service 2, location 2 He wants to essentially create an individual page for each of the above, but i'm concerned that he will be penalized for duplicate content. Each of the pages would have the keyword in the url, page title and within the main body of content. We would certainly alter the content somewhat, but not sure how much a difference this would make. Any thoughts or advice would be greatly appreciated.
Content Development | | embracedarrenhughes1 -
Translated text: should I use canonical link?
Hello everybody, I'm writing an article in Danish, which I have translated into English on a Danish blog. But I'm not sure if I have to use the canonical link from the English version to the Danish, or whether I should just publish both without using canonical link. What is your recommendation for this? Looking forward to hearing from you. Thanks & regards, Jonathan
Content Development | | JoLinda910 -
Can you use creative commons non-commercial images on a company blog?
Does anyone know if it is okay to use creative commons images on your company blog if they are under the Attribution-NonCommercial-NoDerivs 2.0 Generic license. Technically you are using it on a commercial site, but you are not directly making money from the image or selling it.
Content Development | | ProjectLabs0 -
Wordpress Duplicate Pages/ URL's - Help !
Hi guys, I have been running SEOMoz for just over a month and slowly cleaning up one of my Wordpress Blogs. While going through the crawl reports I have noticed that I have duplicate pages showing on the crawl. For example, the main post would be; www.xxxxx.com/blog/post-title Then I see another URL which would be; **www.xxxx.com/blog/page/59 ** When I click on either URL it goes back to the actual post title URL. What's with these page URL's ? Isn't these two URL's showing duplicate content to the search engines ? Any suggestions would be greatly appreciated.
Content Development | | dcc0 -
Blogger - Multiple partial duplicate content and canonical
In Blogger, have at least three pages produced for each post - main post, archive and tag - each has their own canonical tag - are these considered duplicate content by Google? Not sure the best way to handle this.
Content Development | | holdtheonion0 -
Why is this store getting hurt in SERPs when they removed duplicate content?
I work with an e-commerce client who got hit hard by Panda. They are very cautious, and want small-scale tests to prove each hypothesis before committing to larger changes. Recently, we reworked content on 30 product detail pages. Before, these product pages featured some original content mixed with some manufacturer content. The change we made was to remove the manufacturer content completely from the product page, leaving about 300 words of high-quality, original content--all of which was written by subject matter experts. I assumed that Google viewed this manufacturer text as duplicate content. However, when these 30 modified pages were compared to the control, they performed significantly worse. Question 1: Does any have any idea why these pages would perform worse than the control?
Content Development | | merch_zzounds
Question 2: Do you have any tips for convincing this client to try another test or get the buy-in to make the larger changes that--in theory--need to happen? FWIW, this client has about 10,000 product detail pages--the vast majority of which contain just manufacturer content. I appreciate your thoughts.0