Image Optimization & Duplicate Content Issues
-
Hello Everyone,
I have a new site that we're building which will incorporate some product thumbnail images cut and pasted from other sites and I would like some advice on how to properly manage those images on our site. Here's one sample scenario from the new website:
We're building furniture and the client has the option of selecting 50 plastic laminate finish options from the Formica company. We'll cut and paste those 50 thumbnails of the various plastic laminate finishes and incorporate them into our site. Rather than sending our website visitors over to the Formica site, we want them to stay put on our site, and select the finishes from our pages.
The borrowed thumbnail images will not represent the majority of the site's content and we have plenty of our own images and original content. As it does not make sense for us to order 50 samples from Formica & photograph them ourselves, what is the best way to handle to issue?
Thanks in advance,
Scott
-
If you have permission to use their images, just get images from them, name them accurately, and give them accurate alt-text. Duplicate content has to do with your own content, in general. Since the point of naming images and alt-text is to help Google understand them, it's not a big issue if an image has the same alt-text as another or appears multiple times on the site (especially since they should all be coming from an images directory, no matter where they are on the website). Also, images are much more likely to be naturally reused than text, as licensing photos is a long accepted practice.
-
Google does "see" a lot more than just the alt text. To decide which keywords an image should rank for they take into account amongst other things:
- The text surrounding the image (caption, article it illustrates, etc.)
- Which images it is similar to
- The filename of the image
- Text recognition
In this video google shows how much they can "see" when it comes to images: http://youtu.be/t99BfDnBZcI
-
Arjen, Thanks for your reply.
You are correct that we're not looking to rank for images of Formica samples (or any of our other samples for that matter), in fact we're just providing the sample images to help our clients better decide which one of our products to order. The sample tiles are just a means to an end.
Do you have any knowledge as to the extent to which Google can "see" an image the same way a human user sees an image? Does Google just rely on the alt text that that you provide them with?
Thanks in advance,
Scott
-
Hello Keri,
Thanks for your reply. We do have an account with them and permission to use their images.
Do you have any opinions as to the best way to manage the images - ie title, alt text, etc - so as not to run into any duplicate content issues? I'm not clear if Google has the ability to somehow scan the images themselves, or if they just rely on the alt text, titles, etc that you provide along with the images. Any thoughts are appreciated.
Scott
-
I do not think using some images from another website will hurt your SEO. Logo's on a 'our clients' page, news photography delivered through news agencies, icon sets and stock images are by definition used on more than one site. The fact that this form of 'duplicate content' is so omni present, proofs that Google cannot devaluate sites using it.
If you your goal is to rank high in image search for formica in different colours, you should make sure to get your own high res images. If this is not one of your primary SEO goals, you should not worry about using copied images.
My advice would be to focus on really good photography of the furniture you are building and do not worry to much about the thumbnails of formica samples.
PS: I agree with KeriMorget. You should get permission to use the photo's before using them on your site.
-
The first thing I would do would be to look at the copyright on the Formica site to see their policy on copying their content.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Are online tools considered thin content?
My website has a number of simple converters. For example, this one converts spaces to commas
White Hat / Black Hat SEO | | ConvertTown
https://convert.town/replace-spaces-with-commas Now, obviously there are loads of different variations I could create of this:
Replace spaces with semicolons
Replace semicolons with tabs
Replace fullstops with commas Similarly with files:
JSON to XML
XML to PDF
JPG to PNG
JPG to TIF
JPG to PDF
(and thousands more) If somoene types one of those into Google, they will be happy because they can immediately use the tool they were hunting for. It is obvious what these pages do so I do not want to clutter the page up with unnecessary content. However, would these be considered doorway pages or thin content or would it be acceptable (from an SEO perspective) to generate 1000s of pages based on all the permutations?1 -
Robots.txt file in Shopify - Collection and Product Page Crawling Issue
Hi, I am working on one big eCommerce store which have more then 1000 Product. we just moved platform WP to Shopify getting noindex issue. when i check robots.txt i found below code which is very confusing for me. **I am not getting meaning of below tags.** Disallow: /collections/+ Disallow: /collections/%2B Disallow: /collections/%2b Disallow: /blogs/+ Disallow: /blogs/%2B Disallow: /blogs/%2b I can understand that my robots.txt disallows SEs to crawling and indexing my all product pages. ( collection/*+* ) Is this the query which is affecting the indexing product pages? Please explain me how this robots.txt work in shopify and once my page crawl and index by google.com then what is use of Disallow: Thanks.
White Hat / Black Hat SEO | | HuptechWebseo0 -
Having a Size Chart and Personalization Descriptions on each page - Duplicate Content?
Hi everyone, I am coding a Shopify Store theme currently and we want to show customers the size comparisons and personalization options for each product. It will be a great UX addition since it is the number one & two things asked via customer support. But my only concern is that Google might flag it as duplicate content since it will be visible on each product page. What are your thoughts and/or suggestions? Thank you so much in advance.
White Hat / Black Hat SEO | | MadeByBrew0 -
Separating the syndicated content because of Google News
Dear MozPeople, I am just working on rebuilding a structure of the "news" website. For some reasons, we need to keep syndicated content on the site. But at the same time, we would like to apply for google news again (we have been accepted in the past but got kicked out because of the duplicate content). So I am facing the challenge of separating the Original content from Syndicated as requested by google. But I am not sure which one is better: *A) Put all syndicated content into "/syndicated/" and then Disallow /syndicated/ in robots.txt and set NOINDEX meta on every page. **But in this case, I am not sure, what will happen if we will link to these articles from the other parts of the website. We will waste our link juice, right? Also, google will not crawl these pages, so he will not know about no indexing. Is this OK for google and google news? **B) NOINDEX meta on every page. **Google will crawl these pages, but will not show them in the results. We will still loose our link juice from links pointing to these pages, right? So ... is there any difference? And we should try to put "nofollow" attribute to all the links pointing to the syndicated pages, right? Is there anything else important? This is the first time I am making this kind of "hack" so I am exactly sure what to do and how to proceed. Thank you!
White Hat / Black Hat SEO | | Lukas_TheCurious1 -
Footer images links, good or bad?
Hi everybody! I have a very serius question because i have a problem with this. We run a website of voucher codes and we are looking that our rivals are putting their logos on footers of online stores with images, sometimes link to home, sometimes link to store within webpage. Should i ask for the same to online stores? I have scary to get a penalty by Google. Please help me with this and recommend me something because we are doing fair play but rivals are doing this and they get best results in SERPS. Thanks very much! Best regards!
White Hat / Black Hat SEO | | pompero990 -
Can I use content from an existing site that is not up anymore?
I want to take down a current website and create a new site or two (with new url, ip, server). Can I use the content from the deleted site on the new sites since I own it? How will Google see that?
White Hat / Black Hat SEO | | RoxBrock0 -
Duplicate content for product pages
Say you have two separate pages, each featuring a different product. They have so many common features, that their content is virtually duplicated when you get to the bullets to break it all down. To avoid a penalty, is it advised to paraphrase? It seems to me it would benefit the user to see it all laid out the same, apples to apples. Thanks. I've considered combining the products on one page, but will be examining the data to see if there's a lost benefit to not having separate pages. Ditto for just not indexing the one that I suspect may not have much traction (requesting data to see).
White Hat / Black Hat SEO | | SSFCU0 -
How does Google decide what content is "similar" or "duplicate"?
Hello all, I have a massive duplicate content issue at the moment with a load of old employer detail pages on my site. We have 18,000 pages that look like this: http://www.eteach.com/Employer.aspx?EmpNo=26626 http://www.eteach.com/Employer.aspx?EmpNo=36986 and Google is classing all of these pages as similar content which may result in a bunch of these pages being de-indexed. Now although they all look rubbish, some of them are ranking on search engines, and looking at the traffic on a couple of these, it's clear that people who find these pages are wanting to find out more information on the school (because everyone seems to click on the local information tab on the page). So I don't want to just get rid of all these pages, I want to add content to them. But my question is... If I were to make up say 5 templates of generic content with different fields being replaced with the schools name, location, headteachers name so that they vary with other pages, will this be enough for Google to realise that they are not similar pages and will no longer class them as duplicate pages? e.g. [School name] is a busy and dynamic school led by [headteachers name] who achieve excellence every year from ofsted. Located in [location], [school name] offers a wide range of experiences both in the classroom and through extra-curricular activities, we encourage all of our pupils to “Aim Higher". We value all our teachers and support staff and work hard to keep [school name]'s reputation to the highest standards. Something like that... Anyone know if Google would slap me if I did that across 18,000 pages (with 4 other templates to choose from)?
White Hat / Black Hat SEO | | Eteach_Marketing0