Amazon CloudFront CDN
-
Hi,
I'd like to increase website's speed with Amazon CloudFront CDN.
I created some CNAMEs and i've something like this:
- www.mydomain.com (my website)
- cdn1.mydomain.com
- cdn2..mydomain.com
- cdn3.mydomain.com
But i've a lot duplicate content now ! One per subdomain and one per content (gif, css, html, and so one).
Have you any feedback in order to not have SEO penalty ?
Does Google detects CDN ? Can I help him to understand my CDNs ?
Thanks,
Best regards,
Maxime
-
Hi Max,
As you know, SEOmoz uses a CDN (Content Delivery Network) to host our static content. This greatly improves the load time of our pages by distributing our content across a cloud network, and results in an improved experience for users.
If I understand your question correctly, you have set up a CDN and have created duplicate content issues.
To solve this, it's important to set up your CDN only to serve static content, like images, stylesheets and javascript. That is what a CDN is designed for. Do not duplicate your entire site - your HTML - as this will cause duplicate content issues.
If for some reason you need to replicate your entire HTML, then there are some steps you can take to mitigate the damage, although it's going to depend on your exact circumstances.
For example, you can set full URL canonical tags so that all your mapped CNAMES point to your primary URL.
To revert back to one copy of your HTML, you might want to put 301 redirects in place on the duplicated content (pointing to the original) before removing them from the CDN.
But even these aren't ideal solutions. It's best just to serve your static content, and only one version of your HTML.
-
I think he didn't reply.
He store data onto Amazon S3 and serves pictures from CDN (Amazon CloudFront). So he told me he hasn't duplicate content issues because he serves pictures.
But he tolds too "This isn't an issue for duplicate content, unlike if you were replicating your HTML".
When you use Amazon CloudFront without Amazon S3, but you use it with your webserver, Amazon CloudFront duplicates all content (pictures, pages, ...).
Onto your website, you'll only link pictures to CDN, for example http://cdn1.test.com/picture.jpg. But if GoogleBot opens http://cdn1.test.com/ it'll find all your html content !
So it'll be a duplicate issue I think, and I don't really know what is the best way to fix that (not use Amazon CloudFront without Amazon S3, Canonical, http headers, ...)
Thanks
-
Did the author's reply in the comment of the blog post answer your question, or do you still have this question?
-
Great post, but he didn't talk about duplicate content, only increasing speed.
-
Here's the YouMoz post that might help.
http://www.seomoz.org/ugc/improving-page-speed-with-amazon-web-services-a-beginners-guide
-
Tomorrow morning (Seattle time) I'll be posting a YouMoz blog post at http://www.seomoz.org/ugc that deals directly with setting up a CDN on Amazon. You can read through the steps given in the article and see if that answers your questions, and if not, you can ask a question in the comments.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is it possible to compete on keywords with Amazon?
Is it actually even possible to compete against Amazon to be #1 in Google SERPs against Amazon? If so - how? I run a boutique business selling a niche product, in 2008 - 2013 I was always #1 for my keywords.
Intermediate & Advanced SEO | | loginid
But since Amazon started the same type of products as well, I have now always been right under amazon results, who are at 1,2,3. Is it even possible to get to the #1 position any more? Thank you.0 -
Will using CDN Affect SEO?
I'm got a website with a slider and each of the 6 slides has a 5-second video background. The website is B2B and the user profile for the website is employees at Fortune 1000 companies in the United States using desktop computers to browse. The videos are highly optimized and we did testing using various browsers and bandwidth connections to determine the videos loaded fast enough on down to a 15mbit/s connection (which is pretty low by today's average U.S. business bandwidths.) We tried hosting the videos on Vimeo and YouTube but it caused issues in the timing of the slide show display. (I've not seen any other website do what we do the way we do it. Most sites have a single video background with a single text overlay on top.) The downside to this is that loading all those videos produces a lot of bandwidth usage for our server. The website is serving a niche service industry though so we're not exceeding our current limits. I'm wondering though might there be some benefit to hosting just the video files on a CDN? Obviously that would mean lest bandwidth usage for our server, and possibly quicker load times where the CDN server is closer to the user than our server. But are there benefits or downsides from an SEO perspective noting that I'm proposing only putting the videos on the CDN, not the entire web page.
Intermediate & Advanced SEO | | Consult19010 -
Homepage organization schema question: logo lives on amazon server, can I call that out on the structured data?
Basically, the homepage organization schema has called out the logo, but it lives on the amazon server. We're having issues with Google rendering the correct logo on the knowledge graph. The URL for the amazon asset looks something like this: <brandname>-assets.s3-us-west-2.amazonaws.com/<logo>.png</logo></brandname> Calling that out on the organization structured data for the logo is okay right?
Intermediate & Advanced SEO | | imjonny1230 -
Just moved to CDN and site dropped in Google
Hi there, I have been modifying a clients site for months now trying to get higher up in Google for the term "wedding dresses essex" on the website https://www.preciousmomentsbridalwear.co.uk/ It's always ranked around 7th / 8th place and we want to try and get it into 4/5th position ideally. I have optimised pages and then due to the site speed not being that great we moved it to MaxCDN this week which has made the site much faster, but now we have dropped to number 10 in Google and in danger of dropping out of the first page. I was hoping that making the site much faster for desktop and mobile would help not hinder! Any help would be appreciated! Simon
Intermediate & Advanced SEO | | Doublestruck0 -
Google not Indexing images on CDN.
My URL is: http://bit.ly/1H2TArH We have set up a CDN on our own domain: http://bit.ly/292GkZC We have an image sitemap: http://bit.ly/29ca5s3 The image sitemap uses the CDN URLs. We verified the CDN subdomain in GWT. The robots.txt does not restrict any of the photos: http://bit.ly/29eNSXv. We used to have a disallow to /thumb/ which had a 301 redirect to our CDN but we removed both the disallow in the robots.txt as well as the 301. Yet, GWT still reports none of our images on the CDN are indexed. The above screenshot is from the GWT of our main domain.The GWT from the CDN subdomain just shows 0. We did not submit a sitemap to the verified subdomain property because we already have a sitemap submitted to the property on the main domain name. While making a search of images indexed from our CDN, nothing comes up: http://bit.ly/293ZbC1While checking the GWT of the CDN subdomain, I have been getting crawling errors, mainly 500 level errors. Not that many in comparison to the number of images and traffic that we get on our website. Google is crawling, but it seems like it just doesn't index the pictures!? Can anyone help? I have followed all the information that I was able to find on the web but yet, our images on the CDN still can't seem to get indexed.
Intermediate & Advanced SEO | | alphonseha0 -
We used to speak of too many links from same C block as bad, have CDN's like CloudFlare made that concept irrelevant?
Over lunch with our head of development, we were discussing the way CloudFlare and other CDN's help prevent DDOS attacks, etc. and I began to wonder about the IP address vs. the reverse proxy IP address. Before we would look to see commonalities in the IP as a way that search engines would modify the value to given links and most link software showed this. For ahrefs, I know they still show common IPs using the C block as the reference point. I began to get curious about what was the real IP when our head of dev said, that is the IP from CloudFlare... So, I ran a site in ahrefs and we got an older site we had developed years ago that showed up as follows: Actos-lawsuit.org 104.28.13.57 and again as 104.28.12.57 (duplicate C block is first three sets of numbers are the same and obviously, this has a .12 and a .13 so not duplicate.) Then we looked at our host to see what was the IP shown there: 104.239.226.120. So, this really begs a question of is C Block data or even IP address data still relevant with regard to links? What do the search engines see when they look for IP address now? Yes, I have an opinion, but would love to hear yours first!
Intermediate & Advanced SEO | | RobertFisher0 -
CDN for SEO (or not)?
Does CDN impact on SEO or not? There seems conflicting ideas as to whether they impact positively or negatively, I realise that if the page loads quicker this is a good thing for SEO and usability of course. Does Google see CDN as just cheating and a get-around for not doing the work from the ground up and using good hosting etc? Do you have any direct experience? All constructive input much appreciated!
Intermediate & Advanced SEO | | seoman101 -
Mystery: Ranking in Amazon for a product page?
My client has a product on Amazon that has more reviews and better rankings. However, their competitor with less reviews and lower ratings are ranking #1 for our primary keyword in Google. Our product page doesn't even rank on Google, but I'm assuming Google doesn't want to display two results from Amazon. The only difference is they have 1 link pointed to the product page that has a small PA of 10 and DA of 15. Do you think this link could be the only thing making a difference? Should we start building more links to this product page in addition to their website? Any other tips to help our Amazon page rank?
Intermediate & Advanced SEO | | Stryde0