Scraper Advice
-
Hi all,
I know we all deal with scraped content issues. I have one I could use advice on. I found a site that is posting our blog content on their site verbatim, including the links I added to the posts (which is good) and mention our blog home page in a right sidebar beside the content (also good). However, they aren't linking to the specific posts from their copied versions anywhere and their pages canonical back to their versions, not mine.
It's not a very spammy site and has a decent domain authority (though significantly lower than our own). I did a long tail search related to one of posts after discovering it, however, and found their version was outranking the original. I know I can report this one via Webmaster Tools.
I wanted to get your opinion on whether asking them to add a link back to the original post on our site might be sufficient, or do I need to ask for that plus a canonical tag update? I know getting both is ideal, but the links and relationship could be valuable, so I want to leave this particular bridge in tact if I can.
Just trying to decide if I take an "either/or" approach to my request when I mention those two action items, or if I need be a little firmer and ask them to do both and potentially risk losing a potential outlet for future content?
Thanks,
Andrew
-
Don't second-guess on that myth that scrapers can't hurt you. These guys are outranking you right now with your own content. Proof enough to me that Kissmetrics needs to take notice and pull down false information. Also, this is another clear example of Google not knowing how poorly their systsem is working and they know not that they know not.
Google would not be getting millions of DMCAs per week if they were right about this. I've sent them hundreds.
-
Normally, I'd take that harder approach as well. If this was a spammy site that was doing nothing but scraping, I'd definitely be going that route. I still might. I'm trying to see the best way to walk a fine line.
I think #2 on Kissmetrics' 3 Myths About Duplicate Content has me second-guessing myself. If it wasn't a somewhat decent site that has potential to help in terms of referral traffic, it would be a no-brainer.
For the outranking issue, it's weird. For the main term we target, we are top 3 in the SERPs. Change it a little bit and they're ranking, which is the only instance of that I found when testing all posts (5 total).
Thanks for the feedback. I really appreciate it.
-
What you do on your first step will set the tone for how they treat you in the future. So, if you are too liberal now, it will be hard to reign them in and they could start grabbing everything that you own.
If this was my site being grabbed, I would be contacting them to take the content down and be prepared to follow up with an attorney who is already in place for this type of situation, being ready to submit DMCA to Google, Adsense, hosting and more.
If you feel that the relationship could be valuable and have a different philosophy than mine, then I would at minimum insist on the rel=canonical pointing back to the source of the content on my website - and I would require them to ask before they use anything in the future. The fact that they are outranking you with your own content should have you shaking in your boots over this potential relationship. You are making deals with Goliath.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Help. Recently my organic traffic has dropped 40%. Any advice / ideas?
Lately my organic traffic has dropped significantly as well as my adsense revenue. The moz report says, for example, my traffic is down 40%, but I a still #1 for that keyword. Also, in the last week, suddenly my number of indexed pages doubled. We had done some page rewriting and maybe messed that up. We've fixed that though. Webmaster tools is still picking up all of our old pages and the new ones. Background: We recently launched our new responsive website in March. March income was about the same as February. April dropped off suddenly (maybe late march - no sure) When we changed site, we did do 301's for all the old pages to the new ones Any ideas or advice as to why my traffic and revenue has dropped off so sharply? Never submitted questions before - not sure if I am supposed to put urls here so if you just google Home Spelling Words - that's my website. Thanks everyone!!!
Technical SEO | | kimtastic0 -
Advice urgently needed on best practice for handling multiple product categories on Magento website
I have an ecommerce site built using Magento and urgently need advice on best practice for handling multiple product categories (where products appear in more than one category on the site creating multiple URLs to the same page). In April this year, based on advice from my SEO who felt that duplicate content issues were causing my rankings to be held back, I changed about 25% of the product categories to 'noindex, follow'. This has made organic traffic fall (obviously) as these pages fell out of Google's index. But, contrary to what I was hoping for, it didn't then improve rankings - not one iota, nothing - which was the ONLY reason why I did this. This has had a real negative impact on sales, so I'm starting to think this was actually an a terrible idea. Should I change them back? And to ask a wider question, what is best practice for this particular scenario?
Technical SEO | | Coraltoes770 -
CMS on autopilot is happily creating duplicate pages - advice?
Hi, our ecommerce CMS (Magento) is creating a bunch of pages with very little content and no user value like this: http://goo.gl/UU2vl This particular example is the by product of a product filtering page, which has the format www.mywebsite/explore/index/loaddata/id/10/. These pages have no content other than images - also the pages don't have page titles and are therefore being flagged in webmaster tools as requiring HTML improvements We also have CMS auto generated pages like this: www.mysite.comhttp/review/product/list/id/7 where the page is effectively a duplicate of the product page, and this is giving us pages being flagged by webmastertools as having duplicate title tags. Should we exclude these two type of page via robots.txt or take another approach, like not worry about them 🙂 many thanks, any help gratefully received.
Technical SEO | | w1ll1am0 -
Canonicalization - Some advice needed :)
Hi guys, To be honest, it's a little bit embarrassing to throw out this question but it's one of the weakest points of knowledge at the moment for me. I've tried to get a grasp of canonical URLs and what it all means. From my understanding, it's informing Google which page to take into consideration when there's the possibility for duplicate content. Right? However, with the site I'm working on I'm not sure if it would be worth putting site-wide and the impact it would have. Site I'm working on - http://bit.ly/N7eew7 With the nature of the site, there would be a lot of duplicated content as there's the possibility that several properties listed could have a similar address due to being in the same building etc. From what I can see, no canonical URL was setup on the homepage. The other variations of the homepage URL are 301 redirecting to thee http:/www. version. Can someone explain it all to me in simple terms? Honestly believe that I'm getting more confused by the minute. Thanks guys for your patience 🙂
Technical SEO | | MarkScully1 -
Need advice on search listings and link building
Search results on my keyword (engraved wedding glasses) produces several pages of linked domains. (My domain is giftthings.net) Some are good. And admittedly, some are not so good. My question then is simply, why does seomoz link analysis show such a small number of links? And the second part of my question is, "Is there some sort of "magic number", some sort of thresh hold that triggers Google's interest? With a link list that is small but growing, am I missing something in my concern that I'm not moving up in the search listings? I've written a few articles, continuing my work on link building but I remain buried in the search results.
Technical SEO | | AhmadS1 -
Page crawling is only seeing a portion of the pages. Any Advice?
last couple of page crawls have returned 14 out of 35 pages. Is there any suggestions I can take.
Technical SEO | | cubetech0 -
Advice on too many onpage links
Hi Just done a 250 crawl on a new site I am working on (still under development), all 250 pages seem to have too many on page links, however they do not have any links I can take away This page, for example, http://empleous.com/gb/store/category/398743/shoes?price=20-50 has (according to moz crawl) 252 links on. Seems a little high. What would be the best way to correct this please? I cannot find that many links. I know there are about 85 links in the menu bar but they are all needed and none of the others can really be replaced either. Thanks Carl
Technical SEO | | Grumpy_Carl0 -
Advice on display this content on my page for search engines
Hi, my website http://www.in2town.co.uk/Holiday-News is about bringing travel and holiday news to our readers of our lifestyle magazine but i am having problems at the moment with the layout. What i mean by this is, i have written content on the page as an introduction so google knows what this section of the site is about but to be honest it looks rubbish with having the introduction there and i would like to know if i am doing the right thing by having the content there for google to know what my site is about. I have tried taking it away and noticed i dropped in the rankings and when i have put it back up i go up in the rankings, can anyone please give me some advice over this issue
Technical SEO | | ClaireH-1848860