Scraper Advice
-
Hi all,
I know we all deal with scraped content issues. I have one I could use advice on. I found a site that is posting our blog content on their site verbatim, including the links I added to the posts (which is good) and mention our blog home page in a right sidebar beside the content (also good). However, they aren't linking to the specific posts from their copied versions anywhere and their pages canonical back to their versions, not mine.
It's not a very spammy site and has a decent domain authority (though significantly lower than our own). I did a long tail search related to one of posts after discovering it, however, and found their version was outranking the original. I know I can report this one via Webmaster Tools.
I wanted to get your opinion on whether asking them to add a link back to the original post on our site might be sufficient, or do I need to ask for that plus a canonical tag update? I know getting both is ideal, but the links and relationship could be valuable, so I want to leave this particular bridge in tact if I can.
Just trying to decide if I take an "either/or" approach to my request when I mention those two action items, or if I need be a little firmer and ask them to do both and potentially risk losing a potential outlet for future content?
Thanks,
Andrew
-
Don't second-guess on that myth that scrapers can't hurt you. These guys are outranking you right now with your own content. Proof enough to me that Kissmetrics needs to take notice and pull down false information. Also, this is another clear example of Google not knowing how poorly their systsem is working and they know not that they know not.
Google would not be getting millions of DMCAs per week if they were right about this. I've sent them hundreds.
-
Normally, I'd take that harder approach as well. If this was a spammy site that was doing nothing but scraping, I'd definitely be going that route. I still might. I'm trying to see the best way to walk a fine line.
I think #2 on Kissmetrics' 3 Myths About Duplicate Content has me second-guessing myself. If it wasn't a somewhat decent site that has potential to help in terms of referral traffic, it would be a no-brainer.
For the outranking issue, it's weird. For the main term we target, we are top 3 in the SERPs. Change it a little bit and they're ranking, which is the only instance of that I found when testing all posts (5 total).
Thanks for the feedback. I really appreciate it.
-
What you do on your first step will set the tone for how they treat you in the future. So, if you are too liberal now, it will be hard to reign them in and they could start grabbing everything that you own.
If this was my site being grabbed, I would be contacting them to take the content down and be prepared to follow up with an attorney who is already in place for this type of situation, being ready to submit DMCA to Google, Adsense, hosting and more.
If you feel that the relationship could be valuable and have a different philosophy than mine, then I would at minimum insist on the rel=canonical pointing back to the source of the content on my website - and I would require them to ask before they use anything in the future. The fact that they are outranking you with your own content should have you shaking in your boots over this potential relationship. You are making deals with Goliath.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Website url structure after redesign and 301 redirect chains - Looking for advice
OK, been trying to piece together what is best practice for someone I'm working with, so here goes; Website was redesigned, changed urls from url a to url b. 301's put in place. However, the new url structure is not optimal. It's an e-commerce store, and all products are put in the root folder now: www.website.com/product-name A better, more organized url structure would be: www.website.com/category/product-name I think we can all agree on that. However, I'm torn on whether it's worth changing everything again, and how to handle things in terms of redirects. The way I see things, it would result in a redirect chain, which is not great and would reduce link equity. Keeping the products in the root moving forward with a poor structure doesn't feel great either. What to do? Any thoughts on this would be much appreciated!
Technical SEO | | Tomasvdw0 -
Spammers created bad links to old hacked domain, now redirected to our new domain. Advice?
My client had an old site hacked (let's call it "myolddomain.com") and the hackers created many links in other hacked sites with links such as http://myolddomain.com/styless.asp?jordan-12-taxi-kids-cheap-T8927.html The old myolddomain.com site was redirected to a different new site since then, but we still see over a thousand spam links showing up in the new site's Search Console 404 crawl errors report. Also, using the links: operator in google search, we see many results of spam links. Should we be worried about these bad links pointing to our old site and redirecting to 404s on the new site? What is the best recommendation to clean them up? Ignore? 410s? Other? I'm seeing conflicting advice out there. The old site is hosted by the client's previous web developer who doesn't want to clean anything up on their end without an ongoing hosting contract. So beyond turning redirects on or off, the client doesn't want to pay for any additional hosting. So we don't have much control over anything related to "myolddomain.com". 😞 Thanks in advance for any assistance!
Technical SEO | | usDragons0 -
Possible scraper reusing content. Should I be concerned?
I've noticed a few overseas sites seem to be repurposing content from our blog. The process to report for DMCA seems lengthy. Should I be concerned enough to persue this or just write it off as something that happens? Here's an original - http://www.martinsprocket.com/sprocket-sense/sprocket-sense/2015/12/11/free-sprocket-CAD-models Here's an example - http://ptech.in/silica-crushing/free-martin-sprocket-autocad-drawing-download-martin.html Thanks! f9Wfk2h
Technical SEO | | sprockets0 -
Many Errors on E-commerce website mainly Duplicate Content - Advice needed please!
Hi Mozzers, I would need some advice on how to tackle one of my client’s websites. We have just started doing SEO for them and after moz crawled the e-commerce it has detected: 36 329 Errors – 37496 warnings and 2589 Notices all going up! Most of the errors are due to duplicate titles and page content but I cannot identify where the duplicate pages come from, these are the links moz detected of the Duplicate pages (unfortunately I cannot add the website for confidentiality reasons) : • www.thewebsite.com/index.php?dispatch=categories.view&category_id=233&products_per_00&products_per_2&products_per_2&products_per_2&page=2 • www.thewebsite.com/index.php?dispatch=categories.view&category_id=233&products_per_00=&products_per_00&products_per_2&products_per_2&products_per_2&page=2 • www.thewebsite.com/index.php?dispatch=categories.view&category_id=233&products_per_00=&products_per_00&products_per_2&page=2 • www.thewebsite.com/index.php?dispatch=categories.view&category_id=233&products_per_2=&products_per_00&page=2 • www.thewebsite.com/index.php?dispatch=categories.view&category_id=233&products_per_00&products_per_00&products_per_00&products_per_00&page=2 With these URLs it is quite hard to identify which pages need to be canonicalize. And this is jsut an example out of thousands on this website. If anyone would have any advice on how to fix this and how to tackle 37496 errors on a website like this that would be great. Thank you for your time, Lyam
Technical SEO | | AlphaDigital0 -
Which rich snippets wordpress pluggin would you advice
Ello friends which rich snippets wordpress pluggin would you advice for my site
Technical SEO | | maestrosonrisas0 -
Wordpress SEO Errors - Any advice?
Hi all! My site is on the WP platform and I'm having a crawl error. Wondering if you guys could possibly help me figure out what's going on? I have a good number of 404 errors where the links seems to be appended and I can't figure out why. I've scoured my individual posts and cannot seem to find the broken link? The crawl error looks a bit like this: http://preciousthingsphotography.com/2007/12/10/chicago-family-photographer-welcome/http:%2F%2Fpreciousthingsphotography.com%2F2007%2F12%2F10%2Fchicago-family-photographer-welcome%2F You can see that my original link is somehow being doubled with the slashes being replaced? This is happening on all of my posts. Any ideas as to what could be going on? Thanks so much!
Technical SEO | | ptpgen0 -
Keywords and content and seo advice please
Hi i am building a site at the moment which i am working on. please ignore the state of the site as we are just playing with designs at the moment but please do take notice of the top part. The site is a travel magazine but i am a bit concerned. The keywords that we will be looking at to drive travel will be as follows, cheap flights gatwick, holiday magazine and travel magazine. Now i do not want the site to look untidy with loads of content describing the site but at the same time i want google to know what the site is and for people to pick the site up with the search terms that we are aiming for. the site is www.cheapflightsgatwick.com Can anyone please show me some examples of how i should structure the site to attract the keywords and give me some advice. It seems hard when you are designing a magazine site where the content changes all the time to try and attract the search engines with your keywords. any advice would be great
Technical SEO | | ClaireH-1848860 -
Help with domain redirect advice please!
I run the website http://buildyourjacket.com. We have other domains as well, most importantly www.buildyourjacket.com and cvcsports.com. If you Google "letterman jackets" (our primary search term) cvcsports.com shows up as the first result (yay!). But that is not what we want. Until a few weeks ago, Google would show http://buildyourjacket.com as the domain for the first search result from "letterman jackets". But then a few weeks that changed. I don't know how that could have happened. There are two reasons why we want the domain http://buildyourjacket.com to be the one that shows up: 1) It's a better sounding/looking domain and 2) When it was showing up, Google also showed right below the domain another link of our that said "Build Your Own Jacket" which definitely helped us get more clicks. Can someone please help me and tell me what I should do? Thank you so much.
Technical SEO | | BrandonDoyle0