Tons of Crappy links in new OSE (Open Site Explorer)
-
I am starting to miss the old OSE. I've found that for a lot of the pages on our site, the new OSE is showing WAY more links and most of them are garbage nonsense links from China, Russia, and the rest of the internet Wild West.
For instance, in the old OSE, this page used to show 9 linking domains:
http://www.uncommongoods.com/gifts/by-recipient/gifts-for-him
It now shows 454 links. Some of the new links (about 5 of them) are legitimate. The other 400+ are garbage. Some are porn sites, most of them don't even open a web page, they just initiate some shady download. I've seen this for other sites as well (like Urban Outfitters) This is making it much harder for me to do backlink analysis on bc I have no clue how many "Normal" links they have. Is anyone else having this problem ? Any way to filter all this crap out ? See attached screenshot of the list of links I'm getting from OSE.
-
Ok thank you. I will email directly.
-
Hey Zack,
Sorry to hear you're still having problems - we've seen an improvement on most sites at this point. Would you want to send me info on the site you're searching and any filters you are using?
If you don't feel comfortable posting that info on this thread, feel free to email me directly: [email protected].
Thanks!
Carin
-
Hey Carin,
I just wanted to follow up on this...I'm still seeing these spammy binary files show up as links. Unfortunately it makes OSE quite useless for me in regards to exploring our own backlinks.
What is the status of this problem? Has there been any headway ? Why does our site have problems but most others don't?
Thanks!
-Zack
-
Hey Zack,
Thanks so much for understanding! We are doing everything we can to get the bug resolved. Binary files are the downloadable files you see as links - .pdf, .exe, .img, etc.
I'm really sorry, but we don't have a URL to the old OSE. I saw Steven's response as a workaround - is that possible or are there too many file types to filter out?
Our crawlers that provide the metrics to OSE are always crawling, but will take about a month for our fix to propagate through to all the pages we crawl. Once we have removed these links from our crawlers, then we'll have to process the metrics. This is why it's looking like late September for the fix to show up.
I really appreciate your patience and understanding, we're doing everything we can to fix it!!
Thanks,
Carin
-
Hey Carin-
Thank you so much for this in-depth response. Glad to hear that you guys are aware of it and trying to sort it out. Very interesting info...I'd never hear of "binary" links before but I hope you guys can figure out how to handle these. Seems like a tough task to tackle, just by looking at my CSV it looks like these come in several different forms and they could be hard to identify..I have a few questions:
1. Is there by chance a URL you could give me that points to the old OSE ?
2. How often does OSE crawl? Is it a constant process or are there scheduled crawls?
Thanks!!
-Zack
-
Hey Zack, I saw the ticket you filed was answered by Aaron, but I just wanted to follow up with you as well. We have made some really exciting changes to the crawler, but, unfortunately, there is a pretty obvious bug as well...
The reason for the “questionable” links coming from the Internet Wild West is due to the crawler reaching much deeper into sites where there are more download (i.e. binary) links. The first issue is the crawler is counting a binary file as a link, but the larger issue, is that the crawler doesn’t really know how to handle these types of files. This bug is causing some links to be improperly associated with certain domains. This is probably what you're seeing with all the crazy links from China and Russia which don't actually link to the site you're researching.
There are two steps to addressing this issue: changing how the crawler sees these file types and then fixing how the crawler handles these file types. We have made improvements to our algorithm so that we will be handle the majority of these files correctly, however, this update will need about a month to propagate. The fix for this issue probably won’t be seen for two more updates, meaning late September. Our improvements should catch most of the issues, but there still could be a few cases we haven't addressed. If this happens, don't hesitate to let us know; we love feedback since it helps us improve and make our index even better!
The next step is to fix how our crawlers handle binary file links and prevent them from being improperly associated with certain domains. We are in the process of working through that issue right now. We’re doing everything we can to resolve this bug as we know it is alarming to see these “questionable” links associated with your sites.I hope this helps and thanks so much for being patient :)Thanks,Carin
-
2 ways:
- Get as CSV and spend the time going through it
- Wait it out
-
OK cool good info, hope they fix it soon!! Any good ideas on how you can filter this crap[ out ?
-
Hello Zack,
That is an issue that they are working on, I know this because I already discussed this with one of their help desk people. Here is the page that describes the changes: http://www.seomoz.org/blog/brand-new-open-site-explorer-is-here
In addition to that, here is some additional information I can share with you:
you may see “questionable” links with weird file extensions. This is due to the crawler reaching much deeper into sites where there are more download links. We are looking into fixing this bug as soon as we can so these won’t be counted as links.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Links to Your Site: No Data Available in Google Search Console
The site I am working on did not have their site submitted to Google Search Console (formerly Google Webmaster Tools). I submitted the site and a sitemap that auto updates. Google is crawling the site daily (about 30 pages a day). Under Search Traffic > Links to Your Site it shows no data is availible. I thought it was because it was a newly submitted site, but it has been two months now. Moz seems to have the same issue. Moz does show inbound links, but their are some that we think should really help us that are not shown. For instance, the Dallas Morning News wrote this article. They have a high DA and PA. Also, iliveindallas.com has an article about us that is still on the front page. That was a few weeks ago but also does not show up on Moz or Google SC. We are trying to be selective about the links we are getting. That they are follow links from reputable sites. Worried that both Google and Moz are not showing them.
Moz Pro | | TapGoods1 -
When do the "just discovered" links on Open Site Explorer count?
I have been working hard to get follow backlinks but they have all been in the Just Discovered part of Open Site Explorer for a long time. So they don't count in my stats for Domain Authority and such. When do they move OUT of Just Discovered?
Moz Pro | | dealblogger0 -
How Old is OSE link data?
I ran an anchor text report for my client today, which shows that their site has some incoming comment spam links using totally unrelated phrases (pharma products). However, when looking for the live link, the linking page no longer contains the link to them. Maybe the webmasters removed these, but I can't track down a single one... how old is this data? thanks
Moz Pro | | JMagary0 -
Old data in OSE?
Hello, I know the Mozscape was recently updated, but it's been a week and my site data ( DA, number of backlinks) in Open Site Explorer is still the same. The seomoz toolbar and my pro tools show my DA has increased ( I don't know of another way to verify my backlink count, so I'm not sure on that front). Any idea of the reason behind the disparity? Thanks.
Moz Pro | | richje0 -
Open site Explorer Findings??
On OSE, I am comparing two websites(A and B) and one has a page authority higher than the other (35(A) vs 36 (B)), but all the linking metrics are in favor for website A except for the linking C block (A has 11 and B has 12). The website B is ranked in page 1 for couple competitive keywords whereas the website A don't show up in the search at all. Why is that? Does B have better content and is more kw focused?
Moz Pro | | Ideas-Money-Art0 -
Site Ranking Report
Hi guys, My site ranking report says that I've gone from being 1-20 for a variety of keywords in Google UK to not in the top 50. When I do a search myself I see that my site remains where it previously was (between 1-20). How reliable is the site ranking reporting on a weekly basis? Is it best to look at it monthly?
Moz Pro | | columbus0 -
Certain Domains no longer recognised by open site explorer
Afternoon everyone (well, it is for me), We've been tracking the linking root domains to our domain for around 6 months now, alongside tracking these domains we have also been engaging in linking building activities. Our initial activities worked quite well with linking domains rising from around 620 to 720 in 3 months. However, recently we have seen those numbers begin to fall away, in many cases it is because certain domains have stopped linking to us, have become no=follow sites or have been archived. But, in some cases we can see the link is still there, and is being registered by other tools such as yahoo or webmaster tools. My question is really, does anyone have a way of working out why a link, that was in the past being registered by open site explorer, is no longer registering and presumably no longer passing over juice to help with domain authority. What kind of signals should i be looking for to tackle a 'decaying' link? Looking forward to hear your thoughts!
Moz Pro | | NigelJ0 -
Why aren't DMOZ links showing up in Open Site Explorer?
I have a page that is listed in DMOZ.org, but when I run an OSE Link Analysis, that link doesn't show up. I'm currently doing the free trial for SEOMoz Pro and I'm concerned that other links might not be showing up as well. Has anyone had any similar experiences and/or do you know why DMOZ links specifically might not appear in the results?
Moz Pro | | JoyceScott0