Why doesn't Moz crawler follow robots.txt?
-
It is crawling the entire site, and there is stuff we do not want it to. Please advise.
-
Which I am ok with, but why am I getting duplicate content?
-
Yes, it doesn't tell them which pages not to crawl - just not to index them
-
It has been used correctly. The site is a Magento site and they have it built in. There are a lot of filters for products so it uses rel=canonical to tell Google which to index.
-
rel=canonical is not really an robots instruction file - rel=canonical is to help with duplicate copy where you have the same or similar pages and your telling search engines which pages is the preferred page.
If you don't want pages crawling you have to tell Search engines in the robots file
-
Hi There,
Rel=canonical tags tell robots, which page is actually to index out of many.
For SEOs, canonicalization refers to individual web pages that can be loaded from multiple URLs. This is a problem because when multiple pages have the same content but different URLs, links that are intended to go to the same page get split up among multiple URLs. This means that the popularity of the pages gets split up. Unfortunately for web developers, this happens far too often because the default settings for web servers create this problem.
https://mza.seotoolninja.com/learn/seo/canonicalization
I feel you have not used it correctly, check the above article and see if it helps.
Thanks,
Vijay
-
So I made a mistake it isn't the robots.txt that is the issue. I am getting hit with a ton of duplicate content penalties so I figured that was it. The problem is that I have pages with rel=canonical tags that it is ignoring. Does Roger not read those?
-
Hi
Have to agree with the above, Rogerbot does listen to robot.txt file, unlike Bing - while they are getting better Bing ignores the robots.txt file frequently.
Ive analysed quite a few server logs over the years and Roger has always listened to the file - its usually a mistake the in the robots file.
There is an option to test your robots.txt file in GCS - while this is testing to see if Google will crawl the page - usually Roger has the same instructions as Google.
However if you are still pretty certain that Roger is ignoring robots.txt please DM your Server Logs and your website and I will take a look and analyse it for you (free of course).
Thanks
Andy
-
All major search engines, including Moz's crawler Rogerbot and Internet Archives, respect Robots.txt as a standard “robots exclusion protocol” to communicate with web crawlers and web robots.
In case you wish to exclude some specific information from all Search Engines, you can use the following sample code as reference to block specific directories.
User-agent: *
Disallow: /cgi-bin/
Disallow: /tmp/
Disallow: /junk/However, if you want to specifically block Mz's Rogerbot from crawling specific sections of your website. You may take the following reference code to block specific areas / directories in your website from rogerbot:
User-agent: Rogerbot
Disallow: /cgi-bin/
Disallow: /tmp/
Disallow: /junk/I hope this helps, If you have specific questions, please feel free to respond, I will be happy to answer them.
Regards,
Vijay
-
Hi there! Moz's crawler, rogerbot, does follow robots.txt. When he's not following robots.txt, it's usually because the robots.txt protocol is formatted improperly. Learn more about formatting your page here: https://mza.seotoolninja.com/learn/seo/robotstxt
For more information on Roger, including how to block him, head here: https://mza.seotoolninja.com/help/guides/moz-procedures/what-is-rogerbot
And if you want to test your formatting, try the Robots Checker here: https://support.google.com/webmasters/answer/6062598
If you're still unable to determine why rogerbot is crawling your site, feel free to write in to [email protected]!
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Moz can't crawl our site
Moz can't crawl our site because of an error in the robots.txt, we've tried everything in the troubleshooting guide but nothing works - I believe its a server error but have no idea how to fix it pls help
Link Explorer | | SigneerHFS0 -
Give a Solution my DA is not increasing and not index my backlinks in MOZ??
DA of my site is not increasing. It seems like Moz is not indexing my backlinks. Multiple backlinks of my site [ Tech Spotty ] is not indexing in Moz that is why DA is not increasing. Backlinks of my site have indexed in both Ahref and Semrush both not showing in Moz. DR has increased to 51 but DA is still 7. If anyone has the solution please let me know.
Link Explorer | | fbowable1 -
My company has about 70 live pages. Moz crawled 6k. Why?
My company has about 70 live, published pages on our website, which is managed through HubSpot. When Moz crawled the URL, it found 6K+. I noticed several were URLs with a campaign tag. Is this normal? It seems excessive to me. If there's a problem, how do I fix it?
Link Explorer | | ActionableResearch1 -
Learn how to use Moz's Spam Score metric to identify high risk links. Get your Daily SEO Fix.
Almost every site has a few bad links pointing to it but risky links can have a negative impact on your search engine rankings. Watch The Moz Daily SEO Fix: How to Use Spam Score to Identify High Risk Links to learn how to spot those spammy links and what to do with them. And, if you have more questions about Spam Score, check out Rand’s blog post: "Spam Score: Moz’s New Metric to Measure Penalization Risk." This video is part of The Moz Daily SEO Fix tutorial series--Moz tool tips and tricks in under 2 minutes. To watch all of our videos so far, and to subscribe to future ones, make sure to visit the Daily SEO Fix channel on YouTube.
Link Explorer | | kellyjcoop3 -
Dofollow link from Moz user profile not showing in OpenSiteExplorer
Hi everyone! I'm was searching my "new" dofollow link from my Moz user profile http://moz.com/community/users/625691 now I have more than 200 MozPoints, but in OpenSiteExplorer is only showing my "nofollow" link of the icon (the "M" icon in the left of the URL in the profile). Why not appears the dofollow link? Thanks!
Link Explorer | | rubenalonsoes1 -
Why is Moz not crawling my backlinks
Hi my website www.dealwithautism.com is 3 months old and has been on DA 1 and PA1 ever since, even though the site is actively developed with quality content (a couple of posts already have 1k+ fb likes acquired editorially, while that doesnt necessarily improve SERP, it sure tells you that the post is engaging). In contrast another site of mine, www.deckmymac.com which is hardly ever managed, not have more than 15 posts and just 1 backlink, has DA 14. Running an on page analysis on www.dealwithautism.com I observed that Moz has not identified any backlinks nor social signals (except G+). However, according to Webmasters, I have 57 links, 51 of them to the root. Even Majestic is able to report 32+ backlinks. So what am I missing? Certainly, at this stage my website doesn't deserve DA 1, or does it?
Link Explorer | | DealWithAutism0 -
Not sure why the data on the reports is stale. Meaning it hasn't been updated since my purchase date. Hard to know if I am making any progress.
I am a MOZ Pro subscriber and I am not sure why the data on the reports is stale, meaning it hasn't been updated since my purchase date. Hard to know if I am making any progress. How often does the data update?
Link Explorer | | mcorcelli1 -
Why Moz doesn't offer tool similar like SEOprofiler Link Disinfection?
Hi Moz, As I have been your member & using pro tool for more than an year, it's been amazing experience working and using your each tool's function tool, wehther to check on-page grader, rank tracking, on-page crawling issue, keyword difficulty and the one my best for link analysis is Open Site Explorer etc. I came here to ask about something which I didn't find here than other providers is most popular Bad Link Detection Tool. As **SEOprofiler Link Disinfection **offers their customers to use this to get all of their bad or spammy links harming them to rank high or getting away from ranking goal. Let me know, are you planning anything similar to this to bring soon in your tools and provide your members to take benefits of such type function as well OR should i go to SEOprofiler for this? Best,
Link Explorer | | Futura
Teginder Ravi0