Moz "Crawl Diagnostics" doesn't respect robots.txt
-
Hello, I've just had a new website crawled by the Moz bot. It's come back with thousands of errors saying things like:
- Duplicate content
- Overly dynamic URLs
- Duplicate Page Titles
The duplicate content & URLs it's found are all blocked in the robots.txt so why am I seeing these errors?
Here's an example of some of the robots.txt that blocks things like dynamic URLs and directories (which Moz bot ignored):Disallow: /?mode=
Disallow: /?limit=
Disallow: /?dir=
Disallow: /?p=*&
Disallow: /?SID=
Disallow: /reviews/
Disallow: /home/Many thanks for any info on this issue.
-
Hi Si, has this issue been resolved?
-
Hey Si,
Thanks for writing in. It doesn't seem that we are having an overarching issue with our crawler ignoring robots.txt files so I did some research in Google Webmaster Tools and it looks like most crawlers require an asterisk in the disallow directive to recognize that all pages of a dynamic URL are being disallowed. If you look in the "Pattern Matching" section of this resource here: http://support.google.com/webmasters/bin/answer.py?hl=en&answer=156449, that should give you more information about setting up the robots.txt with the correct disallow directives to block those pages.
If you add in the astrisk to the disallow directive and you are still seeing these pages crawled, it would help if you sent in an email with your campaign information to our support desk at [email protected] so we can have our engineers look into this more directly.
I hope this helps.
Chiaryn
-
If you have an "index,(no)follow" meta on those pages I think they will be crawled even though you have them blocked in robots.txt. So by adding "noindex" on those pages it might work as you want it to.
-
Is the / actually in the URL at that spot? Or is your link like http://www.example.com/abcd?p=147
If you give an example full URL that includes one of your blocked dynamic URLs we can take a better look. If your robots is setup correctly, it shouldn't find that stuff but give us more info if you're able.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How do I 'sign in' to the Moz Bar?
It's installed, I can see links etc as highlighted - but it won' t let me "sign in". This 20 second video explains: https://www.screencast.com/t/3kEjQFkTHZv Suggestions? Or shall I just ignore? Paul Barrs
Moz Bar | | PaulBarrs0 -
Page Optimization problem in Moz
Hello there, I've been using Moz Pro for a year now, and encountered I've encountered small issues here and there (like some features not working currently, but they start working after a few hours tops) but now I have a bigger issue. Page Optimization feature stopped working and it's dead for five days now. Even my older research is gone. (check the images) Anyone else having this problem, or is it just me? It says that the crawler is blocked or the page isn't working, and when I check the pages in other platforms with crawlers it all goes smooth. All of the pages are working, and all of the crawlers (tested even in Google console) are doing fine. I even had the Moz crawler on 04.07.2019 as the standard week crawl and I got the results. Everything else is working just fine, I have a problem with the Page Optimization feature only. Thanks in advance, Ivan Ga4LKkQ
Moz Bar | | BMGEmployee0 -
Moz is showing issues at metadata continuously even though the issues are fixed.
I crawled my website by Moz and found many metadata issues (Short meta description, Too long title, Too long URL). I fixed all of the issues. But when I recrawled my site it showed me all issues are fixed except Meta description. I thought maybe my changes are not being saved and I checked again but It seemed okay! All the changes I made it is applied. So, my meta description is okay now but Moz is appearing old meta description what was too short and detecting as an issue. I recrawled my site 4 times. Please help me with that issue. Thanks, Robin
Moz Bar | | Lobin0 -
MozBot Finding Duplicate Pages That Aren't Duplicate
I've been reviewing the technical audits for my campaign in Moz, and noticed I had a number of duplicate content issues that I'm not really sure how to address. When I click on the links of what the duplicates are, they are all different links that have different content/images. Based on what I was seeing other's wrote in the forum, this could be because the code base is really the same between these pages, and many of these were using query parameters (I'm assuming that is why the code is almost exactly the same across these pages), so example: website.com/tags/KEYWORD1?type=KEYWORD2 is a duplicate of website.com/tags/KEYWORD3?type=KEYWORD4 I was reading that I can use that URL Parameters area in google search console, but my search console says that the googlebot isn't experiencing issues, so I wasn't sure if that was the right move. I can't do the canonicals because these pages all have different content on them, and I know duplicate content is a big SEO issue, so I really wasn't sure what my next steps should be. Thanks for the help!
Moz Bar | | amaray4030 -
Moz bar problem?
I have a little problem regarding Moz bar. I had a FB and Twitter account earlier for my company. I have lost access to those accounts. Also they did'nt had my likes and followers (less than 10). My current Fb and Twitter account are active. Fb has 123 Like. And that the same account I am mentioned on my website. Still MozBar is only detecting that earlier account. Is there a way I could re-direct moz bar to my new accounts? thanks
Moz Bar | | jogindergujela0 -
Moz Tool Bar Annoyance - How do I make the green keyword difficulty box go away?
A highly annoying light green keyword difficulty box just started showing up to the right of the search box. How do I make this go away? ulBzP9W
Moz Bar | | JenKeller0 -
I'm checking keyword difficulty for two different sites. Would love to view the results by site instead of just one large list. Is that possible? Or would it just be easier to keep the lists separate in Excel and just import when I want an updated report?
I have keyword lists for two sites. Is there a way to label them in the keyword difficulty tool (List A, List B) so I can just view results for a particular site? Or do I need to run the report with List A, export results, delete those keywords, then run the report for List B?
Moz Bar | | JohnNovakLV0 -
Problems with Moz tools
Hello, I am having real problems with Moz tools. I can log in to Moz analytics (crawl diagnostics) and analyse one site and I get 2 [internal] duplicate page errors Yet when I enter via pro.moz.com (crawl diagnostics) I am finding over 100 duplicate page content warnings? Any idea why this is happening?
Moz Bar | | McTaggart0