WEBMASTER console: increase in the number of URLs we were blocked from crawling due to authorization permission errors.

odmsoft

Hi guys,I received this warning in my webmaster console: "Google detected a significant increase in the number of URLs we were blocked from crawling due to authorization permission errors." So i went to "Crawl Errors" section and i found such errors under "Access denied" status:

?page_name=Cheap+Viagra+Gold+Online&id=471

?page_name=Cheapest+Viagra+Us+Licensed+Pharmacies&id=1603

and many happy URLs like these. Does anybody know what this is and where it comes from?

Thanks in advance!

odmsoft

Thank you Tom!

BlueprintMarketing

Hi

to removed any chance of infection and I am not telling you that I am 100% sure it's infected

You must be certain that the regional infection was removed. If it was not and you had links created by a third party other than yourself you are better off getting it completely cleaned

use Sucuri.net to remove any chance of a hack.

Just type this into Google

?page_name=Cheap+Viagra+Gold+Online&id=471
?page_name=Cheapest+Viagra+Us+Licensed+Pharmacies&id=1603

http://www.pearsonified.com/2010/04/wordpress-pharma-hack.php

https://blog.sucuri.net/2010/07/understanding-and-cleaning-the-pharma-hack-on-wordpress.html

https://sitecheck.sucuri.net/results/www.davidandsonsjewelers.com/articles/author/carole/

i used deepcrawl.com to create the audit I you referenced.

&

Screaming frog SEO to create the site map

I hope that helps,

Tom

odmsoft

Hello Thomas,

I really appreciate your help! You said i can look at your site's structure. What is your site address?

Unfortunately, i still don't know what i need to do in order to remove those pharma hack from my site. If you know where to point me to get the answer, i'll be very grateful.

Also, what tool you used to generate this report http://crawl.blueprintmarketing.com/projects/reports/215533?ro=75ad0c6e4afacc428b553d449dfd281f82ec2ad6 ?

Also, what tool you used to create XML site map?

Thanks

BlueprintMarketing

No site map from checking multiple configurations of XML site maps and coming up with nothing no redirects either e.g. /sitemap_index.xml might exist separately or redirect to /sitemap.xml

http://www.davidandsonsjewelers.com/sitemap.xml shows a 404

Tool's

deepcrawl.com https://varvy.com/mobile/ & https://varvy.com/tools/

http://i.imgur.com/fxGWwXI.png

fxGWwXI.png

BlueprintMarketing

detect mobile issues

https://varvy.com/mobile/ & https://varvy.com/tools/

BlueprintMarketing

If I were you I would look at my site structure make sure that it was built in a certain manner for the right reasons.

If your traffic is all right you really do not want to change the site that much. If you do change the site change it slowly.

( A great example of this is how FireHost.com it is becoming Armor.com)

the tools I used to find out whether or not you had a site map primarily was deepcrawl.com

to detect mobile issues

https://varvy.com/mobile/ & https://varvy.com/tools/

See here: http://crawl.blueprintmarketing.com/projects/reports/215533?ro=75ad0c6e4afacc428b553d449dfd281f82ec2ad6

http://i.imgur.com/W7BDaq7.png

http://www.screamingfrog.co.uk/seo-spider/

http://i.imgur.com/LbCBmmW.png

I used screaming frog to create a XML site map for you here

http://d.pr/f/15Ouy

I would definitely add an XML site map.

Sincerely,

Thomas

W7BDaq7.png LbCBmmW.png

odmsoft

Also, do you say that the mobile site is blocked? Also, how do you see that the site doesn't have XML? What tool shows you all this info?

Thanks

odmsoft

Hi Thomas,

I really appreciate your help! Can you advise me what i should do? I see all these reports but i don't know how i need to clean the site.

Thank you!

BlueprintMarketing

As you are showing certain URLs that are definitely Pharma hack their are certain things Sucuri is unable to detect because of it being a front-end tool not the PHP tool that would be needed for the two-part WordPress and PHP version of your site.

Just type this into Google

?page_name=Cheap+Viagra+Gold+Online&id=471
?page_name=Cheapest+Viagra+Us+Licensed+Pharmacies&id=1603

http://www.pearsonified.com/2010/04/wordpress-pharma-hack.php

https://blog.sucuri.net/2010/07/understanding-and-cleaning-the-pharma-hack-on-wordpress.html

https://sitecheck.sucuri.net/results/www.davidandsonsjewelers.com/articles/author/carole/

https://majestic.com/reports/neighbourhood-checker?d=http%3A%2F%2Fwww.davidandsonsjewelers.com%2Farticles%2Fauthor%2Fcarole%2F&IndexDataSource=H

https://www.virustotal.com/en/ip-address/216.120.237.225/information/

http://dnsbl.inps.de/query.cgi?lang=en&ip=216.120.237.225&action=check&quick=0

BlueprintMarketing

and switch everything to WordPress

view-source:http://www.davidandsonsjewelers.com/

BlueprintMarketing

some of you are links are really not supposed to be there

http://crawl.blueprintmarketing.com/report_grid/page/215533/all_pages/baf4035a9366e9c99559f1f51f42df1475a0c13f?tab=Internal Links Out

Here is your report please use the URL below to navigate the entire report.

All of you are URLs are relative to the most part that should be fixed. You have a Java redirect that definitely needs to be fixed.

http://crawl.blueprintmarketing.com/projects/reports/215533?ro=75ad0c6e4afacc428b553d449dfd281f82ec2ad6

PDF & XML outline

https://varvy.com/

You have roughly 108 indexed URLs according to Google

https://marketing.grader.com/report/www.davidandsonsjewelers.com/overall

you do not have an XML site map unfortunately I found that out in the first five minutes but you can also find out if these things using

https://mza.seotoolninja.com/researchtools/crawl-test

upon a quick check with another tool I found

http://i.imgur.com/Y60WnIc.png

I love deepcrawl however your site is not large you can learn a lot about it with

http://www.screamingfrog.co.uk/seo-spider/ free

I hope this is a help, with analytics access and webmaster tool like this I cannot obviously give you a much better picture.

Tom

U6mS1O6.png fxGWwXI.png Y60WnIc.png

BlueprintMarketing

I will run the audit now sorry for the delay

odmsoft

Sure,

Here it is http://www.davidandsonsjewelers.com/.

Thank you!

BlueprintMarketing

The best way to solve this problem is to use

HTTPS://Deepcrawl.com

Or http://screamingfrog.co.uk Seo spider

If you give me the URL I will do it quick check for you.

odmsoft

Thank you Thomas,

My site is clean though according to sucuri. I spoke to owner of this website and they said that they were hacked in the past and they blocked those pages themselves. So now google detects those pages again? Or what exactly is happening? Anybody knows?

Thanks

BlueprintMarketing

Remember that not every URL is in Googles index. It does not mean that your back link is not in

https://mza.seotoolninja.com/researchtools/ose/

https://majestic.com

http://ahrefs.com

You should very quickly make sure that your website is not still completely full of malware like it sounds it is

use this tool to determined what has happened to your site if it is infected it is free.

https://sitecheck.sucuri.net/

If it is hacked as I believe it may be dependent on what you have described I would then purchase the malware removal and web application firewall

https://sucuri.net/website-antivirus/

if you would like a much more secure hosting environment https://armor.com is the best.

Once you have removed your site from the blacklists and removed all the bad where/malware make sure to crawl it with Google in Webmaster tools using fetch as a Google bot

your nightmare should be short-lived sorry to hear that your site was hacked hopefully this will get you back on track quickly.

odmsoft

Hi Dirk,

In webmaster tools if i click one by one those links, i can see "Linked from" URLs. There are URLs like this:

http://schwagginwagon.com/?page_name=Buying+Tadalis+SX+Safely+No+Prescription+Tadalis+SX&id=1810

and also there is one URL is coming from my domain. Not sure what it means.

I went through every single URL in Google index but all of them are normal URLs. Nothing related to spam. Any ideas?

Thanks

DirkC

Try to do a search of type viagra site:yourdomain.com - and see if there are any pages of suspicious nature that are listed.

In the crawl error section in webmaster tools you could also check where these url's are coming from (external/internal links)

If your site is hacked - you can find more info here http://www.google.com/webmasters/hacked/ on what to do next.

rgds,

Dirk

odmsoft

Hello Dirk,

Thank you for fast reply! I thought it too right away. So all of these URLs are forbidden when i try to access them. This is the message from google webmaster tools "Googlebot couldn't crawl your URL because your server either requires authentication to access the page, or it is blocking Googlebot from accessing your site."

Any ideas? Thanks

DirkC

Hi

On first sight I would guess your site has been hacked - do these url's exist when you try them?

Dirk

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

WEBMASTER console: increase in the number of URLs we were blocked from crawling due to authorization permission errors.

Browse Questions

Explore more categories

Related Questions

Domain Authority vs. Page Authority: What’s More Important?

Folders in url structure?

Quest about 404 Errors

How can I fix this home page crawl error ?

Remove more than 1000 crawl errors from GWT in one day?

How to find original URLS after Hosting Company added canonical URLs, URL rewrites and duplicate content.

What to do with extremely high number of URLs on your site?

.Rel=author