Long Url but makes no sense
-
Hi Just joined.
Crawl states that I am getting a lot of errors, looks like the spider is getting confused and looping back on itself ?
Is there a way to see where the crawl was formulated (ie where from) ?
It is generating urls like:
http://www.wickman.net.au/wineauction/wine_auction_alert.aspx/auction/auction/auction/auction/auction/auction/Default.aspx from http://www.wickman.net.au/wineauction/wine_auction_alert.aspx
-
Welcome! We're happy to have you here, and glad to have helped you solve your problem.
-
Thank you. Thats exactly what I needed. Downloaded the csv and was able to find the referring URLS and track back to the offending page.. Searched my site and voila - silly me put a trailing slash after the file prefix in the sitemap.. must have really messed up the robot.
so I had:
<loc>http://www.wickman.net.au/wineauction/wine_auction_alert.aspx/</loc>
Ooops.. I think I like this SEOMoz place already
-
Did launch a quick crawl on your site, these url were not found (but you have several broken links!). These strange URL usually come from bad robots, and may also come from the interpretation of javascript URL or redirects. GoogleBot gets really confused with them.
If you want to dig, check the servers logs.
-
Hi Martijn,
I did take a look at the source code, did not find that link anywhere, however that page is live:
Not sure if its a dynamically created or a static one...
Mark:
take a look at your physical directories on your server using your favorite FTP software, see if you can follow the same navigation and see if you have these folders and file
-
Hi Igor,
If you have a look at the page the URL is not found on this page.
-
Hi Mark,
What I would recommend, is open this page in your browser:
http://www.wickman.net.au/wineauction/wine_auction_alert.aspx
Then view the source page and try to search for the url:
http://www.wickman.net.au/wineauction/wine_auction_alert.aspx/auction/auction/auction/auction/auction/auction/Default.aspx
Are you using a CMS or ecommerce platform for your site like wordpress or x-cart? If so it might be something wrong with the configuration which produces the pages and links automatically
-
Hi Mark,
Within you Crawl Diagnostics you're able to export your data to CSV (on the topright side of the overview). By doing this you can find the links to the page you think is incorrect.
Hope this helps!
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is using hyphens in a URL to separate words good practice?
Hi guys, I have a client who wants to use a hyphen to separate two words in the URL to make each work stand out. Is is good or bad practice to use a hyphen in a URL and will it affect rankings? Thanks!
On-Page Optimization | | StoryScout0 -
How can i block the below URLs
Google indexed plugins pages for my website. Please check below. How can stop them to be indexed on google.? http://www.ayurjeewan.com/wp-content/plugins/LayerSlider/static/skins/glass/ http://www.ayurjeewan.com/wp-content/plugins/LayerSlider/static/skins/borderlesslight3d/ http://www.ayurjeewan.com/wp-content/plugins/LayerSlider/static/skins/defaultskin/ My robots.txt file is - User-agent: * Disallow: /wp-admin/
On-Page Optimization | | MasonBaker0 -
Long list of companies spread out over several pages - duplicate content?
Hi all, I am currently working with a company formation agent. They have a list of every limited company spread over hundreds of pages. What do you guys think? Is there a need for Canonicals? The website is ranking pretty well but I want to make sure there aren't any problems in the future. Here are two pages as examples: http://www.formationsdirect.com/companysearchlist.aspx?start=MULLAGHBOY+CONSTRUCTION+LIMITED&next=1# http://www.formationsdirect.com/companysearchlist.aspx?start=%40a+company+limited&next=1# Also what about the actual company pages? See an example below http://www.formationsdirect.com/companysearchlist.aspx?name=AMNA+CONSTRUCTION+LTD&number=06630333#.U8PW6_ldX1s Thanks in advance Aaron
On-Page Optimization | | AaronGro0 -
Infinite Scrolling Long Lists and SEO
Just curious if anyone else has tried this. I have pages with words that link to definitions. I have A LOT of them on a page and I am starting the process of trying to either do pagination (which I cant stand) or even cooler infinite scrolling where the page loads more words as the user scrolls. Good Bad for SEO?
On-Page Optimization | | cbielich0 -
Errors in URL´s
SEOMOZ is showing quite a lot of URL Errors like this: http://trampoliny.net.pl/akcesoria/pokrowiec-basic?frontend=1825cb1eea3af8ee6ee2d96617d32ff6 All these URL´s use the parameter "?frontend=". In webmaster tools we told google not to index this parameter. Unfortunately at the moment we cannot set this parameter as "NOINDEX". We also dont want to use a robots.txt file. How to get rid of the URLS in Seomoz?
On-Page Optimization | | drgoodcat0 -
How many urls per page is to many
I know it used to be 100 urls per page, but recently Matt cutts has said that they can count a lot more now. I was wonder what you guys thought was how many was to many per page?
On-Page Optimization | | Gordian0 -
URL with two forward slashes //
We have a potential client with a URL structure in this fashion: http://www.site-url.com//cpage/page.html pretty strange, right? my question is: How bad are the 2 forward slashes // for SEO? How bad is it to have that extra layer of /cpage in the URL? this doesn't appear to serve any other purpose than making the URL longer than necessary.
On-Page Optimization | | Motava0 -
Title optimization best practices for clients with insanely long business names
How do others utilize keywords and preserve branding in the title tag for clients with a REALLY long name? Two examples. Example 1: Business name is 38 characters long in the following format: [Firstname] [initial] [Lastname] [Businesstype] Services 38 characters is workable, but the keywords for what he offers and this industry in general are long too. He abbreviates to his initials in the domain name - I don't love doing that as the acronym has a meaning of its own. (We unintentionally acquired at least one very amusing if useless backlink thanks to that.) Leaving off "Services" saves a few characters. Example 2: Business name totals 58 characters and references their two related lines of business. Similar to: Rogers Institute of Robotic Studies and RIRS Robot Repair
On-Page Optimization | | MaryAnneG
or (saves a few characters)
Rogers Institute of Robotic Studies and Robot Repair How would you handle that? Use the appropriate half of the name on pages related to that particular LOB? Only use the brand on some pages? Abbreviate more? I've been using their full name on the more "general" pages of the site and omitting it in favor of keywords on the more specific pages . Suggestions? Other ideas?1