Webmaster Tools Indexed pages vs. Sitemap?
-
Looking at Google Webmaster Tools and I'm noticing a few things, most sites I look at the number of indexed pages in the sitemaps report is usually less than 100% (i.e. something like 122 indexed out of 134 submitted or something) and the number of indexed pages in the indexed status report is usually higher. So for example, one site says over 1000 pages indexed in the indexed status report but the sitemap says something like 122 indexed.
My question: Is the sitemap report always a subset of the URLs submitted in the sitemap? Will the number of pages indexed there always be lower than or equal to the URLs referenced in the sitemap?
Also, if there is a big disparity between the sitemap submitted URLs and the indexed URLs (like 10x) is that concerning to anyone else?
-
Unfortunately not, the closest you'll get is selecting a long period of time in Analytics and then exporting all the pages that received organic search traffic. If you could then cross check them with your list of URLs on your site it could provide you with a small list. But I would still check them in Google to make sure they aren't indexed. As I said it's not the best way.
-
Is there a reliable way to determine which pages have not been indexed?
-
Great answer by Tom already, but I want to add that probably images and other types of content whom are mostly not by default included in sitemaps could also be among the indexed 'pages'.
-
There's no golden rule that your sitemap > indexed pages or vice versa.
If you have more URLs in your sitemap than you have indexed pages, you want to look at the pages not indexed to see why that is the case. It could be that those pages have duplicate and/or thin content, and so Google is ignoring them. A canonical tag might be instructing Google to ignore them. Or the pages might be off the site navigation and are more than 4 links/jumps away from the homepage or another page on the site, make them hard to find.
Conversely, if you had lots more pages indexed than in your sitemap, it could be a navigation or URL duplication problem. Check to see if any of the pages are duplicate versions caused by things like dynamic URLs generated through search on the site or the site navigation, for example. If those pages are the only physical pages that you have created and you know every single one has been submitted in a sitemap - and so any other indexed URLs would be unaccounted for, that may well be cause for concern, so check nothing is being indexed multiple times.
Just a couple of scenarios, but I hope it helps.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Will noindex pages still get link equity?
We think we get link equity from some large travel domains to white label versions of our main website. These pages are noindex because they're the same URLs and content as our main B2C website and have canonicals to the pages we want indexed. Question is, is there REALLY link equity to pages on our domain which have "noindex,nofollow" on them? Secondly we're looking to put all these white label pages on a separate structure, to better protect our main indexed pages from duplicate content risks. The best bet would be to put them on a sub folder rather than a subdomain, yes? That way, even though the pages are still noindex, we'd get link equity from these big domains to www.ourdomain.com/subfolder where we wouldn't to subdomain.ourdomain.com? Thank you!
Reporting & Analytics | | HTXSEO0 -
Deleted Rarely Visited Pages - Traffic Dropped (Big Time)
Hi folks: I'd appreciate any thoughts you might have on a problem I am having with organic traffic. One of our sites has about 500 pages/blog posts. We had about 200 pages that no one was visiting, or only one to ten people had visited in an entire year. As a result, we decided to experiment, and delete any page which had fewer than 5 visits in a year. This resulted in a deletion of about 90 pages.We did this on April 6 or 7 of this year. Two days later, we had a substantial drop in visits to the site. We had been getting about 300 sessions a day. Now, we are lucky to get that in a month. I know there was an algorithm update in late March, but our traffic dropped about two weeks after that, and a day or so after the deletion of the pages. There is a clear demarcation on analytics. I gave it a month, the traffic did not recover, so we decided to restore the pages. Traffic has not recovered and it has been about 3 months now. Does anyone have any thoughts on why we might have experienced such a drastic drop as well as what we might do to recover from it? Thanks very much
Reporting & Analytics | | jnfere0 -
Landing pages report - Meaning of clics metric
Hi there, I am looking at the landing pages report on Google Analytics, I see 4 columns: Impressiones Clics Average position CTR Regarding the clics metric, this shouldn't be equal to the sessions of organic traffic that you get? In Adwords, a clic is a session. What I see is that clics are not sessions and I am a bit surprised of this. Why are they different in this report? Thanks and regards Thanks and regards
Reporting & Analytics | | footd0 -
How do I set up goals in analytics to track the sales funnel when several pages of the steps required to checkout have the same url?
I have found this in Google Analytic but it makes no sense to me - is there are better resource which explains how you do this step by step ( or a good video perhaps?) Identical URLs Across Multiple Steps In some situations, the URL does not change across a sequence of activity. For example, a sign-up process might have the following URL path: Step 1 (Sign Up): www.example.com/sign_up.cgi Step 2 (Accept Agreement): www.example.com/sign_up.cgi Step 3 (Finish): www.example.com/sign_up.cgi To track visitors' progress through a funnel with the same URL for each step, modify the tracking code to create a virtual URL for each step in the sequence that you want to track. For details on how to use this in your tracking code, see Virtual Pageviews in the _Asynchronous Migration Examples_guide, which shows how to do this in all versions of the tracking code. The following example shows how you might fabricate 3 URLs using the asynchronous tracking code: _gaq.push(['_trackPageview', '/funnel_G1/step1.html']); _gaq.push(['_trackPageview', '/funnel_G1/step2.html']); _gaq.push(['_trackPageview', '/funnel_G1/step3.html']); You would then define your funnel and goal URLs using the ones you created in the tracking code modifications.
Reporting & Analytics | | pookiepro0 -
Google WebMasters Tool - Preferred Domain
I just added Google Analytics to my wordpress site with Google Analytics by YOAST. I then added Google WebMaster tools through via verify through google analytics account. I then tried to set a preferred domain. I chose the non www. version; however, google wanted me to verify ownership of both versions in order to set a preferred domain. I then added the www. version of my domain. I was able to set the non-www. version to my preferred domain. Now, there are two example.com's in my webmaster tools. I have 10 sites. I intend to replicate this process on all of my sites. Do I have to leave the non-preferred version of my sites in the google webmaster? Can I delete it after I have set my preferred version? If I delete the non-preferred version will it delete my setting on the preferred version because it is now no longer verified (saved)?
Reporting & Analytics | | JML11791 -
How to find out which URLs are NOT indexed on a site
Is there a way to easily find out which URLs on a store-type site are NOT being indexed in Google? For example, if my sitemap information in Google Webmaster tools shows I have 7342 URLs in my sitemap and 5699 of those indexed, how do I find out what the 1643 non-indexed URLS are? Thanks for any help!
Reporting & Analytics | | GregWalt0 -
Strange pages visited in analytics
Hi Guys I am finding this in one of my sites analytics allot /tr-url/en-ru.ru/ followed directly by the site in questions domain name and page of interest....like this.. /tr-url/en-ru.ru/www.example.com/blue-widget/ and example.com being the analytics of the site in question. Are these pages that have been translated? like by yandex or something in that line....I cannot seem to find anything on this anywhere. What is this and should i be concerned?? thanks for the help cheers
Reporting & Analytics | | nomad-2023230