Site: Query Question
-
Hi All,
Question around the site: query you can execute on Google for example. Now I know it has lots of inaccuracies, but I like to keep a high level sight of it over time.
I was using it to also try and get a high level view of how many product pages were indexed vs. the total number of pages.
What is interesting is when I do a site: query for say www.newark.com I get ~748,000 results returned.
When I do a query for www.newark.com "/dp/" I get ~845,000 results returned.
Either I am doing something stupid or these numbers are completely backwards?
Any thoughts?
Thanks,
Ben
-
Barry Schwartz posted some great information about this in November of 2010, quoting a couple of different Google sources. In short, more specific queries can cause Google to dig deeper and give more accurate estimates.
-
Yup. get rid of parameter laden urls and its easy enough. If they hang around the index for a few months before disappearing thats no big deal, as long as you have done the right thing it will work out fine
Also your not interested in the chaff, just the bits you want to make sure are indexed. So make sure thise are in sensibly titled sitemaps and its fine (used this on sites with 50 million and 100 million product pages. It gets a bit more complex at that number, but the underlying principle is the same)
-
But then on a big site (talking 4m+ products) its usually the case that you have URL's indexed that wouldn't be generated in a sitemap because they include additional parameters.
Ideally of course you rid the index of parameter filled URL's but its pretty tough to do that.
-
Best bet is to make sure all your urls are in your sitemap and then you get an exact count.
Ive found it handy to use multiple sitempas for each subfolder i.e. /news/ or /profiles/ to be able to quickly see exactly what % of urls are indexed from each section of my site. This is super helpful in finding errors in a specific section or when you are working on indexing of a certain type of page
S
-
What I've found the reason for this comes down to how the Google system works. Case in point, a client site I have with 25,000 actual pages. They have mass duplicate content issues. When I do a generic site: with the domain, Google shows 50-60,000 pages. If I do an inurl: with a specific URL param, I either get 500,000 or over a million.
Though that's not your exact situation, it can help explain what's happening.
Essentially, if you do a normal site: Google will try its best to provide the content within the site that it shows the world based on "most relevant" content. When you do a refined check, it's naturally going to look for the content that really is most relevant - closest match to that actual parameter.
So if you're seeing more results with the refined process, it means that on any given day, at any given time, when someone does a general search, the Google system will filter out a lot of content that isn't seen as highly valuable for that particular search. So all those extra pages that come up in your refined check - many of them are most likely then evaluated as less than highly valuable / high quality or relevant to most searches.
Even if many are great pages, their system has multiple algorithms that have to be run to assign value. What you are seeing is those processes struggling to sort it all out.
-
about 839,000 results.
-
Different data center perhaps - what about if you add in the "dp" query to the string?
-
I actually see 'about 897,000 results' for the search 'site:www.newark.com'.
-
Thanks Adrian,
I understand those areas of inaccuracy, but I didn't expect to see a refined search produce more results than the original search. That just seems a little bizarre to me, which is why I was wondering if there was a clear explanation or if I was executing my query incorrectly.
Ben
-
This is an expected 'oddity' of the site: operator. Here is a video of Matt Cutts explaining the imprecise nature of the site: operator.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Two long established sites with similar audiences, what do we do?
Hi guys, We operate two long established and reasonably well ranking sites — our company website which was built on a keyword domain: market-stalls.co.uk (approx 15 years online) and our online store which was established several years later on a different domain: tradersupplies.co.uk (approx 9 years online). (At the bottom of this post I've attached real world traffic and turnover figures that demonstrate the issue we're facing) The problem is... The above sites target very similar audiences and keywords and both rank fairly well but I know are likely competing against eachother We're a small company (8-10 employees) and we (or rather, I) don't have the time or resources to blog, build back links, manage opseo and all the social channels etc for both sites. I'm struggling to cope with one. The question is... Do we abandon the original company site (market-stalls.co.uk) in favour of pooling all our resource in to improving rankings for our online store (tradersupplies.co.uk). All our social media presence relates to tradersupplies.co.uk. We don't have any social channels for market-stalls.co.uk. Ironically, the only blog we have is established on market-stalls.co.uk — set up a couple of years ago in the hope to pull ourselves back up the rankings — but it hasn't been updated in over a year due to time restraints. Or do we attempt to keep both sites operational, despite a lack of resource? That would likely include a fairly sizeable overhaul of market-stalls.co.uk to bring it up to date with modern design standards, establishing social media channels for market-stalls.co.uk, creating a blog on tradersupplies.co.uk, and regularly updating two blogs and two sets of social media channels with unique content. Sounds like a pretty huge job right!? Obviously, had we been setting up our business in 2017 and having read the many community posts on the subject of multiple websites, we wouldn't be splitting our time between two websites and would be focussing solely on building one highly ranking site. But unfortunately we're not in this position and we're in a quandary because we don't know whether or not we should let our original, highly ranking company site drop off the radar in favour of focussing on building traffic to our online store. This situation arose out of a decision to establish our online store on a different domain to our company website. Back in 2007 I rebuilt market-stalls.co.uk and spent a lot of time optimising it. The site blew up and we were ranking very well for all kinds of keywords related to market stalls In 2009 we opened our online store tradersupplies.co.uk which sells all of the products advertised on market-stalls.co.uk and then some By using "buy now" buttons on market-stalls.co.uk which redirected to tradersupplies.co.uk, our original site was driving a large amount of traffic and sales to tradersupplies.co.uk. At it's peak it was driving almost £6,000 GBP a month in sales. This has since dropped to around a third/quarter of this total. As the business grew we began to run short of time to maintain market-stalls.co.uk and it has inevitably slipped down the rankings This has also had a direct impact on the referral traffic and resulting sales on tradersupplies.co.uk. I've attached below the analytics which show the drop in referral traffic to tradersupplies.co.uk and the drop off in sales. I have a feeling I know the answer to this debacle but I'm keen to hear the opinions of those that may have found themselves in this position before! UPDATE: I've just had a call with our Magento developer halfway through writing this post ... he has suggested we transfer all content from market-stalls.co.uk over to CMS pages on our Magento powered online store, and create 301 redirects. Apparently this will carry the weight of market-stalls.co.uk over to tradersupplies.co.uk. Does anyone have any thoughts on this? turnover.jpg
Reporting & Analytics | | tinselworm0 -
Adding a Query String to a Static URL is that good or bad?
I just went through this huge process to shorten my URL structure and remove all dynamic strings. Now my analytics team wants to add query strings to track clicks from the homepage. Is this going to destroy my clean url structure by appending a query string to the end of the URL structure.
Reporting & Analytics | | rpaiva0 -
Has anyone seen their GWT Search Queries data resolve?
My GWT data seems to be have resolved through 9/28, but still looking low for the week starting 9/29 through present.
Reporting & Analytics | | EmpireToday0 -
What is the impact of a panda refresh on a Pandalized site?
When a panda refresh hits and you have a pandalized site, If the site were to de-pandalized, would you see traffic back to pre-panda levels right away? Or any type of movement right away?
Reporting & Analytics | | jessefriedman0 -
Any thoughts on why Nextag and MonsterMarketPlace are linking to our site?
I'm looking in WMT at the crawl errors and I noticed that our website has gotten a lot of Not Found crawl errors that seem strange. A lot of these not found pages are Display URLs that I use in PPC advertising, but not actual redirects (i.e. explorica.com/EducationalTrips). When I looked at how these links were being found, the inbound links were coming from Nextag.com and monstermarketplace.com, two sites that our company has never had a relationship with. We're an educational travel company, so we'd have no reason to. When I followed the links, it looks like it's coming from their "Sponsored Links," but these aren't Google or Bing Ads. We don't even advertise on the content network. Example link: http://www.monstermarketplace.com/starters-and-alternators/alternator-motorola-style-12v-51a-10376 (the ads do rotate so my site might not appear when you check it out). Anyone ever had experience with this type of issue?
Reporting & Analytics | | Explorica0 -
How would you measure the SEO success of new site launch?
It has been 12 months, and it is time for some serious SEO reality check up. I think we have done some really nice things (social integration, on page optimization etc) but we honestly could do a million time better on some other elements (anchor, text, link building etc...). Would love to hear from the community what would be the top 10 criteria you would use to judge the quality of the SEO work done for a new site during is first 12 months. PS: we are a very content rich over 1,500 new articles/post in our niche with 12 months - our site is migraine.com Thanks
Reporting & Analytics | | OlivierChateau0 -
How to Refesh site comapign?
How to Refesh site comapign? its displaying 3 days old data. now fixed some contents. unable to test it. kindly guide me for howto refresh the report?
Reporting & Analytics | | peanut20100 -
Overall site traffic - 3 quick questions
3 things : 1. Does Google factor in overall site traffic in rankings? So for 2 sites, all other things being pretty much equal, the one with higher traffic will be listed higher? 2. Does this logically imply that sites with lower traffic overall face an uphill struggle to be ranked highly??? 3. As an extension to this, would it be true to say that by increasing site traffic, say with Google Adwords or other online or offiline or whatever advertising, that might help get higher SEO rankings??? Thanks so much for your responses. This forum is great!
Reporting & Analytics | | inhouseninja0