I have more pages in my site map being blocked by the robot file than I have being allowed to be crawled. Is Google going to hate me for this?
-
Using some rules to block all pages which start with "copy-of" on my website because people have a bad habit of duplicating new product listings to create our refurbished, surplus etc. listings for those products. To avoid Google seeing these as duplicate pages I've blocked them in the robot file, but of course they are still automatically generated in our sitemap. How bad is this?
-
When you say "people," are you saying your own web team duplicates content to make their job easier? Or am I missing something?...
If that's the case, you really should create unique URL's with unique page titles, product info, etc. That's the correct way to avoid getting hit for duplicate content - don't create it. It seems like what you're doing now is more of a band-aid solution to the problem.
I'd consider that even though creating unique content in situations like this can seem daunting and/or be more expensive, there's probably huge long-term gains to made if you do it right.
-
It is not bad, just not best practices because Google will still index the URL's if they are mentioned on other pages. Just to quote them:
"While Google won't crawl or index the content of pages blocked by robots.txt, we may still index the URLs if we find them on other pages on the web. As a result, the URL of the page and, potentially, other publicly available information..."
What I would do instead is either use rel="canonical" or 301 redirects. I hope that helps.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
There is a copy of our website that is ranking. How can I let Google know our website is the authentic site?
I just found another copy of my old website and have no way to take it down. Unfortunately, it's ranking so he didn't place it as a nofollow. (My boss hired someone to redevelop our website before I came on board and never finished the project). So, could this be hurting us? I tried to look to see if we were being penalized and couldn't find that we were. Also, ever since we migrated to a new domain name, our ranking is tumbling. I've redirected properly and tested to make sure they're resolving correctly and they are. I have no idea what is going on. We've virtually lost all ranking. Any help would be much appreciated.
On-Page Optimization | | npuffer790 -
Can Robots.txt on Root Domain override a Robots.txt on a Sub Domain?
We currently have beta sites on sub-domains of our own domain. We have had issues where people forget to change the Robots.txt and these non-relevant beta sites get indexed by search engines (nightmare). We are going to move all of these beta sites to a new domain that we disallow all in the root of the domain. If we put fully configured Robots.txt on these sub-domains (that are ready to go live and open for crawling by the search engines) is there a way for the Robots.txt in the root domain to override the Robots.txt in these sub-domains? Apologies if this is unclear. I know we can handle this relatively easy by changing the Robots.txt in the sub-domain on going live but due to a few instances where people have forgotten I want to reduce the chance of human error! Cheers, Dave.
On-Page Optimization | | davelane.verve0 -
New site pages are indexed but not ranking for anything
I just built this site for a client http://primedraftarchitecture.com. It went live 3 weeks ago and the pages are getting indexed as per Webmaster Tools. But I'm not seeing it rank for anything. We're adding blog articles regularly and used Moz Local for local links and have been building links in other local directories (probably about 15 so far). Usually I get some rankings, although very low, after just a week or two for new sites. Does anyone see anything glaring that may be causing a problem?
On-Page Optimization | | DonaldS1 -
Crawl errors
Hi I have the following errors on my site and was wondering would it help improve my ranking to fix : Missing Meta Description Tag 137Duplicate Page Title 17Title Element is Too Long 6Temporary Redirect 3
On-Page Optimization | | WallerD0 -
Image heavy pages: Google friendly fonts / seo text etc
Hi Google friendly fonts - are these in wide use now, do they work ? If you have image heavy site do they work just as well as using what we used to call 'seo text'. I have heard that 'seo text' not really used anymore or at least rebranded to 'helpful, informative paragraph or two of body copy about the page with a couple of the pages target keywords in it'. I take it if fonts in image not google friendly then should still ask dev for some space to fit in a para or two of some proper body copy, with couple of pages target kw in it ? Also looking like if i succeed in this request will be below the fold, how hard should i fight for it to be above the fold ? cheers dan
On-Page Optimization | | Dan-Lawrence0 -
Page Not Indexed
Hi Guys I wrote and published an article last night on my site but it is yet to be indexed. This is strange as articles are usually indexed pretty quickly. Could you have a quick look and see what the problem is? http://www.rankmytri.com/tomtom-running-and-triathlon-watch/ Also all my Blog posts (in the blog section of the site) are not indexed as well (and I dont think they have been for a while) yet I dont have any messages from Google in my webmaster tools. Thoughts? Thanks in advance Ross
On-Page Optimization | | ross88guy0 -
On-Page Report Card: Whats up with the TITLE of the page?
Started to fix the SEO issues on a customers website. When I run a "On-Page Report Card" It says that the title of the webpage:
On-Page Optimization | | maklarlabbet
www.visbymaklarna.se/visbymaklarna.html Is "visbymäklarna - Ditt förstahandsval på gotland." But if I check in the source code of the webbrowser the title should be:
name="title" content="Vi är mäklarna på Gotland som sätter människan i första rummet" /> (Actually this is with special encoding for the swedish characters. The title in coded text is: "Vi är mäklarna på Gotland som sätter människan i första rummet") Anyway the title of the webpage source code and the title of what SEOmoz reports is different. Why is that?0 -
One Page Website vs. Multipage Site, if you want to target one specific Keyword only.
Hello! suppose I want to start a website about, let's say spray adhesives. My aim is to rank on the first page for the keyword "spray adhesive". I don't care about my ranking on more specific keywords like "Tesa spray adhesive" or "3M spray adhesive". My ranking for more general keywords like "glue" is unimportant, too. So I thought about creating a single-page website, that writes about spray adhesives, the pros & cons of every manufacturer, and shows the best discounts for spray adhesives. Each section can be accessed through a top-navigation, that links via anchors to the individual sections. The page will be updated every day On the other hand, i could create a blog and write an article for every specific spray adhesive. So I would have a home page that lists the latest articles for every product, with titles like "3M spray adhesive CreativeMount", "3M spray adhesive SprayMount", "Tesa Spray adhesive" ... I will write one article every day What do you think would be the better strategy? Is there a risk to create competing articles for the keyword "spray adhesive" and thus rank lower if I go with the blog strategy? On the other hand, does google rate singe-page websites lower, because google thinks those websites are less valuable than websites with many pages for the same topic? Thank you ver much for you help in advance!
On-Page Optimization | | MGMT0