News Errors In Google Search Console
-
Years ago a site I'm working on was publishing news as one form of content on the site.
Since then, has stopped publishing news, but still has a q&a forum, blogs, articles... all kinds of stuff.
Now, it triggers "News Errors" in GWT under crawl errors. These errors are
"Article disproportionately short"
"Article fragmented" on some q&a forum pages
"Article too long" on some longer q&a forum pages
"No sentences found"
Since there are thousands of these forum pages and it's problem seems to be a news critique, I'm wondering what I should do about it. It seems to be holding these non-news pages to a news standard:
https://support.google.com/news/publisher/answer/40787?hl=en
For instance, is there a way and would it be a good idea to get the hell out of Google News, since we don't publish news anymore? Would there be possible negatives worth considering?
What's baffling is, these are not designated news urls. The ones we used to have were /news/title-of-the-story per...
https://support.google.com/news/publisher/answer/2481373?hl=en&ref_topic=2481296
Or, does this really not matter and I should just blow it off as a problem.
The weird thing is that we recently went from http to https and The Google News interface still has us as http and gives the option to add https, which I am reluctant to do sine we aren't really in the news business anymore.
What do you think I should do?
Thanks!
-
Update: 5 months later, the problem has long since gone away.
-
Thanks for the answers Matthew & Martijn. So, I'm going to go with what I have in place... 301s to the main forums page to pick up/recycle the incoming links to that section and removing the /news category from where Google News looks for news, but not leave the Google News program altogether right now.
Thanks, again!
-
I'd opt for 301s if there is any link equity on those pages worth pushing to the forum pages. If not, I'd robots.txt them out to save on wasted crawl.
-
HI Matthew,
Thanks for the insight. On your, "I think the important part is that Google doesn't suddenly crawl a bunch of 404s that they think are news pages of some importance" would you robots.txt out the old category they looked at (/news) or leave it and figure out via the mass of 301s and the removal from the url structure we told them to crawl, so that Google figures it out on it's own?
-
Hi there,
It wouldn't be a problem to get out of Google News, but by the same logic, it wouldn't hurt just to leave it alone and let them figure out you're not publishing news anymore. It won't affect your web search rankings, since you're not targeting traffic from the news onebox or news.google.com.
These old pages redirect to forum pages? I think the important part is that Google doesn't suddenly crawl a bunch of 404s that they think are news pages of some importance.
-
Possible, hard to guess what your current set-up looks like. What you could do alternatively is set up a robots.txt with Disallow statement that are only targeting the Google News bot instead of just the general Google Bot.
-
Hi Martijn,
We still have the old /news url still in Google News. I don't think we've ever submitted a news site map.
Is there any downside to deleting the url structure they're currently looking at, which does forward to our forum? Or, would it be better to just get out of Google News altogether, since we don't really have or publish news anymore? Is there a downside to that?
To summarize... delete the url they look to as news or get out of Google News altogether... what do you think?
Thanks!
-
Are you still submitting a Google News Sitemap to Google Search Console? Because usually that's the biggest reason where these errors are coming from because Google is picking up these kind of 'new' pages/articles as news content and then seeing it doesn't match with their guidelines.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How does Google handle fractions in titles?
Which is better practice, using 1/2" or ½"? The keyword research suggests people search for "1 2" with the space being the "/". How does Google handle fractions? Would ½ be the same as 1/2?
Intermediate & Advanced SEO | | Choice2 -
I'm noticing that URL that were once indexed by Google are suddenly getting dropped without any error messages in Webmasters Tools, has anyone seen issues like this before?
I'm noticing that URLs that were once indexed by Google are suddenly getting dropped without any error messages in Webmasters Tools, has anyone seen issues like this before? Here's an example:
Intermediate & Advanced SEO | | nystromandy
http://www.thefader.com/2017/01/11/the-carter-documentary-lil-wayne-black-lives-matter0 -
Ranking on google search
Hello Mozzers Moz On page grader shows A grade for the particular URL,but my page was not ranking on top 100 Google search. Any help is appreciated ,Thanks
Intermediate & Advanced SEO | | sobanadevi0 -
Incorrect URL shown in Google search results
Can anyone offer any advice on how Google might get the url which it displays in search results wrong? It currently appears for all pages as: <cite>www.domainname.com › Register › Login</cite> When the real url is nothing like this. It should be: www.domainname.com/product-type/product-name. This could obviously affect clickthroughs. Google has indexed around 3,000 urls on the site and they are all like this. There are links at the top of the page on the website itself which look like this: Register » Login » which presumably could be affecting it? Thanks in advance for any advice or help!
Intermediate & Advanced SEO | | Wagada0 -
Competitors Showing in Branded Search w/in Google FR
Hi All, When searching for our brand in Google France, I noticed that some of our major competitors show up beneath our Knowledge Graph listing. My managers and I are wondering if this is something Google just does as associated search or if there's a way that we can work around it. Thanks and please see attached image 🙂 2KYBeiT
Intermediate & Advanced SEO | | CSawatzky0 -
Robot.txt error
I currently have this under my robot txt file: User-agent: *
Intermediate & Advanced SEO | | Rubix
Disallow: /authenticated/
Disallow: /css/
Disallow: /images/
Disallow: /js/
Disallow: /PayPal/
Disallow: /Reporting/
Disallow: /RegistrationComplete.aspx WebMatrix 2.0 On webmaster > Health Check > Blocked URL I copy and paste above code then click on Test, everything looks ok but then logout and log back in then I see below code under Blocked URL: User-agent: * Disallow: / WebMatrix 2.0 Currently, Google doesn't index my domain and i don't understand why this happening. Any ideas? Thanks Seda0 -
Error 403
Hi SEOmoz community, Today, I checked the google webmaster tool of one of my clients, and ithere are 18 403 errors, I was wondering on how to fix those since it is the first time I come across these errors? How can I avoid that in the future? Thank you,
Intermediate & Advanced SEO | | Ideas-Money-Art0 -
Google +1 and Yslow
After adding Google's +1 script and call to our site (loading asynchronously), we noticed Yslow is giving us a D for not having expire headers for the following scripts: https://apis.google.com/js/plusone.js
Intermediate & Advanced SEO | | GKLA
https://www.google-analytics.com/ga.js
https://lh4.googleusercontent.com... 1. Is their a workaround for this issue, so expire headers are added to to plusone and GA script? Or, are we being to nit-picky about this issue?0