Google Search Console Coverage Error Fix for Hexo Blog
Introduction
In Google Search Console, I have been observing lots of “Submitted URL marked ‘noindex’” coverage errors for my Hexo blog website for a long period of time.
Recently, I realized that the submitted URLs are the ones that I submitted via sitemap, and the sitemap contains the URLs to pages that has <meta name="robots" content="noindex"> which explicitly tells the search engine crawler not to index the page.
This kind of error, in my opinion, should not be confusing to the search engine. I have submitted the same sitemap to Microsoft Bing but it did not throw me error on this. Anyhow, seeing these errors is annoying, let’s fix them.
Google Search Console Coverage Error Fix for Hexo Blog
Having noindex for the web pages that you don’t want the search to crawl is important. So we will not touch it. What we could do is to remove those web pages from the sitemap. In fact, the web pages that have noindex are not important at all.
My Hexo blog website has been using hexo-generator-sitemap to generate the sitemap. But I did not study how to configure the generation previously.
It turns out that the all the noindex errors come from the tags and the categories web pages, and hexo-generator-sitemap already allows the user to disable them in the sitemap.
We could add the following code in the Hexo blog website _config.yml file and we are done.
1 | # https://github.com/hexojs/hexo-generator-sitemap |
After resubmitting the new sitemap to Google Search Console and clicking “run validation”, we will see that the related errors have gone after a few days once the validation is complete.
Google Search Console Coverage Error Fix for Hexo Blog
https://leimao.github.io/blog/Hexo-Blog-Google-Search-Console-Coverage-Error/