Yandex Webmaster Tools offers a wonderful option for webmasters to analyze the pages not included in the Yandex search engine. This tool helps to understand the followings:
- Not indexed pages with the HTTP error codes
- Internal and External pages linking to the error page
- Set level of importance for each error code
This article explains how to use this tool step by step.
Accessing Excluded Pages Option
In order to use this tool you must have added and verified your site in Yandex Webmaster Tools.
Login to your account and select the site you want to analyze. Go to “Excluded Pages” option available under “Indexing” section. Yandex will show the pages from your site that were not indexed by YandexBot under various categories.
Excluded pages are classified under the following three categories:
HTTP Error Pages:
All the pages received HTTP errors during crawling will be shown here along with the error code. For example “Page Not Found” pages will be shown with the error code 404.
Blocked By Robots.txt:
URLs blocked by robots.txt file are shown under this category.
Not Supported Pages:
Page formats which are not supported by YandexBot are shown here. XML files including your Sitemap will be showing here as invalid file format and you do not any action for XML files showing in this category.
Setting Level of Importance for Error Codes
YandexBot will not know whether the page is intentionally blocked by the site owner or just getting error only at that moment. Hence, Yandex provides an option to set the level of importance on indexing for each error codes. You can click on the “Settings” link shown in the above picture to access the error code page. Click on the “eye” icon to reduce the importance of any error code.
The option can also be accessed through “Settings” tab of your Webmaster Tools account.
HTTP Error Status
The general error in the HTTP status section is “Page Not Found” error with 404 status code. Click on the link to see detailed list of URLs received 404 error code during crawling of YandexBot.
You can see the graphical representation as well as each individual URLs with 404 error along with the last crawl date. “Links to page” column will show the internal or external pages linking to the error page if applicable. Click on the box icon shown under the links to page column to see all the links connecting to the error URL.
If you want to troubleshoot the excluded pages you need to understand the response received by YandexBot during crawling. Yandex offers a tool called “Server Response Check” to help webmasters to fetch the pages as YandexBot and check the response received. Click here to learn more about server response check.