{"id":3850,"date":"2021-05-09T17:10:04","date_gmt":"2021-05-09T17:10:04","guid":{"rendered":"https:\/\/vendlab.com\/?p=3850"},"modified":"2022-01-01T20:19:00","modified_gmt":"2022-01-01T20:19:00","slug":"search-engine-optimisation-seo-website-crawling","status":"publish","type":"post","link":"https:\/\/vendlab.com\/search-engine-optimisation-seo-website-crawling\/","title":{"rendered":"Search Engine Optimisation (SEO): Website Crawling"},"content":{"rendered":"
As we have discussed previously, producing search engine results involves several distinct processes, and many factors must be considered to build and maintain a high performing website.<\/p>
Before your website can appear in the search engine results, it must appear in the search engine’s index. The first task of optimising your site should be to establish if any issues prevent your site from being fully indexed. Many sites are constructed in a way that prevents search engines from indexing some or all the pages on the site.<\/p>
It is essential to establish the proportion of your site’s pages that are indexed. Do this by comparing the number of pages on your website (as measured by your content management system) with the number appearing in the search engines index. To find the number of pages from your website which Google has indexed, consult the Google Search Console (see below). To get an estimate to the proportion of indexed pages, divide the number of indexed pages by the total number of pages.<\/p>
It takes time for a search engine to refresh its index, and so if your site is large or often changes, 100% inclusion is unlikely. However, if the number of pages indexed is way below the expect this should be investigated (see below). If your site is not appearing at all, then there could be a few reasons for this:<\/p>
Google has created a suite of tools called Google Search Console (GSC) with the mission of ‘helping you measure your site’s search traffic and performance, fix issues and make your site shine in Google Search results. These tools supplied include:<\/p>
Submitting a Google sitemap is crucial to improve your indexing and be a priority soon after a site launch. These will be generated automatically from all decent CMS and eCommerce systems and can be quickly submitted to GSC.<\/p>
If your site is not being indexed or has a low percentage of indexed pages, one of the following problems may be present:<\/p>
There are some good reasons why you would not want a search engine to index sections of your site, for example, admin and checkout pages and duplicate content.<\/p>
Robots.txt files can be found in the root directory of each website (e.g., vendlab.com\/robots.txt) and tells Google which parts of your site it should and should not crawl. A problem with the robots.txt may cause Google to stop crawling your site.<\/p>
Your content could be inaccessible to search engine crawlers for several reasons:<\/p>