How Google Crawl, Index and Serve Pages

3


People use Google to search information on internet but how does Google find information for you?  When you perform a search on Google, Google programs performs a lookup in their indexes for most relevant information and show these relevant information back to you. Google uses these three key process to deliver search results to you

Crawling

Crawling is the process by which Googlebot discovers new and updated web pages or websites. Google uses large computing power to crawl billions of web pages. Googlebot uses algorithmic process to determine which web sites to crawl, how often to crawl and how many pages have to crawl.

Google’s crawl process begins with a list of web page URLs, generated from previous crawl processes, and augmented with Sitemap data provided by webmasters. As Googlebot visits each of these websites it detects links on each page and adds them to its list of pages to crawl. New sites, changes to existing sites, and dead links are noted and used to update the Google index.

Google doesn’t accept payment to crawl a site more frequently, and we keep the search side of our business separate from our revenue-generating AdWords service.

Indexing

To process billions of pages crawled by Googlebot Google uses indexing algorithms to organize crawled content. Then Google process information included in key content tags  for example Tile tag or alternative tags of images. Google cannot process the content of some rich media files or dynamic pages.

Serving Results

When a users enters a search query in Google search box, Google machines search the index for matching pages and return web pages  that are most relevant to that search term. Content relevancy is determined by over 200 factors. Google works hard to improve the user experience by identifying spam links and other practices that negatively impact search results. The best types of links are those that are given based on the quality of your content.

Related Articles



3 Responses

  1. Hi ajay,
    Nice article… Would like to add about the importance of keywords and the excellent tools such as keyword selector provided by Google..Yahoo however seem partial towards paid users.. Also Google has more preference towards open source directory dmoz.org..
    Nice blog will put up on blog roll..
    tc
    Dheeraj Suthar (dheerajsuthar[dot]com)
    aka Sherkhan
    (archives on: theanarchia[dot]wordpress[dot]com)

Leave a Reply