...

/

Introduction to Developing Search Engines Using Tries

Introduction to Developing Search Engines Using Tries

Get an introduction to using tries to design search engines.

Search engines

A search engine is a well-managed and sophisticated set of entangled programs that identifies and searches for preexisting information in a datastore and presents it to the user based on a matching criteria. Search engines provide easy access to the available information and are the most common mechanism used on the internet. 

How do search engines work?

Google and Bing are the most commonly used search engines for exploring the World Wide Web. Modern-day search engines perform the following operations: 

Crawling: Crawlers discover existing websites. A search engine maintains a list of available pages and constantly crawls the web for updated pages. This process is called URL discovery. Upon the discovery of a web page, the crawler determines its content. The search engine uses an algorithm to determine the frequency and list of pages to crawl. 

Deduplication: The search engines process the textual content crawled from a web page. The content is tagged with details and attributes called metadata that help the search engine understand the nature of the content. This also enables the search engine to separate duplicate pages and ...