Indexing in a Distributed Search

We’ll first describe what indexing is, and then we’ll make our way toward distributing indexes over many nodes.

Indexing

Indexing is the organization and manipulation of data that’s done to facilitate fast and accurate information retrieval.

Build a searchable index

The simplest way to build a searchable index is to assign a unique ID to each document and store it in a database table, as shown in the following table. The first column in the table is the ID of the text and the second column contains the text from each document.