How to Partition Secondary Indexes

Understand how to partition secondary indexes in databases by exploring two main strategies: partition by document, where each partition contains local indexes, and partition by term, where indexes are globally partitioned. Learn the advantages and disadvantages of each method, focusing on their impact on read and write operations.

We'll cover the following...

Introduction
Partition by document
Partition by term

Introduction

A database index is an additional data structure that allows us to locate the required data quickly without going through the entire dataset in the database. A secondary index is a mechanism to efficiently access records in a database through attributes other than the primary key. While the lookup by primary key always returns a single record, a query on the secondary index can return multiple records.

Partitioning a database with secondary indexes is inherently complex, as the partitioning strategy applies both to the primary dataset and the secondary index.

Broadly speaking, there are two strategies to partition the secondary index:

Partition by document
Partition by term

Partition by document

In the partition by document strategy, every partition acts as an independent database on its own. This is because every partition hosts both the primary dataset and its secondary indexes.

1.Introduction

2.Taxonomy of Databases

3.Database Architecture

4.Data Structures used in Databases

5.Disk Layout

6.Database Index

7.Transaction

8.Replication

9.Partitioning

10.Concurrency Controls

11.Consistency Models

12.Consensus

13.Common Problems Associated with Distributed Databases

14.Conclusion

Assessment

How to Partition Secondary Indexes

Introduction

Partition by document