What Is Partitioning in Databases?

Learn about partitioning in databases, a method for dividing massive datasets across several machines to scale out data storage and query processing. Understand the differences between vertical and horizontal scaling, advantages of partitioning for large datasets and high throughput, and strategies for uniform data distribution and replication. This lesson helps you grasp the core concepts and challenges in scaling databases by partitioning to effectively manage large and high-demand data environments.

We'll cover the following...

Introduction
Path to partitioning
Challenges of partitioning

Introduction

Partitioning a database is the process of breaking down a massive dataset into smaller datasets and distributing these smaller datasets across multiple host machines. Every host instance can hold multiple smaller datasets.

Every record in the database belongs to exactly one partition. Each partition acts as a database that can perform read and write operations on its own. We can either fire the database query targeting a single partition or scatter it across multiple partitions.

There are two ways to scale a database:

Vertical scaling: Vertical scaling is upgrading the capacity of existing hardware by increasing the resources such as disk space, CPU, and memory. It is also called scaling up. The maximum upgrade limits the vertical scaling we can perform on CPU, memory, and other resources and can’t be done infinitely. Vertical scaling is also expensive beyond a certain ...

1.Introduction

2.Taxonomy of Databases

3.Database Architecture

4.Data Structures used in Databases

5.Disk Layout

6.Database Index

7.Transaction

8.Replication

9.Partitioning

10.Concurrency Controls

11.Consistency Models

12.Consensus

13.Common Problems Associated with Distributed Databases

14.Conclusion

Assessment

What Is Partitioning in Databases?

Introduction