Imbalanced Datasets

Explore the fundamentals of imbalanced data, along with its challenges and the best methods to deal with it.

What is an imbalanced dataset?

An imbalanced dataset is a situation where the distribution of samples across different classes is unequal. This means there are more samples in one class than in others. The image provided below graphically demonstrates an imbalanced dataset, where there is an unequal distribution of samples in classes A and B. Class B contains a higher number of samples than class B.

Get hands-on with 1200+ tech skills courses.