Deep Learning with PyTorch Step-by-Step: Part I - Fundamentals/

...

Imbalanced Dataset

Learn how you can counter imbalanced data in binary cross-entropy loss of PyTorch.

We'll cover the following...

Introduction to the imbalanced dataset
The pos_weight argument
Weighted average

Press + to interact

Clearly, this is an imbalanced dataset. There are three times more data points in the negative class than in the positive one.

The `pos_weight` argument

Now, let us turn to the pos_weight argument of BCEWithLogitsLoss. To compensate for the imbalance, one can set the weight equals the ratio of negative to positive examples:

$pos\_weight = \dfrac{\space\space \#points \space in \space negative \space class}{\#points \space in \space positive \space class}$

In our imbalanced dummy example, the result would be 3.0. This way, every point in the positive class would have its corresponding loss multiplied by three. Since there is a single label for each data point (c = 1), the tensor used as an argument for pos_weight has only one ...

Introduction

Visualizing Gradient Descent

A Simple Regression Problem

Rethinking the Training Loop

Going Classy

A Simple Classification Problem

Conclusion

Appendix

Imbalanced Dataset

Introduction to the imbalanced dataset

The `pos_weight` argument

Imbalanced Dataset

Introduction to the imbalanced dataset

The pos_weight argument

The `pos_weight` argument