Getting Started with Google BERT/

...

/

Distillation of Embedding and Prediction Layer

Distillation of Embedding and Prediction Layer

Learn about the distillation of the embedding and prediction layer of Tiny BERT.

We'll cover the following...

Embedding layer distillation
Prediction layer distillation
The final loss function