What Is a Transformer?
Get introduced to transformer-based machine learning models and their specific applications in natural language processing (NLP).
We'll cover the following...
Transformer overview
The transformer is a deep learning model architecture introduced in the paper
Self-attention
Attention is like a communication layer that is put on top of tokens in a text. This allows the model to learn the contextual connections of words in a sentence and weigh the importance of different words within a sequence without the use of recursion or convolution. Essentially, it encodes global information into our model so it can be used in downstream predictions.
This works like such - given text , we then convert this from raw ...