Transformers for Computer Vision Applications/

...

Is Attention All We Need?

Discover what we can do with attention and try it yourself in this lesson.

We'll cover the following...

The evolution of attention mechanisms
The power of attention mechanisms
- Eliminating the need for recurrent connections
- A simple code implementation

In 2017, Google took the concept of attention mechanisms to a whole new level. They moved away from traditional approaches, such as relying on recurrent connections, and instead embraced pure attention mechanisms.

The evolution of attention mechanisms

The sequence-to-sequence models we were familiar with used RNNs for encoding and decoding. As previously mentioned, these models faced fundamental issues. First, the hidden state of the final encoder lacked sufficient information. Moreover, they were slow in processing, as they depended on sequentially processing the input sequence to reach the final state before generating the first output token.

The challenge of sequential processing

Let's look at the encoder-decoder architecture to better understand the challenge of sequential processing.

Press + to interact

Introduction

Overview of Transformer Networks

Neural Machine Translation with a Transformer and Keras

Transformers in Computer Vision

Vision Transformer for Image Classification

Transformers in Image Classification

Fine-Tuning Vision Transformers for Image Classification

Transformers in Object Detection

Transformers in Semantic Segmentation

Spatio-Temporal Transformers

Object Detection with Vision Transformers

Wrap Up

Is Attention All We Need?

The evolution of attention mechanisms

The challenge of sequential processing