Transformers for Computer Vision Applications/

...

Image Segmentation Using Transformers

Discover transformer applications in semantic segmentation and explore SETR and segmenter architectures.

We'll cover the following...

Encoder-decoder architecture with self-attention
Architectures combining approaches
- SETR architecture
- Segmenter architecture
Panoptic segmentation
Challenges with transformer decoders
- Computational efficiency considerations
- Mitigating computational challenges
Code implementation

Let's explore the application of transformers in semantic segmentation. Traditional encoder-decoder architectures pose computational challenges. We'll explore a transformative approach by incorporating transformers for image segmentation.

Encoder-decoder architecture with self-attention

In an encoder-decoder setup, replacing the encoder block with a self-attention mechanism is a viable option. However, the computational cost is a concern. Two solutions were discussed: multihead attention parallelization or utilizing image patches/words, similar to vision transformers (ViT).

Architectures combining approaches

Several architectures seamlessly integrate both approaches. Let's examine two notable models: SEgmentation TRansformers (SETR), and Segmenter.

SETR architecture

The SETR model employs a semantic segmentation transformer that divides the image into patches. The encoder operates on image patch embeddings with positional embeddings, using self-attention.

Press + to interact

Introduction

Overview of Transformer Networks

Neural Machine Translation with a Transformer and Keras

Transformers in Computer Vision

Vision Transformer for Image Classification

Transformers in Image Classification

Fine-Tuning Vision Transformers for Image Classification

Transformers in Object Detection

Transformers in Semantic Segmentation

Spatio-Temporal Transformers

Object Detection with Vision Transformers

Wrap Up

Image Segmentation Using Transformers

Encoder-decoder architecture with self-attention

Architectures combining approaches

SETR architecture

Segmenter architecture

Panoptic segmentation