The T5 Model

Learn about the structure and the features of the T5 model.

We'll cover the following

Now let’s look at the architecture of the T5 transformer model.

Architecture

Raffel et al. (2019) focused on designing a standard input format to obtain text output. The Google T5 team did not want to try new architectures derived from the original transformer, such as BERT-like encoder-only layers or GPT-like decoder-only layers. Instead, the team focused on defining NLP tasks in a standard format.

Get hands-on with 1200+ tech skills courses.