Post-Layer Normalization and Sublayer 2: Feedforward Network
Learn about how post-layer normalization is performed along with the components of the feedforward network.
We'll cover the following
Layer normalization will now process the attention sublayer.
Post-layer normalization
Each attention sublayer and each feedforward sublayer of the transformer is followed by post-layer normalization (Post-LN):
Get hands-on with 1400+ tech skills courses.