Natural Language Processing with TensorFlow/

...

Visualizing Attention Patterns

Learn to visualize attention patterns to gain deeper insights into how they work.

We'll cover the following...

The get_attention_matrix_for_sampled_data() function
Results obtained to visualize attention patterns
Try it yourself

Press + to interact

def get_attention_matrix_for_sampled_data(attention_model, target_lookup_
layer, test_xy, n_samples=5):
    test_x, test_y = test_xy
    rand_ids = np.random.randint(0, len(test_xy[0]),
    size=(n_samples,))
    results = []
    for rid in rand_ids:
        en_input = test_x[rid:rid+1]
        de_input = test_y[rid:rid+1,:-1]
        attn_weights, predictions = attention_model.predict([en_input,
        de_input])
        predicted_word_ids = np.argmax(predictions, axis=-1).ravel()
        predicted_words = [target_lookup_layer.get_vocabulary()[wid]
        for wid in predicted_word_ids]
        clean_en_input = []
        en_start_i = 0
        for i, w in enumerate(en_input.ravel()):
            if w=='<pad>':
                en_start_i = i+1
                continue
            clean_en_input.append(w)
            if w=='</s>': break
        clean_predicted_words = []
        for w in predicted_words:
            clean_predicted_words.append(w)
            if w=='</s>': break
        results.append(
            {
                "attention_weights": attn_weights[
                0,:len(clean_predicted_words),en_start_i:en_start_
                i+len(clean_en_input)
                ],
                "input_words": clean_en_input,
                "predicted_words": clean_predicted_words
            }
        )
    return results

Introduction to Natural Language Processing

Understanding TensorFlow 2

Word2vec: Learning Word Embeddings

Advanced Word Vector Algorithms

Sentence Classification with Convolutional Neural Networks

Recurrent Neural Networks

Understanding Long Short-Term Memory Networks

Applications of LSTM: Generating Text

Sequence-to-Sequence Learning: Neural Machine Translation

Transformers

Sarcasm Classification Using BERT

Image Captioning with Transformers

Caption Generation Using PyTorch

Final Remarks

Appendix: Mathematical Foundations and Advanced TensorFlow

Visualizing Attention Patterns

The `get_attention_matrix_for_sampled_data()` function

Sarcasm Classification Using BERT

Caption Generation Using PyTorch

Visualizing Attention Patterns

The get_attention_matrix_for_sampled_data() function

The `get_attention_matrix_for_sampled_data()` function