Introduction to the Viterbi algorithm

The hidden Markov model (HMM) finds the likely part of speech for a given word. However, for a sentence, we need to generate the most likely sequence of part of speech tags instead of a single word. For this task, we can use the Viterbi algorithm, a dynamic programming algorithm developed for finding the most likely sequence of hidden states, or a Viterbi path. The Viterbi algorithm does this by systematically "crawling" over all of the possible states of our HMM for each step in our sequence of words and uses these calculations to find the maximal set of states.

The algorithm can be split into three steps:

Initialization step
Forward pass
Backward pass

With this setup, let's use the Viterbi algorithm to find the most likely sequence of POS tags.

Initializing the maximal probabilities matrix

During the initialization step, we fill in the first column in both our C and D matrix. The first column in C can be represented by the probability of transitioning from our “Initial” state to the first word. For this, we simply take our “Initial” state probabilities of each state from the first row of matrix A and multiply them by their corresponding ...

	Noun	Verb	Adjective
Initial	0.5	0.2	0.3
Noun	0.3	0.5	0.2
Verb	0.2	0.1	0.7
Adjective	0.4	0.3	0.3

	The	brown	fox	jumps
Noun	0.5	0.1	0.4	0
Verb	0.1	0.1	0.1	0.7
Adjective	0.2	0.6	0.1	0.1

Introduction

Edit Distance

Basic Spellchecker

Modern Spell Check Methods

Part-of-Speech Tagging

Basic Grammatical Error Checking

Modern Grammar Error Correction Methods

Project: Simple Transformer

Conclusion

Auto-Tagging System for Content Categorization

Part-of-Speech Tagging Using the Viterbi Algorithm

Introduction to the Viterbi algorithm

Viterbi setup

Transition Matrix (A)

Emission Matrix (B)

Initializing the maximal probabilities matrix