...

/

Sequence Alignment is the Manhattan Tourist Problem in Disguise

Sequence Alignment is the Manhattan Tourist Problem in Disguise

Let’s relate sequence alignment to the Manhattan Tourist Problem.

We'll cover the following...

Sequence comparison as a graph

In the figure below, we add two arrays of integers to an alignment of ATGTTATA and ATCGTCC. The array [0 1 2 2 3 4 5 6 7 8] holds the number of symbols of ATGTTATA used up to a given column in the alignment. Similarly, the array [0 1 2 3 4 5 5 6 6 7] holds the number of symbols of ATCGTCC used up to a given column. In figure below, we add a third array [↘ ↘ → ↘ ↘ ↓ ] recording whether each column represents a match/mismatch (↘/), an insertion (), or a deletion ().

This third array corresponds to a path from source to sink in an 8×78 \times 7 rectangular grid, shown in the figure below (left). The i-th node of this path is made up of the i-th element of [0 1 2 2 3 4 5 6 7 8] and the i-th element of [0 1 2 3 4 5 5 6 6 7]: ...