String Reconstruction With Overlap Graph: From String To Graph

Let’s explore how a string can be represented using a directed graph.

From a string to a graph

Repeats in a genome necessitate some way of looking ahead to see the correct assembly in advance. Returning to our previous example, you may have already found that TAATGCCATGGGATGTT is a solution to the String Reconstruction Problem for the collection of fifteen 3-mers in the last section, as illustrated below.

Note that we use a different color for each interval of the string between occurrences of ATG.

Get hands-on with 1200+ tech skills courses.