String Reconstruction With Overlap Graph: The Genome Vanishes

Explore how to find the disappeared genome path.

The genome can still be traced out in the graph in the figureFIGURE3_7 by following the horizontal path from TAA to GTT. But in genome sequencing, we don’t know in advance how to correctly order reads. Therefore we’ll arrange the 3-mers lexicographically, which produces the overlap graph shown in the figure below. The genome path has disappeared!

Get hands-on with 1300+ tech skills courses.