Assembling genomes: From Composition to Paired Composition
Let’s find out how composition can be converted to paired composition.
We'll cover the following
Notation of paired compositions
Given a string Text, a (k, d)-mer is a pair of k-mers in Text separated by distance d. We use the notation (Pattern | Pattern) to refer to a (k, d)-mer whose k-mers are Pattern and Pattern. For example, (AAT | TGG) is a (3, 4)-mer in TAATGCCATGGGATGTT. The (k, d)-mer composition of Text, denoted PairedComposition (Text), is the collection of all (k, d)-mers in Text (including repeated (k, d)-mers). For example, here’s PairedComposition(TAATGCCATGGGATGTT):
Get hands-on with 1400+ tech skills courses.