Influence of tree topology restrictions on the complexity of haplotyping with missing data

Michael Elberfeld*, Ilka Schnoor, Till Tantau

*Korrespondierende/r Autor/-in für diese Arbeit

Abstract

Haplotyping, also known as haplotype phase prediction, is the problem of predicting likely haplotypes based on genotype data. One fast haplotyping method is based on an evolutionary model in which a perfect phylogenetic tree is sought that explains the observed data. Unfortunately, when data entries are missing, which is often the case in laboratory data, the resulting formal problem ipph, which stands for incomplete perfect phylogeny haplotyping, is NP-complete. Even radically simplified versions, such as the restriction to phylogenetic trees consisting of just two directed paths from a given root, are still NP-complete; but here, at least, a fixed-parameter algorithm is known. Such drastic and ad hoc simplifications turn out to be unnecessary to make ipph tractable: we present the first theoretical analysis of a parameterized algorithm, which we develop in the course of the paper, that works for arbitrary instances of ipph. This tractability result is optimal insofar as we prove ipph to be NP-complete whenever any of the parameters we consider is not fixed, but part of the input.

OriginalspracheEnglisch
ZeitschriftTheoretical Computer Science
Jahrgang432
Seiten (von - bis)38-51
Seitenumfang14
ISSN0304-3975
DOIs
PublikationsstatusVeröffentlicht - 11.05.2012

Fingerprint

Untersuchen Sie die Forschungsthemen von „Influence of tree topology restrictions on the complexity of haplotyping with missing data“. Zusammen bilden sie einen einzigartigen Fingerprint.
  • Komplexität von Haplotypisierungsproblemen

    Tantau, T. (Projektleiter*in (PI)), Schnoor, I. (Beteiligte*r Wissenschaftler*in), Elberfeld, M. (Beteiligte*r Wissenschaftler*in), Kuczewski, J. (Beteiligte*r Wissenschaftler*in) & Pohlmann, J. (Beteiligte Person)

    01.01.0531.12.10

    Projekt: DFG-ProjekteDFG Einzelförderungen

Zitieren