A large version of the small parsimony problem

Research output: Contribution to book/anthology/report/proceedingArticle in proceedingsResearchpeer-review

10 Citations (Scopus)

Abstract

Given a multiple alignment over $k$ sequences, an evolutionary tree
relating the sequences, and a subadditive gap penalty function (e.g.
an affine function), we reconstruct the internal nodes of the tree
optimally: we find the optimal explanation in terms of indels
of the observed gaps and find the most parsimonious assignment of
nucleotides. The gaps of the alignment are represented in a
so-called gap graph, and through theoretically sound preprocessing
the graph is reduced to pave the way for a running time which in all
but the most pathological examples is far better than the
exponential worst case time. E.g. for a tree with nine leaves and a
random alignment of length 10.000 with 60% gaps, the running time
is on average around 45 seconds. For a real alignment of length 9868 of
nine HIV-1 sequences, the running time is less than one second.
Original languageEnglish
Title of host publicationProceedings of the 3rd workshop on algorithms in bioinformatics (WABI 2003)
Number of pages16
Publication date2003
Pages417-432
Publication statusPublished - 2003
EventWorkshop on algorithms in bioinformatics (WABI 2003) - Budapest, Hungary
Duration: 17 Dec 2010 → …
Conference number: 3

Conference

ConferenceWorkshop on algorithms in bioinformatics (WABI 2003)
Number3
Country/TerritoryHungary
CityBudapest
Period17/12/2010 → …

Fingerprint

Dive into the research topics of 'A large version of the small parsimony problem'. Together they form a unique fingerprint.

Cite this