The binary perfect phylogeny model is too restrictive to model biological events such as back mutations. In this paper, we consider a natural generalization of the model that allows a special type of back mutation. We investigate the problem of reconstructing a near perfect phylogeny over a binary set of characters where characters are persistent: characters can be gained and lost at most once. Based on this notion, we define the problem of the Persistent Perfect Phylogeny (referred as P-PP). We restate the P-PP problem as a special case of the Incomplete Directed Perfect Phylogeny, called Incomplete Perfect Phylogeny with Persistent Completion, (refereed as IP-PP), where the instance is an incomplete binary matrix MM having some missing entries, denoted by symbol ??, that must be determined (or completed) as 00 or 11 so that MM admits a binary perfect phylogeny. We show that the IP-PP problem can be reduced to a problem over an edge colored graph since the completion of each column of the input matrix can be represented by a graph operation. Based on this graph formulation, we develop an exact algorithm for solving the P-PP problem that is exponential in the number of characters and polynomial in the number of species

Bonizzoni, P., Braghin, C., Dondi, R., Trucco, G. (2012). The binary perfect phylogeny with persistent characters. THEORETICAL COMPUTER SCIENCE, 454, 51-63 [10.1016/j.tcs.2012.05.035].

The binary perfect phylogeny with persistent characters

BONIZZONI, PAOLA;
2012

Abstract

The binary perfect phylogeny model is too restrictive to model biological events such as back mutations. In this paper, we consider a natural generalization of the model that allows a special type of back mutation. We investigate the problem of reconstructing a near perfect phylogeny over a binary set of characters where characters are persistent: characters can be gained and lost at most once. Based on this notion, we define the problem of the Persistent Perfect Phylogeny (referred as P-PP). We restate the P-PP problem as a special case of the Incomplete Directed Perfect Phylogeny, called Incomplete Perfect Phylogeny with Persistent Completion, (refereed as IP-PP), where the instance is an incomplete binary matrix MM having some missing entries, denoted by symbol ??, that must be determined (or completed) as 00 or 11 so that MM admits a binary perfect phylogeny. We show that the IP-PP problem can be reduced to a problem over an edge colored graph since the completion of each column of the input matrix can be represented by a graph operation. Based on this graph formulation, we develop an exact algorithm for solving the P-PP problem that is exponential in the number of characters and polynomial in the number of species
Articolo in rivista - Articolo scientifico
algorithms; phylogenetic reconstruction; parsimony models
English
2012
454
51
63
none
Bonizzoni, P., Braghin, C., Dondi, R., Trucco, G. (2012). The binary perfect phylogeny with persistent characters. THEORETICAL COMPUTER SCIENCE, 454, 51-63 [10.1016/j.tcs.2012.05.035].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/43721
Citazioni
  • Scopus 24
  • ???jsp.display-item.citation.isi??? 21
Social impact