MBE Advance Access originally published online on March 10, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Mol. Biol. Evol. 21(6):1160-1163. 2004
DOI: 10.1093/molbev/msh115
© 2004 by the Society for Molecular Biology and Evolution. ISSN: 0737-4038
On Counting Tandem Duplication Trees
Department of Mathematics, National University of Singapore, Singapore
E-mail: matzlx@nus.edu.sg.
Key Words: molecular phylogeny tandem duplication history duplication tree model
| The first 150 words of the full text of this article appear below. |
Introduction
Large genomes are full of repeated DNA sequences. It was estimated that over half of the human DNA consists of repeated sequences (Baltimore 2001; Eichler 2001; Leem et al. 2002). Tandem duplication is one of the important evolutionary mechanisms for producing repeated DNA sequences, in which the copies that may or may not contain genes are adjacent along the genome. Fitch (1977) first observed that tandem duplication histories are much more constrained than speciation histories and proposed to model them assuming that unequal crossover is the biological mechanism from which they originate. The corresponding trees are now called tandem duplication trees, the term tandem sometimes being omitted for the sake of conciseness. With more and more genomic sequences becoming known, inferring tandem duplication history has again redrawn researchers' attention (Benson and Dong 1999; Tang, Waterman, and Yooseph 2002; Elemento, Gascuel, and Lefranc 2002;
Duplication Tree Model
Counting Rooted Duplication Trees
Unrooted Duplication Trees