MBE Advance Access originally published online on May 12, 2009
Molecular Biology and Evolution 2009 26(8):1901-1908; doi:10.1093/molbev/msp102
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Articles |
A Phylogenetic Mixture Model for Gene Family Loss in Parasitic Bacteria
School of Biological Sciences, University of Liverpool, Liverpool, United Kingdom
m.spencer{at}liverpool.ac.uk.
Accepted for publication May 5, 2009.
Gene families are frequently gained and lost from prokaryotic genomes. It is widely believed that the rate of loss was accelerated for some but not all gene families in lineages that became parasites or endosymbionts. This leads to a form of heterotachy that may be responsible for the poor performance of phylogeny estimation based on gene content. We describe a mixture model that accounts for this heterotachy. We show that this model fits data on the distribution of gene families across bacteria from the COG database much better than previous models. However, it still favors an artifactual tree topology in which parasites form a clade over the more plausible 16S topology. In contrast to a previous model of genome dynamics, our model suggests that the ancestral bacterium had a small genome. We suggest that models of gene family gain and loss are likely to be more useful for understanding genome dynamics than for estimating phylogenetic trees.
Key Words: gene families mixture model parasites maximum likelihood phylogenetics
Jennifer Wernegreen, Associate Editor