Molecular Biology and Evolution 18:1024-1033 (2001)
© 2001 Society for Molecular Biology and Evolution
ARTICLE |
Effects of Nucleotide Composition Bias on the Success of the Parsimony Criterion in Phylogenetic Inference
Department of Biology, University of New Mexico
Department of Ecology and Evolutionary Biology, University of Connecticut
Convergence in nucleotide composition (CNC) in unrelated lineages is a factor potentially affecting the performance of most phylogeny reconstruction methods. Such convergence has deleterious effects because unrelated lineages show similarities due to similar nucleotide compositions and not shared histories. While some methods (such as the LogDet/paralinear distance measure) avoid this pitfall, the amount of convergence in nucleotide composition necessary to deceive other phylogenetic methods has never been quantified. We examined analytically the relationship between convergence in nucleotide composition and the consistency of parsimony as a phylogenetic estimator for four taxa. Our results show that rather extreme amounts of convergence are necessary before parsimony begins to prefer the incorrect tree. Ancillary observations are that (for unweighted Fitch parsimony) transition/transversion bias contributes to the impact of CNC and, for a given amount of CNC and fixed branch lengths, data sets exhibiting substantial site-to-site rate heterogeneity present fewer difficulties than data sets in which rates are homogeneous. We conclude by reexamining a data set originally used to illustrate the problems caused by CNC. Using simulations, we show that in this case the convergence in nucleotide composition alone is insufficient to cause any commonly used methods to fail, and accounting for other evolutionary factors (such as site-to-site rate heterogeneity) can give a correct inference without accounting for CNC.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. Rodriguez-Ezpeleta, H. Brinkmann, B. Roure, N. Lartillot, B. F. Lang, and H. Philippe Detecting and Overcoming Systematic Errors in Genome-Scale Phylogenies Syst Biol, June 1, 2007; 56(3): 389 - 399. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Gowri-Shankar and M. Rattray A Reversible Jump Method for Bayesian Phylogenetic Inference with a Nonhomogeneous Substitution Model Mol. Biol. Evol., June 1, 2007; 24(6): 1286 - 1299. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Jayaswal, J. Robinson, and L. Jermiin Estimation of Phylogeny and Invariant Sites under the General Markov Model of Nucleotide Sequence Evolution Syst Biol, April 1, 2007; 56(2): 155 - 162. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. F. Gruber, R. S. Voss, and S. A. Jansa Base-Compositional Heterogeneity in the RAG1 Locus among Didelphid Marsupials: Implications for Phylogenetic Inference and the Evolution of GC Content Syst Biol, February 1, 2007; 56(1): 83 - 96. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. T. Herbeck, P. H. Degnan, and J. J. Wernegreen Nonhomogeneous Model of Sequence Evolution Indicates Independent Origins of Primary Endosymbionts Within the Enterobacteriales ({gamma}-Proteobacteria) Mol. Biol. Evol., March 1, 2005; 22(3): 520 - 532. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. N. Engstrom, H. B. Shaffer, and W. P. McCord Multiple Data Sets, High Homoplasy, and the Phylogeny of Softshell Turtles (Testudines: Trionychidae) Syst Biol, October 1, 2004; 53(5): 693 - 710. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Y.W. Ho and L. S. Jermiin Tracing the Decay of the Historical Signal in Biological Sequence Data Syst Biol, August 1, 2004; 53(4): 623 - 637. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. S. Jermiin, S. Y.W. Ho, F. Ababneh, J. Robinson, and A. W.D. Larkum The Biasing Effect of Compositional Heterogeneity on Phylogenetic Estimates May be Underestimated Syst Biol, August 1, 2004; 53(4): 638 - 643. [Full Text] [PDF] |
||||
![]() |
S. S. Renner and L.-B. Zhang Biogeography of the Pistia Clade (Araceae): Based on Chloroplast and Mitochondrial DNA Sequences and Bayesian Divergence Time Inference Syst Biol, June 1, 2004; 53(3): 422 - 432. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. G. Foster Modeling Compositional Heterogeneity Syst Biol, June 1, 2004; 53(3): 485 - 495. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Rosenberg and S. Kumar Heterogeneity of Nucleotide Frequencies Among Evolutionary Lineages and Phylogenetic Inference Mol. Biol. Evol., April 1, 2003; 20(4): 610 - 621. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Kelchner Group II introns as phylogenetic tools: structure, function, and evolutionary constraints Am. J. Botany, October 1, 2002; 89(10): 1651 - 1669. [Abstract] [Full Text] [PDF] |
||||


