Molecular Biology and Evolution, Vol 11, 316-324, Copyright © 1994 by Society for Molecular Biology and Evolution
Z Yang, N Goldman and A Friday
Using real sequence data, we evaluate the adequacy of assumptions made in
evolutionary models of nucleotide substitution and the effects that these
assumptions have on estimation of evolutionary trees. Two aspects of the
assumptions are evaluated. The first concerns the pattern of nucleotide
substitution, including equilibrium base frequencies and the
transition/transversion-rate ratio. The second concerns the variation of
substitution rates over sites. The maximum-likelihood estimate of tree
topology appears quite robust to both these aspects of the assumptions of
the models, but evaluation of the reliability of the estimated tree by
using simpler, less realistic models can be misleading. Branch lengths are
underestimated when simpler models of substitution are used, but the
underestimation caused by ignoring rate variation over nucleotide sites is
much more serious. The goodness of fit of a model is reduced by ignoring
spatial rate variation, but unrealistic assumptions about the pattern of
nucleotide substitution can lead to an extraordinary reduction in the
likelihood. It seems that evolutionary biologists can obtain accurate
estimates of certain evolutionary parameters even with an incorrect
phylogeny, while systematists cannot get the right tree with confidence
even when a realistic, and more complex, model of evolution is assumed.
ORIGINAL ARTICLE
Comparison of models for nucleotide substitution used in maximum- likelihood phylogenetic estimation
Department of Zoology, University of Cambridge, United Kingdom.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. D. Cutter, J. D. Wasmuth, and N. L. Washington Patterns of Molecular Evolution in Caenorhabditis Preclude Ancient Origins of Selfing Genetics, April 1, 2008; 178(4): 2093 - 2104. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Blankenberg, J. Taylor, I. Schenck, J. He, Y. Zhang, M. Ghent, N. Veeraraghavan, I. Albert, W. Miller, K. D. Makova, et al. A framework for collaborative analysis of ENCODE data: Making large-scale analyses biologist-friendly Genome Res., June 1, 2007; 17(6): 960 - 964. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. G. Beiko and R. L. Charlebois A simulation test bed for hypotheses of genome evolution Bioinformatics, April 1, 2007; 23(7): 825 - 831. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Bofkin and N. Goldman Variation in Evolutionary Processes at Different Codon Positions Mol. Biol. Evol., February 1, 2007; 24(2): 513 - 521. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Nielsen, V. L. Bauer DuMont, M. J. Hubisz, and C. F. Aquadro Maximum Likelihood Estimation of Ancestral Codon Usage Bias Parameters in Drosophila Mol. Biol. Evol., January 1, 2007; 24(1): 228 - 235. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Gowri-Shankar and M. Rattray On the Correlation Between Composition and Site-Specific Evolutionary Rate: Implications for Phylogenetic Inference Mol. Biol. Evol., February 1, 2006; 23(2): 352 - 364. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Dvornyk Subfamilies of cpmA, a gene involved in circadian output, have different evolutionary histories in cyanobacteria Microbiology, January 1, 2006; 152(1): 75 - 84. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Chen, M. Terai, L. Fu, R. Herrero, R. DeSalle, and R. D. Burk Diversifying Selection in Human Papillomavirus Type 16 Lineages Based on Complete Genome Analyses J. Virol., June 1, 2005; 79(11): 7014 - 7023. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. R. Kitchen, S. Philpott, H. Burger, B. Weiser, K. Anastos, and M. A. Suchard Evolution of Human Immunodeficiency Virus Type 1 Coreceptor Usage during Antiretroviral Therapy: a Bayesian Approach J. Virol., October 15, 2004; 78(20): 11296 - 11302. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Howell, J. L. Elson, D. M. Turnbull, and C. Herrnstadt African Haplogroup L mtDNA Sequences Show Violations of Clock-like Evolution Mol. Biol. Evol., October 1, 2004; 21(10): 1843 - 1854. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. G. Hwang and P. Green Inaugural Article: Bayesian Markov chain Monte Carlo sequence analysis reveals varying neutral substitution patterns in mammalian evolution PNAS, September 28, 2004; 101(39): 13994 - 14001. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Burleigh and S. Mathews Phylogenetic signal in nucleotide data from seed plants: implications for resolving the seed plant tree of life Am. J. Botany, September 1, 2004; 91(10): 1599 - 1613. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Whelan and N. Goldman Estimating the Frequency of Events That Cause Multiple-Nucleotide Changes Genetics, August 1, 2004; 167(4): 2027 - 2043. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. I. Jensen-Seaman, T. S. Furey, B. A. Payseur, Y. Lu, K. M. Roskin, C.-F. Chen, M. A. Thomas, D. Haussler, and H. J. Jacob Comparative Recombination Rates in the Rat, Mouse, and Human Genomes Genome Res., April 1, 2004; 14(4): 528 - 538. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Siepel and D. Haussler Phylogenetic Estimation of Context-Dependent Substitution Rates by Maximum Likelihood Mol. Biol. Evol., March 1, 2004; 21(3): 468 - 488. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Faith and D. D. Pollock Likelihood Analysis of Asymmetrical Mutation Bias Gradients in Vertebrate Mitochondrial Genomes Genetics, October 1, 2003; 165(2): 735 - 745. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. S. W. Chang Ancestral Gene Reconstruction and Synthesis of Ancient Rhodopsins in the Laboratory Integr. Comp. Biol., August 1, 2003; 43(4): 500 - 507. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Civetta Positive Selection Within Sperm-Egg Adhesion Domains of Fertilin: An ADAM Gene with a Potential Role in Fertilization Mol. Biol. Evol., January 1, 2003; 20(1): 21 - 29. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Goldman and S. Whelan A Novel Use of Equilibrium Frequencies in Models of Sequence Evolution Mol. Biol. Evol., November 1, 2002; 19(11): 1821 - 1831. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Ranwez and O. Gascuel Improvement of Distance-Based Phylogenetic Methods by a Local Maximum Likelihood Approach Using Triplets Mol. Biol. Evol., November 1, 2002; 19(11): 1952 - 1963. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Kelchner Group II introns as phylogenetic tools: structure, function, and evolutionary constraints Am. J. Botany, October 1, 2002; 89(10): 1651 - 1669. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. S. W. Chang, K. Jonsson, M. A. Kazmi, M. J. Donoghue, and T. P. Sakmar Recreating a Functional Ancestral Archosaur Visual Pigment Mol. Biol. Evol., September 1, 2002; 19(9): 1483 - 1489. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Gutierrez, D. Sanchez, and A. Marin A Reanalysis of the Ancient Mitochondrial DNA Sequences Recovered from Neandertal Bones Mol. Biol. Evol., August 1, 2002; 19(8): 1359 - 1366. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. A. Labas, N. G. Gurskaya, Y. G. Yanushevich, A. F. Fradkov, K. A. Lukyanov, S. A. Lukyanov, and M. V. Matz Diversity and evolution of the green fluorescent protein family PNAS, April 2, 2002; 99(7): 4256 - 4261. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Guindon and O. Gascuel Efficient Biased Estimation of Evolutionary Distances When Substitution Rates Vary Across Sites Mol. Biol. Evol., April 1, 2002; 19(4): 534 - 543. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. G. v. d. Schulenburg, G. D. D. Hurst, D. Tetzlaff, G. E. Booth, I. A. Zakharov, and M. E. N. Majerus History of Infection With Different Male-Killing Bacteria in the Two-Spot Ladybird Beetle Adalia bipunctata Revealed Through Mitochondrial DNA Sequence Analysis Genetics, March 1, 2002; 160(3): 1075 - 1086. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Sanderson and J. A. Doyle Sources of error and confidence intervals in estimating the age of angiosperms from rbcL and 18S rDNA data Am. J. Botany, August 1, 2001; 88(8): 1499 - 1516. [Full Text] |
||||
![]() |
R. Tarrio, F. Rodriguez-Trelles, and F. J. Ayala Shared Nucleotide Composition Biases Among Species and Their Impact on Phylogenetic Reconstructions of the Drosophilidae Mol. Biol. Evol., August 1, 2001; 18(8): 1464 - 1473. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Posada and K. A. Crandall Selecting Models of Nucleotide Substitution: An Application to Human Immunodeficiency Virus 1 (HIV-1) Mol. Biol. Evol., June 1, 2001; 18(6): 897 - 906. [Abstract] [Full Text] |
||||
![]() |
S. Whelan and N. Goldman A General Empirical Model of Protein Evolution Derived from Multiple Protein Families Using a Maximum-Likelihood Approach Mol. Biol. Evol., May 1, 2001; 18(5): 691 - 699. [Abstract] [Full Text] |
||||
![]() |
D. Posada and K. A. Crandall Simple (Wrong) Models for Complex Trees: A Case from Retroviridae Mol. Biol. Evol., February 1, 2001; 18(2): 271 - 275. [Full Text] |
||||
![]() |
J. H. G. von der Schulenburg, M. Habig, J. J. Sloggett, K. M. Webberley, D. Bertrand, G. D. D. Hurst, and M. E. N. Majerus Incidence of Male-Killing Rickettsia spp. ({alpha}-Proteobacteria) in the Ten-Spot Ladybird Beetle Adalia decempunctata L. (Coleoptera: Coccinellidae) Appl. Envir. Microbiol., January 1, 2001; 67(1): 270 - 277. [Abstract] [Full Text] |
||||
![]() |
N. J. Savill, D. C. Hoyle, and P. G. Higgs RNA Sequence Evolution With Secondary Structure Constraints: Comparison of Substitution Rate Models Using Maximum-Likelihood Methods Genetics, January 1, 2001; 157(1): 399 - 411. [Abstract] [Full Text] |
||||
![]() |
J. C. Silva and M. G. Kidwell Horizontal Transfer and Selection in the Evolution of P Elements Mol. Biol. Evol., October 1, 2000; 17(10): 1542 - 1557. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. S. W. Chang and D. L. Campbell Bias in Phylogenetic Reconstruction of Vertebrate Rhodopsin Sequences Mol. Biol. Evol., August 1, 2000; 17(8): 1220 - 1231. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Rodriguez-Trelles, L. Alarcon, and A. Fontdevila Molecular Evolution and Phylogeny of the buzzatii Complex (Drosophila repleta Group): A Maximum-Likelihood Approach Mol. Biol. Evol., July 1, 2000; 17(7): 1112 - 1122. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Korber, M. Muldoon, J. Theiler, F. Gao, R. Gupta, A. Lapedes, B. H. Hahn, S. Wolinsky, and T. Bhattacharya Timing the Ancestor of the HIV-1 Pandemic Strains Science, June 9, 2000; 288(5472): 1789 - 1796. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Goldman and a. S. Whelan Statistical Tests of Gamma-Distributed Rate Heterogeneity in Models of Sequence Evolution in Phylogenetics Mol. Biol. Evol., June 1, 2000; 17(6): 975 - 978. [Full Text] [PDF] |
||||
![]() |
J. J. Hanger, L. D. Bromham, J. J. McKee, T. M. O'Brien, and W. F. Robinson The Nucleotide Sequence of Koala (Phascolarctos cinereus) Retrovirus: a Novel Type C Endogenous Virus Related to Gibbon Ape Leukemia Virus J. Virol., May 1, 2000; 74(9): 4264 - 4272. [Abstract] [Full Text] |
||||
![]() |
D. L. Campbell, A. V. Z. Brower, and N. E. Pierce Molecular Evolution of the Wingless Gene and Its Implications for the Phylogenetic Placement of the Butterfly Family Riodinidae (Lepidoptera: Papilionoidea) Mol. Biol. Evol., May 1, 2000; 17(5): 684 - 696. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. S. Arbogast, J. B. Slowinski;, J. Klicka, and R. M. Zink; Pleistocene Speciation and the Mitochondrial DNA Clock Science, December 11, 1998; 282(5396): 1955a - 1955. [Full Text] |
||||
![]() |
P. Liò and N. Goldman Models of Molecular Evolution and Phylogeny Genome Res., December 1, 1998; 8(12): 1233 - 1244. [Abstract] [Full Text] |
||||
![]() |
N. Goldman, J. L. Thorne, and D. T. Jones Assessing the Impact of Secondary Structure and Solvent Accessibility on Protein Evolution Genetics, May 1, 1998; 149(1): 445 - 458. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Nielsen and Z. Yang Likelihood Models for Detecting Positively Selected Amino Acid Sites and Applications to the HIV-1 Envelope Gene Genetics, March 1, 1998; 148(3): 929 - 936. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Huelsenbeck and B. Rannala Phylogenetic Methods Come of Age: Testing Hypotheses in an Evolutionary Context Science, April 11, 1997; 276(5310): 227 - 232. [Abstract] [Full Text] |
||||










