MBE Advance Access originally published online on September 21, 2005
Molecular Biology and Evolution 2006 23(1):7-9; doi:10.1093/molbev/msj021
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Letter |
Choosing Appropriate Substitution Models for the Phylogenetic Analysis of Protein-Coding Sequences
Department of Zoology, University of Oxford, Oxford, United Kingdom
E-mail: andrew.rambaut{at}zoo.ox.ac.uk.
Although phylogenetic inference of protein-coding sequences continues to dominate the literature, few analyses incorporate evolutionary models that consider the genetic code. This problem is exacerbated by the exclusion of codon-based models from commonly employed model selection techniques, presumably due to the computational cost associated with codon models. We investigated an efficient alternative to standard nucleotide substitution models, in which codon position (CP) is incorporated into the model. We determined the most appropriate model for alignments of 177 RNA virus genes and 106 yeast genes, using 11 substitution models including one codon model and four CP models. The majority of analyzed gene alignments are best described by CP substitution models, rather than by standard nucleotide models, and without the computational cost of full codon models. These results have significant implications for phylogenetic inference of coding sequences as they make it clear that substitution models incorporating CPs not only are a computationally realistic alternative to standard models but may also frequently be statistically superior.
Key Words: phylogenetic inference protein-coding sequences substitution models
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. O. Wertheim, M. J. Sanderson, M. Worobey, and A. Bjork Relaxed Molecular Clocks, the Bias-Variance Trade-off, and the Quality of Phylogenetic Inference Syst Biol, October 13, 2009; (2009) syp072v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. E. Steeman, M. B. Hebsgaard, R. E. Fordyce, S. Y. W. Ho, D. L. Rabosky, R. Nielsen, C. Rahbek, H. Glenner, M. V. Sorensen, and E. Willerslev Radiation of Extant Cetaceans Driven by Restructuring of the Oceans Syst Biol, October 5, 2009; (2009) syp060v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. J. D. Smith, J. Bahl, D. Vijaykrishna, J. Zhang, L. L. M. Poon, H. Chen, R. G. Webster, J. S. M. Peiris, and Y. Guan From the Cover: Dating the emergence of pandemic influenza viruses PNAS, July 14, 2009; 106(28): 11709 - 11712. [Abstract] [Full Text] [PDF] |
||||
![]() |
T.-K. Seo and H. Kishino Statistical Comparison of Nucleotide, Amino Acid, and Codon Substitution Models for Evolutionary Analysis of Protein-Coding Sequences Syst Biol, June 29, 2009; (2009) syp015v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Suchard and A. Rambaut Many-core algorithms for statistical phylogenetics Bioinformatics, June 1, 2009; 25(11): 1370 - 1376. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. C. M. Leitch, J. Bendig, M. Cabrerizo, J. Cardosa, T. Hyypia, O. E. Ivanova, A. Kelly, A. C. M. Kroes, A. Lukashev, A. MacAdam, et al. Transmission Networks and Population Turnover of Echovirus 30 J. Virol., March 1, 2009; 83(5): 2109 - 2118. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. I. D. Ros, V. M. Fleming, E. J. Feil, and J. A. J. Breeuwer How Diverse Is the Genus Wolbachia? Multiple-Gene Sequencing Reveals a Putatively New Wolbachia Supergroup Recovered from Spider Mites (Acari: Tetranychidae) Appl. Envir. Microbiol., February 15, 2009; 75(4): 1036 - 1043. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Anisimova and C. Kosiol Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models Mol. Biol. Evol., February 1, 2009; 26(2): 255 - 271. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. D Atkinson, R. D Gray, and A. J Drummond Bayesian coalescent inference of major human mitochondrial DNA haplogroup expansions in Africa Proc R Soc B, January 22, 2009; 276(1655): 367 - 373. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. H. Majoros and U. Ohler Complexity reduction in context-dependent DNA substitution models Bioinformatics, January 15, 2009; 25(2): 175 - 182. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Delport, K. Scheffler, and C. Seoighe Models of coding sequence evolution Brief Bioinform, January 1, 2009; 10(1): 97 - 109. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Milne, D. Lindner, M. Bayer, D. Husmeier, G. McGuire, D. F. Marshall, and F. Wright TOPALi v2: a rich graphical interface for evolutionary analyses of multiple alignments on HPC clusters and multi-core desktops Bioinformatics, January 1, 2009; 25(1): 126 - 127. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. N Minin and M. A Suchard Fast, accurate and simulation-free stochastic mapping Phil Trans R Soc B, December 27, 2008; 363(1512): 3985 - 3995. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. H. Degnan and N. A. Moran Diverse Phage-Encoded Toxins in a Protective Insect Endosymbiont Appl. Envir. Microbiol., November 1, 2008; 74(21): 6782 - 6791. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. H Thomas Phylogenetic distributions of British birds of conservation concern Proc R Soc B, September 22, 2008; 275(1647): 2077 - 2083. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Norja, A. M. Eis-Hubinger, M. Soderlund-Venermo, K. Hedman, and P. Simmonds Rapid Sequence Change and Geographical Spread of Human Parvovirus B19: Comparison of B19 Virus Evolution in Acute and Persistent Infections J. Virol., July 1, 2008; 82(13): 6427 - 6433. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. R. Schultz and S. G. Brady From the Cover: Major evolutionary transitions in ant agriculture PNAS, April 8, 2008; 105(14): 5435 - 5440. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. Wittmann, R. Biek, A. Hassanin, P. Rouquet, P. Reed, P. Yaba, X. Pourrut, L. A. Real, J.-P. Gonzalez, and E. M. Leroy Isolates of Zaire ebolavirus from wild apes reveal genetic lineage and recombinants PNAS, October 23, 2007; 104(43): 17123 - 17127. [Abstract] [Full Text] [PDF] |
||||
![]() |
T Lembo, D.T Haydon, A Velasco-Villa, C.E Rupprecht, C Packer, P.E Brandao, I.V Kuzmin, A.R Fooks, J Barrat, and S Cleaveland Molecular epidemiology identifies only a single rabies virus variant circulating in complex carnivore communities of the Serengeti Proc R Soc B, September 7, 2007; 274(1622): 2123 - 2130. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. Hugall, R. Foster, and M. S. Y. Lee Calibration Choice, Rate Smoothing, and the Pattern of Tetrapod Diversification According to the Long Nuclear Gene RAG-1 Syst Biol, August 1, 2007; 56(4): 543 - 563. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang PAML 4: Phylogenetic Analysis by Maximum Likelihood Mol. Biol. Evol., August 1, 2007; 24(8): 1586 - 1591. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Biek, J. C. Henderson, L. A. Waller, C. E. Rupprecht, and L. A. Real A high-resolution genetic signature of demographic and spatial expansion in epizootic rabies virus PNAS, May 8, 2007; 104(19): 7993 - 7998. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Bofkin and N. Goldman Variation in Evolutionary Processes at Different Codon Positions Mol. Biol. Evol., February 1, 2007; 24(2): 513 - 521. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M. Turner, R. R. Chaudhuri, Z.-D. Jiang, H. DuPont, C. Gyles, C. W. Penn, M. J. Pallen, and I. R. Henderson Phylogenetic Comparisons Reveal Multiple Acquisitions of the Toxin Genes by Enterotoxigenic Escherichia coli Strains of Different Evolutionary Lineages J. Clin. Microbiol., December 1, 2006; 44(12): 4528 - 4536. [Abstract] [Full Text] [PDF] |
||||









