MBE Advance Access originally published online on October 14, 2008
Molecular Biology and Evolution 2009 26(2):255-271; doi:10.1093/molbev/msn232
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models


* Institute of Computational Science, Swiss Federal Institute of Technology (ETHZ), Zurich, Switzerland
Swiss Institute of Bioinformatics, Lausanne, Switzerland
Department of Biological Statistics and Computational Biology, Cornell University
E-mail: maria.anisimova{at}inf.ethz.ch.
Accepted for publication October 6, 2008.
This review is motivated by the true explosion in the number of recent studies both developing and ameliorating probabilistic models of codon evolution. Traditionally parametric, the first codon models focused on estimating the effects of selective pressure on the protein via an explicit parameter in the maximum likelihood framework. Likelihood ratio tests of nested codon models armed the biologists with powerful tools, which provided unambiguous evidence for positive selection in real data. This, in turn, triggered a new wave of methodological developments. The new generation of models views the codon evolution process in a more sophisticated way, relaxing several mathematical assumptions. These models make a greater use of physicochemical amino acid properties, genetic code machinery, and the large amounts of data from the public domain. The overview of the most recent advances on modeling codon evolution is presented here, and a wide range of their applications to real data is discussed. On the downside, availability of a large variety of models, each accounting for various biological factors, increases the margin for misinterpretation; the biological meaning of certain parameters may vary among models, and model selection procedures also deserve greater attention. Solid understanding of the modeling assumptions and their applicability is essential for successful statistical data analysis.
Key Words: Markov model maximum likelihood Bayesian approach codon evolution positive selection
Arndt von Haeseler, Associate Editor
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
W. Fletcher and Z. Yang INDELible: A Flexible Simulator of Biological Sequence Evolution Mol. Biol. Evol., August 1, 2009; 26(8): 1879 - 1888. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rodrigue, C. L. Kleinman, H. Philippe, and N. Lartillot Computational Methods for Evaluating Phylogenetic Models of Coding Sequence Evolution with Dependence between Codons Mol. Biol. Evol., July 1, 2009; 26(7): 1663 - 1676. [Abstract] [Full Text] [PDF] |
||||
