MBE Advance Access originally published online on November 21, 2006
Molecular Biology and Evolution 2007 24(2):513-521; doi:10.1093/molbev/msl178
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
© 2006 The Authors
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Research Articles |
Variation in Evolutionary Processes at Different Codon Positions
European Molecular Biology LaboratoryEuropean Bioinformatics Institute, Hinxton, United Kingdom
E-mail: goldman{at}ebi.ac.uk.
Accepted for publication November 15, 2006.
Evolutionary studies commonly model single nucleotide substitutions and assume that they occur as independent draws from a unique probability distribution across the sequence studied. This assumption is violated for protein-coding sequences, and we consider modeling approaches where codon positions (CPs) are treated as separate categories of sites because within each category the assumption is more reasonable. Such "codon-position" models have been shown to explain the evolution of codon data better than homogenous models in previous studies. This paper examines the ways in which codon-position models outperform homogeneous models and characterizes the differences in estimates of model parameters across CPs. Using the PANDIT database of multiple species DNA sequence alignments, we quantify the differences in the evolutionary processes at the 3 CPs in a systematic and comprehensive manner, characterizing previously undescribed features of protein evolution. We relate our findings to the functional constraints imposed by the genetic code, protein function, and the types of mutation that cause synonymous and nonsynonymous codon changes. The results increase our understanding of selective constraints and could be incorporated into phylogenetic analyses or gene-finding techniques in the future. The methods used are extended to an overlapping reading frame data set, and we discover that overlapping reading frames do not necessarily cause more stringent evolutionary constraints.
Key Words: adaptive evolution codon positions phylogenetic inference protein-coding sequences sequence evolution
Martin Embley, Associate Editor
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. D. Moore and R. G. Allaby TreeMos: a high-throughput phylogenomic approach to find and visualize phylogenetic mosaicism Bioinformatics, March 1, 2008; 24(5): 717 - 718. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Kingsford, A. L. Delcher, and S. L. Salzberg A Unified Model Explaining the Offsets of Overlapping and Near-Overlapping Prokaryotic Genes Mol. Biol. Evol., September 1, 2007; 24(9): 2091 - 2098. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang PAML 4: Phylogenetic Analysis by Maximum Likelihood Mol. Biol. Evol., August 1, 2007; 24(8): 1586 - 1591. [Abstract] [Full Text] [PDF] |
||||

