MBE Advance Access published online on January 3, 2008
Molecular Biology and Evolution, doi:10.1093/molbev/msm284
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Article |
Mutation-Selection Models of Codon Substitution and Their Use to Estimate Selective Strengths on Codon Usage
1 Department of Biology, Galton Laboratory, University College London, London, UK
2 Department of Biology, University of Copenhagen, Universitetsparken 15, 2100 KBH Ø, Denmark
Address for correspondence: Ziheng Yang, Department of Biology, University College London, Darwin Building, Gower Street, London WC1E 6BT, England, Tel: +44 (20) 7679 4379, Fax: +44 (20) 7679 7096, Email: z.yang{at}ucl.ac.uk
Received for publication September 14, 2007. Revision received December 13, 2007. Accepted for publication December 19, 2007.
Current models of codon substitution are formulated at the levels of nucleotide substitution and do not explicitly consider the separate effects of mutation and selection. They are thus incapable of inferring whether mutation or selection is responsible for evolution at silent sites. Here we implement a few population genetics models of codon substitution that explicitly consider mutation bias and natural selection at the DNA level. Selection on codon usage is modeled by introducing codon-fitness parameters, which, together with mutation-bias parameters, predict optimal codon frequencies for the gene. The selective pressure may be for translational efficiency and accuracy, or for fine-tuning translational kinetics to produce correct protein folding. We apply the models to compare mitochondrial and nuclear genes from several mammalian species. Model assumptions concerning codon usage are found to affect the estimation of sequence distances (such as the synonymous rate dS, the nonsynonymous rate dN, and the rate at the four-fold degenerate sites d4), as found in previous studies, but the new models produced very similar estimates to some old ones. We also develop a likelihood ratio test to test the null hypothesis that codon usage is due to mutation bias alone, not influenced by natural selection. Application of the test to the mammalian data led to rejection of the null hypothesis in most genes, suggesting that natural selection may be a driving force in the evolution of synonymous codon usage in mammals. Estimates of selection coefficients nevertheless suggest that selection on codon usage is weak and most mutations are nearly neutral. The sensitivity of the analysis on the assumed mutation model is discussed.
Key Words: codon substitution model codon usage mutation selection synonymous substitution