MBE Advance Access published online on June 17, 2009
Molecular Biology and Evolution, doi:10.1093/molbev/msp119
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Article |
Estimating the rate of adaptive molecular evolution in the presence of slightly deleterious mutations and population size change
1 Centre for the Study of Evolution and School of Life Sciences, University of Sussex, Brighton BN1 9QG, UK
2 Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, West Mains Road, Edinburgh EH9 3JT, UK
3 Corresponding author. Tel +44 (0)131 650 5443, Fax +44 (0)131 650 6564, Email: keightley.mbe2009{at}gmail.com
Received for publication March 23, 2009. Revision received June 10, 2009. Accepted for publication June 11, 2009.
The prevalence of adaptive evolution relative to genetic drift is a central problem in molecular evolution. Methods to estimate the fraction of adaptive nucleotide substitutions (
) have been developed, based on the McDonald-Kreitman test, that contrast polymorphism and divergence between selectively and neutrally evolving sites. However, these methods are expected to give downwardly biased estimates of
if there are slightly deleterious mutations, because these inflate polymorphism relative to divergence. Here, we estimate
by simultaneously estimating the distribution of fitness effects of new mutations at selected sites from the site frequency spectrum, and the number of adaptive substitutions. We test the method using simulations. If data meet the assumptions of the analysis model, estimates of
show little bias, even when there is little or no recombination. However, population size differences between the divergence and polymorphism phases may cause
to be over- or under-estimated by a predictable factor that depends on the magnitude of the population size change and the shape of the distribution of effects of deleterious mutations. We analyse several data sets of protein-coding genes and noncoding regions from hominids and Drosophila. In Drosophila genes, we estimate that c.50% of amino acid substitutions and c.20% of substitutions in introns are adaptive. In protein-coding and noncoding data sets of humans, comparison to macaque sequences reveals little evidence for adaptive substitutions. However, the true frequency of adaptive substitutions in human coding DNA could be as high as 40%, since estimates based on current polymorphism may be strongly downwardly biased by a decrease in the effective population size along the human lineage.
Key Words: Adaptation Humans Drosophila Site frequency spectrum Nucleotide polymorphism