Molecular Biology and Evolution 19:950-958 (2002)
© 2002 Society for Molecular Biology and Evolution
Accuracy and Power of Bayes Prediction of Amino Acid Sites Under Positive Selection
*Department of Biology, Galton Laboratory
Center for Mathematics and Physics in the Life Sciences and Experimental Biology (CoMPLEX), University College London
Bayes prediction quantifies uncertainty by assigning posterior probabilities. It was used to identify amino acids in a protein under recurrent diversifying selection indicated by higher nonsynonymous (dN) than synonymous (dS) substitution rates or by
= dN/dS > 1. Parameters were estimated by maximum likelihood under a codon substitution model that assumed several classes of sites with different
ratios. The Bayes theorem was used to calculate the posterior probabilities of each site falling into these site classes. Here, we evaluate the performance of Bayes prediction of amino acids under positive selection by computer simulation. We measured the accuracy by the proportion of predicted sites that were truly under selection and the power by the proportion of true positively selected sites that were predicted by the method. The accuracy was slightly better for longer sequences, whereas the power was largely unaffected by the increase in sequence length. Both accuracy and power were higher for medium or highly diverged sequences than for similar sequences. We found that accuracy and power were unacceptably low when data contained only a few highly similar sequences. However, sampling a large number of lineages improved the performance substantially. Even for very similar sequences, accuracy and power can be high if over 100 taxa are used in the analysis. We make the following recommendations: (1) prediction of positive selection sites is not feasible for a few closely related sequences; (2) using a large number of lineages is the best way to improve the accuracy and power of the prediction; and (3) multiple models of heterogeneous selective pressures among sites should be applied in real data analysis.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Zamora, Q. Sun, M. T. Hamblin, C. F. Aquadro, and S. Kresovich Positively Selected Disease Response Orthologous Gene Sets in the Cereals Identified Using Sorghum bicolor L. Moench Expression Profiles and Comparative Genomics Mol. Biol. Evol., September 1, 2009; 26(9): 2015 - 2030. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Smadja, P. Shi, R. K. Butlin, and H. M. Robertson Large Gene Family Expansions and Adaptive Evolution for Odorant and Gustatory Receptors in the Pea Aphid, Acyrthosiphon pisum Mol. Biol. Evol., September 1, 2009; 26(9): 2073 - 2086. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Viljakainen, J. D. Evans, M. Hasselmann, O. Rueppell, S. Tingek, and P. Pamilo Rapid Evolution of Immune Proteins in Social Insects Mol. Biol. Evol., August 1, 2009; 26(8): 1791 - 1801. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Gardiner, R. K Butlin, W. C Jordan, and M. G Ritchie Sites of evolutionary divergence differ between olfactory and gustatory receptors of Drosophila Biol Lett, April 23, 2009; 5(2): 244 - 247. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Anisimova and C. Kosiol Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models Mol. Biol. Evol., February 1, 2009; 26(2): 255 - 271. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Delport, K. Scheffler, and C. Seoighe Models of coding sequence evolution Brief Bioinform, January 1, 2009; 10(1): 97 - 109. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. C. Karn, N. L. Clark, E. D. Nguyen, and W. J. Swanson Adaptive Evolution in Rodent Seminal Vesicle Secretion Proteins Mol. Biol. Evol., November 1, 2008; 25(11): 2301 - 2310. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Sato, E. Yuste, W. A. Lauer, E. H. Chang, J. S. Morgan, J. G. Bixby, J. D. Lifson, R. C. Desrosiers, and W. E. Johnson Potent Antibody-Mediated Neutralization and Evolution of Antigenic Escape Variants of Simian Immunodeficiency Virus Strain SIVmac239 In Vivo J. Virol., October 1, 2008; 82(19): 9739 - 9752. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Studer, S. Penel, L. Duret, and M. Robinson-Rechavi Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes Genome Res., September 1, 2008; 18(9): 1393 - 1402. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond, A. F.Y. Poon, A. J. Leigh Brown, and S. D.W. Frost A Maximum Likelihood Method for Detecting Directional Evolution in Protein Sequences and Its Application to Influenza A Virus Mol. Biol. Evol., September 1, 2008; 25(9): 1809 - 1824. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Bao, H. Gu, K. A. Dunn, and J. P. Bielawski Likelihood-Based Clustering (LiBaC) for Codon Models, a Method for Grouping Sites according to Similarities in the Underlying Process of Evolution Mol. Biol. Evol., September 1, 2008; 25(9): 1995 - 2007. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. C. Nickel, D. Tefft, and M. D. Adams Human PAML browser: a database of positive selection on human genes using phylogenetic methods Nucleic Acids Res., January 11, 2008; 36(suppl_1): D800 - D808. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Y. Low, H. L. Ng, C. J. Morton, M. W. Parker, P. Batterham, and C. Robin Molecular Evolution of Glutathione S-Transferases in the Genus Drosophila Genetics, November 1, 2007; 177(3): 1363 - 1375. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Starrett and E. R Waters Positive natural selection has driven the evolution of the Hsp70s in Diguetia spiders Biol Lett, August 22, 2007; 3(4): 439 - 444. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. L. Zaaijer, F. J. van Hemert, M. H. Koppelman, and V. V. Lukashov Independent evolution of overlapping polymerase and surface protein genes of hepatitis B virus J. Gen. Virol., August 1, 2007; 88(8): 2137 - 2143. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang PAML 4: Phylogenetic Analysis by Maximum Likelihood Mol. Biol. Evol., August 1, 2007; 24(8): 1586 - 1591. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Mayrose, A. Doron-Faigenboim, E. Bacharach, and T. Pupko Towards realistic codon models: among site variability and dependency of synonymous and non-synonymous rates Bioinformatics, July 1, 2007; 23(13): i319 - i327. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Anisimova and Z. Yang Multiple Hypothesis Testing to Detect Lineages under Positive Selection that Affects Only a Few Sites Mol. Biol. Evol., May 1, 2007; 24(5): 1219 - 1228. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Venturi, M. Ciccozzi, S. Montieri, A. Bartoloni, D. Francisci, L. Nicoletti, C. Fortuna, L. Marongiu, G. Rezza, and M. G. Ciufolini Genetic variability of the M genome segment of clinical and environmental Toscana virus strains J. Gen. Virol., April 1, 2007; 88(4): 1288 - 1294. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-L. Wu, T.-P. Lin, M.-Y. Lin, Y.-P. Cheng, and S.-Y. Hwang Divergent Evolution of the Chloroplast Small Heat Shock Protein Gene in the Genera Rhododendron (Ericaceae) and Machilus (Lauraceae) Ann. Bot., March 1, 2007; 99(3): 461 - 475. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Porter, T. W. Cronin, D. A. McClellan, and K. A. Crandall Molecular Characterization of Crustacean Visual Pigments and the Evolution of Pancrustacean Opsins Mol. Biol. Evol., January 1, 2007; 24(1): 253 - 268. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Harlin-Cognato, E. A. Hoffman, and A. G. Jones Gene cooption without duplication during the evolution of a male-pregnancy gene in pipefish PNAS, December 19, 2006; 103(51): 19407 - 19412. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Turner and H. E. Hoekstra Adaptive Evolution of Fertilization Proteins within a Genus: Variation in ZP2 and ZP3 in Deer Mice (Peromyscus) Mol. Biol. Evol., September 1, 2006; 23(9): 1656 - 1669. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Thomas Adaptive evolution in two large families of ubiquitin-ligase adapters in nematodes and plants Genome Res., August 1, 2006; 16(8): 1017 - 1030. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Chen, C.-S. Hung, J. Xu, C. S. Reigstad, V. Magrini, A. Sabo, D. Blasiar, T. Bieri, R. R. Meyer, P. Ozersky, et al. Identification of genes subject to positive selection in uropathogenic strains of Escherichia coli: A comparative genomics approach PNAS, April 11, 2006; 103(15): 5977 - 5982. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Campitelli, M. Ciccozzi, M. Salemi, F. Taglia, S. Boros, I. Donatelli, and G. Rezza H5N1 influenza virus evolution: a comparison of different epidemics in birds and humans (1997-2004). J. Gen. Virol., April 1, 2006; 87(Pt 4): 955 - 960. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Norrgard, Y. Ivarsson, K. Tars, and B. Mannervik Alternative mutations of a positively selected residue elicit gain or loss of functionalities in enzyme evolution PNAS, March 28, 2006; 103(13): 4876 - 4881. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. H. Zakon, Y. Lu, D. J. Zwickl, and D. M. Hillis Sodium channel genes and the evolution of diversity in communication signals of electric fishes: Convergent molecular evolution PNAS, March 7, 2006; 103(10): 3675 - 3680. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Wilson and G. McVean Estimating Diversifying Selection and Functional Constraint in the Presence of Recombination Genetics, March 1, 2006; 172(3): 1411 - 1425. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. L Clark, J. E Aagaard, and W. J Swanson Evolution of reproductive proteins from animals and plants Reproduction, January 1, 2006; 131(1): 11 - 22. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Scheffler and C. Seoighe A Bayesian Model Comparison Approach to Inferring Positive Selection Mol. Biol. Evol., December 1, 2005; 22(12): 2531 - 2540. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Perez-Losada, R. P. Viscidi, J. C. Demma, J. Zenilman, and K. A. Crandall Population Genetics of Neisseria gonorrhoeae in a High-Prevalence Community Using a Hypervariable Outer Membrane porB and 13 Slowly Evolving Housekeeping Genes Mol. Biol. Evol., September 1, 2005; 22(9): 1887 - 1902. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Jagadeeshan and R. S. Singh Rapidly Evolving Genes of Drosophila: Differing Levels of Selective Pressure in Testis, Ovary, and Head Tissues Between Sibling Species Mol. Biol. Evol., September 1, 2005; 22(9): 1793 - 1801. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Lynn, A. R. Freeman, C. Murray, and D. G. Bradley A Genomics Approach to the Detection of Positive Selection in Cattle: Adaptive Evolution of the T-Cell and Natural Killer Cell-Surface Protein CD2 Genetics, July 1, 2005; 170(3): 1189 - 1196. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Bishop Directed Mutagenesis Confirms the Functional Importance of Positively Selected Sites in Polygalacturonase Inhibitor Protein Mol. Biol. Evol., July 1, 2005; 22(7): 1531 - 1534. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Chen, M. Terai, L. Fu, R. Herrero, R. DeSalle, and R. D. Burk Diversifying Selection in Human Papillomavirus Type 16 Lineages Based on Complete Genome Analyses J. Virol., June 1, 2005; 79(11): 7014 - 7023. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond and S. D. W. Frost Not So Different After All: A Comparison of Methods for Detecting Amino Acid Sites Under Selection Mol. Biol. Evol., May 1, 2005; 22(5): 1208 - 1222. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Opazo, R. E. Palma, F. Melo, and E. P. Lessa Adaptive Evolution of the Insulin Gene in Caviomorph Rodents Mol. Biol. Evol., May 1, 2005; 22(5): 1290 - 1298. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang, W. S.W. Wong, and R. Nielsen Bayes Empirical Bayes Inference of Amino Acid Sites Under Positive Selection Mol. Biol. Evol., April 1, 2005; 22(4): 1107 - 1118. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Consuegra, H.-J. Megens, H. Schaschl, K. Leon, R. J. M. Stet, and W. C. Jordan Rapid Evolution of the MH Class I Locus Results in Different Allelic Compositions in Recently Diverged Populations of Atlantic Salmon Mol. Biol. Evol., April 1, 2005; 22(4): 1095 - 1106. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Summers and B. Crespi Cadherins in maternal-foetal interactions: red queen with a green beard? Proc R Soc B, March 22, 2005; 272(1563): 643 - 649. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang The power of phylogenetic comparison in revealing protein function PNAS, March 1, 2005; 102(9): 3179 - 3180. [Full Text] [PDF] |
||||
![]() |
T. Massingham and N. Goldman Detecting Amino Acid Sites Under Positive Selection and Purifying Selection Genetics, March 1, 2005; 169(3): 1753 - 1762. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. H. Perry, B. C. Verrelli, and A. C. Stone Comparative Analyses Reveal a Complex History of Molecular Evolution for Human MYH16 Mol. Biol. Evol., March 1, 2005; 22(3): 379 - 382. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Bishop, D. R. Ripoll, S. Bashir, C. M. B. Damasceno, J. D. Seeds, and J. K. C. Rose Selection on Glycine {beta}-1,3-Endoglucanase Genes Differentially Inhibited by a Phytophthora Glucanase Inhibitor Protein Genetics, February 1, 2005; 169(2): 1009 - 1019. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Aris-Brosou Determinants of Adaptive Evolution at the Molecular Level: the Extended Complexity Hypothesis Mol. Biol. Evol., February 1, 2005; 22(2): 200 - 209. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Bielawski, K. A. Dunn, G. Sabehi, and O. Beja Darwinian adaptation of proteorhodopsin to different light intensities in the marine environment PNAS, October 12, 2004; 101(41): 14824 - 14829. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. S. W. Wong, Z. Yang, N. Goldman, and R. Nielsen Accuracy and Power of Statistical Methods for Detecting Adaptive Evolution in Protein Coding Sequences and for Identifying Positively Selected Sites Genetics, October 1, 2004; 168(2): 1041 - 1051. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. V. Kuzmin, A. D. Botvinkin, L. M. McElhinney, J. S. Smith, L. A. Orciari, G. J. Hughes, A. R. Fooks, and C. E. Rupprecht MOLECULAR EPIDEMIOLOGY OF TERRESTRIAL RABIES IN THE FORMER SOVIET UNION J. Wildl. Dis., October 1, 2004; 40(4): 617 - 631. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Moury Differential Selection of Genes of Cucumber Mosaic Virus Subgroups Mol. Biol. Evol., August 1, 2004; 21(8): 1602 - 1611. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Rohmer, D. S. Guttman, and J. L. Dangl Diverse Evolutionary Mechanisms Shape the Type III Effector Virulence Factor Repertoire in the Plant Pathogen Pseudomonas syringae Genetics, July 1, 2004; 167(3): 1341 - 1360. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zhang Frequent False Detection of Positive Selection by the Likelihood Method with Branch-Site Models Mol. Biol. Evol., July 1, 2004; 21(7): 1332 - 1339. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. S. W. Wong and R. Nielsen Detecting Selection in Noncoding Regions of Nucleotide Sequences Genetics, June 1, 2004; 167(2): 949 - 958. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Suzuki and M. Nei False-Positive Selection Identified by ML-Based Methods: Examples from the Sig1 Gene of the Diatom Thalassiosira weissflogii and the tax Gene of a Human T-cell Lymphotropic Virus Mol. Biol. Evol., May 1, 2004; 21(5): 914 - 921. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Emes, S. A. Beatson, C. P. Ponting, and L. Goodstadt Evolution and Comparative Genomics of Odorant- and Pheromone-Associated Genes in Rodents Genome Res., April 1, 2004; 14(4): 591 - 602. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Choisy, C. H. Woelk, J.-F. Guegan, and D. L. Robertson Comparative Study of Adaptive Molecular Evolution in Different Human Immunodeficiency Virus Groups and Subtypes J. Virol., February 15, 2004; 78(4): 1962 - 1970. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Harrison and B. C. Bonning Application of maximum-likelihood models to selection pressure analysis of group I nucleopolyhedrovirus genes J. Gen. Virol., January 1, 2004; 85(1): 197 - 210. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Sorhannus The Effect of Positive Selection on a Sexual Reproduction Gene in Thalassiosira weissflogii (Bacillariophyta): Results Obtained from Maximum-Likelihood and Parsimony-Based Methods Mol. Biol. Evol., August 1, 2003; 20(8): 1326 - 1328. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Anisimova, R. Nielsen, and Z. Yang Effect of Recombination on the Accuracy of the Likelihood Method for Detecting Positive Selection at Amino Acid Sites Genetics, July 1, 2003; 164(3): 1229 - 1236. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Mathews, J. G. Burleigh, and M. J. Donoghue Adaptive Evolution in the Photosensory Domain of Phytochrome A in Early Angiosperms Mol. Biol. Evol., July 1, 2003; 20(7): 1087 - 1097. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. E. Galindo, V. D. Vacquier, and W. J. Swanson Positive selection in the egg receptor for abalone sperm lysin PNAS, April 15, 2003; 100(8): 4639 - 4643. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. J. Swanson, R. Nielsen, and Q. Yang Pervasive Adaptive Evolution in Mammalian Fertilization Proteins Mol. Biol. Evol., January 1, 2003; 20(1): 18 - 20. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Suzuki and M. Nei Simulation Study of the Reliability and Robustness of the Statistical Methods for Detecting Positive Selection at Single Amino Acid Sites Mol. Biol. Evol., November 1, 2002; 19(11): 1865 - 1869. [Abstract] [Full Text] [PDF] |
||||













