MBE Advance Access originally published online on February 9, 2005
Molecular Biology and Evolution 2005 22(5):1208-1222; doi:10.1093/molbev/msi105
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Article |
Not So Different After All: A Comparison of Methods for Detecting Amino Acid Sites Under Selection
Antiviral Research Center, University of California San Diego
E-mail: sdfrost{at}ucsd.edu.
We consider three approaches for estimating the rates of nonsynonymous and synonymous changes at each site in a sequence alignment in order to identify sites under positive or negative selection: (1) a suite of fast likelihood-based "counting methods" that employ either a single most likely ancestral reconstruction, weighting across all possible ancestral reconstructions, or sampling from ancestral reconstructions; (2) a random effects likelihood (REL) approach, which models variation in nonsynonymous and synonymous rates across sites according to a predefined distribution, with the selection pressure at an individual site inferred using an empirical Bayes approach; and (3) a fixed effects likelihood (FEL) method that directly estimates nonsynonymous and synonymous substitution rates at each site. All three methods incorporate flexible models of nucleotide substitution bias and variation in both nonsynonymous and synonymous substitution rates across sites, facilitating the comparison between the methods. We demonstrate that the results obtained using these approaches show broad agreement in levels of Type I and Type II error and in estimates of substitution rates. Counting methods are well suited for large alignments, for which there is high power to detect positive and negative selection, but appear to underestimate the substitution rate. A REL approach, which is more computationally intensive than counting methods, has higher power than counting methods to detect selection in data sets of intermediate size but may suffer from higher rates of false positives for small data sets. A FEL approach appears to capture the pattern of rate variation better than counting methods or random effects models, does not suffer from as many false positives as random effects models for data sets comprising few sequences, and can be efficiently parallelized. Our results suggest that previously reported differences between results obtained by counting methods and random effects models arise due to a combination of the conservative nature of counting-based methods, the failure of current random effects models to allow for variation in synonymous substitution rates, and the naive application of random effects models to extremely sparse data sets. We demonstrate our methods on sequence data from the human immunodeficiency virus type 1 env and pol genes and simulated alignments.
Key Words: positive and negative selection codon substitution models substitution rates parallel algorithms
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. F. Y. Poon, F. I. Lewis, S. D. W. Frost, and S. L. Kosakovsky Pond Spidermonkey: rapid detection of co-evolving sites using Bayesian graphical models Bioinformatics, September 1, 2008; 24(17): 1949 - 1950. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond, A. F.Y. Poon, A. J. Leigh Brown, and S. D.W. Frost A Maximum Likelihood Method for Detecting Directional Evolution in Protein Sequences and Its Application to Influenza A Virus Mol. Biol. Evol., September 1, 2008; 25(9): 1809 - 1824. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Bao, H. Gu, K. A. Dunn, and J. P. Bielawski Likelihood-Based Clustering (LiBaC) for Codon Models, a Method for Grouping Sites according to Similarities in the Underlying Process of Evolution Mol. Biol. Evol., September 1, 2008; 25(9): 1995 - 2007. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. G. Jarman, E. C. Holmes, P. Rodpradit, C. Klungthong, R. V. Gibbons, A. Nisalak, A. L. Rothman, D. H. Libraty, F. A. Ennis, M. P. Mammen Jr., et al. Microevolution of Dengue Viruses Circulating among Primary School Children in Kamphaeng Phet, Thailand J. Virol., June 1, 2008; 82(11): 5494 - 5500. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Llopart and J. M. Comeron Recurrent Events of Positive Selection in Independent Drosophila Lineages at the Spermatogenesis Gene roughex Genetics, June 1, 2008; 179(2): 1009 - 1020. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Strain, L. A. Kelley, S. Schultz-Cherry, S. V. Muse, and M. D. Koci Genomic Analysis of Closely Related Astroviruses J. Virol., May 15, 2008; 82(10): 5099 - 5103. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kryazhimskiy, G. A. Bazykin, and J. Dushoff Natural Selection for Nucleotide Usage at Synonymous and Nonsynonymous Sites in Influenza A Virus Genes J. Virol., May 15, 2008; 82(10): 4938 - 4945. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D. Dean, J. M. Good, and M. W. Nachman Adaptive Evolution of Proteins Secreted during Sperm Maturation: An Analysis of the Mouse Epididymal Transcriptome Mol. Biol. Evol., February 1, 2008; 25(2): 383 - 392. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. Y. Poon, S. L. Kosakovsky Pond, D. D. Richman, and S. D. W. Frost Mapping Protease Inhibitor Resistance to Human Immunodeficiency Virus Type 1 Sequence Polymorphisms within Patients J. Virol., December 15, 2007; 81(24): 13598 - 13607. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Streisfeld and M. D. Rausher Relaxed Constraint and Evolutionary Rate Variation between Basic Helix-Loop-Helix Floral Anthocyanin Regulators in Ipomoea Mol. Biol. Evol., December 1, 2007; 24(12): 2816 - 2826. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Balajee, S. T. Tay, B. A. Lasker, S. F. Hurst, and A. P. Rooney Characterization of a Novel Gene for Strain Typing Reveals Substructuring of Aspergillus fumigatus across North America Eukaryot. Cell, August 1, 2007; 6(8): 1392 - 1399. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Wagner Rapid Detection of Positive Selection in Genes and Genomes Through Variation Clusters Genetics, August 1, 2007; 176(4): 2451 - 2463. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Stern, A. Doron-Faigenboim, E. Erez, E. Martz, E. Bacharach, and T. Pupko Selecton 2007: advanced models for detecting positive and purifying selection using a Bayesian inference approach Nucleic Acids Res., July 13, 2007; 35(suppl_2): W506 - W511. [Abstract] [Full Text] [PDF] |
||||
![]() |
X.-H. Wang, D. M. Netski, J. Astemborski, S. H. Mehta, M. S. Torbenson, D. L. Thomas, and S. C. Ray Progression of Fibrosis during Chronic Hepatitis C Is Associated with Rapid Virus Evolution J. Virol., June 15, 2007; 81(12): 6513 - 6522. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Noviello, S. L. K. Pond, M. J. Lewis, D. D. Richman, S. K. Pillai, O. O. Yang, S. J. Little, D. M. Smith, and J. C. Guatelli Maintenance of Nef-Mediated Modulation of Major Histocompatibility Complex Class I and CD4 after Sexual Transmission of Human Immunodeficiency Virus Type 1 J. Virol., May 1, 2007; 81(9): 4776 - 4786. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. H. Vanderford, L. J. Demma, M. B. Feinberg, S. I. Staprans, and J. M. Logsdon Jr Adaptation of a Diverse Simian Immunodeficiency Virus Population to a New Host Is Revealed through a Systematic Approach to Identify Amino Acid Sites under Selection Mol. Biol. Evol., March 1, 2007; 24(3): 660 - 669. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. A. Bazykin, J. Dushoff, S. A. Levin, and A. S. Kondrashov Bursts of nonsynonymous substitutions in HIV-1 evolution reveal instances of positive selection at conservative protein sites PNAS, December 19, 2006; 103(51): 19396 - 19401. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. H. M. Mes and M. Doeleman Positive Selection on Transposase Genes of Insertion Sequences in the Crocosphaera watsonii Genome. J. Bacteriol., October 1, 2006; 188(20): 7176 - 7185. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond, D. Posada, M. B. Gravenor, C. H. Woelk, and S. D. W. Frost Automated Phylogenetic Detection of Recombination Using a Genetic Algorithm Mol. Biol. Evol., October 1, 2006; 23(10): 1891 - 1901. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. A. Perez, P. J. Planet, S. C. Kachlany, M. Tomich, D. H. Fine, and D. H. Figurski Genetic Analysis of the Requirement for flp-2, tadV, and rcpB in Actinobacillus actinomycetemcomitans Biofilm Formation. J. Bacteriol., September 1, 2006; 188(17): 6361 - 6375. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Turner and H. E. Hoekstra Adaptive Evolution of Fertilization Proteins within a Genus: Variation in ZP2 and ZP3 in Deer Mice (Peromyscus) Mol. Biol. Evol., September 1, 2006; 23(9): 1656 - 1669. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. V. Kuzmin, G. J. Hughes, and C. E. Rupprecht Phylogenetic relationships of seven previously unclassified viruses within the family Rhabdoviridae using partial nucleoprotein gene sequences. J. Gen. Virol., August 1, 2006; 87(Pt 8): 2323 - 2331. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Rico, P. Ivars, S. F. Elena, and C. Hernandez Insights into the selective pressures restricting pelargonium flower break virus genome variability: evidence for host adaptation. J. Virol., August 1, 2006; 80(16): 8124 - 8132. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Pillai, S. L. K. Pond, Y. Liu, B. M. Good, M. C. Strain, R. J. Ellis, S. Letendre, D. M. Smith, H. F. Gunthard, I. Grant, et al. Genetic attributes of cerebrospinal fluid-derived HIV-1 env Brain, July 1, 2006; 129(7): 1872 - 1883. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Huelsenbeck, S. Jain, S. W. D. Frost, and S. L. K. Pond A Dirichlet process model for detecting positive selection in protein-coding DNA sequences PNAS, April 18, 2006; 103(16): 6263 - 6268. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. P. Rooney, J. L. Swezey, R. Friedman, D. W. Hecht, and C. W. Maddox Analysis of Core Housekeeping and Virulence Genes Reveals Cryptic Lineages of Clostridium perfringens That Are Associated With Distinct Disease Presentations Genetics, April 1, 2006; 172(4): 2081 - 2092. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Poss, H. A. Ross, S. L. Painter, D. C. Holley, J. A. Terwee, S. VandeWoude, and A. Rodrigo Feline Lentivirus Evolution in Cross-Species Infection Reveals Extensive G-to-A Mutation and Selection on Key Residues in the Viral Polymerase J. Virol., March 15, 2006; 80(6): 2728 - 2737. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Wilson and G. McVean Estimating Diversifying Selection and Functional Constraint in the Presence of Recombination Genetics, March 1, 2006; 172(3): 1411 - 1425. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Gonzalez-Martinez, E. Ersoz, G. R. Brown, N. C. Wheeler, and D. B. Neale DNA Sequence Variation and Selection of Tag Single-Nucleotide Polymorphisms at Candidate Genes for Drought-Stress Response in Pinus taeda L. Genetics, March 1, 2006; 172(3): 1915 - 1926. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Bulmer and R. H. Crozier Variation in Positive Selection in Termite GNBPs and Relish Mol. Biol. Evol., February 1, 2006; 23(2): 317 - 326. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Scheffler and C. Seoighe A Bayesian Model Comparison Approach to Inferring Positive Selection Mol. Biol. Evol., December 1, 2005; 22(12): 2531 - 2540. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Nei Selectionism and Neutralism in Molecular Evolution Mol. Biol. Evol., December 1, 2005; 22(12): 2318 - 2342. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Mondragon-Palomino and B. S. Gaut Gene Conversion and the Evolution of Three Leucine-Rich Repeat Gene Families in Arabidopsis thaliana Mol. Biol. Evol., December 1, 2005; 22(12): 2444 - 2456. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Pond and S. V. Muse Site-to-Site Variation of Synonymous Substitution Rates Mol. Biol. Evol., December 1, 2005; 22(12): 2375 - 2385. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Bishop Directed Mutagenesis Confirms the Functional Importance of Positively Selected Sites in Polygalacturonase Inhibitor Protein Mol. Biol. Evol., July 1, 2005; 22(7): 1531 - 1534. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Go, Y. Satta, O. Takenaka, and N. Takahata Lineage-Specific Loss of Function of Bitter Taste Receptor Genes in Humans and Nonhuman Primates Genetics, May 1, 2005; 170(1): 313 - 326. [Abstract] [Full Text] [PDF] |
||||









