MBE Advance Access originally published online on October 13, 2004
Molecular Biology and Evolution 2005 22(2):223-234; doi:10.1093/molbev/msi009
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Article |
A Simple Hierarchical Approach to Modeling Distributions of Substitution Rates
Antiviral Research Center, University of California San Diego, San Diego, California
E-mail: spond{at}ucsd.edu.
Genetic sequence data typically exhibit variability in substitution rates across sites. In practice, there is often too little variation to fit a different rate for each site in the alignment, but the distribution of rates across sites may not be well modeled using simple parametric families. Mixtures of different distributions can capture more complex patterns of rate variation, but are often parameter-rich and difficult to fit. We present a simple hierarchical model in which a baseline rate distribution, such as a gamma distribution, is discretized into several categories, the quantiles of which are estimated using a discretized beta distribution. Although this approach involves adding only two extra parameters to a standard distribution, a wide range of rate distributions can be captured. Using simulated data, we demonstrate that a "beta-" model can reproduce the moments of the rate distribution more accurately than the distribution used to simulate the data, even when the baseline rate distribution is misspecified. Using hepatitis C virus and mammalian mitochondrial sequences, we show that a beta- model can fit as well or better than a model with multiple discrete rate categories, and compares favorably with a model which fits a separate rate category to each site. We also demonstrate this discretization scheme in the context of codon models specifically aimed at identifying individual sites undergoing adaptive or purifying evolution.
Key Words: substitution rates hierarchical model adaptive evolution hepatitis C model selection parallel algorithms
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. D. Fernandes and W. R. Atchley Site-specific evolutionary rates in proteins are better modeled as non-independent and strictly relative Bioinformatics, October 1, 2008; 24(19): 2177 - 2183. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond, A. F.Y. Poon, A. J. Leigh Brown, and S. D.W. Frost A Maximum Likelihood Method for Detecting Directional Evolution in Protein Sequences and Its Application to Influenza A Virus Mol. Biol. Evol., September 1, 2008; 25(9): 1809 - 1824. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Noviello, S. L. K. Pond, M. J. Lewis, D. D. Richman, S. K. Pillai, O. O. Yang, S. J. Little, D. M. Smith, and J. C. Guatelli Maintenance of Nef-Mediated Modulation of Major Histocompatibility Complex Class I and CD4 after Sexual Transmission of Human Immunodeficiency Virus Type 1 J. Virol., May 1, 2007; 81(9): 4776 - 4786. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond, F. V. Mannino, M. B. Gravenor, S. V. Muse, and S. D. W. Frost Evolutionary Model Selection with a Genetic Algorithm: A Case Study Using Stem RNA Mol. Biol. Evol., January 1, 2007; 24(1): 159 - 170. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Foret and R. Maleszka Function and evolution of a gene family encoding odorant binding-like proteins in a social insect, the honey bee (Apis mellifera) Genome Res., November 1, 2006; 16(11): 1404 - 1413. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond, D. Posada, M. B. Gravenor, C. H. Woelk, and S. D. W. Frost Automated Phylogenetic Detection of Recombination Using a Genetic Algorithm Mol. Biol. Evol., October 1, 2006; 23(10): 1891 - 1901. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Huelsenbeck, S. Jain, S. W. D. Frost, and S. L. K. Pond A Dirichlet process model for detecting positive selection in protein-coding DNA sequences PNAS, April 18, 2006; 103(16): 6263 - 6268. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. D. W. Frost, T. Wrin, D. M. Smith, S. L. K. Pond, Y. Liu, E. Paxinos, C. Chappey, J. Galovich, J. Beauchaine, C. J. Petropoulos, et al. Neutralizing antibody responses drive the evolution of human immunodeficiency virus type 1 envelope during recent HIV infection PNAS, December 20, 2005; 102(51): 18514 - 18519. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Scheffler and C. Seoighe A Bayesian Model Comparison Approach to Inferring Positive Selection Mol. Biol. Evol., December 1, 2005; 22(12): 2531 - 2540. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Pond and S. V. Muse Site-to-Site Variation of Synonymous Substitution Rates Mol. Biol. Evol., December 1, 2005; 22(12): 2375 - 2385. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Leman, Y. Chen, J. E. Stajich, M. A. F. Noor, and M. K. Uyenoyama Likelihoods From Summary Statistics: Recent Divergence Between Species Genetics, November 1, 2005; 171(3): 1419 - 1436. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. K. Pond and S. D. W. Frost Datamonkey: rapid detection of selective pressure on individual sites of codon alignments Bioinformatics, May 15, 2005; 21(10): 2531 - 2533. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond and S. D. W. Frost Not So Different After All: A Comparison of Methods for Detecting Amino Acid Sites Under Selection Mol. Biol. Evol., May 1, 2005; 22(5): 1208 - 1222. [Abstract] [Full Text] [PDF] |
||||





