Skip Navigation


MBE Advance Access originally published online on October 13, 2004
Molecular Biology and Evolution 2005 22(2):223-234; doi:10.1093/molbev/msi009
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
22/2/223    most recent
msi009v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Kosakovsky Pond, S. L.
Right arrow Articles by Frost, S. D. W.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kosakovsky Pond, S. L.
Right arrow Articles by Frost, S. D. W.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Molecular Biology and Evolution vol. 22 no. 2 © Society for Molecular Biology and Evolution 2005; all rights reserved.

Research Article

A Simple Hierarchical Approach to Modeling Distributions of Substitution Rates

Sergei L. Kosakovsky Pond and Simon D. W. Frost

Antiviral Research Center, University of California San Diego, San Diego, California

E-mail: spond{at}ucsd.edu.

Genetic sequence data typically exhibit variability in substitution rates across sites. In practice, there is often too little variation to fit a different rate for each site in the alignment, but the distribution of rates across sites may not be well modeled using simple parametric families. Mixtures of different distributions can capture more complex patterns of rate variation, but are often parameter-rich and difficult to fit. We present a simple hierarchical model in which a baseline rate distribution, such as a gamma distribution, is discretized into several categories, the quantiles of which are estimated using a discretized beta distribution. Although this approach involves adding only two extra parameters to a standard distribution, a wide range of rate distributions can be captured. Using simulated data, we demonstrate that a "beta-" model can reproduce the moments of the rate distribution more accurately than the distribution used to simulate the data, even when the baseline rate distribution is misspecified. Using hepatitis C virus and mammalian mitochondrial sequences, we show that a beta- model can fit as well or better than a model with multiple discrete rate categories, and compares favorably with a model which fits a separate rate category to each site. We also demonstrate this discretization scheme in the context of codon models specifically aimed at identifying individual sites undergoing adaptive or purifying evolution.

Key Words: substitution rates • hierarchical model • adaptive evolution • hepatitis C • model selection • parallel algorithms


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Mol Biol EvolHome page
A. R. Jha, S. K. Pillai, V. A. York, E. R. Sharp, E. C. Storm, D. J. Wachter, J. N. Martin, S. G. Deeks, M. G. Rosenberg, D. F. Nixon, et al.
Cross-Sectional Dating of Novel Haplotypes of HERV-K 113 and HERV-K 115 Indicate These Proviruses Originated in Africa before Homo sapiens
Mol. Biol. Evol., November 1, 2009; 26(11): 2617 - 2626.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. D. Fernandes and W. R. Atchley
Site-specific evolutionary rates in proteins are better modeled as non-independent and strictly relative
Bioinformatics, October 1, 2008; 24(19): 2177 - 2183.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. L. Kosakovsky Pond, A. F.Y. Poon, A. J. Leigh Brown, and S. D.W. Frost
A Maximum Likelihood Method for Detecting Directional Evolution in Protein Sequences and Its Application to Influenza A Virus
Mol. Biol. Evol., September 1, 2008; 25(9): 1809 - 1824.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
J. P. Huelsenbeck and M. A. Suchard
A Nonparametric Method for Accommodating and Testing Across-Site Rate Variation
Syst Biol, December 1, 2007; 56(6): 975 - 987.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
C. M. Noviello, S. L. K. Pond, M. J. Lewis, D. D. Richman, S. K. Pillai, O. O. Yang, S. J. Little, D. M. Smith, and J. C. Guatelli
Maintenance of Nef-Mediated Modulation of Major Histocompatibility Complex Class I and CD4 after Sexual Transmission of Human Immunodeficiency Virus Type 1
J. Virol., May 1, 2007; 81(9): 4776 - 4786.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. L. Kosakovsky Pond, F. V. Mannino, M. B. Gravenor, S. V. Muse, and S. D. W. Frost
Evolutionary Model Selection with a Genetic Algorithm: A Case Study Using Stem RNA
Mol. Biol. Evol., January 1, 2007; 24(1): 159 - 170.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
S. Foret and R. Maleszka
Function and evolution of a gene family encoding odorant binding-like proteins in a social insect, the honey bee (Apis mellifera)
Genome Res., November 1, 2006; 16(11): 1404 - 1413.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. L. Kosakovsky Pond, D. Posada, M. B. Gravenor, C. H. Woelk, and S. D. W. Frost
Automated Phylogenetic Detection of Recombination Using a Genetic Algorithm
Mol. Biol. Evol., October 1, 2006; 23(10): 1891 - 1901.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. P. Huelsenbeck, S. Jain, S. W. D. Frost, and S. L. K. Pond
A Dirichlet process model for detecting positive selection in protein-coding DNA sequences
PNAS, April 18, 2006; 103(16): 6263 - 6268.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. D. W. Frost, T. Wrin, D. M. Smith, S. L. K. Pond, Y. Liu, E. Paxinos, C. Chappey, J. Galovich, J. Beauchaine, C. J. Petropoulos, et al.
Neutralizing antibody responses drive the evolution of human immunodeficiency virus type 1 envelope during recent HIV infection
PNAS, December 20, 2005; 102(51): 18514 - 18519.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
K. Scheffler and C. Seoighe
A Bayesian Model Comparison Approach to Inferring Positive Selection
Mol. Biol. Evol., December 1, 2005; 22(12): 2531 - 2540.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. K. Pond and S. V. Muse
Site-to-Site Variation of Synonymous Substitution Rates
Mol. Biol. Evol., December 1, 2005; 22(12): 2375 - 2385.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
S. C. Leman, Y. Chen, J. E. Stajich, M. A. F. Noor, and M. K. Uyenoyama
Likelihoods From Summary Statistics: Recent Divergence Between Species
Genetics, November 1, 2005; 171(3): 1419 - 1436.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. L. K. Pond and S. D. W. Frost
Datamonkey: rapid detection of selective pressure on individual sites of codon alignments
Bioinformatics, May 15, 2005; 21(10): 2531 - 2533.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. L. Kosakovsky Pond and S. D. W. Frost
Not So Different After All: A Comparison of Methods for Detecting Amino Acid Sites Under Selection
Mol. Biol. Evol., May 1, 2005; 22(5): 1208 - 1222.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.