MBE Advance Access originally published online on March 19, 2004
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Mol. Biol. Evol. 21(6):1123-1133. 2004
DOI: 10.1093/molbev/msh123
© 2004 by the Society for Molecular Biology and Evolution. ISSN: 0737-4038
Bayesian Phylogenetic Model Selection Using Reversible Jump Markov Chain Monte Carlo
,
* Section of Ecology, Behavior and Evolution, Division of Biological Sciences, University of California, San Diego
Department of Statistics, University of Wisconsin
Department of Botany, University of Wisconsin
E-mail: johnh{at}biomail.ucsd.edu.
A common problem in molecular phylogenetics is choosing a model of DNA substitution that does a good job of explaining the DNA sequence alignment without introducing superfluous parameters. A number of methods have been used to choose among a small set of candidate substitution models, such as the likelihood ratio test, the Akaike Information Criterion (AIC), the Bayesian Information Criterion (BIC), and Bayes factors. Current implementations of any of these criteria suffer from the limitation that only a small set of models are examined, or that the test does not allow easy comparison of non-nested models. In this article, we expand the pool of candidate substitution models to include all possible time-reversible models. This set includes seven models that have already been described. We show how Bayes factors can be calculated for these models using reversible jump Markov chain Monte Carlo, and apply the method to 16 DNA sequence alignments. For each data set, we compare the model with the best Bayes factor to the best models chosen using AIC and BIC. We find that the best model under any of these criteria is not necessarily the most complicated one; models with an intermediate number of substitution types typically do best. Moreover, almost all of the models that are chosen as best do not constrain a transition rate to be the same as a transversion rate, suggesting that it is the transition/transversion rate bias that plays the largest role in determining which models are selected. Importantly, the reversible jump Markov chain Monte Carlo algorithm described here allows estimation of phylogeny (and other phylogenetic model parameters) to be performed while accounting for uncertainty in the model of DNA substitution.
Key Words: Bayesian phylogenetic inference Markov chain Monte Carlo maximum likelihood reversible jump Markov chain Monte Carlo substitution models
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. P Huelsenbeck, P. Joyce, C. Lakner, and F. Ronquist Bayesian analysis of amino acid substitution models Phil Trans R Soc B, December 27, 2008; 363(1512): 3941 - 3953. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pagel and A. Meade Modelling heterotachy in phylogenetic inference by reversible-jump Markov chain Monte Carlo Phil Trans R Soc B, December 27, 2008; 363(1512): 3955 - 3964. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Kim and M. J. Sanderson Penalized Likelihood Phylogenetic Inference: Bridging the Parsimony-Likelihood Gap Syst Biol, October 1, 2008; 57(5): 665 - 674. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Munch, W. Boomsma, J. P. Huelsenbeck, E. Willerslev, and R. Nielsen Statistical Assignment of DNA Sequences Using Bayesian Phylogenetics Syst Biol, October 1, 2008; 57(5): 750 - 757. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Grzegorczyk, D. Husmeier, K. D. Edwards, P. Ghazal, and A. J. Millar Modelling non-stationary gene regulatory processes with a non-homogeneous Bayesian network and the allocation sampler Bioinformatics, September 15, 2008; 24(18): 2071 - 2078. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Yang and L. Zhang Performance comparison between k-tuple distance and four model-based distances in phylogenetic tree reconstruction Nucleic Acids Res., March 1, 2008; 36(5): e33 - e33. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Chen, S. F. Perry, S. Aris-Brosou, C. Selva, and T. W. Moon Characterization and functional divergence of the {alpha}1-adrenoceptor gene family: insights from rainbow trout (Oncorhynchus mykiss) Physiol Genomics, December 19, 2007; 32(1): 142 - 153. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Gowri-Shankar and M. Rattray A Reversible Jump Method for Bayesian Phylogenetic Inference with a Nonhomogeneous Substitution Model Mol. Biol. Evol., June 1, 2007; 24(6): 1286 - 1299. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Ane, B. Larget, D. A. Baum, S. D. Smith, and A. Rokas Bayesian Estimation of Concordance among Gene Trees Mol. Biol. Evol., February 1, 2007; 24(2): 412 - 426. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond, F. V. Mannino, M. B. Gravenor, S. V. Muse, and S. D. W. Frost Evolutionary Model Selection with a Genetic Algorithm: A Case Study Using Stem RNA Mol. Biol. Evol., January 1, 2007; 24(1): 159 - 170. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Vanderpoorten and B. Goffinet Mapping Uncertainty and Phylogenetic Uncertainty in Ancestral Character State Reconstruction: An Example in the Moss Genus Brachytheciastrum Syst Biol, December 1, 2006; 55(6): 957 - 971. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Perez-Losada, R. G. Christensen, D. A. McClellan, B. J. Adams, R. P. Viscidi, J. C. Demma, and K. A. Crandall Comparing Phylogenetic Codivergence between Polyomaviruses and Their Hosts. J. Virol., June 1, 2006; 80(12): 5663 - 5669. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Lartillot and H. Philippe Computing Bayes Factors Using Thermodynamic Integration Syst Biol, April 1, 2006; 55(2): 195 - 207. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Scheffler and C. Seoighe A Bayesian Model Comparison Approach to Inferring Positive Selection Mol. Biol. Evol., December 1, 2005; 22(12): 2531 - 2540. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang and B. Rannala Branch-Length Prior Influences Bayesian Posterior Probability of Phylogeny Syst Biol, June 1, 2005; 54(3): 455 - 470. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond and S. D. W. Frost Not So Different After All: A Comparison of Methods for Detecting Amino Acid Sites Under Selection Mol. Biol. Evol., May 1, 2005; 22(5): 1208 - 1222. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond and S. D. W. Frost A Simple Hierarchical Approach to Modeling Distributions of Substitution Rates Mol. Biol. Evol., February 1, 2005; 22(2): 223 - 234. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Huelsenbeck and B. Rannala Frequentist Properties of Bayesian Posterior Probabilities of Phylogenetic Trees Under Simple and Complex Substitution Models Syst Biol, December 1, 2004; 53(6): 904 - 913. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Pol Empirical Problems of the Hierarchical Likelihood Ratio Test for Model Selection Syst Biol, December 1, 2004; 53(6): 949 - 962. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Posada and T. R. Buckley Model Selection and Model Averaging in Phylogenetics: Advantages of Akaike Information Criterion and Bayesian Approaches Over Likelihood Ratio Tests Syst Biol, October 1, 2004; 53(5): 793 - 808. [Abstract] [Full Text] [PDF] |
||||






