MBE Advance Access originally published online on November 17, 2004
Molecular Biology and Evolution 2005 22(3):691-703; doi:10.1093/molbev/msi050
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Article |
Accounting for Uncertainty in the Tree Topology Has Little Effect on the Decision-Theoretic Approach to Model Selection in Phylogeny Estimation
,

,
,¶
* Initiative in Bioinformatics and Evolutionary Studies (IBEST),
Program of Bioinformatics and Computational Biology, and
Department of Mathematics, University of Idaho, Moscow;
Department of Biomathematics, David Geffen School of Medicine, University of California, Los Angeles; and ¶ Department of Biological Science, University of Idaho, Moscow
E-mail: abdo9538{at}uidaho.edu.
Currently available methods for model selection used in phylogenetic analysis are based on an initial fixed-tree topology. Once a model is picked based on this topology, a rigorous search of the tree space is run under that model to find the maximum-likelihood estimate of the tree (topology and branch lengths) and the maximum-likelihood estimates of the model parameters. In this paper, we propose two extensions to the decision-theoretic (DT) approach that relax the fixed-topology restriction. We also relax the fixed-topology restriction for the Bayesian information criterion (BIC) and the Akaike information criterion (AIC) methods. We compare the performance of the different methods (the relaxed, restricted, and the likelihood-ratio test [LRT]) using simulated data. This comparison is done by evaluating the relative complexity of the models resulting from each method and by comparing the performance of the chosen models in estimating the true tree. We also compare the methods relative to one another by measuring the closeness of the estimated trees corresponding to the different chosen models under these methods. We show that varying the topology does not have a major impact on model choice. We also show that the outcome of the two proposed extensions is identical and is comparable to that of the BIC, Extended-BIC, and DT. Hence, using the simpler methods in choosing a model for analyzing the data is more computationally feasible, with results comparable to the more computationally intensive methods. Another outcome of this study is that earlier conclusions about the DT approach are reinforced. That is, LRT, Extended-AIC, and AIC result in more complicated models that do not contribute to the performance of the phylogenetic inference, yet cause a significant increase in the time required for data analysis.
Key Words: Decision-theoretic model selection DT-ModSel Bayesian information criterion Akaike information criterion hierarchical likelihood testing ModelTest
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T.-K. Seo and H. Kishino Statistical Comparison of Nucleotide, Amino Acid, and Codon Substitution Models for Evolutionary Analysis of Protein-Coding Sequences Syst Biol, June 29, 2009; (2009) syp015v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T. Holder, J. Sukumaran, and P. O. Lewis A Justification for Reporting the Majority-Rule Consensus Tree in Bayesian Phylogenetics Syst Biol, October 1, 2008; 57(5): 814 - 821. [Full Text] [PDF] |
||||
![]() |
D. Posada jModelTest: Phylogenetic Model Averaging Mol. Biol. Evol., July 1, 2008; 25(7): 1253 - 1256. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Ripplinger and J. Sullivan Does Choice in Model Selection Affect Maximum Likelihood Analysis? Syst Biol, February 1, 2008; 57(1): 76 - 85. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Benavides, R. Baum, D. McClellan, and J. W. Sites Molecular Phylogenetics of the Lizard Genus Microlophus (Squamata:Tropiduridae): Aligning and Retrieving Indel Signal from Nuclear Introns Syst Biol, October 1, 2007; 56(5): 776 - 797. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. McGuire, C. C. Witt, D. L. Altshuler, and J. V. Remsen Phylogenetic Systematics and Biogeography of Hummingbirds: Bayesian and Maximum Likelihood Analyses of Partitioned Data and Selection of an Appropriate Partitioning Strategy Syst Biol, October 1, 2007; 56(5): 837 - 856. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Kosiol, I. Holmes, and N. Goldman An Empirical Codon Model for Protein Sequence Evolution Mol. Biol. Evol., July 1, 2007; 24(7): 1464 - 1479. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. E. Alfaro and J. P. Huelsenbeck Comparative Performance of Bayesian and AIC-Based Measures of Phylogenetic Model Uncertainty Syst Biol, February 1, 2006; 55(1): 89 - 96. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Sullivan, Z. Abdo, P. Joyce, and D. L. Swofford Evaluating the Performance of a Successive-Approximations Approach to Parameter Optimization in Maximum-Likelihood Phylogeny Estimation Mol. Biol. Evol., June 1, 2005; 22(6): 1386 - 1392. [Abstract] [Full Text] [PDF] |
||||

