MBE Advance Access originally published online on March 9, 2005
Molecular Biology and Evolution 2005 22(6):1386-1392; doi:10.1093/molbev/msi129
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Evaluating the Performance of a Successive-Approximations Approach to Parameter Optimization in Maximum-Likelihood Phylogeny Estimation

,
,

* Department of Biological Sciences,
Initiative in Bioinformatics and Evolutionary Studies and Program in Bioinformatics and Computational Biology, and
Department of Mathematics, University of Idaho; and
School of Computational Science and Department of Biological Science, Florida State University
E-mail: jacks{at}uidaho.edu.
Almost all studies that estimate phylogenies from DNA sequence data under the maximum-likelihood (ML) criterion employ an approximate approach. Most commonly, model parameters are estimated on some initial phylogenetic estimate derived using a rapid method (neighbor-joining or parsimony). Parameters are then held constant during a tree search, and ideally, the procedure is repeated until convergence is achieved. However, the effectiveness of this approximation has not been formally assessed, in part because doing so requires computationally intensive, full-optimization analyses. Here, we report both indirect and direct evaluations of the effectiveness of successive approximations. We obtained an indirect evaluation by comparing the results of replicate runs on real data that use random trees to provide initial parameter estimates. For six real data sets taken from the literature, all replicate iterative searches converged to the same joint estimates of topology and model parameters, suggesting that the approximation is not starting-point dependent, as long as the heuristic searches of tree space are rigorous. We conducted a more direct assessment using simulations in which we compared the accuracy of phylogenies estimated using full optimization of all model parameters on each tree evaluated to the accuracy of trees estimated via successive approximations. There is no significant difference between the accuracy of the approximation searches relative to full-optimization searches. Our results demonstrate that successive approximation is reliable and provide reassurance that this much faster approach is safe to use for ML estimation of topology.
Key Words: maximum likelihood models phylogeny successive approximations parameter estimation
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
P. Kornilios, N. Poulakakis, M. Mylonas, and K. Vardinoyannis The phylogeny and biogeography of the genus Zonites Montfort, 1810 (Gastropoda: Pulmonata): preliminary evidence from mitochondrial data J. Mollus. Stud., May 1, 2009; 75(2): 109 - 117. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. I. D. Ros, V. M. Fleming, E. J. Feil, and J. A. J. Breeuwer How Diverse Is the Genus Wolbachia? Multiple-Gene Sequencing Reveals a Putatively New Wolbachia Supergroup Recovered from Spider Mites (Acari: Tetranychidae) Appl. Envir. Microbiol., February 15, 2009; 75(4): 1036 - 1043. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T Holder, D. J Zwickl, and C. Dessimoz Evaluating the robustness of phylogenetic methods to among-site variability in substitution processes Phil Trans R Soc B, December 27, 2008; 363(1512): 4013 - 4021. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Larroux, G. N. Luke, P. Koopman, D. S. Rokhsar, S. M. Shimeld, and B. M. Degnan Genesis and Expansion of Metazoan Transcription Factor Gene Classes Mol. Biol. Evol., May 1, 2008; 25(5): 980 - 996. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. B. Zhang, D. S. Sikes, C. Muster, and S. Q. Li Inferring Species Membership Using DNA Sequences with Back-Propagation Neural Networks Syst Biol, April 1, 2008; 57(2): 202 - 215. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Ripplinger and J. Sullivan Does Choice in Model Selection Affect Maximum Likelihood Analysis? Syst Biol, February 1, 2008; 57(1): 76 - 85. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Morrison Increasing the Efficiency of Searches for the Maximum Likelihood Tree in a Phylogenetic Analysis of up to 150 Nucleotide Sequences Syst Biol, December 1, 2007; 56(6): 988 - 1010. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Moller-Krull, F. Delsuc, G. Churakov, C. Marker, M. Superina, J. Brosius, E. J. P. Douzery, and J. Schmitz Retroposed Elements and Their Flanking Regions Resolve the Evolutionary History of Xenarthran Mammals (Armadillos, Anteaters, and Sloths) Mol. Biol. Evol., November 1, 2007; 24(11): 2573 - 2582. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Kosiol, I. Holmes, and N. Goldman An Empirical Codon Model for Protein Sequence Evolution Mol. Biol. Evol., July 1, 2007; 24(7): 1464 - 1479. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ninio, E. Privman, T. Pupko, and N. Friedman Phylogeny reconstruction: increasing the accuracy of pairwise distance estimation using Bayesian inference of evolutionary rates Bioinformatics, January 15, 2007; 23(2): e136 - e141. [Abstract] [Full Text] [PDF] |
||||





