Molecular Biology and Evolution, Vol 9, 1119-1147, Copyright © 1992 by Society for Molecular Biology and Evolution
A Zharkikh and WH Li
The statistical properties of sample estimation and bootstrap estimation of
phylogenetic variability from a sample of nucleotide sequences are studied
by using model trees of three taxa with an outgroup and by assuming a
constant rate of nucleotide substitution. The maximum-parsimony method of
tree reconstruction is used. An analytic formula is derived for estimating
the sequence length that is required if P, the probability of obtaining the
true tree from the sampled sequences, is to be equal to or higher than a
given value. Bootstrap estimation is formulated as a two-step sampling
procedure: (1) sampling of sequences from the evolutionary process and (2)
resampling of the original sequence sample. The probability that a
bootstrap resampling of an original sequence sample will support the true
tree is found to depend on the model tree, the sequence length, and the
probability that a randomly chosen nucleotide site is an informative site.
When a trifurcating tree is used as the model tree, the probability that
one of the three bifurcating trees will appear in > or = 95% of the
bootstrap replicates is < 5%, even if the number of bootstrap replicates
is only 50; therefore, the probability of accepting an erroneous tree as
the true tree is < 5% if that tree appears in > or = 95% of the
bootstrap replicates and if more than 50 bootstrap replications are
conducted. However, if a particular bifurcating tree is observed in, say,
< 75% of the bootstrap replicates, then it cannot be claimed to be
better than the trifurcating tree even if > or = 1,000 bootstrap
replications are conducted. When a bifurcating tree is used as the model
tree, the bootstrap approach tends to overestimate P when the sequences are
very short, but it tends to underestimate that probability when the
sequences are long. Moreover, simulation results show that, if a tree is
accepted as the true tree only if it has appeared in > or = 95% of the
bootstrap replicates, then the probability of failing to accept any
bifurcating tree can be as large as 58% even when P = 95%, i.e., even when
95% of the samples from the evolutionary process will support the true
tree. Thus, if the rate-constancy assumption holds, bootstrapping is a
conservative approach for estimating the reliability of an inferred
phylogeny for four taxa.
ORIGINAL ARTICLE
Statistical properties of bootstrap estimation of phylogenetic variability from nucleotide sequences. I. Four taxa with a molecular clock
Center for Demographic and Population Genetics, University of Texas, Houston 77225.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
E. Susko Bootstrap Support Is Not First-Order Correct Syst Biol, July 1, 2009; (2009) syp016v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Krug, M. S. Morley, J. Asif, L. L. Hellyar, and W. M. Blom Molecular confirmation of species status for the rare cephalaspidean Melanochlamys lorrainae (Rudman, 1968), and comparison with its sister species M. cylindrica Cheeseman, 1881 J. Mollus. Stud., August 1, 2008; 74(3): 267 - 276. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Liu, M. M. Gutacker, J. M. Musser, and Y.-X. Fu Evidence for Recombination in Mycobacterium tuberculosis J. Bacteriol., December 1, 2006; 188(23): 8169 - 8177. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Hillis, T. A. Heath, and K. St. John Analysis and Visualization of Tree Space Syst Biol, June 1, 2005; 54(3): 471 - 482. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Huelsenbeck and B. Rannala Frequentist Properties of Bayesian Posterior Probabilities of Phylogenetic Trees Under Simple and Complex Substitution Models Syst Biol, December 1, 2004; 53(6): 904 - 913. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Cai, C. Hu, S. Shen, W. Wang, and W. Huang Characterization of Bacteriophage T3 DNA Ligase J. Biochem., March 1, 2004; 135(3): 397 - 403. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. P. Simmons, K. M. Pickett, and M. Miya How Meaningful Are Bayesian Support Values? Mol. Biol. Evol., January 1, 2004; 21(1): 188 - 199. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Helms, T. Friedl, and G. Rambold Phylogenetic relationships of the Physciaceae inferred from rDNA sequence data and selected phenotypic characters Mycologia, November 1, 2003; 95(6): 1078 - 1099. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-D. Durand, P. G. Bianco, J. Laroche, and A. Gilles Insight Into the Origin of Endemic Mediterranean Ichthyofauna: Phylogeography of Chondrostoma Genus (Teleostei, Cyprinidae) J. Hered., July 1, 2003; 94(4): 315 - 328. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Suzuki, G. V. Glazko, and M. Nei Overcredibility of molecular phylogenies obtained by Bayesian phylogenetics PNAS, December 10, 2002; 99(25): 16138 - 16143. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Metzker, D. P. Mindell, X.-M. Liu, R. G. Ptak, R. A. Gibbs, and D. M. Hillis Molecular evidence of HIV-1 transmission in a criminal case PNAS, October 29, 2002; 99(22): 14292 - 14297. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ota and W.-H. Li NJML: A Hybrid Algorithm for the Neighbor-Joining and Maximum-Likelihood Methods Mol. Biol. Evol., September 1, 2000; 17(9): 1401 - 1409. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Buisson, M. Grandadam, E. Nicand, P. Cheval, H. van Cuyck-Gandre, B. Innis, P. Rehel, P. Coursaget, R. Teyssou, and S. Tsarev Identification of a novel hepatitis E virus in Nigeria J. Gen. Virol., April 1, 2000; 81(4): 903 - 909. [Abstract] [Full Text] |
||||
![]() |
P. Morozov, T. Sitnikova, G. Churchill, F. J. Ayala, and A. Rzhetsky A New Method for Characterizing Replacement Rate Variation in Molecular Sequences: Application of the Fourier and Wavelet Models to Drosophila and Mammalian Proteins Genetics, January 1, 2000; 154(1): 381 - 395. [Abstract] [Full Text] |
||||
![]() |
M. Nei, S. Kumar, and K. Takahashi The optimization principle in phylogenetic analysis tends to give incorrect topologies when the number of nucleotides or amino acids used is small PNAS, October 13, 1998; 95(21): 12390 - 12397. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Efron, E. Halloran, and S. Holmes Bootstrap confidence levels for phylogenetic trees PNAS, November 12, 1996; 93(23): 13429 - 13429. [Abstract] [Full Text] [PDF] |
||||









