Skip Navigation

This Article
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (17)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Rzhetsky, A.
Right arrow Articles by Sitnikova, T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Rzhetsky, A.
Right arrow Articles by Sitnikova, T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Molecular Biology and Evolution, Vol 13, 1255-1265, Copyright © 1996 by Society for Molecular Biology and Evolution


ORIGINAL ARTICLE

When is it safe to use an oversimplified substitution model in tree- making?

A Rzhetsky and T Sitnikova
Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park 16802, USA.

The choice of an "optimal" mathematical model for computing evolutionary distances from real sequences is not currently supported by easy-to-use software applicable to large data sets, and an investigator frequently selects one of the simplest models available. Here we study properties of the observed proportion of differences (p- distance) between sequences as an estimator of evolutionary distance for tree-making. We show that p-distances allow for consistent tree- making with any of the popular methods working with evolutionary distances if evolution of sequences obeys a "molecular clock" (more precisely, if it follows a stationary time-reversible Markov model of nucleotide substitution). Next, we show that p-distances seem to be efficient in recovering the correct tree topology under a "molecular clock," but produce "statistically supported" wrong trees when substitutions rates vary among evolutionary lineages. Finally, we outline a practical approach for selecting an "optimal" model of nucleotide substitution in a real data analysis, and obtain a crude estimate of a "prior" distribution of the expected tree branch lengths under the Jukes-Cantor model. We conclude that the use of a model that is obviously oversimplified is inadvisable unless it is justified by a preliminary analysis of the real sequences.
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Mol Biol EvolHome page
E. Susko, Y. Inagaki, and A. J. Roger
On Inconsistency of the Neighbor-Joining, Least Squares, and Minimum Evolution Estimation When Substitution Processes Are Incorrectly Modeled
Mol. Biol. Evol., September 1, 2004; 21(9): 1629 - 1642.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
K. Tamura, M. Nei, and S. Kumar
Prospects for inferring very large phylogenies by using the neighbor-joining method
PNAS, July 27, 2004; 101(30): 11030 - 11035.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Guindon and O. Gascuel
Efficient Biased Estimation of Evolutionary Distances When Substitution Rates Vary Across Sites
Mol. Biol. Evol., April 1, 2002; 19(4): 534 - 543.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
D. Posada and K. A. Crandall
Selecting Models of Nucleotide Substitution: An Application to Human Immunodeficiency Virus 1 (HIV-1)
Mol. Biol. Evol., June 1, 2001; 18(6): 897 - 906.
[Abstract] [Full Text]


Home page
Mol Biol EvolHome page
M. A. Suchard, R. E. Weiss, and J. S. Sinsheimer
Bayesian Selection of Continuous-Time Markov Chain Evolutionary Models
Mol. Biol. Evol., June 1, 2001; 18(6): 1001 - 1013.
[Abstract] [Full Text]


Home page
Genome ResHome page
N. V. Grishin, Y. I. Wolf, and E. V. Koonin
From Complete Genomes to Measures of Substitution Rate Variability Within and Between Proteins
Genome Res., July 1, 2000; 10(7): 991 - 1000.
[Abstract] [Full Text]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Nei, S. Kumar, and K. Takahashi
The optimization principle in phylogenetic analysis tends to give incorrect topologies when the number of nucleotides or amino acids used is small
PNAS, October 13, 1998; 95(21): 12390 - 12397.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.