Molecular Biology and Evolution, Vol 12, 131-151, Copyright © 1995 by Society for Molecular Biology and Evolution
A Rzhetsky and M Nei
Using linear invariants for various models of nucleotide substitution, we
developed test statistics for examining the applicability of a specific
model to a given dataset in phylogenetic inference. The models examined are
those developed by Jukes and Cantor (1969), Kimura (1980), Tajima and Nei
(1984), Hasegawa et al. (1985), Tamura (1992), Tamura and Nei (1993), and a
new model called the eight-parameter model. The first six models are
special cases of the last model. The test statistics developed are
independent of evolutionary time and phylogeny, although the variances of
the statistics contain phylogenetic information. Therefore, these
statistics can be used before a phylogenetic tree is estimated. Our
objective is to find the simplest model that is applicable to a given
dataset, keeping in mind that a simple model usually gives an estimate of
evolutionary distance (number of nucleotide substitutions per site) with a
smaller variance than a complicated model when the simple model is correct.
We have also developed a statistical test of the homogeneity of nucleotide
frequencies of a sample of several sequences that takes into account
possible phylogenetic correlations. This test is used to examine the
stationarity in time of the base frequencies in the sample. For Hasegawa et
al.'s and the eight-parameter models, analytical formulas for estimating
evolutionary distances are presented. Application of the above tests to
several sets of real data has shown that the assumption of stationarity of
base composition is usually acceptable when the sequences studied are
closely related but otherwise it is rejected. Similarly, the simple models
of nucleotide substitution are almost always rejected when actual genes are
distantly related and/or the total number of nucleotides examined is large.
ORIGINAL ARTICLE
Tests of applicability of several substitution models for DNA sequence data
Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park 16802.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Z. Yang, J. D. O'Brien, X. Zheng, H.-Q. Zhu, and Z.-S. She Tree and rate estimation by local evaluation of heterochronous nucleotide data Bioinformatics, January 15, 2007; 23(2): 169 - 176. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Ababneh, L. S. Jermiin, C. Ma, and J. Robinson Matched-pairs tests of homogeneity with applications to homologous nucleotide sequences Bioinformatics, May 15, 2006; 22(10): 1225 - 1231. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Romeike, T. Friedl, G. Helms, and S. Ott Genetic Diversity of Algal and Fungal Partners in Four Species of Umbilicaria (Lichenized Ascomycetes) Along a Transect of the Antarctic Peninsula Mol. Biol. Evol., August 1, 2002; 19(8): 1209 - 1217. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. R. Castro, A. D. Austin, and M. Dowton Contrasting Rates of Mitochondrial Molecular Evolution in Parasitic Diptera and Hymenoptera Mol. Biol. Evol., July 1, 2002; 19(7): 1100 - 1113. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Tarrio, F. Rodriguez-Trelles, and F. J. Ayala Shared Nucleotide Composition Biases Among Species and Their Impact on Phylogenetic Reconstructions of the Drosophilidae Mol. Biol. Evol., August 1, 2001; 18(8): 1464 - 1473. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kumar and S. R. Gadagkar Disparity Index: A Simple Statistic to Measure and Test the Homogeneity of Substitution Patterns Between Molecular Sequences Genetics, July 1, 2001; 158(3): 1321 - 1327. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Posada and K. A. Crandall Selecting Models of Nucleotide Substitution: An Application to Human Immunodeficiency Virus 1 (HIV-1) Mol. Biol. Evol., June 1, 2001; 18(6): 897 - 906. [Abstract] [Full Text] |
||||
![]() |
F. Rodriguez-Trelles, R. Tarrio, and F. J. Ayala Evidence for a High Ancestral GC Content in Drosophila Mol. Biol. Evol., November 1, 2000; 17(11): 1710 - 1717. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Rodriguez-Trelles, L. Alarcon, and A. Fontdevila Molecular Evolution and Phylogeny of the buzzatii Complex (Drosophila repleta Group): A Maximum-Likelihood Approach Mol. Biol. Evol., July 1, 2000; 17(7): 1112 - 1122. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Abe, T. Inami, K. Ishikawa, S. Nakamura, and S. Goto TT Virus Infection in Nonhuman Primates and Characterization of the Viral Genome: Identification of Simian TT Virus Isolates J. Virol., February 1, 2000; 74(3): 1549 - 1553. [Abstract] [Full Text] |
||||
![]() |
H. Naito, S. Hayashi, and K. Abe The entire nucleotide sequence of two hepatitis G virus isolates belonging to a novel genotype: isolation in Myanmar and Vietnam J. Gen. Virol., January 1, 2000; 81(1): 189 - 194. [Abstract] [Full Text] |
||||
![]() |
N. Konomi, C. Miyoshi, C. La Fuente Zerain, T.-C. Li, Y. Arakawa, and K. Abe Epidemiology of Hepatitis B, C, E, and G Virus Infections and Molecular Analysis of Hepatitis G Virus Isolates in Bolivia J. Clin. Microbiol., October 1, 1999; 37(10): 3291 - 3295. [Abstract] [Full Text] |
||||
![]() |
H. Naito, K. M. Win, and K. Abe Identification of a Novel Genotype of Hepatitis G Virus in Southeast Asia J. Clin. Microbiol., April 1, 1999; 37(4): 1217 - 1220. [Abstract] [Full Text] |
||||
![]() |
H. Akashi Inferring the Fitness Effects of DNA Mutations From Polymorphism and Divergence Data: Statistical Power to Detect Directional Selection Under Stationarity and Free Recombination Genetics, January 1, 1999; 151(1): 221 - 238. [Abstract] [Full Text] |
||||
![]() |
J. P. Huelsenbeck and B. Rannala Phylogenetic Methods Come of Age: Testing Hypotheses in an Evolutionary Context Science, April 11, 1997; 276(5310): 227 - 232. [Abstract] [Full Text] |
||||






