Molecular Biology and Evolution, Vol 15, 910-917, Copyright © 1998 by Society for Molecular Biology and Evolution
AL Halpern and WJ Bruno
Estimation of evolutionary distances from coding sequences must take into
account protein-level selection to avoid relative underestimation of longer
evolutionary distances. Current modeling of selection via site-to-site rate
heterogeneity generally neglects another aspect of selection, namely
position-specific amino acid frequencies. These frequencies determine the
maximum dissimilarity expected for highly diverged but functionally and
structurally conserved sequences, and hence are crucial for estimating long
distances. We introduce a codon- level model of coding sequence evolution
in which position-specific amino acid frequencies are free parameters. In
our implementation, these are estimated from an alignment using methods
described previously. We use simulations to demonstrate the importance and
feasibility of modeling such behavior; our model produces linear distance
estimates over a wide range of distances, while several alternative models
underestimate long distances relative to short distances. Site-to-site
differences in rates, as well as synonymous/nonsynonymous and
first/second/third-codon-position differences, arise as a natural
consequence of the site-to-site differences in amino acid frequencies.
ORIGINAL ARTICLE
Evolutionary distances for protein-coding sequences: modeling site- specific residue frequencies
Los Alamos National Laboratory, New Mexico, USA. ahalpern@ender.unm.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. Si Quang, O. Gascuel, and N. Lartillot Empirical profile mixture models for phylogenetic reconstruction Bioinformatics, October 15, 2008; 24(20): 2317 - 2323. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Mustonen, J. Kinney, C. G. Callan Jr, and M. Lassig Energy-dependent fitness: A quantitative model for the evolution of yeast transcription factor binding sites PNAS, August 26, 2008; 105(34): 12376 - 12381. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Kolaczkowski and J. W. Thornton A Mixed Branch Length Model of Heterotachy Improves Phylogenetic Accuracy Mol. Biol. Evol., June 1, 2008; 25(6): 1054 - 1066. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Blanquart and N. Lartillot A Site- and Time-Heterogeneous Model of Amino Acid Replacement Mol. Biol. Evol., May 1, 2008; 25(5): 842 - 858. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang and R. Nielsen Mutation-Selection Models of Codon Substitution and Their Use to Estimate Selective Strengths on Codon Usage Mol. Biol. Evol., March 1, 2008; 25(3): 568 - 579. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. S.W. Wong and R. Nielsen Finding cis-regulatory modules in Drosophila using phylogenetic hidden Markov models Bioinformatics, August 15, 2007; 23(16): 2031 - 2037. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Thorne, S. C. Choi, J. Yu, P. G. Higgs, and H. Kishino Population Genetics Without Intraspecific Data Mol. Biol. Evol., August 1, 2007; 24(8): 1667 - 1677. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Gesell and A. von Haeseler In silico sequence evolution with site-specific interactions along phylogenetic trees Bioinformatics, March 15, 2006; 22(6): 716 - 722. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Gowri-Shankar and M. Rattray On the Correlation Between Composition and Site-Specific Evolutionary Rate: Implications for Phylogenetic Inference Mol. Biol. Evol., February 1, 2006; 23(2): 352 - 364. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Mustonen and M. Lassig Evolutionary population genetics of promoters: Predicting binding sites and functional phylogenies PNAS, November 1, 2005; 102(44): 15936 - 15941. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Porto, H. E. Roman, M. Vendruscolo, and U. Bastolla Prediction of Site-Specific Amino Acid Distributions and Limits of Divergent Evolutionary Changes in Protein Sequences Mol. Biol. Evol., March 1, 2005; 22(3): 630 - 638. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Blouin, D. Butt, and A. J. Roger Impact of Taxon Sampling on the Estimation of Rates of Evolution at Sites Mol. Biol. Evol., March 1, 2005; 22(3): 784 - 791. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Lartillot and H. Philippe A Bayesian Mixture Model for Across-Site Heterogeneities in the Amino-Acid Replacement Process Mol. Biol. Evol., June 1, 2004; 21(6): 1095 - 1109. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Nielsen and Z. Yang Estimating the Distribution of Selection Coefficients from Phylogenetic Data with Applications to Mitochondrial and Viral DNA Mol. Biol. Evol., August 1, 2003; 20(8): 1231 - 1239. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Fornasari, G. Parisi, and J. Echave Site-Specific Amino Acid Replacement Matrices from Structurally Constrained Protein Evolution Simulations Mol. Biol. Evol., March 1, 2002; 19(3): 352 - 356. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. B. Levin and B. Whittome Codon usage in nucleopolyhedroviruses J. Gen. Virol., September 1, 2000; 81(9): 2313 - 2325. [Abstract] [Full Text] |
||||
![]() |
M. Steel and D. Penny Parsimony, Likelihood, and the Role of Models in Molecular Phylogenetics Mol. Biol. Evol., June 1, 2000; 17(6): 839 - 850. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. J. Bruno, N. D. Socci, and A. L. Halpern Weighted Neighbor Joining: A Likelihood-Based Approach to Distance-Based Phylogeny Reconstruction Mol. Biol. Evol., January 1, 2000; 17(1): 189 - 197. [Abstract] [Full Text] [PDF] |
||||



