MBE Advance Access published online on June 16, 2004
Molecular Biology and Evolution, doi:10.1093/molbev/msh194
Molecular Biology and Evolution © Society for Molecular Biology and Evolution 2004; all rights reserved
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv 69978, Israel
* To whom correspondence should be addressed. E-mail: dgraur{at}uh.edu.
The degree to which an amino-acid site is free to vary is strongly dependent on its structural and functional importance. An amino acid that plays an essential role is unlikely to change over evolutionary time. Hence, the evolutionary rate at an amino-acid site is indicative of how conserved this site is, and in turn, allows evaluating its importance in maintaining the structure/function of the protein. When using probabilistic methods for site-specific rate inference few alternatives are possible. In this study we use simulations to compare the maximum likelihood and Bayesian paradigms. We study the dependence of inference accuracy on such parameters as number of sequences, branch lengths, the shape of the rate distribution, and sequence length. We also study the possibility of simultaneously estimating branch lengths and site-specific rates. Our results show that a Bayesian approach is superior to maximum-likelihood under a wide range of conditions, indicating that the prior that is incorporated into the Bayesian computation significantly improves performance. We show that when branch lengths are unknown, it is better first to estimate branch lengths and, then, to estimate site-specific rates. This procedure was found to be superior to estimating both the branch lengths and site-specific rates simultaneously. Finally, we illustrate the difference between maximum likelihood and Bayesian methods when analyzing site-conservation for the apoptosis regulator protein Bcl-xL.
Original Articles
Comparison of Site-Specific Rate-Inference Methods for Protein Sequences: Empirical Bayesian Methods Are Superior
2 Department of Biology and Biochemistry, University of Houston, Houston, Texas 77204, USA
3 Department of Biochemistry, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv 69978, Israel
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Pavelka, E. Chovancova, and J. Damborsky HotSpot Wizard: a web server for identification of hot spots in protein engineering Nucleic Acids Res., July 1, 2009; 37(suppl_2): W376 - W383. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Ezkurdia, L. Bartoli, P. Fariselli, R. Casadio, A. Valencia, and M. L. Tress Progress and challenges in predicting protein-protein interaction sites Brief Bioinform, May 1, 2009; 10(3): 233 - 246. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Goldenberg, E. Erez, G. Nimrod, and N. Ben-Tal The ConSurf-DB: pre-calculated evolutionary conservation profiles of protein structures Nucleic Acids Res., January 1, 2009; 37(suppl_1): D323 - D327. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Cohen, N. D Rubinstein, A. Stern, U. Gophna, and T. Pupko A likelihood framework to analyse phyletic patterns Phil Trans R Soc B, December 27, 2008; 363(1512): 3903 - 3911. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Sankararaman and K. Sjolander INTREPID--INformation-theoretic TREe traversal for Protein functional site IDentification Bioinformatics, November 1, 2008; 24(21): 2445 - 2452. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Rosset, R. S. Wells, D. F. Soria-Hernanz, C. Tyler-Smith, A. K. Royyuru, D. M. Behar, and and The Genographic Consortium Maximum-Likelihood Estimation of Site-Specific Mutation Rates in Human Mitochondrial DNA From Partial Phylogenetic Classification Genetics, November 1, 2008; 180(3): 1511 - 1524. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Zhang, H. Zhang, K. Chen, S. Shen, J. Ruan, and L. Kurgan Accurate sequence-based prediction of catalytic residues Bioinformatics, October 15, 2008; 24(20): 2329 - 2338. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. C. Dukka Bahadur and D. R. Livesay Improving position-specific predictions of protein functional sites using phylogenetic motifs Bioinformatics, October 15, 2008; 24(20): 2308 - 2316. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Fernandes and W. R. Atchley Site-specific evolutionary rates in proteins are better modeled as non-independent and strictly relative Bioinformatics, October 1, 2008; 24(19): 2177 - 2183. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Madaoui and R. Guerois Coevolution at protein complex interfaces can be detected by the complementarity trace with important impact for predictive docking PNAS, June 3, 2008; 105(22): 7708 - 7713. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Fischer, C. E. Mayer, and J. Soding Prediction of protein functional residues from sequence by probability density estimation Bioinformatics, March 1, 2008; 24(5): 613 - 620. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Leulliot, M. T. Bohnsack, M. Graille, D. Tollervey, and H. Van Tilbeurgh The yeast ribosome synthesis factor Emg1 is a novel member of the superfamily of alpha/beta knot fold methyltransferases Nucleic Acids Res., February 2, 2008; 36(2): 629 - 639. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Brilli, R. Fani, and P. Lio Current trends in the bioinformatic sequence analysis of metabolic pathways in prokaryotes Brief Bioinform, January 1, 2008; 9(1): 34 - 45. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Stern, A. Doron-Faigenboim, E. Erez, E. Martz, E. Bacharach, and T. Pupko Selecton 2007: advanced models for detecting positive and purifying selection using a Bayesian inference approach Nucleic Acids Res., July 13, 2007; 35(suppl_2): W506 - W511. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Mayrose, A. Doron-Faigenboim, E. Bacharach, and T. Pupko Towards realistic codon models: among site variability and dependency of synonymous and non-synonymous rates Bioinformatics, July 1, 2007; 23(13): i319 - i327. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Dalhus, I. H. Helle, P. H. Backe, I. Alseth, T. Rognes, M. Bjoras, and J. K. Laerdahl Structural insight into repair of alkylated DNA by a new superfamily of DNA glycosylases comprising HEAT-like repeats Nucleic Acids Res., April 1, 2007; 35(7): 2451 - 2459. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Ruano-Rubio and M. A. Fares Artifactual Phylogenies Caused by Correlated Distribution of Substitution Rates among Sites and Lineages: The Good, the Bad, and the Ugly Syst Biol, February 1, 2007; 56(1): 68 - 82. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Doron-Faigenboim and T. Pupko A Combined Empirical and Mechanistic Codon Model Mol. Biol. Evol., February 1, 2007; 24(2): 388 - 397. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ninio, E. Privman, T. Pupko, and N. Friedman Phylogeny reconstruction: increasing the accuracy of pairwise distance estimation using Bayesian inference of evolutionary rates Bioinformatics, January 15, 2007; 23(2): e136 - e141. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Taylor, S. Tyekucheva, D. C. King, R. C. Hardison, W. Miller, and F. Chiaromonte ESPERR: Learning strong and weak signals in genomic sequence alignments to identify functional elements Genome Res., December 1, 2006; 16(12): 1596 - 1604. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Gowri-Shankar and M. Rattray On the Correlation Between Composition and Site-Specific Evolutionary Rate: Implications for Phylogenetic Inference Mol. Biol. Evol., February 1, 2006; 23(2): 352 - 364. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Y. Tseng and J. Liang Estimation of Amino Acid Residue Substitution Rates at Local Spatial Regions and Application in Protein Function Inference: A Bayesian Monte Carlo Approach Mol. Biol. Evol., February 1, 2006; 23(2): 421 - 436. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Stern and T. Pupko An Evolutionary Space-Time Model with Varying Among-Site Dependencies Mol. Biol. Evol., February 1, 2006; 23(2): 392 - 400. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. B. Bevan, B. F. Lang, and D. Bryant Calculating the Evolutionary Rates of Different Genes: A Fast, Accurate Estimator with Applications to Maximum Likelihood Phylogenetic Analysis Syst Biol, December 1, 2005; 54(6): 900 - 915. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Furmonaviciene, B. J. Sutton, F. Glaser, C. A. Laughton, N. Jones, H. F. Sewell, and F. Shakib An attempt to define allergen-specific molecular surface features: a bioinformatic approach Bioinformatics, December 1, 2005; 21(23): 4201 - 4204. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Kim and Y. Kliger Discovering hidden viral piracy Bioinformatics, December 1, 2005; 21(23): 4216 - 4222. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Dutheil, T. Pupko, A. Jean-Marie, and N. Galtier A Model-Based Approach for Detecting Coevolving Positions in a Molecule Mol. Biol. Evol., September 1, 2005; 22(9): 1919 - 1928. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Landau, I. Mayrose, Y. Rosenberg, F. Glaser, E. Martz, T. Pupko, and N. Ben-Tal ConSurf 2005: the projection of evolutionary conservation scores of residues on protein structures Nucleic Acids Res., July 1, 2005; 33(suppl_2): W299 - W302. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Blouin, D. Butt, and A. J. Roger Impact of Taxon Sampling on the Estimation of Rates of Evolution at Sites Mol. Biol. Evol., March 1, 2005; 22(3): 784 - 791. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Kosakovsky Pond and S. D. W. Frost A Simple Hierarchical Approach to Modeling Distributions of Substitution Rates Mol. Biol. Evol., February 1, 2005; 22(2): 223 - 234. [Abstract] [Full Text] [PDF] |
||||








