Skip Navigation



MBE Advance Access published online on June 16, 2004

Molecular Biology and Evolution, doi:10.1093/molbev/msh194
Molecular Biology and Evolution © Society for Molecular Biology and Evolution 2004; all rights reserved
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow All Versions of this Article:
21/9/1781    most recent
msh194v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Mayrose, I.
Right arrow Articles by Pupko, T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Mayrose, I.
Right arrow Articles by Pupko, T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Accepted June 3, 2004

Original Articles

Comparison of Site-Specific Rate-Inference Methods for Protein Sequences: Empirical Bayesian Methods Are Superior

Itay Mayrose 1, Dan Graur 2*, Nir BenTal 3, Tal Pupko 1

1 Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv 69978, Israel
2 Department of Biology and Biochemistry, University of Houston, Houston, Texas 77204, USA
3 Department of Biochemistry, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv 69978, Israel

* To whom correspondence should be addressed. E-mail: dgraur{at}uh.edu.


   Abstract

The degree to which an amino-acid site is free to vary is strongly dependent on its structural and functional importance. An amino acid that plays an essential role is unlikely to change over evolutionary time. Hence, the evolutionary rate at an amino-acid site is indicative of how conserved this site is, and in turn, allows evaluating its importance in maintaining the structure/function of the protein. When using probabilistic methods for site-specific rate inference few alternatives are possible. In this study we use simulations to compare the maximum likelihood and Bayesian paradigms. We study the dependence of inference accuracy on such parameters as number of sequences, branch lengths, the shape of the rate distribution, and sequence length. We also study the possibility of simultaneously estimating branch lengths and site-specific rates. Our results show that a Bayesian approach is superior to maximum-likelihood under a wide range of conditions, indicating that the prior that is incorporated into the Bayesian computation significantly improves performance. We show that when branch lengths are unknown, it is better first to estimate branch lengths and, then, to estimate site-specific rates. This procedure was found to be superior to estimating both the branch lengths and site-specific rates simultaneously. Finally, we illustrate the difference between maximum likelihood and Bayesian methods when analyzing site-conservation for the apoptosis regulator protein Bcl-xL.

Keywords: rate variation among sites, evolutionary conservation, Empirical Bayesian methods, bioinformatics, Bcl-xL
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
H. Madaoui and R. Guerois
Coevolution at protein complex interfaces can be detected by the complementarity trace with important impact for predictive docking
PNAS, June 3, 2008; 105(22): 7708 - 7713.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. D. Fischer, C. E. Mayer, and J. Soding
Prediction of protein functional residues from sequence by probability density estimation
Bioinformatics, March 1, 2008; 24(5): 613 - 620.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. Leulliot, M. T. Bohnsack, M. Graille, D. Tollervey, and H. Van Tilbeurgh
The yeast ribosome synthesis factor Emg1 is a novel member of the superfamily of alpha/beta knot fold methyltransferases
Nucleic Acids Res., February 2, 2008; 36(2): 629 - 639.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
M. Brilli, R. Fani, and P. Lio
Current trends in the bioinformatic sequence analysis of metabolic pathways in prokaryotes
Brief Bioinform, January 1, 2008; 9(1): 34 - 45.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Stern, A. Doron-Faigenboim, E. Erez, E. Martz, E. Bacharach, and T. Pupko
Selecton 2007: advanced models for detecting positive and purifying selection using a Bayesian inference approach
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W506 - W511.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
I. Mayrose, A. Doron-Faigenboim, E. Bacharach, and T. Pupko
Towards realistic codon models: among site variability and dependency of synonymous and non-synonymous rates
Bioinformatics, July 1, 2007; 23(13): i319 - i327.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
B. Dalhus, I. H. Helle, P. H. Backe, I. Alseth, T. Rognes, M. Bjoras, and J. K. Laerdahl
Structural insight into repair of alkylated DNA by a new superfamily of DNA glycosylases comprising HEAT-like repeats
Nucleic Acids Res., April 1, 2007; 35(7): 2451 - 2459.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
A. Doron-Faigenboim and T. Pupko
A Combined Empirical and Mechanistic Codon Model
Mol. Biol. Evol., February 1, 2007; 24(2): 388 - 397.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Ninio, E. Privman, T. Pupko, and N. Friedman
Phylogeny reconstruction: increasing the accuracy of pairwise distance estimation using Bayesian inference of evolutionary rates
Bioinformatics, January 15, 2007; 23(2): e136 - e141.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
J. Taylor, S. Tyekucheva, D. C. King, R. C. Hardison, W. Miller, and F. Chiaromonte
ESPERR: Learning strong and weak signals in genomic sequence alignments to identify functional elements
Genome Res., December 1, 2006; 16(12): 1596 - 1604.
[Abstract] [Full Text] [PDF]


Home page
Biophys. JHome page
D. Y. Haubertin, H. Madaoui, A. Sanson, R. Guerois, and S. Orlowski
Molecular Dynamics Simulations of E. coli MsbA Transmembrane Domain: Formation of a Semipore Structure
Biophys. J., October 1, 2006; 91(7): 2517 - 2531.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
V. Gowri-Shankar and M. Rattray
On the Correlation Between Composition and Site-Specific Evolutionary Rate: Implications for Phylogenetic Inference
Mol. Biol. Evol., February 1, 2006; 23(2): 352 - 364.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
Y. Y. Tseng and J. Liang
Estimation of Amino Acid Residue Substitution Rates at Local Spatial Regions and Application in Protein Function Inference: A Bayesian Monte Carlo Approach
Mol. Biol. Evol., February 1, 2006; 23(2): 421 - 436.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
A. Stern and T. Pupko
An Evolutionary Space-Time Model with Varying Among-Site Dependencies
Mol. Biol. Evol., February 1, 2006; 23(2): 392 - 400.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Furmonaviciene, B. J. Sutton, F. Glaser, C. A. Laughton, N. Jones, H. F. Sewell, and F. Shakib
An attempt to define allergen-specific molecular surface features: a bioinformatic approach
Bioinformatics, December 1, 2005; 21(23): 4201 - 4204.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
E. Kim and Y. Kliger
Discovering hidden viral piracy
Bioinformatics, December 1, 2005; 21(23): 4216 - 4222.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. Dutheil, T. Pupko, A. Jean-Marie, and N. Galtier
A Model-Based Approach for Detecting Coevolving Positions in a Molecule
Mol. Biol. Evol., September 1, 2005; 22(9): 1919 - 1928.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Landau, I. Mayrose, Y. Rosenberg, F. Glaser, E. Martz, T. Pupko, and N. Ben-Tal
ConSurf 2005: the projection of evolutionary conservation scores of residues on protein structures
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W299 - W302.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
C. Blouin, D. Butt, and A. J. Roger
Impact of Taxon Sampling on the Estimation of Rates of Evolution at Sites
Mol. Biol. Evol., March 1, 2005; 22(3): 784 - 791.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. L. Kosakovsky Pond and S. D. W. Frost
A Simple Hierarchical Approach to Modeling Distributions of Substitution Rates
Mol. Biol. Evol., February 1, 2005; 22(2): 223 - 234.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.