Skip Navigation



MBE Advance Access published online on October 26, 2005

Molecular Biology and Evolution, doi:10.1093/molbev/msj048
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow Supplementary Material
Right arrow All Versions of this Article:
23/2/421    most recent
msj048v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Tseng, Y. Y.
Right arrow Articles by Liang, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Tseng, Y. Y.
Right arrow Articles by Liang, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2005. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org
Accepted October 19, 2005

Research Article

Estimation of Amino Acid Residue Substitution Rates at Local Spatial Regions and Application in Protein Function Inference: A Bayesian Monte Carlo Approach

Yan Yuan Tseng 1 and Jie Liang 1*

1 Department of Bioengineering, SEO, MC-063 University of Illinois at Chicago 851 S. Morgan Street, Room 218 Chicago, IL 60607-7052, U.S.A.

* To whom correspondence should be addressed.
Jie Liang, E-mail: jliang{at}uic.edu


   Abstract

The amino acid sequences of proteins provide rich information for inferring distant phylogenetic relationships and for predicting protein functions. Estimating the rate matrix of residue substitutions from amino acid sequences is also important because the rate matrix can be used to develop scoring matrices for sequence alignment. Here we use a continuous time Markov process to model the substitution rates of residues and develop a Bayesian Markov chain Monte Carlo method for rate estimation. We validate our method using simulated artificial protein sequences. Because different local regions such as binding surfaces and the protein interior core experience different selection pressures due to functional or stability constraints, we use our method to estimate the substitution rates of local regions. Our results show that the substitution rates are very different for residues in the buried core and residues on the solvent exposed surfaces. In addition, the rest of the proteins on the binding surfaces also have very different substitution rates from residues. Based on these findings, we further develop a method for protein function prediction by surface matching using scoring matrices derived from estimated substitution rates for residues located on the binding surfaces. We show with examples that our method is effective in identifying functionally related proteins that have overall low sequence identity, a task known to be very challenging.

Keywords: Continuous time Markov process; Bayesian Markov chain Monte Carlo; amino acid substitution matrix; protein function prediction.
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
Y. Y. Tseng, Z. J. Chen, and W.-H. Li
fPOP: footprinting functional pockets of proteins by comparative spatial patterns
Nucleic Acids Res., October 30, 2009; (2009) gkp900v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y. Y. Tseng, C. Dupree, Z. J. Chen, and W.-H. Li
SplitPocket: identification of protein functional surfaces and characterization of their spatial patterns
Nucleic Acids Res., July 1, 2009; 37(suppl_2): W384 - W389.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
L. Xie, L. Xie, and P. E. Bourne
A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery
Bioinformatics, June 15, 2009; 25(12): i305 - i312.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
N. Tuncbag, G. Kar, O. Keskin, A. Gursoy, and R. Nussinov
A survey of available tools and web servers for analysis of protein-protein interactions and interfaces
Brief Bioinform, May 1, 2009; 10(3): 217 - 232.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
Y.-S. Lin, W.-L. Hsu, J.-K. Hwang, and W.-H. Li
Proportion of Solvent-Exposed Amino Acids in a Protein and Rate of Protein Evolution
Mol. Biol. Evol., April 1, 2007; 24(4): 1005 - 1011.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
M. Dai, H. E. Fisher, J. Temirov, C. Kiss, M. E. Phipps, P. Pavlik, J. H. Werner, and A. R.M. Bradbury
The creation of a novel fluorescent protein by guided consensus engineering
Protein Eng. Des. Sel., February 2, 2007; (2007) gzl056v1.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
X. Gu
A Simple Statistical Method for Estimating Type-II (Cluster-Specific) Functional Divergence of Protein Sequences
Mol. Biol. Evol., October 1, 2006; 23(10): 1937 - 1945.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.