MBE Advance Access published online on October 26, 2005
Molecular Biology and Evolution, doi:10.1093/molbev/msj048
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Department of Bioengineering, SEO, MC-063 University of Illinois at Chicago 851 S. Morgan Street, Room 218 Chicago, IL 60607-7052, U.S.A.
* To whom correspondence should be addressed. The amino acid sequences of proteins provide rich information for inferring distant phylogenetic relationships and for predicting protein functions. Estimating the rate matrix of residue substitutions from amino acid sequences is also important because the rate matrix can be used to develop scoring matrices for sequence alignment. Here we use a continuous time Markov process to model the substitution rates of residues and develop a Bayesian Markov chain Monte Carlo method for rate estimation. We validate our method using simulated artificial protein sequences. Because different local regions such as binding surfaces and the protein interior core experience different selection pressures due to functional or stability constraints, we use our method to estimate the substitution rates of local regions. Our results show that the substitution rates are very different for residues in the buried core and residues on the solvent exposed surfaces. In addition, the rest of the proteins on the binding surfaces also have very different substitution rates from residues. Based on these findings, we further develop a method for protein function prediction by surface matching using scoring matrices derived from estimated substitution rates for residues located on the binding surfaces. We show with examples that our method is effective in identifying functionally related proteins that have overall low sequence identity, a task known to be very challenging.
Accepted October 19, 2005
Research Article
Estimation of Amino Acid Residue Substitution Rates at Local Spatial Regions and Application in Protein Function Inference: A Bayesian Monte Carlo Approach
Jie Liang, E-mail: jliang{at}uic.edu
![]()
Abstract ![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Y. Y. Tseng, Z. J. Chen, and W.-H. Li fPOP: footprinting functional pockets of proteins by comparative spatial patterns Nucleic Acids Res., October 30, 2009; (2009) gkp900v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Y. Tseng, C. Dupree, Z. J. Chen, and W.-H. Li SplitPocket: identification of protein functional surfaces and characterization of their spatial patterns Nucleic Acids Res., July 1, 2009; 37(suppl_2): W384 - W389. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Xie, L. Xie, and P. E. Bourne A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery Bioinformatics, June 15, 2009; 25(12): i305 - i312. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Tuncbag, G. Kar, O. Keskin, A. Gursoy, and R. Nussinov A survey of available tools and web servers for analysis of protein-protein interactions and interfaces Brief Bioinform, May 1, 2009; 10(3): 217 - 232. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-S. Lin, W.-L. Hsu, J.-K. Hwang, and W.-H. Li Proportion of Solvent-Exposed Amino Acids in a Protein and Rate of Protein Evolution Mol. Biol. Evol., April 1, 2007; 24(4): 1005 - 1011. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Dai, H. E. Fisher, J. Temirov, C. Kiss, M. E. Phipps, P. Pavlik, J. H. Werner, and A. R.M. Bradbury The creation of a novel fluorescent protein by guided consensus engineering Protein Eng. Des. Sel., February 2, 2007; (2007) gzl056v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Gu A Simple Statistical Method for Estimating Type-II (Cluster-Specific) Functional Divergence of Protein Sequences Mol. Biol. Evol., October 1, 2006; 23(10): 1937 - 1945. [Abstract] [Full Text] [PDF] |
||||




