MBE Advance Access originally published online on May 3, 2006
Molecular Biology and Evolution 2006 23(7):1406-1413; doi:10.1093/molbev/msl002
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Article |
Prediction of Function Divergence in Protein Families Using the Substitution Rate Variation Parameter Alpha
Center for Genomics and Bioinformatics, Karolinska Institutet, Stockholm, Sweden
E-mail: Abhiman.Saraswathi{at}cgb.ki.se.
Protein families typically embody a range of related functions and may thus be decomposed into subfamilies with, for example, distinct substrate specificities. Detection of functionally divergent subfamilies is possible by methods for recognizing branches of adaptive evolution in a gene tree. As the number of genome sequences is growing rapidly, it is highly desirable to automatically detect subfamily function divergence.
To this end, we here introduce a method for large-scale prediction of function divergence within protein families. It is called the alpha shift measure (ASM) as it is based on detecting a shift in the shape parameter (alpha [
]) of the substitution rate gamma distribution. Four different methods for estimating
were investigated. We benchmarked the accuracy of ASM using function annotation from Enzyme Commission numbers within Pfam protein families divided into subfamilies by the automatic tree-based method BETE.
In a test using 563 subfamily pairs in 162 families, ASM outperformed functional site-based methods using rate or conservation shifting (rate shift measure [RSM] and conservation shift measure [CSM]). The best results were obtained using the "GZ-Gamma" method for estimating
. By combining ASM with RSM and CSM using linear discriminant analysis, the prediction accuracy was further improved.
Key Words: protein evolution adaptive evolution enzyme protein function protein subfamily substitution rates gamma distribution alpha parameter function shift
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
C. Kemena and C. Notredame Upcoming challenges for multiple sequence alignment methods in the high-throughput era Bioinformatics, October 1, 2009; 25(19): 2455 - 2465. [Abstract] [Full Text] [PDF] |
||||
