Molecular Biology and Evolution 18:1435-1441 (2001)
© 2001 Society for Molecular Biology and Evolution
Secator: A Program for Inferring Protein Subfamilies from Phylogenetic Trees
LSIIT-ICPS (AXE E), UPRES-A CNRS 70005 Université Louis Pasteur, Illkirch, France
Laboratoire de Biologie et Génomique Structurales, Institut de Génétique et de Biologie Moléculaire et Cellulaire CNRS/INSERM/ULP, Illkirch, France
With the huge increase of protein data, an important problem is to estimate, within a large protein family, the number of sensible subsets for subsequent in-depth structural, functional, and evolutionary analyses. To tackle this problem, we developed a new program, Secator, which implements the principle of an ascending hierarchical method using a distance matrix based on a multiple alignment of protein sequences. Dissimilarity values assigned to the nodes of a deduced phylogenetic tree are partitioned by a new stopping rule introduced to automatically determine the significant dissimilarity values. The quality of the clusters obtained by Secator is verified by a separate Jackknife study. The method is demonstrated on 24 large protein families covering a wide spectrum of structural and sequence conservation and its usefulness and accuracy with real biological data is illustrated on two well-studied protein families (the Sm proteins and the nuclear receptors).
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
D. A. Lee, R. Rentzsch, and C. Orengo GeMMA: functional subfamily classification within superfamilies of predicted protein structural domains Nucleic Acids Res., November 18, 2009; (2009) gkp1049v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. L. Ward, J. F. Challacombe, P. H. Janssen, B. Henrissat, P. M. Coutinho, M. Wu, G. Xie, D. H. Haft, M. Sait, J. Badger, et al. Three Genomes from the Phylum Acidobacteria Provide Insight into the Lifestyles of These Microorganisms in Soils Appl. Envir. Microbiol., April 1, 2009; 75(7): 2046 - 2056. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. P. Brown Efficient functional clustering of protein sequences using the Dirichlet process Bioinformatics, August 15, 2008; 24(16): 1765 - 1771. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Stam, E. G.J. Danchin, C. Rancurel, P. M. Coutinho, and B. Henrissat Dividing the large glycoside hydrolase family 13 into subfamilies: towards improved functional annotations of {alpha}-amylase-related proteins Protein Eng. Des. Sel., December 1, 2006; 19(12): 555 - 562. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Sivakumar, C. Wilton, and L. Holm From sequences to a functional unit Physiol Genomics, March 13, 2006; 25(1): 1 - 8. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Geisler-Lee, M. Geisler, P. M. Coutinho, B. Segerman, N. Nishikubo, J. Takahashi, H. Aspeborg, S. Djerbi, E. Master, S. Andersson-Gunneras, et al. Poplar Carbohydrate-Active Enzymes. Gene Identification and Expression Analyses Plant Physiology, March 1, 2006; 140(3): 946 - 962. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pei, W. Cai, L. N. Kinch, and N. V. Grishin Prediction of functional specificity determinants from protein sequences using log-likelihood ratios Bioinformatics, January 15, 2006; 22(2): 164 - 171. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Donald and E. I. Shakhnovich Determining functional specificity from protein sequences Bioinformatics, June 1, 2005; 21(11): 2629 - 2635. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Thompson, V. Prigent, and O. Poch LEON: multiple aLignment Evaluation Of Neighbours Nucleic Acids Res., February 24, 2004; 32(4): 1298 - 1307. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Wicker, D. Dembele, W. Raffelsberger, and O. Poch Density of points clustering, application to transcriptomic data analysis Nucleic Acids Res., September 15, 2002; 30(18): 3992 - 4000. [Abstract] [Full Text] [PDF] |
||||





