Molecular Biology and Evolution 17:164-178 (2000)
© 2000 Society for Molecular Biology and Evolution
Article |
Correlations Among Amino Acid Sites in bHLH Protein Domains: An Information Theoretic Analysis



*Department of Genetics, North Carolina State University;
and
Department of Ecology and Evolutionary Biology, University of California at Irvine;
and
Fakultät für Mathematik, Universität Bielefeld, Bielefeld, Germany
An information theoretic approach is used to examine the magnitude and origin of associations among amino acid sites in the basic helix-loop-helix (bHLH) family of transcription factors. Entropy and mutual information values are used to summarize the variability and covariability of amino acids comprising the bHLH domain for 242 sequences. When these quantitative measures are integrated with crystal structure data and summarized using helical wheels, they provide important insights into the evolution of three-dimensional structure in these proteins. We show that amino acid sites in the bHLH domain known to pack against each other have very low entropy values, indicating little residue diversity at these contact sites. Noncontact sites, on the other hand, exhibit significantly larger entropy values, as well as statistically significant levels of mutual information or association among sites. High levels of mutual information indicate significant amounts of intercorrelation among amino acid residues at these various sites. Using computer simulations based on a parametric bootstrap procedure, we are able to partition the observed covariation among various amino acid sites into that arising from phylogenetic (common ancestry) and stochastic causes and those resulting from structural and functional constraints. These results show that a significant amount of the observed covariation among amino acid sites is due to structural/functional constraints, over and above the covariation arising from phylogenetic constraints. These quantitative analyses provide a highly integrated evolutionary picture of the multidimensional dynamics of sequence diversity and protein structure.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. G. Williams and S. C. Lovell The Effect of Sequence Evolution on Protein Structural Divergence Mol. Biol. Evol., May 1, 2009; 26(5): 1055 - 1065. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Weigt, R. A. White, H. Szurmant, J. A. Hoch, and T. Hwa Identification of direct residue contacts in protein-protein interaction by message passing PNAS, January 6, 2009; 106(1): 67 - 72. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Liu, E. Eyal, and I. Bahar Analysis of correlated mutations in HIV-1 protease using spectral clustering Bioinformatics, May 15, 2008; 24(10): 1243 - 1250. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. S. Horner, W. Pirovano, and G. Pesole Correlated substitution analysis and the prediction of amino acid structural contacts Brief Bioinform, January 1, 2008; 9(1): 46 - 56. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. A. Travers, D. C. Tully, G. P. McCormack, and M. A. Fares A Study of the Coevolutionary Patterns Operating within the env Gene of the HIV-1 Group M Subtypes Mol. Biol. Evol., December 1, 2007; 24(12): 2787 - 2801. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Hernandez, A. Feller, K. Morohashi, K. Frame, and E. Grotewold The basic helix loop helix domain of maize R links transcriptional regulation and histone modifications by recruitment of an EMSY-related factor PNAS, October 23, 2007; 104(43): 17222 - 17227. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-H. Yeang, J. F. J. Darot, H. F. Noller, and D. Haussler Detecting the Coevolution of Biosequences An Example of RNA Interaction Prediction Mol. Biol. Evol., September 1, 2007; 24(9): 2119 - 2131. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Waddell, H. Kishino, and R. Ota Phylogenetic Methodology for Detecting Protein Interactions Mol. Biol. Evol., March 1, 2007; 24(3): 650 - 659. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. R. Atchley and J. Zhao Molecular Architecture of the DNA-Binding Region and Its Relationship to Classification of Basic Helix-Loop-Helix Proteins Mol. Biol. Evol., January 1, 2007; 24(1): 192 - 202. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Feller, J. M. Hernandez, and E. Grotewold An ACT-like Domain Participates in the Dimerization of Several Plant Basic-helix-loop-helix Transcription Factors J. Biol. Chem., September 29, 2006; 281(39): 28964 - 28974. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Zhao, X. Zhang, C. Liang, J. Wu, Q. Bao, and S. Qin Genome-wide analysis of restriction-modification system in unicellular and filamentous cyanobacteria Physiol Genomics, February 23, 2006; 24(3): 181 - 190. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. C. Martin, G. B. Gloor, S. D. Dunn, and L. M. Wahl Using information theory to search for co-evolving residues in proteins Bioinformatics, November 15, 2005; 21(22): 4116 - 4124. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Cowperthwaite, J. J. Bull, and L. A. Meyers Distributions of Beneficial Fitness Effects in RNA Genetics, August 1, 2005; 170(4): 1449 - 1457. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Buck and W. R. Atchley Networks of Coevolving Sites in Structural and Functional Domains of Serpin Proteins Mol. Biol. Evol., July 1, 2005; 22(7): 1627 - 1634. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. R. Atchley, J. Zhao, A. D. Fernandes, and T. Druke Solving the protein sequence metric problem PNAS, May 3, 2005; 102(18): 6395 - 6400. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. R. Atchley and A. D. Fernandes Sequence signatures and the probabilistic identification of proteins in the Myc-Max-Mad network PNAS, May 3, 2005; 102(18): 6401 - 6406. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Chakravarty, A. M. Hutson, M. K. Estes, and B. V. V. Prasad Evolutionary Trace Residues in Noroviruses: Importance in Receptor Binding, Antigenicity, Virion Assembly, and Strain Diversity J. Virol., January 1, 2005; 79(1): 554 - 568. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Siep, E. Sleddens-Linkels, S. Mulders, H. van Eenennaam, E. Wassenaar, W. A. Van Cappellen, J. Hoogerbrugge, J. A. Grootegoed, and W. M. Baarends Basic helix-loop-helix transcription factor Tcfl5 interacts with the Calmegin gene promoter in mouse spermatogenesis Nucleic Acids Res., December 7, 2004; 32(21): 6425 - 6436. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Afonnikov and N. A. Kolchanov CRASP: a program for analysis of coordinated substitutions in multiple alignments of protein sequences Nucleic Acids Res., July 1, 2004; 32(suppl_2): W64 - W68. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Vullhorst and A. Buonanno Characterization of General Transcription Factor 3, a Transcription Factor Involved in Slow Muscle-specific Gene Expression J. Biol. Chem., February 28, 2003; 278(10): 8370 - 8379. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Dean, C. Neuhauser, E. Grenier, and G. B. Golding The Pattern of Amino Acid Replacements in {alpha}/{beta}-Barrels Mol. Biol. Evol., November 1, 2002; 19(11): 1846 - 1864. [Abstract] [Full Text] [PDF] |
||||








