MBE Advance Access originally published online on October 13, 2006
Molecular Biology and Evolution 2007 24(1):192-202; doi:10.1093/molbev/msl143
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Articles |
Molecular Architecture of the DNA-Binding Region and Its Relationship to Classification of Basic Helix–Loop–Helix Proteins
Department of Genetics and Center for Computational Biology, North Carolina State University
E-mail: bill{at}atchleylab.org.
Accepted for publication October 5, 2006.
Multivariate statistical analyses are used to explore the molecular architecture of the DNA-binding and dimerization regions of basic helix–loop–helix (bHLH) proteins. Alphabetic amino acid data are transformed to biologically meaningful quantitative values using a set of 5 multivariate "indices." These multivariate indices summarize variation in a large suite of amino acid physiochemical attributes and reflect variability in polarity–accessibility–hydrophobicity, propensity for secondary structure, molecular size, codon composition, and electrostatic charge. Using these index score data, discriminant analyses describe the multidimensional aspects of physiochemical variation and clarify the structural basis of the prevailing evolutionary classification of bHLH proteins. A small number of amino acids from both the binding dimerization domains, when considered simultaneously, accurately distinguish the 5 known DNA-binding groups. The relevant sites often have well-documented structural and functional characteristics.
Key Words: computational biology bHLH proteins molecular architecture multivariate statistics DNA-binding specificity molecular evolution
William Martin, Associate Editor