MBE Advance Access originally published online on January 30, 2007
Molecular Biology and Evolution 2007 24(4):1032-1044; doi:10.1093/molbev/msm022
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Articles |
Functional Coevolutionary Networks of the Hsp70HopHsp90 System Revealed through Computational Analyses

* Molecular Evolution and Bioinformatics Laboratory, Department of Biology, National University of Ireland, Maynooth, Ireland
Evolutionary Genetics and Bioinformatics Laboratory, Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College, Dublin, Ireland
E-mail: faresm{at}tcd.ie.
| Abstract |
|---|
|
|
|---|
Currently, the identification of groups of amino acid residues that are important in the function, structure, or interaction of a protein can be both costly and prohibitively complex, involving vast numbers of mutagenesis experiments. Here, we present the application of a novel computational method, which identifies the presence of coevolution in a data set, thereby enabling the a priori identification of amino acid residues that play an important role in protein function. We have applied this method to the heat shock protein (Hsp) protein-folding system, studying the network between Hsp70, Hsp90, and Hop (heat shockorganizing protein). Our analysis has identified functional residues within the tetratricopeptide repeat (TPR) 1 and 2A domains in Hop, previously shown to be interacting with Hsp70 and Hsp90, respectively. Further, we have identified significant residues elsewhere in Hop within domains that have been recently proposed as being important for Hop interaction with Hsp70 and/or Hsp90. In addition, several amino acid sites present in groups of coevolution were identified as 3-dimensionally or linearly proximal to functionally important sites or domains. Based on our results, we also investigate a further functional domain within Hop, between TPR1 and TPR2A, which we suggest as being functionally important in the interaction of Hop with both Hsp70 and Hsp90 whether directly or otherwise. Our method has identified all the previously characterized functionally important regions in this system, thereby indicating the power of this method in the a priori identification of important regions for site-directed mutagenesis studies
Key Words: Hsp90 Hop Hsp70 functional coevolution
| Introduction |
|---|
|
|
|---|
Uncovering the functional and structural properties of protein residues is challenging both theoretically and experimentally. As a way to identify functional amino acid sites, researchers have investigated the mutational dynamics of residues by the development of mathematical models able to describe the evolution of these amino acid sites throughout millions of years. Accurate description of these mutational dynamics has been, however, halted by the simplicity of the mathematical models proposed so far. As an alternative to this approach, researchers have emulated the evolution of sequences by mutagenesis experiments in genes extracted from single organisms. Unfortunately, this approach can prove both costly and prohibitively slow mainly due to the multifactorial evolutionary and functional nature of amino acid sites, which relies on the coevolutionary relationships between protein functional domains (Fares 2006
The advent of increased computing power and the availability of full-genome sequences from a myriad of organisms have enabled the development of numerous methods for the in silico analysis of proteins. This has been paralleled by an increase in the sensitivity of methods supported by complex and more realistic mathematical models. We have recently described a method, Coevolution Analysis using Protein Sequences (CAPS), to identify intraprotein coevolution (Fares and Travers 2006
). Even though coevolution is not synonymous with proteinprotein interaction (Pritchard and Dufton 2000
; Suel et al. 2003
; Gloor et al. 2005
), identifying coevolutionary events can highlight structural, functional, or interaction dependencies between 2 proteins. Coevolution has been studied, with varying degrees of sensitivity, in heat shock protein 90 (Hsp90), GroEL, the HIV gag and env genes, tetrapod myoglobin sequences, as well as with 846 rathumandog trios for which protein sequences were available (Korber et al. 1993
; Pollock et al. 1999
; Shim Choi et al. 2005
; Fares and Travers 2006
). Describing proteinprotein functional relationships through coevolutionary analyses can help in the identification of the main genetic factors responsible for the robustness of biological systems. Essential cell cycles, such as the ATPase cycle of Hsp90, present strong functional dependencies between their protein components. The identification of these dependencies can aid in the identification of the important factors governing this vital cell cycle. However, most efforts have been devoted to the identification of proteinprotein interactions and little attention has been put on the identification of functional coevolution between proteins. To illustrate the importance of establishing coevolutionary functional relationships between proteins, we completed the development of our previously published method (CAPS; Fares and Travers 2006
) for the detection of interprotein evolutionary dependencies in the ATPase cycle of Hsp90. This cycle is essential for cell viability, and yet, the evolutionary and functional dependencies between its protein components remain obscure.
Hsp play an essential role in cell viability at normal physiological conditions by protecting their cellular protein clients from degradation and misfolding (Hightower 1991
). Hsp90 is a specialized chaperone involved in the folding and maturation of key regulatory proteins, such as steroid hormone receptors, transcription factors, and kinases (Wegele et al. 2004
; Atchley et al. 2005
). Most of the slow-folding client-regulatory proteins of Hsp70 chaperones require further processing by the Hsp90 machinery. Both Hsp70 and Hsp90 chaperones are ATP dependent, and, in both chaperones, ATP hydrolysis results in conformational changes that are responsible for the folding and activation of the client proteins (Prodromou et al. 2000
; Richter et al. 2001
; Shomura et al. 2005
).
Generally, Hsp70 consists of a 44-kDa N-terminal nucleotide-binding domain (NBD), an 18-kDa substrate-binding domain (SBD), and a 10-kDa C-terminal domain (Flaherty et al. 1990
; Zhu et al. 1996
). Three-dimensional structures have been obtained for the Hsp70 NBD (Flaherty et al. 1990
; Harrison et al. 1997
; Osipiuk et al. 1999
) and the SBD (Zhu et al. 1996
), and more recently, the NBD and SBD (amino acids 1501) of Thermus thermophilus DnaK was resolved (Revington et al. 2005
) as was a functionally intact crystal structure of bovine Hsc70 (Jiang et al. 2005
). Hsp90 exists as a dimer and consists of an N-terminal NBD linked by a flexible charged linker to the middle domain and a C-terminal dimerization domain. Three-dimensional structures have been resolved for the Hsp90 NBD (Prodromou, Roe, O'Brien et al. 1997
; Stebbins et al. 1997
) and middle domains (Meyer et al. 2003
). Recently, in a rigorous study the 3-dimensional structure of the full Hsp90 bound to the cochaperone p23 has also been resolved, and the main conformational changes resulted from this binding have been precisely and elegantly detailed (Ali et al. 2006
).
Cochaperones are an integral part of the chaperone proteinfolding mechanisms and are involved in regulation of chaperone activity. They also present client proteins to chaperones and enable the cooperative relationship between different chaperone systems. A minimum system of Hsp40, Hsp70, Hsp90, and the cochaperones Hop (heat shockorganizing protein, Sti1 in yeast) and p23 (Sba1 in yeast) has been observed to be sufficient for the assembly of steroid hormone receptor complexes (Dittmar et al. 1998
; Kosano et al. 1998
). The definitive function of p23 is as yet unknown, although it is known that p23 binds to and stabilizes the ATP-bound form of Hsp90 (Sullivan et al. 1997
). Hop has been shown to bridge and enable the transfer of client proteins from the Hsp70 to the Hsp90 (Smith et al. 1993
; Bose et al. 1996
; Chang et al. 1997
; Zhong et al. 2006
). More recently, it has been suggested that Hop is also involved in modulating the activities of Hsp70 and Hsp90 (Gross and Hessefort 1996
; Owens-Grillo et al. 1996
; Chang et al. 1997
; Richter et al. 2003
; Wegele et al. 2003
) and may bind proteins other than the Hsp70 and Hsp90 chaperones (Glover and Lindquist 1998
; Abbas-Terki et al. 2001
, 2002
). An overview of the Hsp70, Hsp90, and Hop domains and interactions is shown in figure 1.
|
Hop contains 9 tetratricopeptide repeat (TPR) motifs that are evenly clustered into 3 TPR domains, TPR1, TRP2A, and TPR2B. Both Hsp70 and Hsp90 contain an EEVD motif at their C-termini that are known to interact with the Hop TPR1 and TPR2A motifs, respectively (Scheufler et al. 2000
Using CAPS, we have investigated the presence of coevolution in amino acid residues between the chaperones Hsp70 and Hsp90 and their cochaperone Hop. We show that previously reported or suggested interacting regions are identified using our method. Further, we have also observed a novel region within Hop coevolving with both Hsp70 and Hsp90. This region has never before been described as important in the function of Hop with Hsp70 and Hsp90, thereby providing an ideal target for future mutagenesis analyses. These results provide support to our method as a useful computational tool in the future identification of proteinprotein functional domains.
| Materials and Methods |
|---|
|
|
|---|
Sequences and Sequence Alignments
Twenty-five eukaryotic organisms for the Hsp70/Hop analysis and 27 organisms for Hsp90/Hop were explored for interprotein coevolution, including unicellular as well as multicellular eukaryotes (table 1, Supplementary Material online). Individual multiple sequence alignments for Hsp70, Hsp90, and Hop were produced using ClustalW (Bowers et al. 2004
|
Detecting Intermolecular Coevolution
We examined proteinprotein functional coevolution between Hsp and Hop using a parametric method to detect coevolution (Fares and Travers 2006
Briefly, the method compares the correlated variance of the evolutionary rates at 2 amino acid sites, belonging to the same or different proteins, corrected by the time since the divergence of the species from which protein sequences were obtained. Because of the difference in the amino acid composition and evolutionary rates of the different proteins compared, substitutions or conservation at 2 independent sites cannot be directly compared. The method instead compares the transition probability scores between 2 sequences at these particular sites, using BLOSUM (Blocks Substitution Matrix) (Henikoff S and Henikoff JG 1992). For each protein alignment the corresponding BLOSUM is applied depending on the average sequence identity. BLOSUM values are then normalized by the divergence time between sequences. For example, BLOSUM values for the transition between 2 amino acids e and k are weighted using the time (t) of divergence between the sequences compared. Finally, the correlation in the amino acid variability between the sites under analysis is measured using the Pearson's correlation coefficient.
To determine if the correlation coefficient (
AB) is significant, a simulation analysis can be performed by the Monte Carlo generation of K number of simulated sequence alignments that share the same mean parameter values of amino acid proportions, amino acid substitution rates, and phylogenetic relationships between sequences. Simulations were performed using the program evolver from the PAML package version 3.15 (Yang 1997
). We then screened simulated sequence alignments for coevolution and calculated the correlation coefficient for pairs of amino acid sites randomly sampled from the simulated multiple sequence alignments. Finally, we tested these correlation coefficients for significance by comparing the real coefficients with the distribution of correlation coefficients generated from the simulated multiple sequence alignments.
To account for multiple tests and nonindependence of the data, we applied the step-down permutational procedure (Westfall and Young 1993). In order to improve the statistical confidence of our results, we dropped the confidence level (type I error) to 0.001 instead of the 0.05 normally used. We then sorted these significant correlation coefficients and included the coefficients obtained from the real sequence alignment in the sorted list. The probability of the real coefficients was finally calculated by dividing their position in the sorted list by the total number of correlation coefficients (the protocol for the filtering of the correlation coefficients is shown in fig. 2).
|
Identifying Functionally Important Protein Domains
Coevolution between 3-dimensionally distant sites is usually observed when these sites are surrounding functionally important regions. In these cases, coevolution between amino acid sites mainly occurs to maintain the structural characteristics around these regions and consequently to maintain the conformational and functional stability of the domain (Gloor et al. 2005
| Results |
|---|
|
|
|---|
In our previous study (Fares and Travers 2006
Coevolution between Hop and Hsp70
Application of interprotein CAPS analysis using 25 eukaryote Hsp70 and Hop sequences identified the presence of a number of groups of coevolving amino acid residues (table 1). Within Hsp70, 30 amino acid residues were identified as coevolving with amino acid residues in Hop, with the correlation coefficients ranging between a minimum value of 0.35 and a maximum value of 0.73 (both correlation coefficients with probabilities of P < 0.001; table 1). The majority of these coevolving residues were located in either the NBD or the C-terminal domain with only 5 residues (394A, 412P, 542I, 543A and 546L) identified in the SBD. In total, 50% of all the residues within Hsp70 identified as coevolving with Hop were located in the C-terminal domain, the domain known to be involved in functional interactions with Hop. Interestingly, amino acid residues from different domains within Hsp70 were observed as coevolving with the same residues in Hop, suggesting that there is a high codependence of evolution between the functional domains of Hsp70.
Of the 30 amino acids in Hsp70 identified as coevolving with residues in Hop, one-third of these are proximal in the 3-dimensional structure to residues in where mutations somehow affect Hsp70 function. Coevolving residues are observed as clustering together in groups within the bovine Hsc70 3-dimensional structure (Jiang et al. 2005
) (fig. 3). Although not proximal to each other in the 3-dimensional structure, amino acid residues 79P and 236N, both coevolving with residues in the TPR2B and DP2 regions of Hop (table 1), are directly proximal to residues that comprise part of a multiple bacterial Hsp70 homologue (DnaK) mutant able to restore growth at high temperatures of cells containing a D35N mutant in bacterial Hsp40 homologue (DnaJ) (Suh et al. 1998
). Residues 315E and 319R, while coevolving with different residues in Hop (table 1), are both directly adjacent to residues 322K and 527E, the mutation of which affects the ability of Hsc70 to bind auxilin, to disassemble clathrin cages and to hydrolyze ATP (Jiang et al. 2005
) (fig. 3). Both residues, 322K and 527E, seem to be functionally/structurally linked because the effects of mutating one residue can be compensated by the mutation of the second (Jiang et al. 2005
). Within the SBD, residues 394A and 412P (table 1) are both significantly proximal to residues 395P and 413R, which, in turn, are adjacent to 213I, 512I, and 516V (fig. 3). The mutant P395L in yeast abolishes (URE3) prion propagation (Roberts et al. 2004
), whereas the 413R mutant in DnaK (K414I in DnaK numbering) cannot support phage
replication in vivo and also shows an absence of allosteric function in vitro (Pellecchia et al. 2000
). Jiang et al. (2005)
identified 213I, 512I, and 516I as important in interdomain communication within Hsc70. Finally, 542I, 543A, and 546L (fig. 3) are proximal to hydrophobic residues (403A, 427T, 428Y, 440F, and 447T) identified as important in the binding of substrates by Hsp70 (Pellecchia et al. 2000
).
|
The TPR1 and C-terminal DP2 repeat regions in Hop are known to be important for HopHsp70 binding (Scheufler et al. 2000
The only available structural data for Hop are those of the TPR1 domain bound with the Hsc70 EEVD peptide (fig. 4) and the TPR2 domain bound with the Hsp90 MEEVD peptide (Scheufler et al. 2000
). In the absence of complete 3-dimensional data, identifying the proximity of all amino acid residues to each other is impossible. However, of the residues in Hop identified as coevolving with Hsp70, position 77Y in TPR1, coevolving with 106K and 597I in Hsp70, is directly adjacent to 75K and 76G (fig. 4). These residues were proposed by Song and Masison (2005)
as being critical for Sti1 (Hop in yeast) regulation of Hsp70, whereas other studies have confirmed 75K as a site critical for Hsp70 binding (Flom et al. 2005
; Carrigan et al. 2006
) along with 79R (Flom et al. 2005
).
|
Coevolution between Hop and Hsp90
We performed the interprotein coevolution analysis between Hsp90 and Hop using 27 eukaryote sequences for each protein. Within Hsp90, 51 amino acid residues, representing all the major domains within Hsp90, were observed as coevolving with Hop and presented correlation coefficients ranging between a minimum value of 0.33 and a maximum value of 0.75 (P < 0.001; table 2). The majority of residues in Hsp90 coevolving with Hop are located within the NBD and the middle segment (table 2). Interdomain communication is known to be essential for the operation of Hsp90 (Meyer et al. 2003
|
|
When mapped on the Hsp90 NBD structure (Prodromou, Roe, O'Brien et al. 1997
Amino acid residues from all the TPR domains within Hop are observed as coevolving with Hsp90, as are residues in the TPR linker region and the C-terminal DP2 repeat region (table 2). The Hop TPR1 residues (4T, 77Y, 97Y, and 113G) and TPR2A residues (329R and 335I), identified as coevolving with Hsp90, were plotted on their respective three-dimensional structures (fig. 4). Two of the 4 residues within the TPR1 domain identified as coevolving with Hsp90 were also observed coevolving with Hsp70 (fig. 4). The other 2 residues within TPR1, 97Y, and 113G are proximal to each other as well as to 77Y (fig. 4). The crystal structures of TPR1 and TPR2A are highly similar, with the bound peptides in both TPR1 and TPR2A making contact in similar positions with side chains of the same helices in the structure (Scheufler et al. 2000
) (fig. 4). The residues within Hop TPR2A identified as coevolving with Hsp90 (329R and 335I) are located in the region involved in interaction with the Hsp90 peptide (fig. 4). Interestingly, 335I appears to be in a similar position in TPR2A as 77Y is in TPR1, directly adjacent to residues essential in the interaction between the TPR and its respective Hsp peptide (fig. 4).
| Discussion |
|---|
|
|
|---|
A Powerful Computational Method to Detect Proteins Functional Coevolutionary Networks
In this study, we present the application of a powerful computational tool to identify functional coevolutionary networks among proteins within a specific cellular pathway. Our analysis of functional dependencies between proteins in an apparently well-characterized cell cycle leads to a number of conclusions. First, the method is proven powerful as known amino acid regions involved in proteinprotein interactions within the Hsp70HopHsp90 system have been identified. Second, regions previously suggested as important for such interactions (whether in direct interactions or otherwise) have been pinpointed by the method, and, finally, we have identified novel regions that we suggest are involved in the functional relationships between the proteins studied. We propose these regions as a priori targets for mutagenesis experiments and we predict their functional importance in the stability of the Hsp70HopHsp90 cycle. Other studies have conducted similar approaches but with different focus. For instance, coevolutionary analysis has been used to predict protein folds through the study of interatomic evolutionary interactions (Socolich et al. 2005
Despite the fact that our method is not focused on the detection of physical dependencies but rather functional dependencies, studies of coevolution between interacting and noninteracting proteins show a clear positive correlation between the strength of coevolution (correlation coefficient) between amino acids and their physical interaction when comparing both sets of data (data not shown). In addition, proteins that do not interact from the functional point of view (do not belong to the same pathway) and physical point of view show no parsimony-informative coevolution signal when both the stochastic and phylogenetic coevolutions are removed from results (data not shown).
Coevolution Identified between Hsp70 and Hop Shows Not Only Expected But Also Novel Results
Although previous studies have suggested a number of regions within Hop which are important for its interaction with Hsp70 (Scheufler et al. 2000
; Karunakaran et al. 2003
; Odunuga et al. 2004
; Song and Masison 2005
), our understanding of the interactions between these 2 proteins is far from complete. Our analysis has identified groups of amino acid positions within Hsp70 and Hop that are evolving in a codependent fashion. Although coevolution is not an indication of proteinprotein interaction at these positions, the identification of coevolution at these residues is indicative of some manner of dependence (functional, structural, or energetic interaction) in the evolution of these proteins. Coevolution between amino acid sites can be also indirect, for example, due to their functional/structural link to the same active site (Gloor et al. 2005
).
The observation of coevolution between regions in Hop known to be important for the interaction with Hsp70 provides support for the sensitivity of our method in detecting domains involved in proteinprotein interaction. We identified coevolution between the TPR1 domain and the C-terminal DP2 repeat domain in Hop, both known to be important for the interaction of Hop and Hsp70 (Scheufler et al. 2000
; Karunakaran et al. 2003
). Methods to detect coevolution rely on the amount of variability information to infer covariation between 2 amino acid sites. Conserved residues (e.g., amino acid sites with no amino acid mutations) would not be identified as coevolving despite their possible interaction. Unsurprisingly, due to its conservation across all species, we did not observe coevolution between the EEVD motif in the Hsp70 C-terminal and the TPR1 domain of Hop, which are known to interact (Scheufler et al. 2000
).
Our results are highly conservative in that they identify regions within Hop known to be important for Hsp70 binding (TPR1 and the C-terminal DP repeat region) and regions hypothesized to be necessary for Hsp70 binding (TPR2B). The only other residues identified in Hop as coevolving with Hsp70 are located between the TPR1 and TPR2A domains (228D, 229S, and 230T).
Coevolution between Hsp90 and Hop Underlines the Need for Further ProteinProtein Interaction Analyses
Currently, it is known that Hsp90 interacts with the TPR2A domain of Hop and it has also been suggested that Hop TPR2B may act as an Hsp90 ligand and that DP2 may be involved in Hsp90 interaction (Flom et al. 2005
; Song and Masison 2005
). Therefore, the identification of coevolution between amino acids within these domains and residues in Hsp90 is not unexpected (table 2). With the exception of 110A, all the residues within the Hsp90 ATPase domain identified as coevolving with Hop are localized to the 8-stranded ß sheet proximal to the ATP-binding pocket (Prodromou, Roe, Piper, Pearl 1997
). Of these residues, 78R and 174R are directly adjacent to residues comprising the ATP-binding pocket, 2 of which (79D and 171T) are observed as directly interacting with bound ADP (Prodromou, Roe, O'Brien et al. 1997
). In fact mutation at 79D has been shown to affect ATP binding and hydrolysis in Hsp90, resulting in a failure to maintain cell viability in vitro (Ali et al. 2006
). These results are consistent with the observation by Gloor et al. (2005)
that coevolving sites are frequently located in regions critical for protein function such as active sites. It is well documented that the ATPase activity of Hsp90 is regulated by Hop (Prodromou et al. 1999
) and, perhaps, the observed coevolution between the Hsp90 N-domain and Hop occurred in order to maintain 3-dimensional structures in an optimal conformation for the regulation of ATP hydrolysis. Interestingly, the only other residue within the NBD of Hsp90 identified as coevolving with Hop, is located on the lid segment of the NBD which is hypothesized to close following ATP binding which, in turn, promotes N-terminal dimerization, thereby resulting in the closure of the clamp required for optimal ATPase activity (Prodromou et al. 2000
).
Surprisingly, the majority of residues within the middle segment of Hsp90 observed as coevolving with Hop are located in the Hsp90-unique domain identified in comparison with MutL and GyrB structures (Meyer et al. 2003
) (fig. 5). Within this domain, mutation at residue T525, in yeast Hsp90, affected Hsp90 interaction with a number of client proteins (Bohen and Yamamoto 1993
), whereas mutation at the corresponding position in mouse Hsp90 (T541) affected the interaction of Hsp90 with the FKBP51 and p23 cochaperones (Zhong et al. 2006
). We observe a large degree of coevolution between this domain and Hop, and we think that the relationship between this novel region and Hop should be studied further.
Shared Coevolution between Hsp70, Hsp90, and Hop
In a system such as the Hsp70HopHsp90 system, in which the optimal function of the system depends on complex intramolecular and proteinprotein interactions and chains of interactions between its constituent proteins, changes within the domain of one protein can have a knock-on affect on the domain of another protein, even if the 2 domains are not directly interacting. This implies that compensatory changes are needed elsewhere within the system in order to maintain its stability. Therefore, it is not surprising that we observe coevolution between domains of proteins not previously reported to be interacting with, or regulating, each other. In the same vein, the observation of a large degree of overlap in residues within Hop coevolving with residues in both Hsp70 and Hsp90 is not surprising. For example, the TPR1 domain of Hop is proposed to be important for the binding of Hsp70, yet we identified 4 residues within TPR1 as coevolving with Hsp90, 2 of which overlap with those identified as coevolving with Hsp70 (tables 1 and 2). On the other hand, amino acid sites from Hsp90 and Hsp70 detected to be coevolving with the same sites in Hop are also detected to be coevolving between each other when we perform analysis of coevolution between Hsp90 and Hsp70. Also, sites coevolving in intraprotein coevolutionary analyses in Hsp70 or Hsp90 can be also detected in interprotein analyses of these proteins with Hop despite their indirect involvement in the physical interaction between the proteins. Although this problem is unavoidable given the limitation to distinguish between physical and functional coevolution, this approach can also be used to detect functionally related amino acid sites within a protein or between proteins. In such a complex system, as with most protein systems, it is incredibly difficult to categorize coevolving residues (interacting, structural, functional) and to distinguish between direct and indirect interaction. The reason for such difficulty is that if protein A is interacting with protein B and B with C, we would expect certain transitivity in the system such as A coevolves with C. This means that the indirect coevolution between proteins may provide overlapping coevolving amino acid residues between any protein pairs of the system. Future work in this field should aim to extrapolate the various classes of coevolving residues within a protein system. If one can, in fact, separate the interacting, structural and functional classes of coevolving residues within a protein system, it is therefore theoretically possible to utilize coevolution analyses in the prediction of proteinprotein interaction. Also, identifying key coevolving amino acid residues between Hsp and their known clients may provide a mechanism for the computational identification of further, as yet unknown, Hsp clients.
Proximity of Coevolving Residues to Functionally Important Residues
Most proteins exhibit residues at positions within the amino acid sequence that have been conserved throughout the evolutionary history due to functional constraints. Natural selection is responsible for this conservation, as a change at these residues will have a negative affect on protein function. The identification of such functionally conserved residues using coevolution methods is impossible, as no variability can be measured at these sites and hence no information exists to conduct powerful statistical tests. All functional sites, however, are not conserved throughout evolutionary history and, therefore, the in silico prediction of functional residues within proteins remains a realistic endeavor. In these analyses, we have observed many coevolving sites to be spatially proximal to residues previously implicated to be of functional importance. Therefore, it is plausible to suggest that mutation of highly conserved amino acids 3-dimensionally proximal to residues identified as coevolving may, in fact, enable the identification of highly conserved functionally important sites within proteins.
The Identification of a Novel Putative Functional Domain within Hop?
With one exception, all the domains within Hop identified as coevolving with Hsp70 and/or Hsp90 have been previously identified as functionally important in interactions within the Hsp70HopHsp90 proteinfolding system. In our analysis we have observed one region within Hop, termed here as the TPR linker domain, which has not been previously shown to be important for the interaction of Hop with Hsp70/Hsp90. The TPR linker domain is not conserved within lower eukaryotes, whereas it is extremely conserved between higher eukaryotes. In fact, within this domain, when comparing Hop sequences from higher and lower eukaryotes, using the computer software DIVERGE (Gu and Vander Velden 2002
), we identified 2 residues (202N and 206S) as having undergone functional divergence type I (altered evolutionary rates at amino acids in higher eukaryotes when compared with lower eukaryotes, due to novel functional constraints within higher eukaryotes). Although no 3-dimensional structures are available for Hop, neural network analysis using the computer program CONSEQ (Berezin et al. 2004
) predicted that the TPR linker is exposed on the molecule and also that a number of residues within this domain are functionally important in both S. cerevisiae and Homo sapiens Sti1/Hop (data not shown). In the absence of conclusive mutagenesis data and 3-dimensional structures, it is difficult to definitively propose a function for this TPR linker domain, but we feel that, due to its proximity to the TPR1 and TPR2A domains in Hop, it may play a role in binding and/or regulation of Hsp70 and Hsp90 interactions with Hop. Our proposal of a function for the TPR linker region was purely based on the in silico analysis performed in this study.
However, following the in silico analysis, we uncovered data by Longshaw et al. (2004)
proposing that this region may, in fact, perform a role in Hop binding to Hsp70/Hsp90. Within our proposed TPR linker domain, they identified a casein kinase II (CKII) phosphorylation site and a cdc2 kinase phosphorylation site at positions 189S and 198T, respectively, in murine Hop. They observed that mimicking phosphorylation at 189S promotes nuclear localization of Hop, whereas phosphorylation at 198T promotes cytoplasmic localization of Hop at the G1/S-phase transition. Although the mutations at these positions did not seem to affect Hop binding with Hsp70 and Hsp90, they suggested that phosphorylation at these positions may be involved in regulation of the nuclear localization signal and therefore indirectly in the assembly of Hsp70HopHsp90 complexes.
Taking into account the conclusions of Longshaw et al. (2004)
and the observation, in this study, of coevolution only within functionally important domains in Hop, we propose that this TPR linker domain is, in fact, functionally important within the Hsp70HopHsp90 complexes. The fact that many coevolving sites are neighboring or directly contacting functionally important regions or amino acid sites previously proposed is due to that coevolutionary analyses are based upon the presence of evolutionary information. This type of analyses therefore misses highly conserved sites. The a priori assumption of our method that considers important those coevolving sites surrounding functionally important and highly conserved regions (see Materials and Methods) ameliorates this problem and permits the detection of functional sites in the protein. We think that further mutagenesis analysis should be performed in definitively defining the role of this region within the Hsp70HopHsp90 proteinfolding system. We suggest that these analyses should be performed in organisms from both higher and lower eukaryotes as we have observed a degree of functional divergence within this TPR linker domain between them.
| Supplementary Material |
|---|
|
|
|---|
Supplementary materials are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
| Acknowledgements |
|---|
|
|
|---|
This work was supported by Science Foundation Ireland under the program of the President of Ireland Young Researcher Award to M.A.F.
| Footnotes |
|---|
Michele Vendruscolo, Associate Editor
| References |
|---|
|
|
|---|
Abbas-Terki T, Briand PA, Donze O, Picard D. (2002) The Hsp90 co-chaperones Cdc37 and Sti1 interact physically and genetically. Biol Chem 383:13351342.[CrossRef][ISI][Medline]
Abbas-Terki T, Donze O, Briand PA, Picard D. (2001) Hsp104 interacts with Hsp90 cochaperones in respiring yeast. Mol Cell Biol 21:75697575.
Ali MM, Roe SM, Vaughan CK, Meyer P, Panaretou B, Piper PW, Prodromou C, Pearl LH. (2006) Crystal structure of an Hsp90-nucleotide-p23/Sba1 closed chaperone complex. Nature 440:10131017.[CrossRef][Medline]
Atchley WR, Zhao J, Fernandes AD, Druke T. (2005) Solving the protein sequence metric problem. Proc Natl Acad Sci USA 102:63956400.
Berezin C, Glaser F, Rosenberg J, Paz I, Pupko T, Fariselli P, Casadio R, Ben-Tal N. (2004) ConSeq: the identification of functionally and structurally important residues in protein sequences. Bioinformatics 20:13221324.
Bohen SP and Yamamoto KR. (1993) Isolation of Hsp90 mutants by screening for decreased steroid receptor function. Proc Natl Acad Sci USA 90:1142411428.
Bose S, Weikl T, Bugl H, Buchner J. (1996) Chaperone function of Hsp90-associated proteins. Science 274:17151717.
Bowers PM, Pellegrini M, Thompson MJ, Fierro J, Yeates TO, Eisenberg D. (2004) Prolinks: a database of protein functional linkages derived from coevolution. Genome Biol 5:R35.[CrossRef][Medline]
Carrigan PE, Sikkink LA, Smith DF, Ramirez-Alvarado M. (2006) Domain:domain interactions within Hop, the Hsp70/Hsp90 organizing protein, are required for protein stability and structure. Protein Sci 15:522532.
Chang HC, Nathan DF, Lindquist S. (1997) In vivo analysis of the Hsp90 cochaperone Sti1 (p60). Mol Cell Biol 17:318325.[Abstract]
Dittmar KD, Banach M, Galigniana MD, Pratt WB. (1998) The role of DnaJ-like proteins in glucocorticoid receptor.hsp90 heterocomplex assembly by the reconstituted Hsp90.p60. Hsp70 foldosome complex. J Biol Chem 273:73587366.
Fares MA. (2006) Computational and statistical methods to explore the various dimensions of protein evolution. Curr Bioinform 1:207217.
Fares MA and McNally D. (2006) CAPS: coevolution analysis using protein sequences. Bioinformatics 22:28212822.
Fares MA and Travers SA. (2006) A novel method for detecting intramolecular coevolution: adding a further dimension to selective constraints analyses. Genetics 173:923.
Flaherty KM, DeLuca-Flaherty C, McKay DB. (1990) Three-dimensional structure of the ATPase fragment of a 70K heat-shock cognate protein. Nature 346:623628.[CrossRef][Medline]
Flom G, Weekes J, Williams JJ, Johnson JL. (2005) Effect of mutation of the TPR and DP2 domains of Sti1 on Hsp90 signaling and interaction in Saccharomyces cerevisiae. Genetics 172:4151.
Gloor GB, Martin LC, Wahl LM, Dunn SD. (2005) Mutual information in protein multiple sequence alignments reveals two classes of coevolving positions. Biochemistry 44:71567165.[CrossRef][Medline]
Glover JR and Lindquist S. (1998) Hsp104, Hsp70, and Hsp40: a novel chaperone system that rescues previously aggregated proteins. Cell 94:7382.[CrossRef][ISI][Medline]
Gross M and Hessefort S. (1996) Purification and characterization of a 66-kDa protein from rabbit reticulocyte lysate which promotes the recycling of hsp 70. J Biol Chem 271:1683316841.
Gu X and Vander Velden K. (2002) DIVERGE: phylogeny-based analysis for functional-structural divergence of a protein family. Bioinformatics 18:500501.
Harrison CJ, Hayer-Hartl M, Di Liberto M, Hartl F, Kuriyan J. (1997) Crystal structure of the nucleotide exchange factor GrpE bound to the ATPase domain of the molecular chaperone DnaK. Science 276:431435.
Henikoff S and Henikoff JG. (1992) Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci USA 89:1091510919.
Hightower LE. (1991) Heat shock, stress proteins, chaperones, and proteotoxicity. Cell 66:191197.[CrossRef][ISI][Medline]
Jiang J, Prasad K, Lafer EM, Sousa R. (2005) Structural basis of interdomain communication in the Hsc70 chaperone. Mol Cell 20:513524.[CrossRef][ISI][Medline]
Karunakaran KP, Noguchi Y, Read TD, Cherkasov A, Kwee J, Shen C, Nelson CC, Brunham RC. (2003) Molecular analysis of the multiple GroEL proteins of Chlamydiae. J Bacteriol 185:19581966.
Korber BT, Farber RM, Wolpert DH, Lapedes AS. (1993) Covariation of mutations in the V3 loop of human immunodeficiency virus type 1 envelope protein: an information theoretic analysis. Proc Natl Acad Sci USA 90:71767180.
Kosano H, Stensgard B, Charlesworth MC, McMahon N, Toft D. (1998) The assembly of progesterone receptor-Hsp90 complexes using purified proteins. J Biol Chem 273:3297332979.
Longshaw VM, Chapple JP, Balda MS, Cheetham ME, Blatch GL. (2004) Nuclear translocation of the Hsp70/Hsp90 organizing protein mSTI1 is regulated by cell cycle kinases. J Cell Sci 117:701710.
Maddison WP and Maddison DR. (1992) MacClade. (Sinauer, Sunderland (MA)).
Meyer P, Prodromou C, Hu B, Vaughan C, Roe SM, Panaretou B, Piper PW, Pearl LH. (2003) Structural and functional analysis of the middle segment of Hsp90: implications for ATP hydrolysis and client protein and cochaperone interactions. Mol Cell 11:647658.[CrossRef][ISI][Medline]
Odunuga OO, Longshaw VM, Blatch GL. (2004) Hop: more than an Hsp70/Hsp90 adaptor protein. Bioessays 26:10581068.[CrossRef][ISI][Medline]
Osipiuk J, Walsh MA, Freeman BC, Morimoto RI, Joachimiak A. (1999) Structure of a new crystal form of human Hsp70 ATPase domain. Acta Crystallogr Sect D Biol Crystallogr 55:11051107.[CrossRef][Medline]
Owens-Grillo JK, Czar MJ, Hutchison KA, Hoffmann K, Perdew GH, Pratt WB. (1996) A model of protein targeting mediated by immunophilins and other proteins that bind to Hsp90 via tetratricopeptide repeat domains. J Biol Chem 271:1346813475.
Pellecchia M, Montgomery DL, Stevens SY, Vander Kooi CW, Feng HP, Gierasch LM, Zuiderweg ER. (2000) Structural insights into substrate binding by the molecular chaperone DnaK. Nat Struct Biol 7:298303.[CrossRef][ISI][Medline]
Pollock DD, Taylor WR, Goldman N. (1999) Coevolving protein residues: maximum likelihood identification and relationship to structure. J Mol Biol 287:187198.[CrossRef][ISI][Medline]
Poole AM and Ranganathan R. (2006) Knowledge-based potentials in protein design. Curr Opin Struct Biol 16:508513.[CrossRef][ISI][Medline]
Pritchard L and Dufton MJ. (2000) Do proteins learn to evolve? The Hopfield network as a basis for the understanding of protein evolution. J Theor Biol 202:7786.[CrossRef][ISI][Medline]
Prodromou C, Panaretou B, Chohan S, Siligardi G, OBrien R, Ladbury JE, Roe SM, Piper PW, Pearl LH. (2000) The ATPase cycle of Hsp90 drives a molecular clamp' via transient dimerization of the N-terminal domains. Embo J 19:43834392.[CrossRef][ISI][Medline]
Prodromou C, Roe SM, O'Brien R, Ladbury JE, Piper PW, Pearl LH. (1997) Identification and structural characterization of the ATP/ADP-binding site in the Hsp90 molecular chaperone. Cell 90:6575.[CrossRef][ISI][Medline]
Prodromou C, Roe SM, Piper PW, Pearl LH. (1997) A molecular clamp in the crystal structure of the N-terminal domain of the yeast Hsp90 chaperone. Nat Struct Biol 4:477482.[CrossRef][ISI][Medline]
Prodromou C, Siligardi G, O'Brien R, Woolfson DN, Regan L, Panaretou B, Ladbury JE, Piper PW, Pearl LH. (1999) Regulation of Hsp90 ATPase activity by tetratricopeptide repeat (TPR)-domain co-chaperones. Embo J 18:754762.[CrossRef][ISI][Medline]
Revington M, Zhang Y, Yip GN, Kurochkin AV, Zuiderweg ER. (2005) NMR investigations of allosteric processes in a two-domain Thermus thermophilus Hsp70 molecular chaperone. J Mol Biol 349:163183.[CrossRef][ISI][Medline]
Richter K, Muschler P, Hainzl O, Buchner J. (2001) Coordinated ATP hydrolysis by the Hsp90 dimer. J Biol Chem 276:3368933696.
Richter K, Muschler P, Hainzl O, Reinstein J, Buchner J. (2003) Sti1 is a non-competitive inhibitor of the Hsp90 ATPase. Binding prevents the N-terminal dimerization reaction during the ATPase cycle. J Biol Chem 278:1032810333.
Roberts BT, Moriyama H, Wickner RB. (2004) [URE3] prion propagation is abolished by a mutation of the primary cytosolic Hsp70 of budding yeast. Yeast 21:107117.[CrossRef][ISI][Medline]
Scheufler C, Brinker A, Bourenkov G, Pegoraro S, Moroder L, Bartunik H, Hartl FU, Moarefi I. (2000) Structure of TPR domain-peptide complexes: critical elements in the assembly of the Hsp70-Hsp90 multichaperone machine. Cell 101:199210.[CrossRef][ISI][Medline]
Shim Choi S, Li W, Lahn BT. (2005) Robust signals of coevolution of interacting residues in mammalian proteomes identified by phylogeny-aided structural analysis. Nat Genet 37:13671371.[CrossRef][ISI][Medline]
Shomura Y, Dragovic Z, Chang HC, Tzvetkov N, Young JC, Brodsky JL, Guerriero V, Hartl FU, Bracher A. (2005) Regulation of Hsp70 function by HspBP1: structural analysis reveals an alternate mechanism for Hsp70 nucleotide exchange. Mol Cell 17:367379.[ISI][Medline]
Smith DF, Sullivan WP, Marion TN, Zaitsu K, Madden B, McCormick DJ, Toft DO. (1993) Identification of a 60-kilodalton stress-related protein, p60, which interacts with Hsp90 and hsp70. Mol Cell Biol 13:869876.
Socolich M, Lockless SW, Russ WP, Lee H, Gardner KH, Ranganathan R. (2005) Evolutionary information for specifying a protein fold. Nature 437:51

) or Hsp90 (*) are also shown.


