MBE Advance Access originally published online on January 19, 2006
Molecular Biology and Evolution 2006 23(4):764-772; doi:10.1093/molbev/msj098
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Article |
Diversification and Specialization of HIV Protease Function During In Vitro Evolution
Department of Biochemistry, Center for Fundamental and Applied Molecular Evolution, Emory University School of Medicine, Rollins Research Center, Atlanta, GA, USA
E-mail: imatsum{at}emory.edu.
| Abstract |
|---|
|
|
|---|
Our goal is to understand how enzymes adapt to utilize novel substrates. We and others have shown that directed evolution tends to generate enzyme variants with broadened substrate specificity. Broad-specificity enzymes are generally deleterious to living cells, so this observed trend might be an artifact of the most commonly employed high throughput screens. Here, we demonstrate a more natural and effective screening strategy for directed evolution. The gene encoding model enzyme HIV protease was randomly mutated, and the resulting library was expressed in Escherichia coli cells to eliminate cytotoxic broad-specificity variants. The surviving variants were screened for clones with activity against a reporter enzyme. The wild-type human immunodeficiency virus type I protease (HIV PR) is cytotoxic and exhibits no detectable activity in reactions with beta-galactosidase (BGAL). In contrast, the selected variants were nontoxic and exhibited greater activity and specificity against BGAL than did the wild-type HIV PR in reactions with any substrate. A single round of whole gene random mutagenesis and conventional high-throughput screening does not usually effect complete inversions of substrate specificity. This suggests that a combination of positive and purifying selection engenders more rapid adaptation than positive selection alone.
Key Words: directed evolution random mutagenesis high-throughput screening aspartic protease
| Introduction |
|---|
|
|
|---|
Our goal is to understand how enzymes evolve to recognize novel substrates. Adaptive molecular evolution is worthy of study for several reasons. Metazoan speciation is manifested at the macroscopic level but is driven by microscopic changes in protein structure and function. Population genetic models are generally applicable to evolving populations of proteins as the same rules should apply to complex systems at all levels of biological order. Molecules are easier to manipulate and study than whole organisms and are therefore more amenable to experimental hypothesis testing. Finally, studies of molecular adaptation are very practical as they enable the artificial evolution of new enzymes with useful properties.
Phylogenetic analysis has proven successful at identifying instances of natural adaptive molecular evolution. This contribution is significant because neutral mutations are fixed at a relatively high constant rate due to genetic drift (Kimura 1983
). The most common approach is to compare the rates of nonsynonymous (missense) and synonymous (silent) substitutions; unusually high ratios are taken to indicate adaptive evolution (Yang and Bielawski 2000
; Swanson 2003
). In certain cases, gene synthesis or site-directed mutagenesis has been applied to "resurrect" the ancestral genes predicted by phylogenetic inference algorithms. The subsequent expression, purification, and characterization of the corresponding proteins can support (or reject) predictions based on phylogenetic analyses (Thornton 2004
).
Adaptive mutations that affect substrate specificity are particularly difficult to identify for several reasons. First, enzyme-substrate specificity is often conserved for billions of years (Golding and Dean 1998
). The adaptive mutations that affect specificity have thus become obscured by the accumulation of subsequent neutral mutations. Second, the relationship between the structure and function of existing proteins is often unclear even when an X-ray crystal structure is available. The conformational dynamics of an enzyme is often an important determinant of its specificity, and mutations that affect this property can occur anywhere in the protein (Benkovic and Hammes-Schiffer 2003
; James and Tawfik 2003
). Third, the criteria that determine the overall fitness of any particular protein at any particular time remain incompletely understood. Phylogenetic approaches alone are thus insufficient to understand the rules that govern of adaptive molecular evolution.
We and many others (Stemmer 1994
; Moore and Arnold 1996
; Yano, Oue, and Kagamiyama 1998
; Aharoni et al. 2005
; Varadarajan et al. 2005
) control the evolutionary process in the laboratory and observe populations of proteins as they adapt to new substrates. We can repeat informative experiments, varying population sizes, mutation rates, recombination rates, and the selection criteria. In contrast to those who evolve bacterial populations in vitro, it is easy for us to identify potentially beneficial mutations, infer their mechanistic effects (through computational molecular modeling), and to purify and characterize the ancestral and evolved protein variants. Neutral and slightly deleterious mutations are rarely fixed in any of our experiments, in contrast to the rates observed in natural gene families.
Laboratory evolution can provide insight into the mechanisms of adaptive molecular evolution. We (Matsumura and Ellington 2001
) and others (Aharoni et al. 2005
) have shown that directed evolution tends to produce enzyme variants with broadened substrate specificity. This trend suggests that enzymes fluctuate through "specialist" and "generalist" states as they adapt to new substrates. Laboratory evolution may allow the isolation of mutants that would not be discovered through natural evolution, and vice versa, due to operational differences. First, natural molecular populations are larger than those produced by the polymerase chain reaction (PCR) and cloning. Second, cells in nature are normally challenged with libraries of novel substrates. In contrast, directed evolution experiments are generally contrived with a wild-type enzyme that exhibits modest but detectable activity against a single novel substrate. Third, laboratory evolution is able to isolate certain generalist enzymes that may be deleterious to cells in nature. We are therefore motivated to devise more realistic and effective selection schemes.
The human immunodeficiency virus type I protease (HIV PR) is a good model protein. Its structure and function are well understood, and it is amenable to high throughput assays. HIV PR normally catalyzes the cleavage of nine divergent amino acid sequences within the HIV-1 Gag-Pol polyprotein, thereby activating viral proteins essential for replication (Kohl et al. 1988
). The enzyme is thought to recognize a shape assumed by its polypeptide substrates, rather than a consensus amino acid sequence (Prabu-Jeyabalan, Nalivaika, and Schiffer 2002
). The enzyme is specific in the sense that it does not recognize other sites within the HIV-1 proteome, but we consider it nonspecific because it recognizes essential proteins in Escherichia coli (Baum, Bebernitz, and Gluzman 1990b), yeast, and mammalian cells (Blanco, Carrasco, and Ventoso 2003
) and is cytotoxic to all three species. HIV protease is normally expressed late in the viral life cycle, so the cleavage of host proteins imposes little or no fitness cost to the virus in its natural environment.
Here, we observe the divergence of HIV PR specificity in an artificial system designed to recapitulate the natural selection of a generic protease. Random mutations were introduced by mutagenic PCR. The toxicity of the wild-type enzyme was exploited to eliminate variants with broad specificity, reflecting the fitness costs of generalist enzymes upon living cells. The surviving variants were challenged with a de facto library of peptide substrates consisting of the
1,023 amino acids of a modified form of the E. coli beta-galactosidase (BGAL). Several specific (nontoxic) variants emerged after a single round of random mutagenesis and screening, some of which exhibited wild-typelike levels of activity against novel substrates not recognized by the wild-type enzyme.
| Experimental Procedures |
|---|
|
|
|---|
Materials
The HIV PR/lacZ coexpression vector, p1 + IQ (Baum, Bebernitz, and Gluzman 1990a), was purchased from the American Type Culture Collection (Manassas, Va.). 6his-lacZ-pET28 (induction control E) was from Novagen (Madison, Wisc.), and E. coli strain DH5
lac (DE3) was constructed as previously described (Matsumura and Ellington 2001
F' was from Invitrogen (Carlsbad, Calif.). The GeneAmp XL PCR kit was from Perkin Elmer (Boston, Mass.). Molecular biology enzymes, including restriction enzymes, T4 DNA ligase and T4 DNA polymerase, were from New England Biolabs (Beverly, Mass.). All oligonucleotides were custom synthesized by IDT (Coralville, Iowa); rabbit polyclonal antibodies to BGAL and monoclonal anti-HIV antibodies were from Abcam (ab616); the IRD700 infraredlabeled goat anti-rabbit antibody was obtained from LI-COR (Lincoln, Nebr.). All other chemicals, including the fluorogenic peptide HIV PR substrate I (with the sequence Arg-Glu-(EDANS)-Ser-Gln-Asn-Tyr-Pro-Ile-Val-Gln-Lys-(DABCYL)-Arg), M9 salts, ampicillin, and para-nitrophenyl-beta-D-galactopyranoside (pNP-gal) were from Sigma-Aldrich (St. Louis, Mo.).
Cloning
Vector p1 + IQ (Baum, Bebernitz, and Gluzman 1990a) was used to coexpress the HIV PR and BGAL variants in E. coli. The p6 site in lacZ (after codon 80) was present in the p1 + IQ vector. The TNF-
peptide sequence (NRPDYLLFAE) was cloned into this vector, replacing the p6, by whole plasmid PCR and self-ligation (Geddie and Matsumura 2004
) as follows. The p1 + IQ plasmid was amplified using primers: 5'-CTTCTTTTCGCCGAGGAGGCCGATACTGTCGTCGTCCCC-3' and 5'-GTAATCGGGTCGGTTAGGAAGATCGCACTCCAGCCAGCTT-3' (bold sequence encodes TNF-
peptide and underlined sequence complements lacZ). The presence of the TNF-
site was verified with DNA sequencing. Random mutations were introduced into the HIV PR gene in a error-prone PCR (Cadwell and Joyce 1992
) using the extragenic primers 5'-CTACCATTACCAGTTGGTCTGGTGTCA-3' and 5'-CGTTTCACTTCTGAGTTCGGCATG-3'. In short, the mutations were introduced in an error-prone PCR containing 5 microliters of a 10x mutagenic PCR mix (8 mM dTTP, 8 mM dCTP, 48 mM MgCl2, 5 mM MnCl2) in 50 microliters (total volume). The PCR was cycled 25 times between 94°C for 30 seconds, 65°C for 30 seconds and 72°C for 1 minute. The addition of MnCl2 to the reaction reduced the fidelity of the polymerase and the excess of dTTP and dCTP offset the balance of dNTPs. Adding MnCl2 to the reaction reduces the fidelity of the polymerase, and the excess of dTTP and dCTP offsets the balance of deoxynucleoside triphosphates. The library of mutant HIV PR alleles was cloned into a TNF-
/BGAL variant of the p1 + IQ coexpression vector using the enzymes BsmI and PstI and electroporated (Dower, Miller, and Ragsdale 1988
) into E. coli W2244 (lacZ, lacY+) (Cook and Lederberg 1962
) to produce a library containing 500,000 random mutants.
Western Blot Analysis
p1 + IQ vectors containing various combinations of HIV PR (wild type, D25N, P9S/I50L) and lacZ (wild type, p6, and TNF-
) alleles were propagated in LB-ampicillin media to midlogarithmic phase and induced with 1 mM isopropyl-beta-D-thiogalactopyranoside (IPTG) for 1 h at 37°C. Cells were harvested by micro-centrifugation in microfuge at 3,000 rpm for 10 min and resuspended in sodium dodecyl sulfate (SDS)/beta-mercaptoethanol buffer (Sambrook and Russell 2001
). The proteins in these extracts were electrophoretically separated in a 6% polyacrylamide gel and transferred to a nitrocellulose membrane. The membrane was blocked with 4% nonfat milk, reacted with polyclonal anti-BGAL antibodies and IRD700-labeled secondary antibodies. The immobilized antibody-antigen complexes were visualized and quantified on a LI-COR Odyssey infrared imager.
In Vivo BGAL Activity
P1 + IQ-type vectors for the coexpression of HIV PR (wild type, D25N, or P9S/I50L) and BGAL (wild type, native p6, or novel TNF-
) were constructed. Escherichia coli strain W2244 (lacZ, lacY+) was transformed with each of the nine constructs and separately propagated on LB-ampicillin agar plates. The resulting colonies were scraped off of each plate and resuspended in liquid LB-ampicillin. The OD600 of each culture was measured, and the cells were diluted to the same density. Twenty microliters of each culture (in triplicate) were reacted with 2 mM pNP-gal in 50 mM Tris-HCl/10 mM MgCl2 (pH 7.6). The OD405 was then monitored for 5 h in a Thermo Labsystems Multiskan Ascent microplate spectrophotometer.
Protein Purification
The HIV PR alleles from p1 + IQ were amplified with 5'-NdeI HIV PR (5'-GGAATTCCATATGCCTCAGATCACTCTTTGGCAACGACC-3'), where the bold sequence is an NdeI restriction site and the underlined sequence complements the HIV PR gene and the extragenic pUC-600 (5'-GAAAACGTTCTTCGGGGCGAA-3'). The PCR products were purified and cloned into the pET28+ vector (Novagen) using restriction enzymes NdeI and HindIII. This fused DNA encoding an N-terminal histidine tag and thrombin cleavage site to the 5' end of the HIV PR gene. The clone was sequenced to verify the absence of undesired mutations from the PCR procedure. The protease was purified according to an established protocol (Vickrey et al. 2003
) that describes the purification of the protease from inclusion bodies. Briefly, a 10-ml culture of the his-tagged protease was grown to saturation in LB media supplemented with 25 mg/ml kanamycin at 37°C (LB-kan). The culture was diluted 1:100 in 1 liter LB-kan, grown to midlogarithmic phase, and induced for 3 h with 1 mM IPTG. The cells were harvested by centrifugation, and the pellet was frozen at 80°C.
The cells were resuspended in lysis buffer (50 mM Tris, pH 7.5, 200 mM sodium chloride, and 2 mM ethylenediaminetetraacetic acid [EDTA]) and lysed by the application of 20,000 PSI of force in two passages through a French pressure cell. The inclusion bodies were separated from the soluble protein fraction by centrifugation, resuspended in 6 M urea, and applied to a nickel affinity column equilibrated with lysis buffer and 6 M urea. The column was sequentially washed with three lysis buffers containing 6 M urea and 10, 25, and 50 mM imidazole, respectively. HIV PR was step eluted with 200 mM and 1 M imidazole. The denatured protease was placed in dialysis tubing and dialyzed initially against buffer (20 mM sodium acetate, pH 5, 1 mM dithiothreitol DTT), plus 3 M urea and then the same buffer with 1 M urea and finally two exchanges of buffer without urea. The protein was concentrated using a Centriprep YM-3 concentrator (Millipore, Bedford, Mass.). All HIV PR protein was concentrated to > 1 microgram/microliter, as quantified by a Bradford assay (Bradford 1976
). The proteins were stored at 20°C in 20 mM sodium acetate, pH 5, 1 mM DTT, and 50% glycerol and were stable for over 2 months. The protease was purified to greater than 95% purity, as judged by sodium dodecyl sulfatepolyacrylamide gel electrophoresis (SDS-PAGE) analysis (Sambrook and Russell 2001
).
Enzyme Assays with Fluorogenic Oligopeptide Substrates
The protease assays were initiated by first adding varying amounts of HIV protease substrate I (1.4513.85 µM) to HIV PR assay buffer (0.1 M sodium acetate, 1.0 M sodium chloride, 1.0 mM EDTA, 1.0 mM DTT, 1mg/ml BSA, pH 4.7) and then adding a constant amount of purified protease (191.5 nM) to a total reaction volume of 130 µl. The enzymes and substrates were reacted for 20 min at 25°C on an SLM-Aminco Bowman Series 2 spectrofluorimeter. The assays were performed in triplicate at each substrate concentration. The purified, inactive D25N variant did not react with either peptide substrate (negative control, data not shown). To convert fluorescence units per time to molarity per time, a known amount of the 5-(2-aminoethylamino)-1-naphthalene sulfonate (EDANS) fluorophore was used as a reference compound, and its absorbance was recorded. The equation Vc = Vi(Cref/Iref), where Vc is the concentration of product per unit time, Vi is the fluorescence per unit time, Cref is the known concentration of EDANS, and Iref is the fluorescence intensity of EDANS for the given concentration. The values were also corrected for the differences of the intensities of equimolar quantities of reference compound to that of the free cleavage product by multiplying by this ratio (Matayoshi et al. 1990
).
| Results |
|---|
|
|
|---|
High-Throughput HIV PR Screen
Our objective is to study adaptive enzyme evolution under conditions that simulate the natural selection of a generic protease. Previous workers have found that many high-throughput screens (those divorced from organismal fitness) tend to favor the selection of generalist variants with broad specificity, at least as an intermediate. Here, we describe a system that discriminates against such generalists and challenges each randomly mutated enzyme with a library of possible substrates. It has been shown that HIV PR catalyzes the cleavage of the native p6 peptide (VSFNF
PQITL) in the context of a permissive surface loop (amino acid position 80) of the E. coli BGAL (p6/BGAL). The cleavage reaction occurs in vivo when the HIV PR and p6/BGAL proteins are coexpressed in E. coli, enabling high-throughput assays based on inactivation of BGAL activity (Baum, Bebernitz, and Gluzman 1990a).
We arbitrarily chose a relatively hydrophobic 10-amino acid peptide (NRPDY
LLFAE) as a novel HIV PR substrate; this sequence is normally displayed on a surface loop of TNF-
(Eck and Sprang 1989
), a drug target that causes inflammatory diseases (Beutler and Cerami 1986
). We inserted the TNF-
peptide into lacZ after codon 80 (TNF-
/BGAL) by site-directed insertion mutagenesis. Coexpression of TNF-
/BGAL with either the wild-type or the catalytically inactive (negative control) D25N HIV PR mutant demonstrated that the wild-type protease exhibited detectable activity against the novel substrate (table 1), normally a prerequisite for in vitro evolution. The wild-type HIV PR thus strongly prefers the TNF-
peptide over all the other peptides of TNF-
/BGAL.
|
Random Mutagenesis and Selection
Random mutations were introduced into the HIV PR gene in a error-prone PCR; the cumulative mutation rate of this procedure is about 0.66 ± 0.13% per position (Cadwell and Joyce 1992
/BGAL coexpression vector and electroporated into E. coli W2244 (lacZ, lacY+). Sixty-thousand transformed cells were propagated in LB-ampicillin media to midlogarithmic phase; expression of the HIV PR library and TNF-
/BGAL proteins were induced with IPTG. The wild-type HIV PR catalyzes the cleavage of essential E. coli proteins and is therefore cytotoxic (Baum, Bebernitz, and Gluzman 1990b); this initial selection eliminated active variants with broad specificity. After 2 h of induction, the surviving
15,000 cells were spread onto plates containing 100 µg/µl ampicillin at a density of
500 colony-forming units per plate and grown overnight at 37°C.
The transformed colonies were adsorbed to nitrocellulose filters and transferred (colony side up) onto plates containing ampicillin and 0.08 mg/ml 5-bromo-4-chloro-3-indolyl-beta-D-galactopyranoside (X-gal), a histochemical marker of BGAL activity. Control colonies expressing the wild-type HIV PR and TNF-
/BGAL were a lighter blue, indicating protease activity, compared with those expressing with D25N variant under these conditions, indicating protease activity. We identified 27 light blue colonies (presumably expressing HIV PR variants with altered specificity) and restreaked these onto LB-ampicillinX-gal plates to verify the decrease in BGAL activity. Fewer than half (12 clones) retained the light blue phenotype, and seven of these were discarded after restriction mapping analysis indicated large deletions within the p1 + IQ plasmid. In summary, we identified five protease variants (out of 60,000 clones) that were less toxic than the wild type but exhibited reproducibly greater activity against the TNF-
/BGAL protein.
Sequences of the Evolved HIV PR Variants
The selected HIV PR genes were sequenced (both strands) in order to determine the mutations that altered specificity. The following amino acid replacements were observed: P9S/I50L, Q7L (twice), W6R/F53S/N83I/I93N, L5F. We mapped them upon the crystal structure of the enzyme-substrate complex (Prabu-Jeyabalan, Nalivaika, and Schiffer 2002
) and found that the mutations occurred in residues either in the substrate-binding cleft or at the dimer interface (fig. 1). The possible adaptive mechanisms of these mutations are described in Discussion. We studied the behavior of these mutants in vivo (within E. coli) and in vitro to learn how the amino acid changes altered substrate specificity.
|
In Vivo Behavior
In Vivo Activity
Amongst the four unique HIV PR variants (P9S/I50L, Q7L, W6R/F53S/N83I/I93N, L5F), P9S/I50L was the sole protease that exhibited improved specificity for the TNF-
/BGAL substrate. The three other mutations showed decreased BGAL activity, regardless of the sequence of the inserted peptide. To quantify the inversion of specificity exhibited by the P9S/I50L variant, we co-expressed three HIV PR variants (wild type, D25N, and P9S/I50L) with each of the three BGAL variants (containing the wild type, p6, and TNF-
peptides at amino acid 80) in liquid culture and reacted each culture with pNP-gal (table 1). Increased protease activity is revealed by diminished BGAL activity (relative to the catalytically inactive D25N control). The wild-type HIV PR exhibited similar activity in reactions with p6/BGAL (
21% less BGAL activity than corresponding D25N control) and TNF-
/BGAL (
25% less BGAL activity than D25N). In contrast, the P9S/I50L variant exhibited little activity against p6/BGAL (
6% less than D25N) but significantly improved activity against TNF-
/BGAL (
50% less BGAL activity than D25N). These results suggest that the P9S/I50L mutations confer specificity for the TNF-
peptide.
Toxicity
The heterologous expression of the wild-type protease is toxic to E. coli due to nonspecific cleavage of essential cellular proteins (Baum, Bebernitz, and Gluzman 1990b). Escherichia coli transformed with HIV PR/BGAL coexpression vectors were grown to midlogarithmic phase. The HIV PRs (wild type, D25N, P9S/I50L, Q7L, or W6R/F53S/N83I/I93N, L5F) and BGAL (p6/BGAL) were induced with IPTG. The cultures were diluted 105-fold and plated at time points 0, 1, and 2 h. After overnight incubation at 37°C, the colonies were counted and the viability of each culture was calculated. This survival assay confirmed that the wild-type HIV PR was toxic to E. coli, but the catalytically inactive D25N and artificially selected HIV PR variants were not (Figure 2).
|
Western Blot
Anti-BGAL polyclonal antibodies were employed to visualize the cleavage products of the PRs within cell extracts. Cells transformed with variants of the p1 + IQ plasmid (wild type, D25N or P9S/I50L HIV PR plus wild type, p6/BGAL or TNF-a/BGAL) were propagated to midlogarithmic phase and induced with IPTG. Extracts from these cells were visualized by SDS-PAGE and immunoblotting (fig. 3). The expected fragments of BGAL when cleaved at the engineered site are approximately 17 and 108 kDa. The western blot clearly shows cleavage of the p6/BGAL by wild-type protease (lane 2) but not by D25N (lane 3); a modest product can be seen when coexpressed with P9S, I50L (lane 4). Similarly, the TNF-
site within TNF-
/BGAL is cleaved (albeit incompletely) by the selected P9S/I50L variant (lane 7), but not by D25N (lane 6), and to a lesser degree by the wild-type protease (lane 5). A lower band present in all lanes, including those containing the inactive D25N HIV PR mutant, is probably either a nonspecific cross-reactant with the BGAL antibody or the product of some endogenous protease. These results suggest that the P9S/I50L HIV PR variant recognizes the novel TNF-
peptide rather than some other part of BGAL. To determine if the P9S/I50L mutation affected protein expression levels, soluble and whole cell extracts were analyzed by Western blotting analysis using a monoclonal anti-HIV PR antibody (Figure 4). The levels of expression differed only modestly. The increased activity of the P9S/I50L HIV protease against the TNF-
/BGAL reporter is therefore not likely due to increased expression levels.
|
|
In Vitro Behavior
Synthetic Substrate
The wild-type, D25N, P9S/I50L, W6R/F53S/N83I/I93N, Q7L HIV PRs were purified and refolded by an established protocol (Vickrey et al. 2003
3% wild-type activity. These results are consistent with the in vivo assays above, which show that P9S/I50L HIV PR is relatively inactive against the p6/BGAL protein.
|
BGAL Substrate
We next showed that the purified HIV PR variants could cleave the purified BGAL variants in vitro. The his-tagged wild-type, p6- and TNF-
/ BGAL proteins were purified by immobilized metal affinity chromatography. The HIV PRs were reacted with the BGAL variants in phosphate buffer (25 mM Na2HPO4, pH 7.0; 25 mM NaCl; 1 mM DTT, 30°C for 1 h). The cleavage products were then separated on 6% polyacrylamide gels and stained with SYPRO dye (Invitrogen). Figure 6A shows a representative gel of the degradation of p6/BGAL by HIV PR. Figure 6B shows the quantification of the upper band (full-length BGAL) at each time point. We chose to quantify the band representing the full-length BGAL in each experiment because it is evident that the 100-kDa product is subject to further degradation, possibly because this product is conformationally unstable. The purified P9S/I50L PR is
2.5-fold more active than the wild-type enzyme in reactions with the TNF-
/BGAL. Neither version of BGAL is cleaved by D25N PR; the wild-type BGAL is not cleaved by the wild type, D25N, or P9S/I50L (data not shown). These results suggest that P9S/I50L is specific for the TNF-
peptide within the TNF-
/BGAL reporter.
|
The purified W6R/F53S/N83I/I93N and Q7L HIV PRs exhibited activity in reactions with the TNF-
/BGAL both in vivo and in vitro (fig. 7). The HIV PR variants were cloned into pET28 and transformed into the strain BL21(DE3), which is lacZ+. Cultures transformed with various proteases show a profound decrease in BGAL activity (about 10-fold) when activity was measured with a pNP-gal assay (fig. 7A). In the in vitro assays, the evolved HIV PR variants caused the disappearance of the 116 kDa beta-galactosidase band, but not the appearance of the 100 kDa digestion product (fig. 7b). This result suggests that these mutants evolved to recognize sites within BGAL other than the TNF-
site. The wild-type HIV PR has no detectable activity against the wild-type BGAL either in vivo (fig. 7A) or in vitro (fig. 7B). In contrast, the purified W6R/F53S/N83I/I93N and Q7L HIV PR variants appear to be even more active against wild-type BGAL than is the wild-type HIV PR is against p6/BGAL. We cannot pinpoint the site of cleavage due to the smearing produced by the protease, but it is clear that these HIV PR variants have evolved to recognize sites within BGAL not susceptible to the wild-type protease.
|
| Discussion |
|---|
|
|
|---|
Positive and Purifying Selection
Laboratory evolution experiments can reveal possible mechanisms of adaptive molecular evolution, but their relevance to real-world populations is predicated upon the nature of the high-throughput screen. Protein engineers typically begin with a wild-type enzyme with modest but detectable catalytic activity in reactions with a single novel substrate. In contrast, natural selection adapts entire proteomes to libraries of novel substrates. The simplest high-throughput screens tend to favor enzyme variants with broadened substrate specificity (Aharoni et al. 2005
Our selection system discriminates against protease variants with broad substrate specificity. It also enabled the identification of variants that utilize any part of the 1023 amino acid BGAL protein as a substrate. These features apparently favored variants more specific than those commonly observed in directed evolution experiments. The P9S/I50L PR exhibits inverted specificity toward the TNF-
site within BGAL and is as active and specific in reactions with TNF-
/BGAL as the wild-type HIV PR is against p6/BGAL. The W6R/F53S/N83I/I93N and Q7L HIV PR variants exhibit inverted specificity against unknown sites within the wild-type BGAL protein, even though the wild-type HIV PR shows no detectable activity against this substrate. None of the evolved protease variants are as toxic to E. coli cells as the wild-type enzyme. It is unusual to observe such inversions in substrate specificity after a single round of whole gene mutagenesis and screening. A recent study that utilizes bacterial display and flow cytometry (Varadarajan et al. 2005
) applied positive and negative selection as well. Through this screening procedure, these investigators were able to isolate mutants of the E. coli endopeptidase OmpT that exhibited greater than three million-fold specificity toward a novel substrate over the native substrate. These results further validate this purifying selection strategy.
Structure and Function
We randomly mutated the HIV PR gene and identified a single clone (P9S/I50L) with improved activity against a designated target substrate (TNF-
peptide) in a high-throughput screen. Characterization of the mutant enzyme within the artificial E. coli system and in vitro showed that it had reduced activity against its native substrate and increased activity toward this novel substrate. To rationalize the effect of the P9S and I50L amino acid alterations on the overall structure and activity of the protease, we modeled the TNF-
substrate within the active site of HIV PR (fig. 1). I50 is part of the flap region, which in the closed conformation interacts with the substrate and is involved in hydrogen bonding within the protease (Hoog et al. 1996
). The mutation of this isoleucine to leucine may alter hydrophobic packing within the enzyme-substrate complex. Interestingly, a mutation in I50 has also been identified in a selection searching for mutations in the protease that alleviate its toxicity (Baum, Bebernitz, and Gluzman 1990b). The effect of the P9S substitution is not obvious, but the proline residue is present in the dimer interface of the protease. This region of the protease is primarily hydrophobic, and a substitution from proline to serine may alter this hydrophobicity and thus the dimer interaction. Because dimerization influences the shape of the active site pocket, an alteration may allow the novel peptide substrate to bind (Prabu-Jeyabalan, Nalivaika, and Schiffer 2000
). The substrate preference of other mutations (Q7L, W6R, F53S, N83I, I93N, and L5F) remains unknown, but each clone contains mutations also in the dimerization interface.
| Acknowledgements |
|---|
|
|
|---|
D.N.G. first expressed and purified the hexa-histidine tagged HIV PR, although she did not refold it. We are grateful to Lauren Ancel-Meyers, Stefan Lutz, Wayne Patrick, and Jennifer Stratton for their critical readings of the manuscript. We thank the National Institutes of Health/National Institute of Allergy and Infectious Diseases (1 R21AI054602-01) and National Institute of General Medical Services (1 R01 GM074264-01) for support.
| Footnotes |
|---|
Claudia Schmidt-Dannert, Associate Editor
| References |
|---|
|
|
|---|
Aharoni, A., L. Gaidukov, O. Khersonsky, Q. G. S. Mc, C. Roodveldt, and D. S. Tawfik. 2005. The evolvability of promiscuous protein functions. Nat. Genet. 37:7376.[ISI][Medline]
Baum, E. Z., G. A. Bebernitz, and Y. Gluzman. 1990a. Beta-galactosidase containing a human immunodeficiency virus protease cleavage site is cleaved and inactivated by human immunodeficiency virus protease. Proc. Natl. Acad. Sci. USA 87:1002310027.
. 1990b. Isolation of mutants of human immunodeficiency virus protease based on the toxicity of the enzyme in Escherichia coli. Proc. Natl. Acad. Sci. USA 87:55735577.
Benkovic, S. J., and S. Hammes-Schiffer. 2003. A perspective on enzyme catalysis. Science 301:11961202.
Beutler, B., and A. Cerami. 1986. Cachectin/tumor necrosis factor: an endogenous mediator of shock and inflammation. Immunol. Res. 5:281293.[ISI][Medline]
Blanco, R., L. Carrasco, and I. Ventoso. 2003. Cell killing by HIV-1 protease. J. Biol. Chem. 278:10861093.
Bradford, M. M. 1976. A rapid and sensitive method for the quantitation of microgram quantities of protein utilizing the principle of protein-dye binding. Anal. Biochem. 72:248254.[CrossRef][ISI][Medline]
Cadwell, R. C., and G. F. Joyce. 1992. Randomization of genes by PCR mutagenesis. PCR Methods Appl. 2:2833.[Medline]
Cook, A., and J. Lederberg. 1962. Recombination studies of lactose nonfermenting mutants of Escherichia coli K-12. Genetics 47:13351353.
Dower, W. J., J. F. Miller, and C. W. Ragsdale. 1988. High efficiency transformation of E. coli by high voltage electroporation. Nucleic Acids Res. 16:61276145.
Eck, M. J., and S. R. Sprang. 1989. The structure of tumor necrosis factor-alpha at 2.6 A resolution. Implications for receptor binding. J. Biol. Chem. 264:1759517605.
Geddie, M. L., and I. Matsumura. 2004. Rapid evolution of beta-glucuronidase specificity by saturation mutagenesis of an active site loop. J. Biol. Chem. 279:2646226468.
Golding, G. B., and A. M. Dean. 1998. The structural basis of molecular adaptation. Mol. Biol. Evol. 15:355369.[Abstract]
Hoog, S. S., E. M. Towler, B. Zhao, M. L. Doyle, C. Debouck, and S. S. Abdel-Meguid. 1996. Human immunodeficiency virus protease ligand specificity conferred by residues outside of the active site cavity. Biochemistry 35:1027910286.[CrossRef][Medline]
James, L. C., and D. S. Tawfik. 2003. Conformational diversity and protein evolutiona 60-year-old hypothesis revisited. Trends Biochem. Sci. 28:361368.[CrossRef][ISI][Medline]
Kimura, M. 1983. The neutral theory of molecular evolution. Cambridge University Press, Cambridge.
Kohl, N. E., E. A. Emini, W. A. Schleif, L. J. Davis, J. C. Heimbach, R. A. Dixon, E. M. Scolnick, and I. S. Sigal. 1988. Active human immunodeficiency virus protease is required for viral infectivity. Proc. Natl. Acad. Sci. USA 85:46864690.
Matayoshi, E. D., G. T. Wang, G. A. Krafft, and J. Erickson. 1990. Novel fluorogenic substrates for assaying retroviral proteases by resonance energy transfer. Science 247:954958.
Matsumura, I., and A. D. Ellington. 2001. In vitro evolution of beta-glucuronidase into a beta-galactosidase proceeds through non-specific intermediates. J. Mol. Biol. 305:331339.[CrossRef][ISI][Medline]
Moore, J. C., and F. H. Arnold. 1996. Directed evolution of a para-nitrobenzyl esterase for aqueous-organic solvents. Nat. Biotechnol. 14:458467.[CrossRef][ISI][Medline]
Prabu-Jeyabalan, M., E. Nalivaika, and C. A. Schiffer. 2000. How does a symmetric dimer recognize an asymmetric substrate? A substrate complex of HIV-1 protease. J. Mol. Biol. 301:12071220.[CrossRef][ISI][Medline]
. 2002. Substrate shape determines specificity of recognition for HIV-1 protease: analysis of crystal structures of six substrate complexes. Structure (Camb). 10:369381.
Sambrook, J., and D. W. Russell. 2001. Molecular cloning: a laboratory manual, 3rd edition, Vol. 1. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
Stemmer, W. P. 1994. Rapid evolution of a protein in vitro by DNA shuffling. Nature 370:389391.[CrossRef][Medline]
Swanson, W. J. 2003. Adaptive evolution of genes and gene families. Curr. Opin. Genet. Dev. 13:617622.[CrossRef][ISI][Medline]
Thornton, J. W. 2004. Resurrecting ancient genes: experimental analysis of extinct molecules. Nat. Rev. Genet. 5:366375.[CrossRef][ISI][Medline]
Varadarajan, N., J. Gam, M. J. Olsen, G. Georgiou, and B. L. Iverson. 2005. Engineering of protease variants exhibiting high catalytic activity and exquisite substrate selectivity. Proc. Natl. Acad. Sci. USA 102:68556860.
Vickrey, J. F., B. C. Logsdon, G. Proteasa, S. Palmer, M. A. Winters, T. C. Merigan, and L. C. Kovari. 2003. HIV-1 protease variants from 100-fold drug resistant clinical isolates: expression, purification, and crystallization. Protein Expr. Purif. 28:165172.[CrossRef][ISI][Medline]
Yang, Z., and J. P. Bielawski. 2000. Statistical methods for detecting molecular adaptation. Trends Ecol. Evol. 15:496503.[CrossRef][Medline]
Yano, T., S. Oue, and H. Kagamiyama. 1998. Directed evolution of an aspartate aminotransferase with new substrate specificities. Proc. Natl. Acad. Sci. USA 95:55115515.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. L. O'Loughlin, W. M. Patrick, and I. Matsumura Natural history as a predictor of protein evolvability Protein Eng. Des. Sel., October 1, 2006; 19(10): 439 - 442. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||






= wild-type HIV PR;
= P9S/I50L PR; = p6/BGAL; - - - - = TNF-
