Skip Navigation


MBE Advance Access originally published online on March 18, 2008
Molecular Biology and Evolution 2008 25(6):1113-1119; doi:10.1093/molbev/msn063
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow Supplementary Material
Right arrow All Versions of this Article:
25/6/1113    most recent
msn063v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Toyota, H.
Right arrow Articles by Yomo, T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Toyota, H.
Right arrow Articles by Yomo, T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2008. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Research Articles

Emergence of Polyproline II-Like Structure at Early Stages of Experimental Evolution from Random Polypeptides

Hitoshi Toyota*, Masato Hosokawa*, Itaru Urabe* and Tetsuya Yomo{dagger},{ddagger},§

* Department of Biotechnology, Graduate School of Engineering, Osaka University, Suita, Osaka, Japan
{dagger} Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, Osaka University, Suita, Osaka, Japan
{ddagger} Graduate School of Frontier Science, Osaka University, Suita, Osaka, Japan
§ Complex Systems Biology Project, Exploratory Research for Advanced Technology, Japan Science and Technology Agency, Suita, Osaka, Japan

E-mail: yomo{at}ist.osaka-u.ac.jp


    Abstract
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Supplementary Material
 Acknowledgements
 References
 
To examine whether a primordial functional protein at the early stages of evolution has structural features, we carried out experimental evolution consisting of 25 cycles (generations) of mutation and selection toward DNA-binding function using a random-sequence polypeptide of 139 amino acid residues with no secondary structure as the initial sequence. In each generation, 16 clones were sampled arbitrarily for sequence analysis, and a phylogenetic tree was constructed. Polypeptide evolution proceeded from the initial point on branch I in 2 main directions of branches II and III. The initial and 2 evolved polypeptides (one at the 24th generation on branch III and the other at the 25th generation on branch II) were purified to examine their functional and structural properties. Although binding of the initial polypeptide to the target DNA was not detected by surface plasmon resonance measurements, the 2 evolved polypeptides bound to the DNA with dissociation constants of 1.6 and 1.0 µM, respectively, indicating an increase in affinity during the experimental evolution. Circular dichroism spectra of the evolved polypeptides, but not of the initial polypeptide, showed features characteristic of the polyproline II (PPII)–like structure, a left-handed helical structure commonly found in natural proteins, suggesting that the structure emerged through the experimental evolution. The same structural feature was found in another experimental evolution toward catalytic activity. These results demonstrate that the PPII-like structure is one of the common features that could have appeared in the early evolutionary stages of primordial functional protein.

Key Words: experimental evolution • early evolution • random polypeptide • polyproline II • DNA binding


    Introduction
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Supplementary Material
 Acknowledgements
 References
 
In the early stages of protein evolution, the first functional proteins presumably originated from arbitrary polypeptides with random sequences. As the frequency of functional polypeptides in a random-sequence library was estimated experimentally to be 10–11 (Keefe and Szostak 2001Go), a similar probability is expected if such innate functional polypeptides emerged suddenly at the early stages of evolution (Keefe and Szostak 2001Go). On the other hand, most trials of protein evolution unavoidably start with very low or negligible levels of function, as observed for random polypeptides (Prijambada et al. 1996Go; Yamauchi et al. 1998Go). Therefore, the evolvability of arbitrarily chosen random polypeptides was examined and demonstrated experimentally through cycles of mutation and selection toward several biological functions (Yamauchi et al. 2002Go; Hayashi et al. 2003Go; Ito et al. 2004Go; Nakashima et al. 2007Go).

In the latter case, assuming that the global structures of proteins coevolve gradually with their function, as suggested theoretically (Saito et al. 1997Go; Yomo et al. 1999Go), such primordial functional proteins obtained through experimental evolution may show traces of structural features. However, it is not known whether primordial functional proteins at the early stages of evolution already show some structural characteristics. Here, we report the emergence of a polyproline II (PPII)–like structure when an arbitrarily chosen soluble random polypeptide, RP3-42 (Prijambada et al. 1996Go), was subjected to experimental evolution through cycles of mutation and selection based on binding affinity to DNA. In addition, a similar structure was found in a primordial functional protein obtained previously (Yamauchi et al. 2002Go) at an early stage of experimental evolution based on another biological function. The PPII-like structure, which is not necessarily composed of proline residues, is a left-handed helical structure with 3 residues per turn, and is commonly found in natural proteins (Makarov et al. 1992Go; Adzhubei and Sternberg 1993Go; Sreerama and Woody 1994Go; Bochicchio and Tamburro 2002Go; Hicks and Hsu 2004Go; Cubellis et al. 2005Go). These experimental results support the theoretical proposition that structures coevolve gradually with function (Saito et al. 1997Go; Yomo et al. 1999Go), which is also a scenario proposed for protein diversification (James and Tawfik 2003Go), at the initial stages of protein evolution. The emergence of PPII-like structures by different types of experimental evolution suggests that PPII is one of the common structures that could have appeared in the early evolutionary stages of primordial functional protein.


    Materials and Methods
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Supplementary Material
 Acknowledgements
 References
 
Bacterial Strains, Phage, Plasmids, and Oligonucleotides
Escherichia coli strains TG1 (supE, hsd{Delta}5, thi, {Delta}(lac-proAB)/F’[traD36, proAB+, lacIq, lacZ, {Delta}M15]) used for phage display and BL21(DE3) (F, ompT, hsdSB(rB mB), gal, dcm (DE3)) used for protein expression were purchased from Amersham Biosciences (Piscataway, NJ) and Novagen (San Diego, CA), respectively. The phage clone displaying a random-sequence polypeptide, RP3-42, was prepared as described previously (Nakashima et al. 2000Go). The phagemid pCANSS, a derivative of pCANTAB5E (Amersham Biosciences), was prepared previously (Nakashima et al. 2000Go) and used for phage display. The plasmid pET21aSH, a derivative of pET21a(+) (Novagen), was prepared previously (Yamauchi et al. 2002Go) and used for expressing proteins with a C-terminal His6-tag. Oligonucleotides were purchased from Sigma Genosys Japan (Hokkaido, Japan).

Phage Preparation
Phage were prepared as described previously (Yamauchi et al. 2002Go) except that they were suspended in N-2-hydroxyethylpiperazine-N'-2-ethanesulfonic acid (HEPES)–buffered saline (HBS) (10 mM HEPES–NaOH, 50 mM NaCl, 1 mM MgCl2, pH 7.5) supplemented with 8% (v/v) NEB restriction enzyme diluent buffer A (10 mM Tris–HCl, 50 mM KCl, 0.1 mM ethylenediaminetetraacetic acid, 1 mM dithiothreitol [DTT], 0.2 µg/ml bovine serum albumin, 50% (v/v) glycerol, pH 7.4).

Derivation of Phage Library Displaying Derivatives of RP3-42
The genes for RP3-42 and its derivatives on phagemids were amplified by mutagenic polymerase chain reaction (PCR) under the conditions of low mutational bias (deoxyguanosine triphosphate and deoxyadenosine triphosphate concentrations were each 0.2 mM) (Cadwell and Joyce 1992Go) using {Delta}Tth DNA polymerase (Toyobo, Osaka, Japan) (Arakawa et al. 1996Go) and the primers, 5'-ATCCTCGCAACTGCGGCCCACGTGGCCATGGCTAGCATGACTGGTGGACAGCAAATGGGT-3' and 5'-AGTTTAGGCCACAGAGGCCTGGATCGCGAGATCTGTCGACTC-3'. The SfiI fragment from the PCR products was ligated into pCANSS, and E. coli TG1 cells were transformed with the ligated DNA by electroporation. A phage library was prepared from the phagemid library, the size of which was maintained at about 2 x 106, as described above. The mutation rate was about 0.01 per nucleotide site.

In Vitro Selection
Derivatives of a random polypeptide, RP3-42, displayed on the surface of M13 phage as a fusion protein with the pIII coat protein were selected based on their binding ability with double-stranded target DNA, including one of the artificial sequences that can be recognized by Zif268 reported by Choo et al. (1997)Go and an NcoI recognition site. The design and preparation of the target DNA were almost the same as those described previously (Nakashima et al. 2007Go). The sequence of the biotinylated oligonucleotide was biotin-5'-GACGTACCATGGAATTCGTACGCTCACGCTAGTAGCCCTCGAGATGCAG-3', where the underlined bases indicate the NcoI recognition site and the bases in bold type indicate the zinc-finger recognition site. Selection of phage-displayed polypeptides capable of binding to the target DNA was performed at 20 °C on a Biacore 2000 system (Biacore AB, Uppsala, Sweden). The Biacore instrument and a Sensor Chip SA (streptavidin) were equilibrated with HBS supplement with Tween 20 (10 mM HEPES–NaOH, 50 mM NaCl, 1 mM MgCl2, 0.005% Tween 20, pH 7.5) supplemented with 8% (v/v) NEB restriction enzyme diluent buffer A. The sensor chip was conditioned with NaCl/NaOH solution in accordance with the manufacturer's instructions. The biotinylated target DNA (1 µM) dissolved in 10 mM HEPES–NaOH (pH 7.5) containing 0.5 M NaCl and 1 mM MgCl2 was injected into a flow cell at a flow rate of 2 µl/min for 30 min. As a result, an increase of about 2,500 resonance units (RU) corresponding to the immobilization of 0.1 pmol of the target DNA on the sensor chip was observed. Then, aliquots of 4 x 1013/ml of the phage displaying polypeptides were injected into the flow cell at a flow rate of 1 µl/min for 30 min. After injection, the sensor chip was undocked from the Biacore instrument keeping the sensor surface wet, and background phage bound to the liquid delivery system of the instrument was washed out by 2 consecutive desorb commands followed by rinsing with water: first with 50 mM glycine (pH 9.5) and 0.5% sodium dodecyl sulfate (SDS) and second with 50 mM HCl and 70% ethanol; total washing time was about 1 h. No phage were detectable from the instrument after washing. The sensor chip was then docked again into the instrument and washed with HBST supplemented with 8% (v/v) NEB restriction enzyme diluent buffer A at a rate of 5 µl/min, and the flow-through buffer was recovered at constant time intervals using the inject (15 min) and recovery commands. At the last (fourth) recovery, HBST supplemented with 8% (v/v) NEB restriction enzyme diluent buffer A containing NcoI (final concentration, 4 units/µl; New England Biolabs, Ipswich, MA) was injected to recover the target DNA with bound phage by NcoI digestion. Completion of the digestion reaction was monitored on the Biacore sensorgram (Nilsson et al. 1995Go). The number of phage recovered was measured by determining the number of ampicillin-resistant colonies of phage-infected E. coli TG1 cells (Marks et al. 1991Go). All the colonies obtained from the last recovery were collected and stored at –80 °C.

Sequence Analysis
DNA sequencing was outsourced to Hitachi High-Technologies Corporation (Tokyo, Japan) or performed using a BigDye Terminator v1.1 Cycle Sequencing Kit (Applied Biosystems, Foster City, CA) and an ABI PRISM 3100 Genetic Analyzer (Applied Biosystems).

Expression and Purification of His-Tagged Polypeptides
The polypeptide gene on the phagemid carried by a selected phage was subcloned into the NheI/SfiI sites of pET21aSH. When the gene had an internal amber stop codon (TAG), which can survive in the supE strain of E. coli TG1, the codon was replaced by the glutamine codon (CAG) using a QuikChange site-directed mutagenesis kit (Stratagene, La Jolla, CA). The polypeptide was expressed in E. coli BL21(DE3) cells harboring the hybrid plasmid by induction with isopropyl-β-D-thiogalactopyranoside. The induction conditions were 5 h at 30 °C for RP3-42H and TDP24-2H or 7 h at 20 °C for TDP25-4H. After induction, the cells were lysed in 0.1 M NaH2PO4, 10 mM Tris (pH 7.5) containing 0.5 M NaCl, 20 mM imidazole, and 8 M urea. The supernatant of the lysate was applied to a Ni-NTA column (Qiagen, Hilden, Germany) equilibrated with the same buffer on an AKTA prime FPLC system (Amersham Biosciences). The column was washed with the same buffer except that the imidazole concentration was increased to 30 mM, and the polypeptide was eluted with an imidazole gradient of 30–80 mM. The fractions showing a single band on sodium dodecyl sulfate–polyacrylamide gel electrophoresis (SDS-PAGE) were pooled and concentrated with a Vivaspin centrifugal concentrator (Vivascience, Hannover, Germany). The buffer solution was exchanged with HBS using a PD-10 desalting column (Amersham Biosciences), and the purified polypeptide was stored at 4 °C. All the purified polypeptides were confirmed to be homogeneous by SDS-PAGE and amino acid analysis (Suga et al. 1996Go). Protein concentration was also determined by amino acid analysis (Suga et al. 1996Go).

Gel Filtration
Gel filtration chromatography was performed at room temperature (20–25 °C) on a Superdex 200 PC 3.2/30 column (Amersham Biosciences) with a SMART system (Amersham Biosciences). The column was equilibrated with 10 mM HEPES–NaOH (pH 7.5) containing 150 mM NaCl and 1 mM MgCl2 or the same buffer supplemented with 8 M urea and 10 mM DTT for native or reducing denaturing conditions, respectively. The sample solutions were injected into the column, and the elution profiles were monitored by determining the absorbance at 280, 260, and 237 nm. The void and total volumes of the column were measured with blue dextran (2,000 kDa) and acetone, respectively. The following proteins in a molecular weight (MW) marker kit (Sigma, St. Louis, MO) were used as standards: sweet potato β-amylase, MW 200 kDa and Stokes radius (Rs) 50.4 Å; bovine serum albumin, MW 66 kDa and Rs 33.9 Å; bovine erythrocyte carbonic anhydrase, MW 29 kDa and Rs 23.6 Å; and horse heart cytochrome c, MW 12.4 kDa and Rs 17.0 Å. The Rs values of the polypeptides were calculated from the empirical linear relation between Rs and the inverse of elution volume (Uversky 1993Go).

Surface Plasmon Resonance
Surface plasmon resonance (SPR) measurements were performed at 20 °C in HBST supplemented with 8% (v/v) NEB restriction enzyme diluent buffer A at a flow rate of 30 µl/min on a Biacore 2000. The target DNA was immobilized on a Sensor Chip SA (about 400 RU). Association and dissociation phases were 180 and 300 s, respectively. After each measurement, the surface of the sensor chip was regenerated with pulses of 0.05% SDS solution. For binding experiments, samples with various polypeptide concentrations (2.5–100 nM) were injected. For solution affinity experiments, a sample polypeptide (final concentration 100 nM) was mixed with various concentrations (final 10–300 nM) of target DNA (without biotinylation) or nontarget DNA (double-stranded 5'-GTCGATGGTACCCCGTAGTAGGCCAATGGTAGCAATTGTGCACTAGCTG-3'), and the mixtures were incubated for 1 h at 20 °C before injection. The concentration of free polypeptide, Pfree, in each mixture was estimated using the standard curve obtained from the binding experiments. The dissociation constant, Kd, and the apparent site size, n, occupied by a bound polypeptide were estimated by nonlinear least square fitting to the equation:

Formula
where {nu} is the binding density, that is, the number of bound polypeptides per target DNA, N is the number of base pairs of the target DNA, (f) is the probability that a base pair of the target DNA is free, and (ff) is the conditional probability that the right-side base pair of a free base pair is also free. The equation was derived by extending the McGhee and von Hippel (1974)Go approach for noncooperative binding to a 1-dimensional infinite lattice into binding to a finite lattice by the following assumptions. At sufficiently low binding density, the probabilities (b1) that a residue is the first of an occupied site and (f) are assumed to be {nu}/N and 1 – n({nu}/N), respectively, at every residue of the finite lattice, and hence (ff) becomes (f)/[(f)+(b1)]. In addition, the first residue of a potential binding site is assumed to lie within the first Formula residues of the lattice. Hence, the total number of free binding sites per lattice is Formula.

Circular Dichroism Spectroscopy
Circular dichroism (CD) spectra were recorded on a Jasco J-720 spectropolarimeter equipped with a Peltier temperature controller (PTC-348WI; Jasco, Tokyo, Japan). The spectra were scanned 8 times at a scan rate of 20 nm/min, using a response time of 0.25 s and a spectral band width of 1 nm, and the average of the 8 scans was recorded. The polypeptide concentration was in the range of 5–20 µM, and the light path length of the cell was 1 mm. Data points with high-tension voltage over 600 V were excluded from analysis. Appropriate background spectra were scanned and subtracted from polypeptide spectra.


    Results and Discussion
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Supplementary Material
 Acknowledgements
 References
 
We started experimental evolution toward DNA-binding function using a random polypeptide, RP3-42, as the initial sequence. RP3-42 is a soluble random polypeptide of 139 amino acid residues with no secondary structure (Prijambada et al. 1996Go; Yamauchi et al. 1998Go), which was used previously as an arbitrary sequence representing the initial material of an intended evolutionary study toward another biological function related to phage infectivity (Hayashi et al. 2003Go). As RP3-42 was shown to be evolvable in an intended direction (Hayashi et al. 2003Go), testing its evolvability in another direction (i.e., DNA binding) represents a test of the evolutional plasticity of an arbitrary sequence.

First, the RP3-42 gene on the phagemid was subjected to random mutagenesis, and the resulting library of mutant polypeptides displayed on phage was subjected to selection based on binding ability to a target DNA using a Biacore system. The system enables us to monitor the binding and elution processes and to control the selection processes under constant conditions. The target DNA contained a recognition site for the NcoI restriction enzyme, which was used to recover the bound phage as a complex with the target DNA (Nakashima et al. 2007Go). The selected phage formed the first-generation library. Further cycles of mutation and selection were carried out up to the 25th generation. The number of phage selected in each generation increased gradually, and the value for the 25th generation was about 2.2 x 103 times larger than that of the zeroth-generation phage displaying RP3-42 (fig. 1A). The increase in phage selection suggested an increase in the number of phage bound to the target DNA and hence an increase in the binding ability of the polypeptides displayed on the selected phage.


Figure 1
View larger version (37K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 1.— Evolution of a random polypeptide, RP3-42, based on DNA-binding ability. (A) Number of selected phage (bars) and cumulative number of amino acid substitutions (colored symbols) in each generation. The phage number for the zeroth generation was obtained from the in vitro selection experiment using phage displaying RP3-42. The number of phage subjected to selection was 1.2 x 1012. The cumulative number of substitutions is expressed as the mean and sample standard deviation calculated from 16 sequences sampled for each generation. (B) Phylogenetic tree constructed by the Neighbor-Joining method (Saitou and Nei 1987Go) implemented in MEGA 4 (Tamura et al. 2007Go) using the distances among all the deduced amino acid sequences determined. The positions of the sequences are indicated by the colored symbols used in (A). Thus, 16 sequences in each generation have the same symbols. The 3 main branches are designated I, II, and III, as indicated. The red symbol of the initial sequence (RP3-42) located on branch I is enclosed within a red circle. The symbols of TDP24-2 and TDP25-4 (the original deduced sequences of TDP24-2H and TDP25-4H without His6-tag) located on branches III and II, respectively, are enclosed within purple and magenta circles, respectively. The scale bar indicates 0.05 amino acid substitutions per site. TDP24-2 and TDP25-4 possess 47 and 49 amino acid substitutions, respectively, from RP3-42, and there are 59 substitutions between the 2, including 15 potentially parallel substitution sites at which both residues are different from the corresponding residues in RP3-42.

 
In each generation, 16 clones were sampled arbitrarily, and the nucleotide sequences of the genes encoding the polypeptides were analyzed. The mean of the cumulative number of amino acid substitutions deduced from the nucleotide sequences increased concomitant with the increase in phage selection (fig. 1A). A phylogenetic tree was constructed based on the distances among the deduced amino acid sequences (fig. 1B). In the tree, the positions of each sequence are indicated by colored symbols, which correspond to the symbols used in figure 1A. The tree is composed of 3 main branches: I, II, and III (fig. 1B). The red symbol of the initial sequence (RP3-42) enclosed within a red circle is located on branch I, and sequences in generations 1–8 (orange to yellow) are also located on branch I forming a group. The positions of the sequences in generations 9–13 (yellow to green) are widely scattered and are located on branches I and II. After the 15th generation (cyan to magenta), no new sequences appeared on branch I, suggesting extinction of the large sequence group of the branch. On the other hand, branch III is formed only by the newer sequences appearing in and after the 14th generation (green to magenta). Overall, the evolution of the polypeptide sequence proceeded from the starting point on branch I in the 2 main directions of branches II and III. These results suggest that evolution under a constant selection pressure proceeds divergently but not isotropically; many sequences appear and become extinct, whereas several sequences survive on a limited number of groups forming small branches in different directions. That is, evolution seems to proceed not monodirectionally but in several directions, retaining the possibility of reaching several different endpoints.

To examine the functional and structural properties of the evolved polypeptides, we chose one clone from each of branches II and III (the magenta and purple circles in fig. 1B), and the polypeptides of these clones and also RP3-42 were expressed with a His6-tag in E. coli cells as soluble forms. The purified polypeptides were named TDP24-2H (selected at the 24th generation on branch III), TDP25-4H (selected at the 25th generation on branch II), and RP3-42H, respectively.

The MWs of RP3-42H, TDP24-2H, and TDP25-4H determined by matrix-assisted laser desorption/ionization-time of flight (MALDI-TOF) mass spectrometry were 15751, 15621, and 15290, respectively, indicating that one Met residue was missing in each sequence. The N-terminal Met residue was probably removed during expression, as reported previously (Hirel et al. 1989Go). The deduced sequences of TDP24-2H and TDP25-4H are shown in the legend to table 1.


View this table:
[in this window]
[in a new window]

 
Table 1 Physical Properties of the Initial and Evolved Polypeptides

 
To estimate the compactness of RP3-42H, TDP24-2H, and TDP25-4H, their Stokes radii (Rs) were determined by gel filtration chromatography. In the native state, their values were similar and 1.1- to 1.2-fold greater than that expected for a globular protein of similar MW (table 1). In the denatured state with 8 M urea and 10 mM DTT, these values increased to almost the same as those expected for the denatured state of the calculated protein (table 1). These results suggest that these polypeptides in the native state are more compact than the denatured state but not as compact as native globular proteins.

The affinities of RP3-42H, TDP24-2H, and TDP25-4H to the target DNA were estimated by SPR analysis. Although the binding of the initial polypeptide, RP3-42H, to the target DNA was not detected, TDP24-2H and TDP25-4H bound to the DNA with dissociation constants, Kd, of 1.6 and 1.0 µM, respectively (table 1), indicating increases in DNA-binding affinity during the experimental evolution. It should be noted that they showed similar affinity to another DNA with the same composition but different sequence (data not shown), indicating that their DNA-binding ability was not specific to the target DNA sequence used for the selection. Although the level of the evolved function is still in a primitive state, the random polypeptide (RP3-42) was shown to be evolvable toward a DNA-binding protein. As the same random polypeptide was shown previously to be evolvable toward another biological function (Hayashi et al. 2003Go), these results demonstrate the evolutional plasticity of an arbitrary sequence in function.

To examine whether these polypeptides with a primitive function obtained through the experimental evolution have structural characteristics, CD spectra of TDP24-2H and TDP25-4H were measured together with RP3-42H at pH 7.5 and at various temperatures, as indicated in figure 2. The spectra of TDP24-2H and TDP25-4H were similar and showed characteristic features in that they each had a negative band at around 200 nm, and the trough deepened with decreases in temperature with a concomitant increase in a band around 220 nm forming an isodichroic point around 209 nm. As these features are characteristic of the PPII structure (Bochicchio and Tamburro 2002Go), the 2 polypeptides were suggested to contain a PPII-like structure. The PPII-like structure, not necessarily composed of proline residues, is a left-handed helical structure with 3 residues per turn and is commonly found in natural proteins (Makarov et al. 1992Go; Adzhubei and Sternberg 1993Go; Sreerama and Woody 1994Go; Bochicchio and Tamburro 2002Go; Hicks and Hsu 2004Go; Cubellis et al. 2005Go). The presence of the isodichroic point indicates that the polypeptides existed in an equilibrium state between PPII and other conformations. The results of singular value decomposition analysis (Henry and Hofrichter 1992Go; Johnson 1992Go) of the spectra suggested that the significant components of the conformations of the polypeptides were PPII and probably disordered conformations, corresponding to the second and the first basis spectrum, respectively (supplementary fig. S3, Supplementary Material online). The difference spectra generated by subtracting higher temperature spectra from lower temperature spectra clearly showed that the PPII content, indicated by the positive ellipticity around 220 nm, was increased with decreasing temperature in the polypeptides (fig. 2). On the other hand, the spectra of RP3-42H did not show such features (fig. 2) and were similar to those of RP3-42 with no marked secondary structure (Yamauchi et al. 1998Go). Therefore, the PPII-like structure observed for TDP24-2H and TDP25-4H was suggested to have emerged through the experimental evolution with selection based on binding affinity to DNA. In addition, these polypeptides are located on different main branches (fig. 1B). Hence, the emergence of the PPII-like structure was not branch-specific, suggesting that the structure appeared commonly, if not on all the evolved sequences.


Figure 2
View larger version (16K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 2.— CD spectra (mean residue ellipticity, [{theta}]) of the ancestral and evolved polypeptides. (A) RP3-42H at 4 °C (blue), 20 °C (green), and 60 °C (red). (B) TDP24-2H at 4 °C (blue), 20 °C (green), 60 °C (orange), and 70 °C (red). (C) TDP25-4H at 4 °C (blue), 20 °C (green), 60 °C (orange), and 70 °C (red). (D) YSLP1-1 at 5 °C (blue), 20 °C (green), 60 °C (orange), and 70 °C (red). (E) YSLP6-1 at 5 °C (blue), 20 °C (green), 60 °C (orange), and 70 °C (red). RP3-42H, TDP24-2H, and TDP25-4H were measured in HBST (pH 7.5) supplemented with 8% (v/v) NEB restriction enzyme diluent buffer A, and YSLP1-1 and YSLP6-1 reported previously (Yamauchi et al. 2002Go) were measured in 50 mM acetate buffer (pH 5.0). The difference spectra, {Delta}[{theta}], generated by subtracting higher temperature spectrum (60 °C for RP3-42H and 70 °C for the others) from each lower temperature spectrum are shown with the same colors as described above.

 
CD spectra of YSLP1-1 and YSLP6-1 were determined to examine the generality of the emergence of the PPII-like structure. YSLP1-1 and YSLP6-1 are polypeptides selected at the first and sixth generations, respectively, of experimental evolution starting from random polypeptides toward the highest affinity to a transition state analog for esterase reaction (Yamauchi et al. 2002Go). These results showed that the spectra of YSLP6-1 have similar characteristics to those of TDP24-2H and TDP25-4H, but such characteristics were observed to only a small extent for YSLP1-1 (fig. 2), indicating an increase in the PPII-like structure during the experimental evolution. Together with the observations of the present study, these results support the theoretical proposition that structures coevolve gradually with function (Saito et al. 1997Go; Yomo et al. 1999Go), which is also a scenario proposed for protein diversification (James and Tawfik 2003Go), at the initial stages of protein evolution, and suggest that PPII is a common structure in primordial proteins. That is, our experimental evolution may have shown one of the common routes occurring in the early stages of protein evolution. Further development of structure and function of primordial proteins, which can be realized by continuing experimental evolution, will provide insights into the process of the development of protein evolution.


    Supplementary Material
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Supplementary Material
 Acknowledgements
 References
 
Supplementary figures S1S3 are available at Molecular biology and Evolution online (http://www.mbe.oxfordjournals.org/).


    Acknowledgements
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Supplementary Material
 Acknowledgements
 References
 
We thank Dr K. Ogasahara of the Institute for Protein Research, Osaka University, for technical guidance and discussions on CD spectroscopy; I. Nishina and W. Furutani of Center for Medical Research and Education, Graduate School of Medicine, Osaka University, for help and technical support with amino acid analysis and MALDI-TOF mass spectrometry; and also Dr T. Nakashima for valuable advice regarding phage display and in vitro selection experiments. This research was supported in part by "Special Coordination Funds for Promoting Science and Technology: Yuragi Project" and "Global COE (Centers of Excellence) Program" of the Ministry of Education, Culture, Sports, Science, and Technology, Japan.


    Footnotes
 
Takashi Gojobori, Associate Editor


    References
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results and Discussion
 Supplementary Material
 Acknowledgements
 References
 

    Adzhubei AA, Sternberg MJ. Left-handed polyproline II helices commonly occur in globular proteins. J Mol Biol (1993) 229:472–493.[CrossRef][Web of Science][Medline]

    Arakawa T, Jongsareejit B, Tatsumi Y, et al. Application of N-terminally truncated DNA polymerase from Thermus thermophilus ({Delta}Tth polymerase) to DNA sequencing and polymerase chain reactions: comparative study of {Delta}Tth and wild-type Tth polymerases. DNA Res (1996) 3:87–92. (14 co-authors).[Abstract]

    Bochicchio B, Tamburro AM. Polyproline II structure in proteins: identification by chiroptical spectroscopies, stability, and functions. Chirality (2002) 14:782–792.[Medline]

    Cadwell RC, Joyce GF. Randomization of genes by PCR mutagenesis. PCR Methods Appl (1992) 2:28–33.[Medline]

    Choo Y, Castellanos A, Garcia-Hernandez B, Sanchez-Garcia I, Klug A. Promoter-specific activation of gene expression directed by bacteriophage-selected zinc fingers. J Mol Biol (1997) 273:525–532.[CrossRef][Web of Science][Medline]

    Cubellis MV, Caillez F, Blundell TL, Lovell SC. Properties of polyproline II, a secondary structure element implicated in protein-protein interactions. Proteins (2005) 58:880–892.[CrossRef][Web of Science][Medline]

    Hayashi Y, Sakata H, Makino Y, Urabe I, Yomo T. Can an arbitrary sequence evolve towards acquiring a biological function? J Mol Evol (2003) 56:162–168.[CrossRef][Web of Science][Medline]

    Henry ER, Hofrichter J. Singular value decomposition: application to analysis of experimental data. Methods Enzymol (1992) 210:129–192.[CrossRef][Web of Science]

    Hicks JM, Hsu VL. The extended left-handed helix: a simple nucleic acid-binding motif. Proteins (2004) 55:330–338.[CrossRef][Web of Science][Medline]

    Hirel PH, Schmitter MJ, Dessen P, Fayat G, Blanquet S. Extent of N-terminal methionine excision from Escherichia coli proteins is governed by the side-chain length of the penultimate amino acid. Proc Natl Acad Sci U S A (1989) 86:8245–8251.

    Ito Y, Kawama T, Urabe I, Yomo T. Evolution of an arbitrary sequence in solubility. J Mol Evol (2004) 58:196–202.[CrossRef][Web of Science][Medline]

    James LC, Tawfik DS. Conformational diversity and protein evolution—a 60-year-old hypothesis revisited. Trends Biochem Sci (2003) 28:361–368.[CrossRef][Web of Science][Medline]

    Johnson CW. Analysis of circular dichroism spectra. Methods Enzymol (1992) 210:426–447.[Web of Science][Medline]

    Keefe AD, Szostak JW. Functional proteins from a random-sequence library. Nature (2001) 410:715–718.[CrossRef][Medline]

    Makarov AA, Lobachov VM, Adzhubei IA, Esipova NG. Natural polypeptides in left-handed helical conformation. A circular dichroism study of the linker histones’ C-terminal fragments and β-endorphin. FEBS Lett (1992) 306:63–65.[CrossRef][Web of Science][Medline]

    Marks JD, Hoogenboom HR, Bonnert TP, McCafferty J, Griffiths AD, Winter G. By-passing immunization. Human antibodies from V-gene libraries displayed on phage. J Mol Biol (1991) 222:581–597.[CrossRef][Web of Science][Medline]

    McGhee JD, von Hippel PH. Theoretical aspects of DNA-protein interactions: co-operative and non-co-operative binding of large ligands to a one-dimensional homogeneous lattice. J Mol Biol (1974) 86:469–489.[CrossRef][Web of Science][Medline]

    Nakashima T, Ishiguro N, Yamaguchi M, Yamauchi A, Shima Y, Nozaki C, Urabe I, Yomo T. Construction and characterization of phage libraries displaying artificial proteins with random sequences. J Biosci Bioeng (2000) 90:253–259.[Web of Science][Medline]

    Nakashima T, Toyota H, Urabe I, Yomo T. Effective selection system for experimental evolution of random polypeptides towards DNA-binding protein. J Biosci Bioeng (2007) 103:155–160.[CrossRef][Web of Science][Medline]

    Nilsson P, Persson B, Uhlen M, Nygren PA. Real-time monitoring of DNA manipulations using biosensor technology. Anal Biochem (1995) 224:400–408.[CrossRef][Web of Science][Medline]

    Prijambada ID, Yomo T, Tanaka F, Kawama T, Yamamoto K, Hasegawa A, Shima Y, Negoro S, Urabe I. Solubility of artificial proteins with random sequences. FEBS Lett (1996) 382:21–25.[CrossRef][Web of Science][Medline]

    Saito S, Sasai M, Yomo T. Evolution of the folding ability of proteins through functional selection. Proc Natl Acad Sci USA (1997) 94:11324–11328.[Abstract/Free Full Text]

    Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol (1987) 4:406–425.[Abstract]

    Sreerama N, Woody RW. Poly(pro)II helices in globular proteins: identification and circular dichroic analysis. Biochemistry (1994) 33:10022–10025.[CrossRef][Web of Science][Medline]

    Suga Y, Yomo T, Urabe I. Heme content of catalase I from Bacillus stearothermophilus. J Ferment Bioeng (1996) 81:259–261.[CrossRef]

    Tamura K, Dudley J, Nei M, Kumar S. MEGA4: molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol (2007) 24:1596–1599.[Abstract/Free Full Text]

    Uversky VN. Use of fast protein size-exclusion liquid chromatography to study the unfolding of proteins which denature through the molten globule. Biochemistry (1993) 32:13288–13298.[CrossRef][Web of Science][Medline]

    Yamauchi A, Nakashima T, Tokuriki N, Hosokawa M, Nogami H, Arioka S, Urabe I, Yomo T. Evolvability of random polypeptides through functional selection within a small library. Protein Eng (2002) 15:619–626.[Abstract/Free Full Text]

    Yamauchi A, Yomo T, Tanaka F. (11 co-authors). Characterization of soluble artificial proteins with random sequences. FEBS Lett (1998) 421:147–151.[CrossRef][Web of Science][Medline]

    Yomo T, Saito S, Sasai M. Gradual development of protein-like global structures through functional selection. Nat Struct Biol (1999) 6:743–746.[CrossRef][Web of Science][Medline]

Accepted for publication February 21, 2008.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow Supplementary Material
Right arrow All Versions of this Article:
25/6/1113    most recent
msn063v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Toyota, H.
Right arrow Articles by Yomo, T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Toyota, H.
Right arrow Articles by Yomo, T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?