Skip to main content

Knottin cyclization: impact on structure and dynamics



Present in various species, the knottins (also referred to as inhibitor cystine knots) constitute a group of extremely stable miniproteins with a plethora of biological activities. Owing to their small size and their high stability, knottins are considered as excellent leads or scaffolds in drug design. Two knottin families contain macrocyclic compounds, namely the cyclotides and the squash inhibitors. The cyclotide family nearly exclusively contains head-to-tail cyclized members. On the other hand, the squash family predominantly contains linear members. Head-to-tail cyclization is intuitively expected to improve bioactivities by increasing stability and lowering flexibility as well as sensitivity to proteolytic attack.


In this paper, we report data on solution structure, thermal stability, and flexibility as inferred from NMR experiments and molecular dynamics simulations of a linear squash inhibitor EETI-II, a circular squash inhibitor MCoTI-II, and a linear analog lin-MCoTI. Strikingly, the head-to-tail linker in cyclic MCoTI-II is by far the most flexible region of all three compounds. Moreover, we show that cyclic and linear squash inhibitors do not display large differences in structure or flexibility in standard conditions, raising the question as to why few squash inhibitors have evolved into cyclic compounds. The simulations revealed however that the cyclization increases resistance to high temperatures by limiting structure unfolding.


In this work, we show that, in contrast to what could have been intuitively expected, cyclization of squash inhibitors does not provide clear stability or flexibility modification. Overall, our results suggest that, for squash inhibitors in standard conditions, the circularization impact might come from incorporation of an additional loop sequence, that can contribute to the miniprotein specificity and affinity, rather than from an increase in conformational rigidity or protein stability. Unfolding simulations showed however that cyclization is a stabilizing factor in strongly denaturing conditions. This information should be useful if one wants to use the squash inhibitor scaffold in drug design.


The knottins are fascinating miniproteins present in many species and featuring various biological actions such as toxic, inhibitory, antimicrobial, insecticidal, cytotoxic, anti-HIV, or hormone-like activities [1]. They share a unique knotted topology of three disulfide bridges, with one disulfide penetrating through a macrocycle formed by the two other disulfides and interconnecting peptide backbones. The KNOTTIN database provides standardized data on sequences, structures and other information on known knottins, also referred to as "inhibitor cystine knot" (ICK) proteins [2, 3]. The main knottin features are a remarkable stability due to the cystine knot, a small size making them readily accessible to chemical synthesis, and an excellent tolerance to sequence variations. Knottins therefore appear as appealing leads or scaffolds for peptide drug design [1, 48]. The knottin scaffold is found in almost 30 different protein families among which conotoxins, spider toxins, squash inhibitors, agouti-related proteins and plant cyclotides are the most populated families. Cyclotides are knottins from plants in the Rubiaceae and Violaceae families that, until recently, were always shown to be head-to-tail macrocyclic peptides [911]. In contrast, all known structurally similar squash inhibitors were linear knottins [12]. This difference held many years until the discovery of the first macrocyclic squash inhibitors Momordica cochinchinensis trypsin inhibitor (MCoTI)-I and -II [13], and, more recently, of a linear cyclotide [14]. The oxidative folding of squash inhibitors and cyclotides has been thoroughly studied [1520]. In both cases, the folding has been shown to occur via a two-disulfide intermediate whose structure is very native-like. This intermediate is the direct precursor of the three-disulfide knotted miniprotein for the squash inhibitors but not for the cyclotide kalata B1. It is now clear that both cyclic and linear variants can exist in different knottin families, but the reasons for this, and the impact of the cyclization, are still poorly understood. Macrocyclic peptides are expected to display improved stability, better resistance to proteases, and reduced flexibility when compared to their linear counterparts, hopefully resulting in enhanced biological activities. On the other hand, although peptide, and, more specifically, knottin cyclization has been shown to be accessible to chemical synthesis [21] and biosynthesis [22], the cost for the cyclization should not be neglected in view of potential pharmaceutical applications of the knottin scaffold. It is thus of interest to carefully evaluate the role and importance of the cyclization in the different knottin families. Despite several studies, the impact of the cyclization in the cyclotide series is still unclear since linearization of kalata B1 was shown to eliminate hemolytic activity [23], whereas a naturally occurring linear cyclotide displayed a reduced but not suppressed hemolytic activity [14]. In both cases, the structures were essentially conserved but higher flexibilities of the linear compounds were described. However, no large differences in thermal and enzymatic stability were described between kalata B1 and acyclic permutants [24]. On the other hand, very little is known on the difference between linear and cyclic squash inhibitors beside the fact that they share similar 3D structures [25, 26]. Nevertheless, it has been stated recently that the rigidifying cyclic backbone of MCoTI-II contributes enhanced stability in comparison to the simpler linear squash inhibitors, prompting the development of improved methods for cyclic knottin production [27].

In this study, we report on structure, stability, and dynamics of cyclic and linear squash inhibitors and analogs. Figure 1 shows the scaffold and the amino acid sequences of the cyclic squash inhibitor, MCoTI-II, of the synthetic linear variant used in this work, lin-MCoTI, and of the naturally occurring linear squash inhibitor, Ecballium elaterium trypsin inhibitor (EETI)-II. The solution structure of the linear analog of MCoTI-II, lin_MCoTI and a refined solution structure of EETI-II were first determined and compared to the MCoTI-II structure. These structures were then used to evaluate the impact of the cyclization on the flexibility and on the stability through NMR experiments, molecular dynamics simulations, and thermal unfolding simulations. We show that, in contrast to what could have been intuitively predicted and has been explicitly stated [27], the head-to-tail cyclization has minimal impact on structure and flexibility of MCoTI-II in standard conditions, possibly explaining why most squash inhibitors are linear. This finding raises the question as to why few squash inhibitors have evolved into cyclic compounds. However, although all compounds are remarkably stable and rigid at room temperature, we also show that the cyclic compound can endure heat better than the linear ones in high temperature simulations.

Figure 1

The knottin fold. (Top) Stereoscopic view of a schematic representation of MCoTI-II, a head-to-tail cyclized squash inhibitor. The head-to-tail linker is shown in blue. Disulfide bridges are shown as ball-and-stick representations. The cystine knot is shown in green (the disulfide macrocycle) and orange (the penetrating disulfide). β-strands are shown as flat arrows and the 310-helix turn is shown in magenta. Cysteines are numbered. (Bottom) Sequences of the squash inhibitors used in this work. Numbering follows MCoTI-II. The disulfide bridge coloring scheme follows the one used in the structure. The colors used in the three-dimensional structure are shown using a colored line below the MCoTI-II sequence. Peptide cyclization is displayed as a black line for MCoTI-II.


Solution structures of lin-MCoTI and of EETI-II

To compare solution structures of free linear squash inhibitors to the solution structure of the cyclic squash inhibitor MCoTI-II, we selected wild type EETI-II and the synthetic linear analog of MCoTI-II, i.e. lin-MCoTI. As the solution structure of EETI-II currently available in the Protein Data Bank [28] (PDB ID: 2eti) was determined in 1989 using NMR data from 360 MHz spectra and corresponds to a crude distance geometry structure without molecular mechanics refinement [29, 30], a new refined EETI-II solution structure was determined. NMR spectra of EETI-II and of lin-MCoTI were therefore recorded to determine their three-dimensional structures. The sequences are displayed in Figure 1 along with the numbering scheme used in this study that follows the MCoTI-II sequence with numbers from 1 to 34.

The assignment of all the 1H and 13C resonances present in the spectra was achieved using well-established techniques [31] and part of the 1H sequential assignment for lin-MCoTI is provided as additional file 1: NMR spectra of lin-MCoTI. 1H and 13C chemical shifts are available from additional file 2: Chemical shifts in ppm for lin-MCoTI. Figure 2 summarizes the sequential and medium range nuclear Overhauser effects (NOEs), 3JHN-Hα coupling constants, slowly exchanging amide protons and the Cα chemical shift differences from random coil values for the lin-MCoTI peptide [32, 33]. The observation of two small 3JHN-Hα coupling constants for residues Asp18 and Ser19, and the dNN(i, i+2), dα N(i, i+2) and dα N(i, i+3) NOEs in the region 16–21 shows the presence of a short 310 helix which is generally detected between the second and the third cysteine of squash inhibitors. The nuclear magnetic resonance (NMR) parameters measured in the region 22–25 are in agreement with a β-turn (dα N(i, i+2) NOE and slowly exchanging amide proton of residue 25). It is now well-established that squash inhibitors share a common structural motif with other knottins. This Cystine Stabilized β-sheet (CSB) motif is essentially made of an anti-parallel triple-stranded β-sheet [1, 34]. The CSB motif is well-defined in EETI-II and lin-MCoTI and is similar to the corresponding region in cyclic MCoTI-II [26]. The three regions of the sequence involved in this motif are 13–15, 26–28 and 32–34. Large 3JHN-Hαcoupling constants and slowly exchanging amide protons measured in these parts of the sequence are in agreement with this structure. In addition, all the characteristic inter-strand NOEs were detected. Region 26–34 is a β-hairpin with a β-turn involving residues 28–31 that was ascertained by the presence of dNN(i, i+2) and dα N(i, i+2) NOEs.

Figure 2

NMR data summary for lin-McoTI. Data on sequential and medium range NOE connectivities, 3JHN-Hαcoupling constants and slowly exchanging amide protons observed for lin-MCoTI are summarized. The height of the bars correspond to the strength of the NOEs. The values of the 3JHN-Hαcoupling constants are indicated by ↓ (< 4 Hz) and ↑ (> 8.5 Hz). Open and filled squares indicate backbone amide protons that were still observed after 3 and 24 h, respectively, in 2H2O. The deviations from random coil values for the 13C chemical shifts of Cα of lin-MCoTI are plotted at the bottom of the figure.

The three-dimensional structures of lin-MCoTI and EETI-II were determined from NMR data using the same strategy previously used for structural studies of the cyclic wild type MCoTI-II [26]. The NMR study led to 78 and 71 sequential and 167 and 122 medium and long range NOEs for lin-MCoTI and EETI-II, respectively. Eight (lin-MCoTI) and thirteen (EETI-II) φ angles were determined from the 3JHN-Hαcoupling constants, and stereospecific assignment of the Hβ protons was achieved for 9 residues (Figure 2 and Table 1). The connectivity of the disulfide bonds for lin-MCoTI was deduced on the basis of the very strong sequence similarity with the parent compound. It is also strongly supported by the similarity between the NMR data sets for MCoTI-II and lin-MCoTI. Moreover, no alternative disulfide connectivity has yet been reported for miniproteins with the knottin fold [3537]. The NMR data were converted into distance and angle constraints and the three-dimensional (3D) structures were obtained as described in Methods. The 30 lowest energy (molecular mechanics plus restraint energy) structures were selected to represent the NMR structures of lin-MCoTI and of EETI-II. The statistics for constraint violations and for molecular mechanics energies are shown in Table 1. Twenty lin-MCoTI solution structures are displayed in Figure 3.

Table 1 Statistics on geometry, energy and NMR data of the EETI-II and lin-MCoTI solution structures
Figure 3

Stereoview of the 20 lowest energy solution structures of lin-McoTI. Structures have been superimposed for their Cα atoms. The coloring scheme is as follows: whole backbone and proline, black; hydrophobic and aromatic residues, green; polar residues, magenta; acidic residues, red, basic residues, blue; disulfide bridges, orange. Cysteines and N- and C-termini are labeled.

The calculated structures of lin-MCoTI and of EETI-II satisfy the NMR data very well with no distance and dihedral violation larger than 0.2 Å or 5°, respectively (Table 1). Statistical analyses using the PROCHECK-NMR software [38] show that the overall stereochemistry of the lin-MCoTI and EETI-II solution structures is correct with all non-glycine and non-proline residues lying in the most favored and additional allowed regions of the Ramachandran map (Table 1). As expected, the refined models also display large negative molecular mechanics energies.

The lin-MCoTI and EETI-II solution structures are well-resolved with pairwise RMS deviations of 0.26 Å and 0.34 Å for superimposition of backbone atoms of residues 8–33, respectively (Table 2). The two average solution structures are close to the EETI-II X-ray structure (PDB ID: 1w7z[39]) with RMS deviations of 0.68 and 0.74 Å for backbone atoms of residues 8–33, i.e. all residues except the C-to-N linker (Table 3). The structures are also close to the cyclic MCoTI-II structure with RMS deviations of 0.78 Å and 0.87 Å, respectively. The EETI-II, lin-MCoTI and MCoTI-II solution structures are available from the Protein Data Bank [28] under PDB IDs 2it7, 2it8, and 1ha9, respectively. The refined EETI-II solution structure described here (PDB ID 2it7) significantly differs from the distance geometry structure published in 1989 (PDB ID 2eti) with RMS deviations of 1.9 Å for backbone atoms of residues 8–33. The main difference lies in the inhibition loop (residues 8–14) as shown by the much lower RMS deviation of 1.0 Å obtained when the loop is omitted from the superimposition.

Table 2 Structural variations of backbone atoms (N, Cα, C, O) in NMR and MD conformational ensembles
Table 3 RMS deviations between average structures

Molecular dynamics simulations of MCoTI-II, lin-MCoTI and EETI-II

Rather than comparison of the structures, one major goal of our study was to determine to which extent the miniprotein flexibility is affected by the circularization. Therefore, the lowest energy solution structures of MCoTI-II, lin-MCoTI and EETI-II were submitted to 22ns-long unrestrained molecular dynamics (MD) simulations in periodic boxes of explicit water molecules at 300 K using the program AMBER [40]. RMS deviations and positional atomic fluctuations were computed on all saved conformations after structural superimposition and are detailed in Table 2, Figure 4 (green lines), and Figure 5 (top). For each simulation, the conformation closest to the average structure was used for comparison with other structures (Table 3 and Figure 6).

Figure 4

Root mean square deviation from the NMR conformation along the MD simulations. Reported values are for backbone atoms (N, Cα, C, O) of residues 8 to 33 at 300 K (green), 400 K (blue) and 500 K (red). Conformations were superimposed for residue ranges 8–33 (heavy colors) and 15–33 (light colors).

Figure 5

Root mean square positional atomic fluctuations in the 300 K, 400 K and 500 K MD simulations. Reported values are for backbone atoms (N, Cα, C, O) and per residue: MCoTI-II (green line), lin-MCoTI (blue line), EETI-II (red line).

Figure 6

Stereoview of average structures from the 300 K simulations. Structures were superimposed on top of the EETI-II X-ray structure (PDB ID: 1w7z[39]), shown in grey, for backbone atoms of residues 15–33. EETI-II is shown in green, MCoTI-II in red, and lin-MCoTI in blue. Cysteines and N- and C-termini of lin-MCoTI are labeled. Disulfide bridges are shown as orange ball-and-stick representations.

The average solution structures are slightly closer to the X-ray EETI-II structure than the MD average structures (Table 3). The fact that no experimental restraints were applied during the dynamics might explain part of the difference. The EETI-II NMR structure is closer from the X-ray structure than the MCoTI-II and lin-MCoTI structures for the 15–33 region (i.e. the CSB motif), but not for the 8–33 region with RMS deviations of 0.54 Å and 0.74 Å, respectively (Table 3). The RMS deviations are displayed in Figure 4, green lines. All three compounds display low deviations from the initial conformation, close to 1 Å. Interestingly, the linear EETI-II showed the lowest deviation for the CSB motif (residues 15–33).

To evaluate the flexibility of each compound, the per-residue atomic fluctuations in the three simulations have been plotted and are displayed in Figure 5 (top). The most striking feature is that the head-to-tail linker in cyclic MCoTI-II is by far the most flexible region. Clearly, the cyclization does not modify the flexibility significantly. This observation is consistent with previous NMR structure calculations [25, 26]. Except for the linker, the three compounds display similar flexibilities. But, interestingly, data in Table 2 and Figure 5 (top) show that the two linear peptides display the lowest average fluctuations although differences remain limited. Lin-MCoTI appears slightly more mobile than EETI-II in the 8–21 region corresponding to the inhibitory loop and the 310 helical loop. The fact that the linear EETI-II is the less flexible compound strongly suggests that head-to-tail cyclization is not a key element of structure stabilization in these conditions. Hydrogen bonding in the simulations are summarized in Table 4. Analysis of hydrogen bonding reinforces previous analyses since the percent occurrences and the average donor-acceptor distances are not significantly affected by the cyclization.

Table 4 Hydrogen bond occurrences during the molecular dynamics simulations

Experimental and simulated thermal unfolding

NMR thermal unfolding experiments were performed on wild type circular MCoTI-II and on the linear analog. In these analyses, the chemical shift variations due to temperature increases between 10°C and 80°C are considered to result solely from a simple two-state unfolding mechanism. It is worth noting that the thermal unfolding has been studied without reduction of disulfide bridges since reduced squash inhibitors are essentially unstructured [16]. The chemical shift for the theoretical fully unfolded species is approximated as the random coil values for short peptides. Protons that could be followed in accumulated spectra at various temperatures were used to compute Tm values [34, 41]. Unfolding curves calculated for the Hβ proton of Cys27 in cyclic MCoTI-II and its linear analog are displayed in Figure 7, and Tm values calculated from data recorded for few protons are tabulated in Table 5. Data from previous experiments on linear EETI-II and on a shorter analog Min-23 [34], a derived two-disulfide peptide containing the CSB motif (Figure 1), have been included for comparison. As expected, the shorter two-disulfide Min-23 peptide consistently displays lower Tm values by approximately 30°C [34]. On the other hand, the differences between the three other compounds appear to be marginal and to fall within experimental errors, thus precluding any detailed comparisons. Although no large stability increase due to cyclization is apparent from unfolding experiments no definitive conclusion could be drawn.

Table 5 Estimated Tm values from NMR thermal unfolding
Figure 7

Thermal unfolding curves. The fraction unfolded calculated from the chemical shift (see Methods) is plotted as a function of temperature. Protein identification is as follows: Min-23 (purple, ), EETI-II (red, ■), MCoTI-II (green, ), lin-MCoTI (blue, ▲).

To provide further information, unfolding simulations were performed using molecular dynamics simulation at high temperature (400 K and 500 K). The RMSDs from the starting conformations are shown in Figure 4 (blue and orange-red curves), and a nativeness score, the Q-score [42], in Figure 8. The Q-score has been used as a measure of structural similarity to the native conformation and indicates the fraction of native non-bonded contacts. In the Q-score implementation used here, Cα inter-residue distances are used as contact measures. The use of other implementations based on all heavy atom distances led to very similar results (data not shown). A Q-score equal to 1 means that the structure is fully native whereas a Q-score below 0.4–0.6 means that the structure is significantly unfolded or incorrectly folded. Average RMSDs and Q-scores plotted as a function of temperature are displayed in Figure 9. The RMSD from the starting structure and the nativeness Q-score criterion are not independent and display similar profiles. While the two linear knottins display close results with almost identical slopes for variation of RMSD or Q-score with temperature, the cyclic MCoTI-II appears less sensitive to temperature increase (Figure 9). It is likely that the constraints imposed by the covalent connection between the N- and C-termini, in addition to the disulfide bridges, slow down the unfolding process during the highest temperature (500 K) simulation. Comparison of the RMSDs of the Cystine-Stabilized Beta-sheet motif in the simulations at 500 K (orange curves in Figure 4) highlights drastic differences between the linear and the cyclic compounds. Despite the constraints imposed by the conserved disulfide bridges, significant unfolding of the CSB motif (residues 15–33) is apparent for the linear compounds with average RMSD values between 2 and 3 Å. In contrast, the CSB motif of the cyclic compound is essentially conserved in the simulation at 500 K with RMSD values that remain as low as 1 Å.

Figure 8

Q-scores of conformations explored in the unfolding simulations. The minimized starting NMR conformation is used as the reference native structure. For each compound, the Q-score evolution is shown at 300 K (green), 400 K (blue) and 500 K (red).

Figure 9

Variation of the mean Root Mean Square deviations and Q-scores with temperature. MCoTI-II is shown as green lines, lin-MCoTI as blue lines and EETI-II as red lines.


The 3D structure of squash inhibitors is known since the late 1980s [29, 30, 43]. They were soon recognized as a prototypic member of a structural class of disulfide-rich miniproteins known as the knottins or Inhibitor Cystine Knots (ICK) [44, 45]. The first cyclic knottin, kalata B1, was described shortly after, in which the N- and C-termini are connected through a regular peptide bond, yielding a head-to-tail macrocyclic and knotted peptide. Although this circularization could have been an exception, it soon appeared that macrocyclic knottins were common in plants in the Rubiaceae and Violaceae families, constituting the large cyclotide family [10, 11, 46]. The only other example of macrocyclic knottin has been found more recently in the squash inhibitor family which otherwise contains more than thirty linear miniproteins [1, 3, 13]. Until now, the structural and functional impact of the circularization remains poorly understood, and very little is known on dynamics and thermal stability of cyclic squash inhibitors when compared to their linear counterparts. This prompted us to investigate structures, dynamics and stabilities of cyclic and linear squash inhibitors and analogs.

NMR experiments and molecular dynamics simulations evidence a limited impact of cyclization on structure and dynamics

The average NMR structures of the three studied compounds are close from each other with most RMS deviations lying below 0.9 Å for backbone atoms of residues 8–33 (Table 3). The differences between NMR and MD structures are only slightly higher and mostly remain near 1.0 Å, a reasonably low value. Moreover, all NMR and MD structures are close to the EETI-II X-ray structure with RMS deviations below 0.9 Å. MD simulations are expected to provide a more energetically rigorous conformational sampling than NMR structure determination. This is mainly because NOE averaging on the NMR time scale emphasizes short distances and because instantaneous and continuous application of these average constraints biases the conformational sampling towards shorter distances. Since no unrealistic deviation occurred during the simulations, the MD conformations were used for comparisons and assessment of flexibilities.

A superimposition of the average MD structures from simulations at 300 K onto the EETI-II X-ray structure is displayed in Figure 6. The only significant difference between EETI-II and MCoTI-II, beside the cyclization, concerns the 22–25 turn. A PROMOTIF [47] analysis indicates that this β-turn is of type I in EETI-II but of type II' in MCoTI-II and lin-MCoTI. This modification is likely a consequence of the different turn sequences (LAGC and PGAC in EETI-II and MCoTI-II, respectively). Interestingly, despite their flexibility, the inhibitory loops of EETI-II, MCoTI-II and lin-MCoTI (residues 8–15) remain reasonably close to the conformation in the EETI-II X-ray structure (Figure 6).

In all three compounds, the Asp20 side chain forms strong hydrogen bonds with amides of residues 16 and 17, but, surprisingly, the percent occurrences are lower in MCoTI-II and lin-MCoTI (Table 4). It is worth noting, however, that in MCoTI-II and lin-MCoTI, Asp20 is surrounded by four positively charged residues at positions Lys13, Lys14, Arg16 and Arg17. Instead of Lys13 and Arg17, corresponding residues in EETI-II are Met and Gln, respectively. Examination of Asp20 interactions in MCoTI-II and lin-MCoTI (Table 4) shows that, beside interactions with backbone amides of residues 16 and 17, Asp20 is also interacting with the Arg17 side chain in both compounds (percent occurrences 23.2 and 5.3, respectively), and with Arg16 or Lys13 in lin-MCoTI and MCoTI-II, respectively (percent occurrence 21.5 and 10.1). The latter interaction, which necessitates a small displacement of the Asp20 carboxylate, supports a higher flexibility of lin-MCoTI in this region, in agreement with the residue fluctuations shown in Figure 5. In contrast to the stronger Asp20-amides interactions in EETI-II, the hydrogen bonding of the Asp18 side chain with the amide of residue 27 is much weaker in EETI-II (22%) than in the two other compounds (94 and 91% in MCoTI-II and lin-MCoTI, respectively). This might be related to the H-bonding in the 3–10 helix, with both stronger (N20-O17, 97.7%) and weaker (N21-018, 70.6%) interactions in EETI-II.

In general, there are no large differences in hydrogen bonding (percent occupancy and average distance) between MCoTI-II and lin-MCoTI. However, in MCoTI-II, the Arg28 side chain is hydrogen bonding 50% of the time with the carbonyl of Ser1 (Table 4), a residue which does not exist in lin-MCoTI. Instead, in lin-MCoTI, Arg28 is hydrogen bonding with the carbonyl of Cys33 and, to a minor extent, Tyr32 (Table 4). There is no such interaction in EETI-II since residue 28 is a glycine. The 310 helix region also differ slightly with weaker N21-O18 and N17-Asp20 hydrogen bonds in the linear compound (Table 4). This difference is surprising because MCoTI-II and lin-MCoTI sequences are strictly identical on this side of the molecule. It thus appears that the modifications near the N- and C-termini propagate toward the 310 helix, probably via subtle modifications of the hydrogen bonding scheme due to modified charges and hydrogen bond acceptors near the C-terminus. This observation is consistent with the fact that the amide proton of residue Arg17 is exchanged more rapidly in lin-MCoTI than in MCoTI-II. In fact several other amide protons display a similar behavior, in particular those of Arg28, Gly31 and Gly34, that all belong to the triple-stranded β-sheet (Figure 2 and [26]), indicating a slightly higher flexibility of the sheet in lin-MCoTI.

Cyclic and linear squash inhibitors exhibit close thermal stabilities

With the hope to better evaluate the influence of cyclization on stability, we have performed NMR-monitored thermal unfolding experiments on MCoTI-II and lin-MCoTI. Due to the extremely high stabilities of the compounds, only limited unfolding could be achieved at temperatures compatible with NMR experiments which resulted in large uncertainties in curve fitting and calculation of thermodynamics parameters. From results in Table 5, it appears that, as expected [34], the shorter two-disulfide Min-23 peptide is clearly less stable than the three-disulfide compounds with Tm values lowered by about 10–40°C. In contrast, differences between Tm values for the three-disulfide knottins are smaller than standard deviations and may not be significant. Nevertheless, most calculated Tm values for the circular MCoTI-II are slightly higher than those calculated for the linear analog, suggesting a small stability increase in the circular compound. This would be consistent with the results of the MD simulations that showed a small flexibility increase in lin-MCoTI, and with the NMR data indicating a faster exchange of several amide protons in this compound.

However, (i) even the highest temperature used for NMR unfolding experiments remained too low to achieve a significant overall unfolding, (ii) the thermodynamics data are scarce and, (iii) more importantly, the two-state unfolding hypothesis and the use of the random coil values for chemical shifts of the unfolded species may not be valid in this case, especially because of the high disulfide bridge content. It is worth noting also that, for the cyclotide kalata B1, no significant changes were observed in circular dichroism spectra in absence or presence of 8 M urea and in the temperature range 5–90°C [24]. Therefore, to get further information on stability while avoiding the experimental limitations, we have used high temperature molecular dynamics simulations that make possible to study processes that are difficult to investigate experimentally.

Cyclic MCoTI-II displays better thermoresistance to unfolding

It has been suggested that molecular dynamics simulations at high temperature accelerate protein unfolding without changing significantly the unfolding pathway and provide useful information on thermal events [4851]. Therefore, in addition to the simulations at 300 K (22 ns), simulations at 400 K (30 ns) and 500 K (30 ns) were performed for the two linear knottins EETI-II and lin-MCoTI and for the cyclic compound MCoTI-II. The curves in Figure 4, 5 and 8 suggest that the unfolding of MCoTI-II at 500 K is approximately similar to what is displayed by the linear compounds at 400 K, strongly suggesting a better resistance to temperature for the cyclic compound. This can be compared to recent reports on family 11 xylanase where MD simulations at 600 K, but not 300 K, revealed significant differences between mesophilic and thermophilic enzymes [51].

Nevertheless, all three studied compounds displayed high stabilities and no significant structure or flexibility differences between linear and cyclic compounds appeared in simulations at room temperature or in NMR experiments up to ~80°C. Rather, there are sequence differences that seem to be as or more important than circularization for structure and flexibility in standard conditions. For example, the sequence and type of the 22–25 β-turn differ in EETI-II and lin-MCoTI resulting in flexibility differences in this area (Figure 5), with EETI-II displaying lower flexibilities at 300 K and 500 K. Also, the addition of a charged residue (Lysine) in MCoTI-II and lin-MCoTI at position 13 induces transient salt bridging with Asp20 thus perturbing the Asp20-O17 hydrogen bonding at the N-terminus of the 3–10 helix. This may be, at least partly, responsible for the lower flexibility at 300 K of the 3–10 helix in EETI-II in comparison to lin-MCoTI. More generally, the charged termini in linear compounds can in principle provide additional stabilizing electrostatic interactions when compared to cyclic analogs. As an example, a salt-bridge between the C-terminus and the side chain of an arginine in position 7 (Figure 1 numbering) has been reported in several linear squash inhibitors [43, 5254]. There is no such interaction here because both EETI-II and lin-MCoTI lack the Arg7, and, moreover, lin-MCoTI is uncharged at the C-terminus due to amidation.

Therefore circularization does not seem to be a key element of the squash inhibitors structure or flexibility in standard conditions. The cystine knot itself appears as the main factor responsible for the high stability and cyclization or sequence modifications seem to only induce marginal modifications. This does not hold however in high temperature simulations (500 K) where cyclization slowed the unfolding significantly, indicating that the cyclic squash inhibitor is more thermoresistant than linear counterparts. Despite the high flexibility of the head-to-tail linker in MCoTI-II, the cyclization is therefore likely to enhance resistance to strongly unfolding conditions.

Comparison of cyclization in squash inhibitors and in cyclotides

Comparisons of solution structures suggested that the circularization in cyclotides provides a reduction of the flexibility, and this was supposed to be at the origin of the cancellation [23] or reduction [14] of the hemolytic activity in wild type or synthetic linear cyclotides. However it has been mentioned that the hemolytic activity could be related to hydrophobicity [55], and that the wild type linear cyclotide, violacin A, is less hydrophobic than other cyclotides [14]. It is thus unclear which of the linear or hydrophilic feature of violacin A is responsible for the reduced hemolytic activity. On the other hand, the linear synthetic cyclotides lacked few residues that could have impact on hemolytic activity [23, 56], and to which extent linearization itself is involved in reduced activity remains to be determined. Moreover, stabilities of kalata B1 and acyclic permutants were shown to be very similar [24], suggesting that it is indeed the cystine knot rather than the circularization that provides most of the stability in cyclotides. It would be interesting to examine carefully the structural/functional differences between circular cyclotides and linear standard (hydrophobic) wild cyclotide when one is discovered.

Although the conclusions regarding the cyclization impact in the cyclotide family are roughly consistent with our results, it is worth noting that the head-to-tail linker in cyclotides is significantly different from the one in the cyclic squash inhibitor MCoTI-II. While the latter includes four glycines and no proline out of eight residues, the linker in most cyclotides includes one proline but only one glycine out of seven residues (see residues with numbers < 20 or > 100 in standardized alignments of cyclotides provided in the KNOTTIN database at [2, 3]). These sequence differences are probably sufficient to explain why this region is by far the most flexible part in MCoTI-II but not in the cyclotides since glycines are the most flexible residues and prolines the most rigid ones. One could then hypothesize that this well-structured linker is an integral part of the cyclotide structure possibly explaining why all cyclotides but one are circular. Conversely, the flexible linker in MCoTI-II is probably nothing more than an additional and very flexible loop, consistent with the fact that most squash inhibitors are linear. The presence of the linker in MCoTI-II could possibly provide extra contacts with the protease as well as protection to degradations by exoproteases or resistance to denaturing conditions.

Biological role of knottin cyclization

The observations that the head-to-tail linker in MCoTI-II is the most flexible part of the molecule, and that linear squash inhibitors (e.g. EETI-II) can be as rigid as the circular MCoTI-II in standard conditions strongly suggests that the biological role of the circularization in squash inhibitors is not to reduce flexibility or to render the molecule more rigid. Nevertheless, better affinities for trypsin were achieved by circular squash inhibitors [Ki = 3.10-11 M for MCoTI-II vs. 3.10-10 M for lin-MCoTI and 8.10-11 M for EETI-II [57]]. From the results in this work, it is unlikely that the improved affinity arises from conformational effects, either directly, or by reducing the flexibility of the free inhibitor, hence lowering the entropic loss due to binding. Our work rather suggests that the enhanced affinity of the circular compound could be due to direct binding of the head-to-tail linker with trypsin, as previously suggested by molecular modeling of the complex [13]. If this is the case, then the linker would have minimal impact in engineered squash inhibitor-based knottin variants, except in such particular cases where the linker bears a supplementary interaction site.

It is tempting to speculate that the biological role of the cyclization in squash inhibitors is for enzymatic rather than for thermodynamic reasons, and it has been suggested earlier that cyclization could prevent degradation by exoproteases [13, 24]. The enzymatic stability of several knottins has been reported recently and both circular cyclotides and linear squash inhibitors were shown to display excellent resistance to proteases, except for the enzymes specific to the miniprotein, as e.g. serine proteases for squash inhibitors [4, 24]. Even the naturally occurring linear cyclotide, violacin A, was shown to survive for 6 h in presence of trypsin or thermolysin reinforcing the idea that cyclization is not the main determinant for resistance to endoproteases [14]. Individual sequences may however display different enzymatic stabilities, and an example is provided by the human agouti-related protein that was shown to be proteolized more rapidly than squash inhibitors [4]. Interestingly, reduced cyclic kalata B1 has been shown to be more resistant to proteolysis than reduced linear conotoxin PVIIA [24]. This suggests that cyclization could also have some influence by slowing down enzymatic degradation of reduced knottins. Sensitivity of knottins to exoproteases, however, has not yet been systematically studied. Certainly, macrocyclic knottins will remain unaffected by exoproteases, but to which extent various linear knottins are sensitive to exoproteases remains to be determined. Both N- and C-terminal segments before and after the first and last well-structured half-cystines are rather short in many knottins, and it is unclear if these will be very sensitive to exoproteases. To our knowledge the only reported example comes from the linear cyclotide violacin A [14]. Violacin A was shown to be proteolysed by aminopeptidase M, which cleaved only the first two N-terminal residues. The third and fourth residues that precede the first cysteine of the knot were not cleaved, most likely because of their proximity to the cystine knot [14]. It can thus be hypothesized that many linear knottins with short N- and C-termini will not be easily degraded by exoproteases. Moreover, limited degradation of longer termini might not always be very deleterious since active residues are mostly located in inter-cysteine loops rather than in the termini. An example of this is provided by the minimized 34-residue agouti-related protein (AGRP) analog containing only the cystine knot domain, and which maintains the melanocortin receptor pharmacological profile of AGRP(87–132) [58]. Beside stability or protease resistance, cyclization was shown to facilitate in vitro oxidative folding of Kalata B1 that otherwise needs a hydrophobic environment to attain the native fold [21]. However it is worth noting that (i) in vivo, the formation of disulfide bridges is generally assumed to occur prior cyclization, thus bringing the termini close to each other for subsequent cyclization, and (ii) the phenomenon is specific to cyclotides since, in contrast, there is no need of cyclization or particular environment in the oxidative folding of squash inhibitors. The positive impact of cyclization on folding and/or resistance to denaturing conditions might well be important when membrane crossing is involved.


We have shown that only small conformational differences are displayed by circular squash inhibitors and that linear compounds may display sufficient stabilities without the need for cyclization.

The cyclization observed in MCoTI-II, with no strong impact on thermodynamic or enzymatic stability, might therefore be an exceptional event rather than a general process in the squash family. One can even wonder if the increased affinity towards trypsin afforded by the cyclization (see above) is important for the plant itself since the affinities displayed by the linear compounds are already quite strong. Indeed, determining to which extent cyclization of squash inhibitors is a general process must await further large-scale studies in search of naturally occurring cyclized squash inhibitors in cucurbit seeds.

Protein circularization has been proposed as an interesting tool to stabilize engineered peptides in drug design studies [5961]. Thanks to their small size, knottins have been shown to be readily accessible to chemical synthesis and routes to macrocyclic knottins are available [1, 21, 59, 62, 63]. Interestingly macrocyclic knottins were also recently shown to be accessible from bioengineering or biomimetic routes [22, 27]. Nevertheless, peptide cyclization induces constraints on peptide synthesis and is expected to significantly lower the yields, especially when engineered sequences, that could be non-optimal for native-like folding, will be grafted onto the knottin scaffold. Thus the cost increase should be taken into account along with the potential benefit of circularization, if any, when considering circular knottin-based engineered molecules in drug design studies, since linear scaffolds with similar features are available, e.g. the squash inhibitor EETI-II. However, it is possible that stability can vary from one sequence to the other and that in particular cases circularization will be one good option to increase stability. But in many cases, the knottin scaffold itself and sequence optimization might be sufficient to insure high stabilities of linear compounds based on the squash inhibitor scaffold. An excellent example of knottin sequence optimization has been provided by incorporation of pairwise β-sheet stabilizing residues that improved folding and stability of the C-terminal knottin fold of the agouti signaling protein [64]. Nevertheless, the high temperature simulations we have performed suggest that the head-to-tail cyclization might help to stabilize squash inhibitors in strongly denaturing conditions and this may be important for membrane crossing. It is worth reminding also that when using the squash inhibitor scaffold in drug design, degradation by specific enzymes should generally be avoided. This can be easily achieved by mutating the protease sensitive site [44, 57].

Finally, it can be concluded that the cystine knot is the main determinant for stability and for resistance to proteolysis of knottins. Then the specific sequence can help increasing both stability and resistance to proteolysis. Circularization can probably enhance resistance to strongly denaturing conditions and possibly facilitate in vitro folding in particular cases. Circularization may not be however a general prerequisite in knottin based drug design, especially when using the squash inhibitor scaffold.


Peptide synthesis

Wild type MCoTI-II was purified from seeds of the Gâc fruit (Momordica cochinchinensis) collected in Vietnam [13]. The linear variant lin-MCoTI was chemically synthesized using Fmoc solid phase peptide synthesis [57, 65]. EETI-II was chemically synthesized as previously described [66].

NMR experiments

Samples were prepared by dissolving peptides in either 90% H2O/10% 2H2O (v/v) or 100% 2H2O to a concentration of approximately 1.2 mM with the pH adjusted to 3.0 by addition of dilute HCl or NaOH. NMR spectra were recorded on a Bruker Avance-600 spectrometer equipped with a triple resonance inverse Cryoprobe with a single axis z gradient. Data were acquired at 12°C and 27°C, and TSP-d4 was used as an internal reference. All two-dimensional (2D) experiments, correlated spectroscopy (COSY), total correlated spectroscopy (TOCSY) and nuclear Overhauser effect spectroscopy (NOESY), were performed according to standard procedures [31] using quadrature detection in both dimensions with spectral widths of 6849.3 Hz in both dimensions. The carrier frequency was centred on the water signal and the solvent water resonance was suppressed by using the WATERGATE [67] method for experiments in H2O and by applying continuous low power irradiation during the relaxation delay and during the mixing time for NOESY spectra for experiments in 2H2O. The 2D spectra were obtained using 2048 or 4096 points for each t1 value, and 512 t1 experiments were acquired for COSY, TOCSY and NOESY experiments. TOCSY spectra were recorded with spin lock times of 30 and 60 ms. The mixing time was 150 and 300 ms in NOESY spectra. Spectra were processed using XWINNMR (Bruker). The t1 dimension was zero filled to 1024 points and π/3 or π/4 shifted sine bell functions were applied in t1 and t2 domains, respectively, prior to Fourier transform. 3JNH-Hαcoupling constants were measured on one-dimensional (1D) spectra. The exchange of amide protons with deuterium was studied at 12°C on samples lyophilized from H2O at pH 3.0 and dissolved in 2H2O. A series of 1D, TOCSY, and NOESY spectra were acquired over a 48-h period. 1H-13C hetero single quantum coherence spectroscopy (HSQC) and 1H-13C HSQC TOCSY spectra [68, 69] were recorded on the samples in 2H2O. Spectral widths were 6849.3 Hz and 25000 Hz in the 1H and 13C dimensions respectively. 2048 data points were acquired with 512 t1 increments.

Structure calculations

All calculations and analyses were performed on PC Linux boxes. The structures were displayed and analyzed using PyMOL (Warren L. DeLano "The PyMOL Molecular Graphics System." DeLano Scientific LLC, San Carlos, CA, USA. The NOE intensities were classified as strong, medium, and weak, and converted into distance constraints as previously described [26]. When necessary, the distance constraints were corrected for pseudoatoms [70]. Φ angles of residues with small or large 3JHN-Hαcoupling constants (< 4 Hz or > 8.5 Hz) were constrained into the -90° to -40° or -160° to -80° ranges, respectively. χ1 angles of residues for which stereospecific attribution of the β protons could be achieved were constrained in the corresponding range. Disulfide bridges were imposed through distance constraints of 2.0–2.1, 3.0–3.1, and 3.75–3.95 Å on Si-Sj, Si-Cβ j and Sj-Cβ i, and Cβ i-Cβ j distances, respectively. No hydrogen bond was imposed. Three hundred 3D structures were obtained as previously [26] from the distance and angle restraints using the torsion angle molecular dynamics method available in the CYANA program [71]. The fifty structures with the lowest violation of the target function were submitted to molecular mechanics energy refinement with the SANDER module of the AMBER 8 program [40], using the ff03 force field [72] and the GB/SA implicit solvation scheme [72]. During the restrained molecular dynamics runs the covalent bond lengths were kept constant by applying the SHAKE algorithm [73] allowing a 2 fs time step to be used. Distance and angle NMR restraints were applied using square bottom wells with parabolic sides. The sides become linear for large deviations [26]. When no stereospecific assignment could be achieved for methyl or methylene protons, an < r-6 > -1/6 averaging scheme was used instead of pseudo-atoms. No constraints were applied to the disulfide bridges. Five thousand cycles of restrained energy minimization were first carried out followed by a 100-ps long simulated annealing procedure in which the temperature was raised to 900 K for 40 ps then gradually lowered to 300 K. During this stage, the force constant for the NMR distance and dihedral restraints were gradually increased from 3 to 30 kcal.mol-1-2 or kcal.mol-1.rad-2.

Thermal unfolding

The thermal unfolding was monitored using 1D NMR data obtained on samples in 2H2O previously used for exchange experiments. The temperature within the NMR samples was calibrated with ethylene glycol and varied between 10°C and 80°C. Chemical shifts for fully unfolded species δU could not be attained experimentally and were taken as random coil chemical shifts of the corresponding protons [7476]. Since we could not precisely determine chemical shifts for fully folded species δF, these were obtained from fitting experimental δ(T), the chemical shift at temperature T, to a simple two-state unfolding mechanism [34, 41] using Kaleidagraph (Synergy Software, Reading):δ(T) = α δF + (1-α) δU

where α = 1/1 + eG|RT

Then the two-state equilibrium constant for thermal unfolding,

K U t h e r m a l = f U / ( 1 f U ) = ( δ F δ ( T ) ) / ( δ ( T ) δ U ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4saS0aa0baaSqaaiabdwfavbqaaiabdsha0jabdIgaOjabdwgaLjabdkhaYjabd2gaTjabdggaHjabdYgaSbaakiabg2da9maalyaabaGaemOzay2aaSbaaSqaaiabdwfavbqabaaakeaacqGGOaakcqaIXaqmcqGHsislcqWGMbGzdaWgaaWcbaGaemyvaufabeaaaaGccqGGPaqkcqGH9aqpdaWcgaqaamaabmaabaGaeqiTdq2aaSbaaSqaaiabdAeagbqabaGccqGHsislcqaH0oazdaqadaqaaiabdsfaubGaayjkaiaawMcaaaGaayjkaiaawMcaaaqaamaabmaabaGaeqiTdq2aaeWaaeaacqWGubavaiaawIcacaGLPaaacqGHsislcqaH0oazdaWgaaWcbaGaemyvaufabeaaaOGaayjkaiaawMcaaaaaaaa@5730@


Δ G U t h e r m a l = R T ln K U t h e r m a l MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuiLdqKaem4raC0aa0baaSqaaiabdwfavbqaaiabdsha0jabdIgaOjabdwgaLjabdkhaYjabd2gaTjabdggaHjabdYgaSbaakiabg2da9iabgkHiTiabdkfasjabdsfaujGbcYgaSjabc6gaUjabdUealnaaDaaaleaacqWGvbqvaeaacqWG0baDcqWGObaAcqWGLbqzcqWGYbGCcqWGTbqBcqWGHbqycqWGSbaBaaaaaa@4CD2@

was determined at each experimental temperature (fU, fraction unfolded). The Tm values for reversible thermal unfolding of the peptides were calculated by linear fitting of Δ G U t h e r m a l MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuiLdqKaem4raC0aa0baaSqaaiabdwfavbqaaiabdsha0jabdIgaOjabdwgaLjabdkhaYjabd2gaTjabdggaHjabdYgaSbaaaaa@394B@ versus T for each proton.

The statistical error on experimental chemical shifts was estimated to ± 0.02 ppm. To obtain a rough estimate of the corresponding error on the calculated thermodynamics parameters, the above calculations were repeated 100 times with chemical shifts randomly picked in the range of x ± 0.02 ppm, where x is the experimentally determined value. From the resulting distributions of parameters, mean values of Tm, and associated standard deviations were calculated and are reported in Table 5.

Molecular dynamics simulations

Molecular dynamics simulations were carried out on an AMD Opteron cluster using the PGI compilers (The Portland Group, Inc., Portland, USA) and the AMBER 8.0 program [40]. The starting models were immersed into a truncated octahedron of TIP3P explicit water models [77], with minimal distances of 15 Å between any protein atom and the box boundaries. Periodic boundary conditions were imposed and the total charge of the system was compensated for by using a neutralizing plasma. Lennard-Jones and electrostatic interactions were calculated using the Particle-mesh Ewald (PME) summation scheme [78], with a cut-off of 8 Å for the separation of the direct and reciprocal space summation.

Water molecules were first energy minimized while restraining the protein atoms. Then, the whole system was equilibrated for 0.5 ns at the target temperature and 1 bar using the weak coupling algorithm (temperature and pressure relaxation times = 2 ps) [79]. For production runs, the temperature was regulated using the Langevin dynamics with a collision frequency of 3 ps-1, and bonds involving hydrogen atoms were constrained using the SHAKE algorithm [73]. The conformations were stored every 1 ps, and the trajectories were analyzed with the Ptraj program of the Amber 8.0 suite. Room temperature molecular dynamics simulations were performed at 300 K for 22 ns. Unfolding simulations were performed at higher temperatures (400 K and 500 K) for 30 ns. The structural criteria used to monitor protein unfolding were the RMSD and a nativeness score, the Q-score. The Q-score was computed using the MMTSB tool available at It is calculated using a Gaussian function of the inter-residue Cα distance centered at zero with standard deviation of |j-i|0.15 and normalized by the number of non-bonded-contacts [42]


  1. 1.

    Chiche L, Heitz A, Gelly JC, Gracy J, Chau PT, Ha PT, Hernandez JF, Le-Nguyen D: Squash inhibitors: from structural motifs to macrocyclic knottins. Curr Protein Pept Sci 2004, 5(5):341–349.

    CAS  Article  Google Scholar 

  2. 2.

    Gelly JC, Gracy J, Kaas Q, Le-Nguyen D, Heitz A, Chiche L: The KNOTTIN website and database: a new information system dedicated to the knottin scaffold. Nucleic Acids Res 2004, 32(Database issue):D152–9.

    Google Scholar 

  3. 3.

    Gracy J, Le-Nguyen D, Gelly JC, Kaas Q, Heitz A, Chiche L: KNOTTIN: the knottin or inhibitor cystine knot scaffold in 2007. Nucleic Acids Res 2008, 36(Database issue):D314–9.

    CAS  Google Scholar 

  4. 4.

    Werle M, Schmitz T, Huang HL, Wentzel A, Kolmar H, Bernkop-Schnurch A: The potential of cystine-knot microproteins as novel pharmacophoric scaffolds in oral peptide drug delivery. J Drug Target 2006, 14(3):137–146.

    CAS  Article  Google Scholar 

  5. 5.

    Reiss S, Sieber M, Oberle V, Wentzel A, Spangenberg P, Claus R, Kolmar H, Losche W: Inhibition of platelet aggregation by grafting RGD and KGD sequences on the structural scaffold of small disulfide-rich proteins. Platelets 2006, 17(3):153–157.

    CAS  Article  Google Scholar 

  6. 6.

    Souriau C, Chiche L, Irving R, Hudson P: New binding specificities derived from Min-23, a small cystine-stabilized peptidic scaffold. Biochemistry 2005, 44(19):7143–7155.

    CAS  Article  Google Scholar 

  7. 7.

    Hosse RJ, Rothe A, Power BE: A new generation of protein display scaffolds for molecular recognition. Protein Sci 2006, 15(1):14–27.

    CAS  Article  Google Scholar 

  8. 8.

    Craik DJ, Cemazar M, Daly NL: The cyclotides and related macrocyclic peptides as scaffolds in drug design. Curr Opin Drug Discov Devel 2006, 9(2):251–260.

    CAS  Google Scholar 

  9. 9.

    Craik DJ, Daly NL, Mulvenna J, Plan MR, Trabi M: Discovery, structure and biological activities of the cyclotides. Curr Protein Pept Sci 2004, 5(5):297–315.

    CAS  Article  Google Scholar 

  10. 10.

    Craik DJ, Daly NL, Bond T, Waine C: Plant cyclotides: A unique family of cyclic and knotted proteins that defines the cyclic cystine knot structural motif. J Mol Biol 1999, 294(5):1327–1336.

    CAS  Article  Google Scholar 

  11. 11.

    Wang CK, Kaas Q, Chiche L, Craik DJ: CyBase: a database of cyclic protein sequences and structures, with applications in protein discovery and engineering. Nucleic Acids Res 2008, 36(Database issue):D206–10.

    CAS  Google Scholar 

  12. 12.

    Otlewski J, Krowarsch D: Squash inhibitor family of serine proteinases. Acta Biochim Pol 1996, 43(3):431–444.

    CAS  Google Scholar 

  13. 13.

    Hernandez JF, Gagnon J, Chiche L, Nguyen TM, Andrieu JP, Heitz A, Trinh Hong T, Pham TT, Le Nguyen D: Squash trypsin inhibitors from Momordica cochinchinensis exhibit an atypical macrocyclic structure. Biochemistry 2000, 39(19):5722–5730.

    CAS  Article  Google Scholar 

  14. 14.

    Ireland DC, Colgrave ML, Nguyencong P, Daly NL, Craik DJ: Discovery and characterization of a linear cyclotide from Viola odorata: implications for the processing of circular proteins. J Mol Biol 2006, 357(5):1522–1535.

    CAS  Article  Google Scholar 

  15. 15.

    Le-Nguyen D, Heitz A, Chiche L, el Hajji M, Castro B: Characterization and 2D NMR study of the stable [9–21, 15–27] 2 disulfide intermediate in the folding of the 3 disulfide trypsin inhibitor EETI II. Protein Sci 1993, 2(2):165–174.

    CAS  Article  Google Scholar 

  16. 16.

    Heitz A, Chiche L, Le-Nguyen D, Castro B: Folding of the squash trypsin inhibitor EETI II. Evidence of native and non-native local structural preferences in a linear analogue. Eur J Biochem 1995, 233(3):837–846.

    CAS  Article  Google Scholar 

  17. 17.

    Heitz A, Le-Nguyen D, Castro B, Chiche L: Conformational study of a native monodisulfide bridge analogue of EETI-II. Lett in Pep Sci 1997, 4: 245–249.

    CAS  Google Scholar 

  18. 18.

    Daly NL, Clark RJ, Craik DJ: Disulfide folding pathways of cystine knot proteins. Tying the knot within the circular backbone of the cyclotides. J Biol Chem 2003, 278(8):6314–6322.

    CAS  Article  Google Scholar 

  19. 19.

    Cemazar M, Daly NL, Haggblad S, Lo KP, Yulyaningsih E, Craik DJ: Knots in rings. The circular knotted protein Momordica cochinchinensis trypsin inhibitor-II folds via a stable two-disulfide intermediate. J Biol Chem 2006, 281(12):8224–8232.

    CAS  Article  Google Scholar 

  20. 20.

    Cemazar M, Joshi A, Daly NL, Mark AE, Craik DJ: The structure of a two-disulfide intermediate assists in elucidating the oxidative folding pathway of a cyclic cystine knot protein. Structure 2008, 16(6):842–851.

    CAS  Article  Google Scholar 

  21. 21.

    Daly NL, Love S, Alewood PF, Craik DJ: Chemical synthesis and folding pathways of large cyclic polypeptides: studies of the cystine knot polypeptide kalata B1. Biochemistry 1999, 38(32):10606–10614.

    CAS  Article  Google Scholar 

  22. 22.

    Camarero JA, Kimura RH, Woo YH, Shekhtman A, Cantor J: Biosynthesis of a fully functional cyclotide inside living bacterial cells. Chembiochem 2007, 8(12):1363–1366.

    CAS  Article  Google Scholar 

  23. 23.

    Barry DG, Daly NL, Clark RJ, Sando L, Craik DJ: Linearization of a naturally occurring circular protein maintains structure but eliminates hemolytic activity. Biochemistry 2003, 42(22):6688–6695.

    CAS  Article  Google Scholar 

  24. 24.

    Colgrave ML, Craik DJ: Thermal, chemical, and enzymatic stability of the cyclotide kalata B1: the importance of the cyclic cystine knot. Biochemistry 2004, 43(20):5965–5975.

    CAS  Article  Google Scholar 

  25. 25.

    Felizmenio-Quimio ME, Daly NL, Craik DJ: Circular proteins in plants: solution structure of a novel macrocyclic trypsin inhibitor from Momordica cochinchinensis. J Biol Chem 2001, 276(25):22875–22882.

    CAS  Article  Google Scholar 

  26. 26.

    Heitz A, Hernandez JF, Gagnon J, Hong TT, Pham TT, Nguyen TM, Le-Nguyen D, Chiche L: Solution structure of the squash trypsin inhibitor MCoTI-II. A new family for cyclic knottins. Biochemistry 2001, 40(27):7973–7983.

    CAS  Article  Google Scholar 

  27. 27.

    Thongyoo P, Roque-Rosell N, Leatherbarrow RJ, Tate EW: Chemical and biomimetic total syntheses of natural and engineered MCoTI cyclotides. Org Biomol Chem 2008, 6(8):1462–1470.

    CAS  Article  Google Scholar 

  28. 28.

    Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Res 2000, 28(1):235–242.

    CAS  Article  Google Scholar 

  29. 29.

    Heitz A, Chiche L, Le-Nguyen D, Castro B: 1H 2D NMR and distance geometry study of the folding of Ecballium elaterium trypsin inhibitor, a member of the squash inhibitors family. Biochemistry 1989, 28(6):2392–2398.

    Article  Google Scholar 

  30. 30.

    Chiche L, Gaboriaud C, Heitz A, Mornon JP, Castro B, Kollman PA: Use of restrained molecular dynamics in water to determine three-dimensional protein structure: prediction of the three-dimensional structure of Ecballium elaterium trypsin inhibitor II. Proteins 1989, 6(4):405–417.

    CAS  Article  Google Scholar 

  31. 31.

    Wuthrich K: NMR of Proteins and Nucleic Acids. NY: John Wiley & Sons Inc; 1986.

    Google Scholar 

  32. 32.

    Wishart DS, Sykes BD: The 13C chemical-shift index: a simple method for the identification of protein secondary structure using 13C chemical-shift data. J Biomol NMR 1994, 4(2):171–180.

    CAS  Article  Google Scholar 

  33. 33.

    Wishart DS, Sykes BD, Richards FM: The chemical shift index: a fast and simple method for the assignment of protein secondary structure through NMR spectroscopy. Biochemistry 1992, 31(6):1647–1651.

    CAS  Article  Google Scholar 

  34. 34.

    Heitz A, Le-Nguyen D, Chiche L: Min-21 and min-23, the smallest peptides that fold like a cystine-stabilized beta-sheet motif: design, solution structure, and thermal stability. Biochemistry 1999, 38(32):10615–10625.

    CAS  Article  Google Scholar 

  35. 35.

    Combelles C, Gracy J, Heitz A, Craik DJ, Chiche L: Structure and folding of disulfide-rich miniproteins: Insights from molecular dynamics simulations and MM-PBSA free energy calculations. Proteins 2008.

    Google Scholar 

  36. 36.

    Goransson U, Craik DJ: Disulfide mapping of the cyclotide kalata B1. Chemical proof of the cystic cystine knot motif. J Biol Chem 2003, 278(48):48188–48196.

    Article  Google Scholar 

  37. 37.

    Rosengren KJ, Daly NL, Plan MR, Waine C, Craik DJ: Twists, knots, and rings in proteins. Structural definition of the cyclotide framework. J Biol Chem 2003, 278(10):8606–8616.

    CAS  Article  Google Scholar 

  38. 38.

    Laskowski RA, Rullmannn JA, MacArthur MW, Kaptein R, Thornton JM: AQUA and PROCHECK-NMR: programs for checking the quality of protein structures solved by NMR. J Biomol NMR 1996, 8(4):477–486.

    CAS  Article  Google Scholar 

  39. 39.

    Kratzner R, Debreczeni JE, Pape T, Schneider TR, Wentzel A, Kolmar H, Sheldrick GM, Uson I: Structure of Ecballium elaterium trypsin inhibitor II (EETI-II): a rigid molecular scaffold. Acta Crystallogr D Biol Crystallogr 2005, 61(Pt 9):1255–1262.

    Article  Google Scholar 

  40. 40.

    Case DA, Darden TA, Cheatham TE 3rd, Simmerling C, Wang J, Duke RE, Luo R, Merz KM Jr, Wang B, Pearlman DA, et al.: AMBER 8. San Francisco.: University of California; 2004.

    Google Scholar 

  41. 41.

    Mer G, Hietter H, Lefevre JF: Stabilization of proteins by glycosylation examined by NMR analysis of a fucosylated proteinase inhibitor. Nat Struct Biol 1996, 3(1):45–53.

    CAS  Article  Google Scholar 

  42. 42.

    Panchenko AR, Luthey-Schulten Z, Cole R, Wolynes PG: The foldon universe: a survey of structural similarity and self-recognition of independently folding units. J Mol Biol 1997, 272(1):95–105.

    CAS  Article  Google Scholar 

  43. 43.

    Bode W, Greyling HJ, Huber R, Otlewski J, Wilusz T: The refined 2.0 A X-ray crystal structure of the complex formed between bovine beta-trypsin and CMTI-I, a trypsin inhibitor from squash seeds (Cucurbita maxima). Topological similarity of the squash seed inhibitors with the carboxypeptidase A inhibitor from potatoes. FEBS Lett 1989, 242(2):285–292.

    CAS  Article  Google Scholar 

  44. 44.

    Le-Nguyen D, Heitz A, Chiche L, Castro B, Boigegrain RA, Favel A, Coletti-Previero MA: Molecular recognition between serine proteases and new bioactive microproteins with a knotted structure. Biochimie 1990, 72(6–7):431–435.

    CAS  Article  Google Scholar 

  45. 45.

    Pallaghy PK, Nielsen KJ, Craik DJ, Norton RS: A common structural motif incorporating a cystine knot and a triple-stranded beta-sheet in toxic and inhibitory polypeptides. Protein Sci 1994, 3(10):1833–1839.

    CAS  Article  Google Scholar 

  46. 46.

    Simonsen SM, Sando L, Ireland DC, Colgrave ML, Bharathi R, Goransson U, Craik DJ: A continent of plant defense peptide diversity: cyclotides in Australian Hybanthus (Violaceae). Plant Cell 2005, 17(11):3176–3189.

    CAS  Article  Google Scholar 

  47. 47.

    Hutchinson EG, Thornton JM: PROMOTIF – a program to identify and analyze structural motifs in proteins. Protein Sci 1996, 5(2):212–220.

    CAS  Article  Google Scholar 

  48. 48.

    Beck DA, Daggett V: Methods for molecular dynamics simulations of protein folding/unfolding in solution. Methods 2004, 34(1):112–120.

    CAS  Article  Google Scholar 

  49. 49.

    Beck DA, White GW, Daggett V: Exploring the energy landscape of protein folding using replica-exchange and conventional molecular dynamics simulations. J Struct Biol 2007, 157(3):514–523.

    CAS  Article  Google Scholar 

  50. 50.

    Day R, Bennion BJ, Ham S, Daggett V: Increasing temperature accelerates protein unfolding without changing the pathway of unfolding. J Mol Biol 2002, 322(1):189–203.

    CAS  Article  Google Scholar 

  51. 51.

    Purmonen M, Valjakka J, Takkinen K, Laitinen T, Rouvinen J: Molecular dynamics studies on the thermostability of family 11 xylanases. Protein Eng Des Sel 2007, 20(11):551–559.

    CAS  Article  Google Scholar 

  52. 52.

    Huang Q, Liu S, Tang Y: Refined 1.6 A resolution crystal structure of the complex formed between porcine beta-trypsin and MCTI-A, a trypsin inhibitor of the squash family. Detailed comparison with bovine beta-trypsin and its complex. J Mol Biol 1993, 229(4):1022–1036.

    CAS  Article  Google Scholar 

  53. 53.

    Zhu Y, Huang Q, Qian M, Jia Y, Tang Y: Crystal structure of the complex formed between bovine beta-trypsin and MCTI-A, a trypsin inhibitor of squash family, at 1.8-A resolution. J Protein Chem 1999, 18(5):505–509.

    CAS  Article  Google Scholar 

  54. 54.

    Helland R, Berglund GI, Otlewski J, Apostoluk W, Andersen OA, Willassen NP, Smalas AO: High-resolution structures of three new trypsin-squash-inhibitor complexes: a detailed comparison with other trypsins and their complexes. Acta Crystallogr D Biol Crystallogr 1999, 55(Pt 1):139–148.

    CAS  Article  Google Scholar 

  55. 55.

    Chen B, Colgrave ML, Daly NL, Rosengren KJ, Gustafson KR, Craik DJ: Isolation and characterization of novel cyclotides from Viola hederaceae: solution structure and anti-HIV activity of vhl-1, a leaf-specific expressed cyclotide. J Biol Chem 2005, 280(23):22395–22405.

    CAS  Article  Google Scholar 

  56. 56.

    Simonsen SM, Daly NL, Craik DJ: Capped acyclic permutants of the circular protein kalata B1. FEBS Lett 2004, 577(3):399–402.

    CAS  Article  Google Scholar 

  57. 57.

    Avrutina O, Schmoldt HU, Gabrijelcic-Geiger D, Le Nguyen D, Sommerhoff CP, Diederichsen U, Kolmar H: Trypsin inhibition by macrocyclic and open-chain variants of the squash inhibitor MCoTI-II. Biol Chem 2005, 386(12):1301–1306.

    CAS  Article  Google Scholar 

  58. 58.

    Jackson PJ, McNulty JC, Yang YK, Thompson DA, Chai B, Gantz I, Barsh GS, Millhauser GL: Design, pharmacology, and NMR structure of a minimized cystine knot with agouti-related protein activity. Biochemistry 2002, 41(24):7565–7572.

    CAS  Article  Google Scholar 

  59. 59.

    Clark RJ, Fischer H, Dempster L, Daly NL, Rosengren KJ, Nevin ST, Meunier FA, Adams DJ, Craik DJ: Engineering stable peptide toxins by means of backbone cyclization: stabilization of the alpha-conotoxin MII. Proc Natl Acad Sci USA 2005, 102(39):13767–13772.

    CAS  Article  Google Scholar 

  60. 60.

    Trabi M, Craik DJ: Circular proteins – no end in sight. Trends Biochem Sci 2002, 27(3):132–138.

    CAS  Article  Google Scholar 

  61. 61.

    Craik DJ, Simonsen S, Daly NL: The cyclotides: novel macrocyclic peptides as scaffolds in drug design. Curr Opin Drug Discov Devel 2002, 5(2):251–260.

    CAS  Google Scholar 

  62. 62.

    Tam JP, Lu Y-A: Synthesis of large cyclic cystine-knot peptide by orthogonal coupling strategy using unprotected peptide precursor. Tetrahedron Lett 1997, 38(32):5599–5602.

    CAS  Article  Google Scholar 

  63. 63.

    Thongyoo P, Tate EW, Leatherbarrow RJ: Total synthesis of the macrocyclic cysteine knot microprotein MCoTI-II. Chem Commun (Camb) 2006, (27):2848–2850.

  64. 64.

    McNulty JC, Jackson PJ, Thompson DA, Chai B, Gantz I, Barsh GS, Dawson PE, Millhauser GL: Structures of the agouti signaling protein. J Mol Biol 2005, 346(4):1059–1070.

    CAS  Article  Google Scholar 

  65. 65.

    Avrutina O, Schmoldt HU, Kolmar H, Diederichsen U: Fmoc-assisted synthesis of a 29-residue cystine-knot trypsin inhibitor containing a guanyl amino acid at the P1-position. Eur J Org Chem 2004, (23):4931–4935.

  66. 66.

    Le-Nguyen D, Nalis D, Castro B: Solid phase synthesis of a trypsin inhibitor isolated from the Cucurbitaceae Ecballium elaterium. Int J Pept Protein Res 1989, 34(6):492–497.

    CAS  Article  Google Scholar 

  67. 67.

    Piotto M, Saudek V, Sklenar V: Gradient-tailored excitation for single-quantum NMR spectroscopy of aqueous solutions. J Biomol NMR 1992, 2(6):661–665.

    CAS  Article  Google Scholar 

  68. 68.

    Bodenhausen G, Ruben DJ: Natural Abundance Nitrogen-15 NMR by Enhanced Heteronuclear Spectroscopy. Chem Phys Lett 1980, 69: 185–189.

    CAS  Article  Google Scholar 

  69. 69.

    Bax A, Ikura M, Kay LE, Torchia DA, Tschudin R: Comparison of different modes of two-dimensional reverse-correlation NMR for the study of proteins. J Magn Reson 1990, 86: 304–318.

    CAS  Google Scholar 

  70. 70.

    Wuthrich K, Billeter M, Braun W: Pseudo-structures for the 20 common amino acids for use in studies of protein conformations by measurements of intramolecular proton-proton distance constraints with nuclear magnetic resonance. J Mol Biol 1983, 169(4):949–961.

    CAS  Article  Google Scholar 

  71. 71.

    Guntert P, Mumenthaler C, Wüthrich K: Torsion angle dynamics for NMR structure calculation with the new program DYANA. J Mol Biol 1997, 273: 283–298.

    CAS  Article  Google Scholar 

  72. 72.

    Onufriev A, Bashford D, Case DA: Exploring protein native states and large-scale conformational changes with a modified generalized born model. Proteins 2004, 55(2):383–394.

    CAS  Article  Google Scholar 

  73. 73.

    van Gunsteren WF, Berendsen HJC: Algorithms for macromolecular dynamics and constrained dynamics. Mol Phys 1977, 34: 1311–1327.

    CAS  Article  Google Scholar 

  74. 74.

    Bundi A, Wüthrich K: 1H-NMR parameters of the common amino acid residues measured in aqueous solutions of the linear tetrapeptides H-Gly-Gly-X-L-Ala-OH. Biopolymers 1979, 18(2):285–297.

    CAS  Article  Google Scholar 

  75. 75.

    Wishart DS, Sykes BD: Chemical shifts as a tool for structure determination. Methods Enzymol 1994, 239: 363–392.

    CAS  Article  Google Scholar 

  76. 76.

    Merutka G, Dyson HJ, Wright PE: 'Random coil' 1H chemical shifts obtained as a function of temperature and trifluoroethanol concentration for the peptide series GGXGG. J Biomol NMR 1995, 5(1):14–24.

    CAS  Article  Google Scholar 

  77. 77.

    Jorgensen WL, Chandreskhar J, Madura JD, Imprey RW, Klein ML: Comparison of simple potential functions for simulating liquid water. J Chem Phys 1983, 79: 926–935.

    CAS  Article  Google Scholar 

  78. 78.

    Darden TA, York D, Pedersen L: Particle Mesh Ewald: An N log(N) method for Ewald sums in large systems. J Chem Phys 1993, 98: 10089.

    CAS  Article  Google Scholar 

  79. 79.

    Berendsen HJC, Postma JPM, van Gunsteren WF, DiNola A, Haak JR: Molecular dynamics with coupling to an external bath. J Chem Phys, 3684–3690 1984, 81: 3684–3690.

    CAS  Article  Google Scholar 

Download references

Author information



Corresponding author

Correspondence to Laurent Chiche.

Additional information

Authors' contributions

AH participated in the design of the study, carried out the NMR study and helped to draft the manuscript. OA carried out the chemical synthesis of lin-MCoTI. DLN carried out the chemical synthesis of EETI-II and helped to draft the manuscript. UD supervised the synthesis of lin-MCoTI. JFH participated in the design of the study and drafted the manuscript. JG implemented computer tools, carried out simulations and helped to draft the manuscript. HK conceived the study and participated in its design and coordination. LC conceived the study, participated in its design and coordination, carried out simulations, and drafted the manuscript.

Electronic supplementary material

NMR spectra of lin-MCoTI

Additional file 1: . 600 MHz spectra at 300 K in 90% H2O/10% 2H2O at pH 3.0 (Top) TOCSY spectrum. Amino acid spin systems are labeled. (Bottom) Fingerprint region of the NOESY spectrum showing sequential connectivities between the residues for the 10–21 and 23–34 regions. (PNG 782 KB)

Chemical shifts in ppm for lin-MCoTI

Additional file 2: . 1H chemical shifts (12 and 27°C) and 13C chemical shifts (27°C) in ppm for lin-MCoTI. Values were measured at pH 3.0 in H2O/D2O (1H) or in D2O (13C) relative to TSP-d4 as internal reference. The numbering starts at 1 for the first residue and does not follow the numbering used in the text and shown in Figure 1. (PDF 83 KB)

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Heitz, A., Avrutina, O., Le-Nguyen, D. et al. Knottin cyclization: impact on structure and dynamics. BMC Struct Biol 8, 54 (2008).

Download citation


  • Nuclear Magnetic Resonance
  • Hemolytic Activity
  • Nuclear Magnetic Resonance Data
  • Nuclear Magnetic Resonance Experiment
  • Nuclear Magnetic Resonance Structure