The Ramachandran plots of glycine and pre-proline
© Ho and Brasseur; licensee BioMed Central Ltd. 2005
Received: 15 March 2005
Accepted: 16 August 2005
Published: 16 August 2005
Skip to main content
© Ho and Brasseur; licensee BioMed Central Ltd. 2005
Received: 15 March 2005
Accepted: 16 August 2005
Published: 16 August 2005
The Ramachandran plot is a fundamental tool in the analysis of protein structures. Of the 4 basic types of Ramachandran plots, the interactions that determine the generic and proline Ramachandran plots are well understood. The interactions of the glycine and pre-proline Ramachandran plots are not.
In glycine, the ψ angle is typically clustered at ψ = 180° and ψ = 0°. We show that these clusters correspond to conformations where either the Ni+1 or O atom is sandwiched between the two Hα atoms of glycine. We show that the shape of the 5 distinct regions of density (the α, αL, βS, βP and βPR regions) can be reproduced with electrostatic dipole-dipole interactions. In pre-proline, we analyse the origin of the ζ region of the Ramachandran plot, a region unique to pre-proline. We show that it is stabilized by a COi-1···CδHδi+1 weak hydrogen bond. This is analogous to the COi-1···NHi+1 hydrogen bond that stabilizes the γ region in the generic Ramachandran plot.
We have identified the specific interactions that affect the backbone of glycine and pre-proline. Knowledge of these interactions will improve current force-fields, and help understand structural motifs containing these residues.
The Ramachandran plot  is the 2d plot of the φ-ψ torsion angles of the protein backbone. It provides a simple view of the conformation of a protein. The φ-ψ angles cluster into distinct regions in the Ramachandran plot where each region corresponds to a particular secondary structure. There are four basic types of Ramachandran plots, depending on the stereo-chemistry of the amino acid: generic (which refers to the 18 non-glycine non-proline amino acids), glycine, proline, and pre-proline (which refers to residues preceding a proline ). The generic and proline Ramachandran plots are now well understood  but the glycine and pre-proline Ramachandran plots are not.
The generic Ramachandran plot was first explained by Ramachandran and co-workers in terms of steric clashes . This has become the standard explanation for the observed regions in the Ramachandran plot [4, 5]. However, recent studies found significant discrepancies between the classic steric map and the Ramachandran plot of high-resolution protein structures [6–9]. These discrepancies have now been resolved. The first discrepancy is that the N···Hi+1 and Oi-1···C steric clashes in the classic steric map have no effect in the observed Ramachandran plot . By removing these steric clashes, a better steric map can be constructed. The second discrepancy is that the Ramachandran plot cluster into distinct regions within the sterically-allowed regions of the Ramachandran plot [8, 10]. These clusters have now been explained in terms of backbone dipole-dipole interactions [3, 11, 12].
The proline Ramachandran plot has been reproduced in a calculation . The proline Ramachandran plot is severely restricted by the pyrrolidine ring, where the flexibility in the pyrrolidine ring couples to the backbone .
Although the overall shape of the pre-proline Ramachandran plot (Figure 1B) is well understood, there exists a region unique to pre-proline that remains unexplained. The basic shape of pre-proline was predicted by Flory using steric interactions . This was later confirmed in a statistical analysis of the protein database . However, the statistical analysis also revealed the existence of a little leg of density poking out below the β-region (Figure 1B; purple in Figure 2C), which Karplus called the ζ region . More recent calculations using standard molecular mechanics force-fields reproduced the energy surface of the original Flory calculation [13, 18] but not the ζ region. In this study, we focus on the physical origin of the ζ region.
To extract the statistical distributions of the glycine and pre-proline Ramachandran plots, we chose a high-resolution subset of the PDB  provided by the Richardson lab  of 500 non-homologous proteins. These proteins have a resolution of better than 1.8 Å where all hydrogen atoms have been projected from the backbone and optimized in terms of packing. Following the Richardsons, we only consider atoms that have a B-factor of less than 30.
Glycine is fundamentally different to the other amino acids in that it lacks a sidechain. In particular, glycine does not have the Cβ atom, which induces many steric clashes in the generic Ramachandran plot. We call the hydrogen atom that is shared with the other amino acids, the Hα1 atom. We call the hydrogen atom that replaces the Cβ atom, the Hα2 atom. The absence of the Cβ atom allows the glycine Ramachandran plot to run over the borders at -180° and 180° (Figure 1A).
The original steric map of glycine (Figure 2A)  fails to explain large parts of the observed glycine Ramachandran plot (Figure 1A). In the observed glycine Ramachandran (Figure 3A), there are two large excluded horizontal strips at 50° < ψ' < 120° and -120° < ψ' < -50°, which are not excluded in the glycine steric map (Figure 2A). Conversely, the glycine steric map excludes a horizontal strip at -30° < ψ' < 30° (Figure 2A), but this region is populated in the observed plot (Figure 1A). There are also diagonal steric boundaries in the observed glycine Ramachandran plot (Figure 1A), whereas the steric map predicts vertical boundaries (Figure 2A).
We carried out a re-evaluation of the steric map of glycine (Figure 2B) by following the methodology of Ho and co-workers . For each interaction in the glycine backbone, we consider the variation of the inter-atomic distance with respect to the φ'-ψ' angles. We compare the observed variation to the variation generated from a model that uses canonical backbone geometry. We divide these interactions into 3 categories: the φ' dependent, ψ' dependent and φ'-ψ' co-dependent distances.
For some of the interactions, the results for glycine are identical to that of the generic Ramachandran plot . For brevity, we omit the analysis of these interactions and summarize the results. The excluded horizontal strip -30° < ψ' < 30°, due to the N···Hi+1 steric interaction in the glycine steric map (Figure 2A), does not exist in the observed distribution (Figure 1A). Similarly, the Oi-1···C steric clash in the original glycine steric map, which excludes a vertical strip centered on φ' = 0° (Figure 2A), does not exist in the observed distribution (Figure 1A). We ignore the effect of the N···Hi+1 and Oi-1···C steric clashes. The diagonal boundaries of the observed distribution are defined by the φ'-ψ' co-dependent steric interactions Oi-1···O and Oi-1···Ni+1. In Figure 3A, we show the fit of these steric interactions to the data.
Here, we analyze the most distinctive feature of the glycine Ramachandran plot – the tendency for ψ' to cluster near 180° and 0°. We focus on the ψ'-dependent interactions. For each interaction, we first calculate the model curve of the corresponding inter-atomic distance as a function of ψ' (see Methods). We then compare the observed ψ' distribution (bottom of Figure 3B) to the curve. If a hard-sphere repulsion restricts ψ', then, in regions of ψ' where the model curve is below the van der Waals (VDW) diameter (horizontal dashed line in Figure 3B), the ψ' frequency distribution should drop correspondingly.
In the region (60° < ψ' < 100°), we find that the drop-off in the ψ frequency distribution (bottom of Figure 3B) corresponds to values of Hα1···Ni+1 (bottom of Figure 3B) and Hα2···O (top of Figure 3B) that are smaller than their VDW diameters. In the region (-90° < ψ' < -60; 210° < ψ' < 270°), the drop-off in the ψ frequency distribution corresponds to regions where Hα2···Ni+1 and Hα1···O are found below their VDW radii. In contrast, the values of Hα1···Hi+1 and Hα2···Hi+1 are never found significantly below their VDW diameter (middle of Figure 3B).
The revised glycine steric map does not explain the diagonal shape of the α, αL, βP, βPR and βS regions. In the generic Ramachandran plot, it was found that the diagonal shape of regions could be reproduced using electrostatic dipole-dipole interactions  but only when the dipole-dipole interactions were considered individually. The overall electrostatic interaction does not reproduce the observed Ramachandran plot . Here, we use the same approach of treating individual electrostatic dipole-dipole interactions along the glycine backbone.
We calculate the energy map of φ-ψ for the 4 dipole-dipole interactions in the glycine backbone interaction: COi-1···CO, NH···NHi+1, CO···NH and COi-1···NHi+1 (Figure 5C-F). The electrostatic interactions are calculated with the Lennard-Jones potentials of the steric clashes identified in the section above. We find that the shapes of the different regions of the glycine Ramachandran plot (Figure 3A) are reproduced (Figure 5). The CO···NH interaction produces the diagonal αL, α and βS region (Figure 5E). The NH···NHi+1 interaction also produces a diagonal αL and α region (Figure 5D). The α region is symmetric to the αL region. The COi-1···CO interaction produces minima corresponding to the βP and βPR regions (Figure 5C).
In the original glycine steric map (Figure 2A), the region near (φ, ψ) = (-180°, 180°) is forbidden due to a steric clash between O and H. Yet glycine has density in this region in the observed Ramachandran plot (Figure 3A). This can also be seen in the frequency distribution of d(O···H) (Figure 3C), where there is a peak at d(O···H) ~ 2.4 Å. At this peak, the O and H atoms are in contact, as the VDW diameter is 2.5 Å. Thus, in the βS region of glycine, the favorable CO···HN dipole-dipole interaction overcomes the steric repulsion of the O and H atoms (Figure 5E).
Schimmel and Flory argued in 1968 that pre-proline – amino acids preceding proline – has a particularly restricted Ramchandran plot, compared to the generic Ramachandran plot . This was finally observed in the protein database by MacArthur and Thornton (Figure 1B) .
There are three main differences between the pre-proline Ramachandran plot and the generic Ramachandran plot. In the pre-proline Ramachandran plot, there is a large excluded horizontal strip at -40° < ψ < 50°, which restricts αL and α regions. The αL region is shifted up higher. These two features were reproduced in the Schimmel-Flory calculation  and subsequent calculations [13, 18]. The third feature is a little leg of density poking out below the β-region (Figure 1B; purple in Figure 2C). Karplus called this the ζ region , which is unique to pre-proline.
Previous calculations [2, 17, 18] did not focus on the individual interactions, and did not account for the ζ region. Here, we identify the exact steric clashes that determine the pre-proline Ramachandran plot. We will then analyse the interactions responsible for the ζ region.
In pre-proline, instead of an interaction with the NH atom in the succeeding generic amino acid, the pre-proline interacts with a CH2 group of the succeeding proline (Figure 1B). The CH2 group exerts a much larger steric effect on the pre-proline Ramachandran plot. MacArthur and Thornton  suggested that the dominant effect is due to the N···Cδi+1 and Cβ···Cδi+1 steric clashes. Here we can analyse the efficacy of each clash by analysing the statistical distributions directly.
We consider the φ-ψ co-dependent interactions that involve the Cδ, Hδ1 and Hδ2 atoms of the succeeding proline (Figure 1B). For each interaction, we generate the contour plot in φ-ψ of the VDW diameter distance. By comparing the contour plot to the observed density in the pre-proline Ramachandran plot, we identify the interactions that induce the best match in the boundaries (Figure 6A, the interactions are identified in Figure 2C). We found that the chunk taken out of the bottom-left β-region of the observed density is due to the Oi-1···Cδi+1 steric clash. Another restriction on the αL and α regions is due to the H···Cδi+1 steric clash.
Parameters of the CO···HX hydrogen bond
γ region of the generic amino acid
ζ region of pre-proline: UP PUCKER
ζ region of pre-proline: DOWN PUCKER
Can the Cδ Hδi+1 group interact with COi-1? Such an interaction would fall under the class of the CH···O weak hydrogen bond, a well-documented interaction in proteins . Studies of the CH···O weak hydrogen bond use a distance criteria of d(H···O) < 2.8 Å [25–27]. There is little angular dependence found in the CH···O bond around the H atom where an angle criteria of ∠OHX > 90° is often used. This is much more permissive than the geometry of the canonical hydrogen bond. In Table 1, we list the hydrogen bond parameters of the COi-1···CδHδi+1 interaction in the ζ region. As proline can take on two different major conformations, the UP and DOWN pucker, measurements of the geometry of the COi-1···CδHδi+1 interaction must also be divided in terms of the UP and DOWN pucker. The observed geometry of the COi-1···CδHδi+1 geometry satisfies the geometric criteria of the weak hydrogen bond (Table 1).
As the COi-1···CδHδi+1 weak hydrogen bond is a close contact, we need to model the interaction in order to understand its dependence on the φ-ψ angles. For the modelling, we consider strategies that have been used for the analogous COi-1···HNi+1 hydrogen bond. The COi-1···HNi+1 hydrogen bond has been modelled in quantum-mechanical studies where the γ region was found to be the minimum energy conformation in vacuum . A simpler approach, which modelled the hydrogen bond with electrostatic dipole-dipole interactions, also find a minimum in the γ region .
Here, we model the COi-1···CδHδi+1 weak hydrogen bond as an electrostatic dipole-dipole interaction (see Methods). How do we model the CδHδi+1 group as an electrostatic dipole? Bhattacharyya and Chakrabarti  found that, of the CH groups in proline, the CδHδ group forms the most CH···O hydrogen bonds. The Cδ atom sits next to the electron-withdrawing N atom and thus, is more acidic than the other C atoms. Consequently, we place a small negative partial charge on the Cδ atom. In our model, we find an energy minimum in the ζ region for both the UP pucker (Figure 7B) and the DOWN pucker (Figure 7C). We conclude that the COi-1···Cδi+1Hδ1i+1 weak hydrogen bond stabilizes the ζ region in pre-proline.
We have identified the interactions that determine the high-resolution Ramachandran plots of glycine and pre-proline.
For glycine, the Ramachandran plot of the glycine backbone modeled by standard force-fields fails to reproduce the observed Ramachandran plot . Instead the modeled Ramachandran plot resembles the original steric map of glycine . The failure of these calculations arises from the inadequate treatment of the Hα atoms. We have identified a revised set of steric interactions that can reproduce the observed glycine Ramachandran plot. These are Oi-1···O, Oi-1···Ni+1, Hα1···O, Hα2···O, Hα1···Ni+1 and Hα2···Ni+1 (Figure 2B). These steric interactions constrain either the Ni+1 or O atom to be sandwiched between the two Hα atoms, which clusters glycine to ψ = 180° and ψ = 0°. The five clustered regions can be traced to electrostatic dipole-dipole interactions: the CO···NH interaction induces diagonal αL, α and βS regions; and the COi-1···CO interaction induces the diagonal βP and βPR regions.
Previous calculations of the pre-proline Ramachandran reproduced most of the observed pre-proline Ramachandran plot with the notable exception of the ζ region. Previous studies did not identify the specific steric interactions involved in defining the pre-proline Ramachandran plot. Here, we have identified them: N···Cδi+1, Oi-1···Cδi+1 and H···Cδi+1 (Figure 2C). We have also identified the physical mechanism that stabilizes the ζ region (purple in Figure 2C). It is the COi-1···CδHδi+1 weak hydrogen bond, which is directly analogous to the COi-1···NHi+1 hydrogen bond that stabilizes γ-turns in the generic amino acid.
Combined with the analysis of the generic Ramachandran plot  and the proline Ramachandran plot [13, 14], we have identified the interactions that define the high-resolution Ramachandran plots of all 20 amino acids. Although our analysis uses simple modeling techniques, the interactions identified here suggest concrete ways to resolve the inadequacies in current force-fields.
In the steric clash analysis, we used the VDW radii given by the Richardson lab : Hα = 1.17Å, H = 1.00Å, C = 1.65Å, Cα = Cβ = 1.75Å, O = 1.40Å and N = 1.55Å. From the database, we extracted 7277 glycine and 4336 pre-proline residues.
To calculate the model curves of the inter-atomic distances as a function of the φ-ψ angles, we modeled the glycine and pre-proline protein fragments shown in Figure 1. Covalent bond lengths and angles were fixed to CHARMM22 values . Only the φ-ψ angles vary. The φ-ψ angles of the central residue were incremented in 5° steps and the corresponding distance parameters and energies of the inter-atomic interactions were calculated. We used 2 types of interactions, partial charge electrostatics, Eelec = 331·(q1·q2) kcal·mol-1, and Lennard-Jones 12-6 potentials, ELJ = ε (σ/d)12 – 2 (σ/d)6) kcal·mol-1, where the parameters were taken from CHARMM22 . There are no parameters in CHARMM22 for the Hδ and Cδ atoms. As such, we have assigned a partial charge of -0.20 to Cδ and 0.10 to Hδ1 and Hδ2. These are not based on any detailed arguments but are merely used to estimate the effect that such charges would have.
BKH was supported by a post-doctoral grant from the Fonds National de la Recherche Scientifique (FNRS), Belgium. RB is director of research of FNRS.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.