- Research article
- Open Access
Helical ambivalency induced by point mutations
BMC Structural Biology volume 13, Article number: 9 (2013)
Mutation of amino acid sequences in a protein may have diverse effects on its structure and function. Point mutations of even a single amino acid residue in the helices of the non-redundant database may lead to sequentially identical peptides which adopt different secondary structures in different proteins. However, various physico-chemical factors which govern the formation of these ambivalent helices generated by point mutations of a sequence are not clearly known.
Sequences generated by point mutations of helices are mapped on to their non-helical counterparts in the SCOP database. The results show that short helices are prone to transform into non-helical conformations upon point mutations. Mutation of amino acid residues by helix breakers preferentially yield non-helical conformations, while mutation with residues of intermediate helix propensity display least preferences for non-helical conformations. Differences in the solvent accessibility of the mutating/mutated residues are found to be a major criteria for these sequences to conform to non-helical conformations. Even with minimal differences in the amino acid distributions of the sequences flanking the helical and non-helical conformations, helix-flanking sequences are found be more solvent accessible.
All types of mutations from helical to non-helical conformations are investigated. The primary factors attributing such changes in conformation can be: i) type/propensity of the mutating and mutant residues ii) solvent accessibility of the residue at the mutation site iii) context/environment dependence of the flanking sequences. The results from the present study may be used to design de novo proteins via point mutations.
Under physiological conditions, folding of proteins to well-defined three dimensional structures is crucial for executing their specific biological functions. To the extent that the folding pattern of proteins dictates function, modifying the structure may entail in either altering or disrupting its function. Point mutations in a protein sequence, which are introduced by single amino acid residue replacements by site-directed mutagenesis, may yield different phenotypes by changing the native structure. The effect of point mutations are important in exploring various functional and structural features viz. protein sequenceŰstructure relationships , engineering protein stability [2, 3], predicting the evolutionary dynamics of proteins [4–7], validating and refining various protein models and simulations [8–10], and designing de novo proteins . Mutations may also influence the local structure of a protein by changing the secondary structures of peptide sequences from one conformation to another [12, 13]. It is established that the inherited prion diseases of humans are all linked to specific point and insertion mutations, which are responsible for the α→β transition leading to aggregation due to the formation of amyloid fibrils . Similar conclusions are drawn from the theoretical studies of prion protein .
Experimental studies of mutating different positions in a protein structure with various amino acid residues are time-consuming and expensive process. Such an investigation is facilitated by the three-dimensional modeling of polypeptide chain mutations. Experimental determinations of secondary structure propensity and stability [16–19] are usually studied through point mutations. The results obtained from the mutation of helices are used for rationalizing helix-coil transitions and compared to that of Lifson-Roig  and Zimm-Bragg  models. However, the knowledge of the type and position of mutations required for the transformation of helical → non-helical conformation or vice versa is largely unclear. The computational demands of such problems limit the number of mutations and size of sequences that may be considered. Alternatively, crystallographic structures from the Protein Data Bank  may be used as templates to explore point mutations leading to helical ambivalency.
The present work aims at investigating the physico-chemical properties of ambivalent helices generated by point mutations. Ambivalent sequences conform to different secondary structures in two different proteins. Kabsch and Sander  first reported the occurrence and physico-chemical properties of these sequences which were studied in great detail in the subsequent works [24–30]. Importance of these sequences lie in their implication in the pathogenesis of amyloid diseases including Alzheimer’s disease, transmissible bovine spongiform encephalopathy etc [31, 32]. These sequences also challenge the existing protein structure prediction methods based on sequence homology . Recently we have shown that variable (ambivalent) helices, sequences which are in helical conformations in one protein and non-helical in other, have different physico-chemical, context and dynamical properties in comparison to conserved helices i.e. sequences which remain in helical conformation throughout [34–36]. Although Argos  reported ambivalent sequences which differ by a single amino acid, no subsequent studies on the occurrence and properties of these sequences were performed. However, these type of sequences are important as point mutations may cause secondary structure transformation, which may have significant implications in various misfolding diseases. Recent studies demonstrate that a conformational switch between two proteins with completely different structures and functions occurs via a single amino acid mutation .
This switching between the folds can occur in multiple ways via successive single amino acid mutation at different positions . Both theoretical  and computational  studies have highlighted the bifunctionality of proteins via single point mutation.
This work presents a detailed study of helical ambivalency induced by point mutations. Helices from the non-redundant database are point mutated at all residue positions and the resulting sequences are mapped onto the SCOP database to obtain completely non-helical conformations. Previous analysis of physico-chemical properties of ambivalent sequences were restricted to the local structure neglecting the global influence of the other parts of the protein [25, 27–30, 34]. Along similar lines, the present study also analyzes the physico-chemical properties of ambivalent sequences induced by point mutations disregarding the effect on the remaining parts of the protein. The results show that smaller helices readily transform into non-helical conformations after point mutations. Sequences obtained by point mutations with helix-breaking residues usually correspond to non-helical conformations in the SCOP database. However, point mutations of helix indifferent residues yield mutated sequences which has a low probability of conforming to non-helical conformations. Differences in the solvent accessibility of the mutating/mutated residues in helical and non-helical conformations is found to be a major factor for the mutated sequence conforming to non-helical structures. Most of the mutated residues display different degree of solvent accessibility in helical and non-helical conformations. The non-helical conformations are found to be less solvent-accessible as compared to the original helices. Despite marginal differences in the amino acid distributions of the sequences flanking helical and non-helical conformations, the solvent accessibility of the sequences flanking helices are greater than that of the corresponding non-helical conformations.
Database and mutation
The database used in the present study comprises the crystal structures from May-2008 release of PDB-select , which were compiled to create a database of non-redundant proteins from PDB  (Protein Data Bank). The database comprises protein chains with a sequence identity of 25% or less. All protein chains considered in this study have resolution ≤ 3Å and crystallographic R-factor, R≤0.3. The selected database consists of 2586 non-redundant protein chains from 2466 protein structures. Secondary structures are annotated residue-wise with the help of DSSP software . According to the widely used definition, H and G denote helical conformations while all other classes (B, E, I, S, T, -) are considered to be non-helical [43–45]. Neglecting helices with less than 5 residues long, there are 11592 helices in the non-redundant database. These helices were point mutated at each position by all 20 amino acids (excluding the amino acid present at the given position in wild type helix). So for a helix of length 5 amino acids this method will generate (5X 19) 95 mutated sequences. The total number of mutated helices generated from 11592 helices is 2662375 which constitute the database of mutated sequences.
Proteins from nine SCOP (Structural Classification of Proteins)  classes viz. (I)All alpha proteins, (II)All beta proteins, (III)Alpha and beta proteins(a+b), (IV)Alpha and beta proteins(a/b), (V)Coiled coiled proteins, (VI)Membrane and cell surface proteins and peptides, (VII)Multi-domain proteins(alpha and beta), (VIII)Peptides and (IX)Small proteins were compiled to obtain sequences identical to the mutated sequences. A structural cut-off resolution ≤ 3Å and R≤0.3 were applied on these proteins with the PISCES server . The resultant SCOP database consists of 48244 protein chains from 22309 protein structures.
Both the non-redundant protein and SCOP database used in the present study are similar to our recent work on ambivalent helices  which helps in comparison between both the works.
Identical sequence search
The method for identifying the sequences identical to the mutated sequences in SCOP database in non-helical conformations is similar to previous method of searching variable and conserved helices . However, a slight modification of the method was performed due to the large number of mutation generated sequences as compared to the original wild type helices. The method is outlined as follows: for a mutated sequence of N residues to be mapped onto a protein in the SCOP database of M residues, an NXM matrix is created where an element of the matrix, A(i, j)[i=1→N, j=1→M], is equal to 1 if ith position of the helix and jth position of the protein chain have identical residues and the conformation of the residue in jth position of the protein in SCOP database is non-helical. Now if an element A(k, l)[k ε i, l ε j] = 1 and , where m is a running index, then the helix from non-redundant database is mapped from l to l+N−1 position of the SCOP database protein with non-helical conformations. Sequences identical to the mutated sequences with non-helical conformations are only identified in the present study. This method yields sequences identical to the mutated sequences with completely non-helical conformations only. Among 2662375 mutated sequences, 28957 were mapped in 253728 sequences in the SCOP database with completely non-helical conformations. The list of 28957 mutation generated helical sequences are provided in Additional file 1: Table S1 of supplementary material. These 28957 mutated sequences are generated from 1775 non-redundant helices through point mutations. These 1775 helices are denoted as wild type helices, 28957 mutated sequences as ambivalent mutated sequences and 253728 mapped sequences in the SCOP database as non-helical sequences respectively.
Results and discussion
Number distribution of mutation generated ambivalent helices
The number of ambivalent mutated sequences (28957) which conform to non-helical structures in the SCOP database is plotted as a function of sequence length in Figure 1. From Figure 1 it is observed that the shorter mutated helical sequences (≤6) are more probable to conform to non-helical conformations in the SCOP database. However, a small number of sequences (∼6%) of length ≥7 are found to conform to non-helical conformations in the SCOP database. This further dips to ∼2% when mapped 253728 non-helical sequences in SCOP are considered. For long helices, hydrogen bonds render stability to the overall structure which is not disrupted by the point mutations. This explains why few long helices generated by point mutations switch to non-helical conformations.
The inset of Figure 1 highlights the region of sequence length ≥7. An interesting phenomenon may be observed here. The number of mutated sequences with length in multiples of 3 (i.e. 9, 12,...) either shows a peak or a trough, which is similar to the trend observed in helices . A larger number of these sequences imply a greater probability of switching into non-helical conformations subject to point mutations.
Propensity of conformation change with different mutations
Table 1 depicts the number of times an ambivalent mutated sequence is mapped into non-helical conformations in the SCOP database when a particular amino acid (first column) is mutated by 19 other amino acids (first row). A glance at the table shows that the number of non-helical conformations increases either by mutating an amino acid with helix breaking residues (e.g. columns starting with G, P) or substituting for a helix forming amino acid (e.g. row starting with L).
It is computationally infeasible to generate sequences identical to the mutated helical sequences. However, it is possible to obtain the number of mutated sequences which retain their helical conformations in the SCOP database. To estimate the number of different type of mutations, which retains the helical conformations, the probability of occurence of 20 amino acids in conserved helices is used from our earlier study . The number of X → Y mutations leading to the retention of helical sequences is obtained by multiplying the total number of mutations to generate mutated sequences in the database with the fraction of occurence of amino acids, Y, in the conserved helices. Table 2 depicts the estimated number of each type of amino acid mutations which preserve helical conformations. The results show that a large number of sequences mutated by helix forming residues like Ala, Val etc. retain their helical conformation when mapped into SCOP database.
For simplification we divide the 20 amino acids into three groups based on their helix forming propensity: F for helix forming amino acids which include A, E, F, H, L, M, Q, V and W, I for helix indifferent amino acids viz. C, D, I, K, R, S and T and B for helix breaking amino acids like G, N, P and Y [49, 50]. This division is primarily based upon the statistical analysis values rather than experimental scales like thermodynamic stability . The number of different types of mutation is reduced from 380(20X 19) to 3X 3=9 viz. F →F, F →I, F →B,...etc. The propensity of a particular mutation X →Y, which transforms the sequence from helical to non-helical conformation, is defined as
where p is the probability of occurrence of X →Y mutation leading to 28957 ambivalent mutated sequences, while q denotes the probability of the same mutation in the database of mutated sequences. A propensity value >1 denotes a preference for the specific mutation to conform to the non-helical structures while <1 implies that the mutation may not preferably yield the same.
Figure 2 shows the propensity of a particular mutation leading to non-helical conformations with 9 different types of mutations. Propensity of mutations with helix-breaking residues have a higher probability to form non-helical structures in the SCOP database.
However, propensity of mutation of helix indifferent residues shows that it is not favored for conforming into non-helical structures.
Earlier results depict that amino acids have different preferences for the two termini of helices [48, 52–57]. However, the present work did not find any perceptible difference in the propensities of different types of mutations at N- and C-terminus of the helical sequences.
Interestingly propensity for B →F, B →I and B →B are found to be greater than 1 in Figure 2. While higher propensities for B →B are expected, a similar trend for B →I and B →F may be explained by a marked increase in the solvent accessibility, which outweighs the decrease in propensity yielding predominantly non-helical conformations. Previous studies have shown that amino acid propensities are position specific with respect to both termini displaying an oscillatory behavior .
Hence, mutation of helix breaking residues with helix-forming and helix-indifferent residues are position-specific, which may account for the behavior observed in Figure 2.
Mutation generated sequences leading to non-helical conformations show change in solvent accessibility of the mutated site
The solvent accessibility of each residue is calculated using DSSP . The normalized solvent accessibility is calculated as the ratio of this absolute value to the maximum solvent accessibility of amino acid residues found in Gly-X-Gly . In this study, the residues are classified as buried or exposed according to their different degree of solvent accessibility. Buried residues are ≤7% solvent accessible, while the exposed surface residues have ≥37% relative solvent accessibility. Residues in the intermediate zone have relative solvent accessibility in the range between 7% and 37% .
Propensity of a X →Y mutation for different solvent accessible (buried/ intermediate/ exposed) residues of type X in the wild type helices is defined by equation 1. For a given solvent accessible residue of type X, p is the probability of occurrence of X →Y mutation in the ambivalent mutated sequences, while q is the probability of the same mutation for a residue of the same solvent accessibility in database of mutated sequences. Figure 3 depicts the propensity of a given type of mutation for residues of different solvent accessibility in the helices of the non-redundant database, corresponding to non-helical conformations in the SCOP database. The results show that mutations of helix-forming and helix-indifferent residues at solvent accessible sites mostly yields non-helical conformations followed by mutation of residues having intermediate solvent accessibility, while an opposite trend is observed for mutations of helix-breaking residues.
Mapping the entire dataset of all mutation generated sequences (2662375) in the SCOP database demands enormous computational time and resources and hence is out of the scope of the present work. Hence the propensity of mutation X →Y for residues of different solvent accessibility (buried/ intermediate/ exposed) of Y in 253728 non-helical sequences of the SCOP database may be approximately calculated as the ratio of probabilities given by
where p is the probability of X →Y mutation in the non-helical sequences (253728) with a particular type of solvent accessibility of Y and q 1 and q 2 are the probabilities of residue X and Y of same/different solvent accessibility in the non-redundant database and the SCOP database respectively. The denominator is normalized to unity for the different types of mutations. Figure 4 show the propensity of various types of mutations leading to different solvent accessibility of the mutated position. Excluding mutation of the helix forming residues (F) a completely opposite trend is observed in Figure 4 compared to that of Figure 3. This shows that mutating the helix-breaking and helix-indifferent residues the solvent accessibility changes, which makes the sequence non-helical.
The above results motivated us to investigate the overall solvent accessibility of the wild type helices, yielding the ambivalent mutated sequences which corresponds to the non-helical sequences in the SCOP database. Figure 5 shows the fraction of wild type helices and their corresponding non-helical sequences in the SCOP database at different values of the average normalized solvent accessibility. The results show that mutated sequences in the non-helical conformations have lower solvent accessibility i.e. they are buried in the interior of the protein compared to the parent helices. The data displayed in Figure 5 are validated through the t-test and the difference is found to be highly significant with P-value P<0.001 and T-value T=25.3569. For more than 65% cases, the non-helical conformations are found to be less solvent accessible as compared to the original helical conformations.
Flanking sequences have vast differences in solvent accessibility though amino acid distributions are similar in them
Previous studies have established that context plays a major role in determining the structure of an ambivalent sequences [28, 30, 34]. It was suggested earlier that the flanking amino acids of ambivalent sequences play an important role in determining the conformations of these sequences. This finding is also validated experimentally . An analysis of the amino acid occurrence frequency in the flanking sequences of the wild type helices and those of the corresponding non-helical conformations are also presented in this work. The flanking sequences are differentiated as N-terminus flanking sequences (four residues preceding the sequences) and C-terminus flanking sequences (four residues succeeding the sequences). Figure 6 shows the distribution of 20 amino acids in these flanking sequences. The distribution is depicted as the conformational parameter which is defined by,
where P i j is the probability of ith residue in jth flanking sequence while P i is the probability of ith residue in the non-redundant database.
Conformational parameter >1 depicts that the amino acid is preferred in that sequence while a value <1 shows it is not preferred.
Minimal differences are noticed for the flanking sequences of the wild type helices and their corresponding non-helical counterparts. This minimal differences in conformational parameters are in sharp contrast to those observed in chameleon/ambivalent sequences earlier [25, 28, 30, 34, 64]. In previous studies of chameleon/ambivalent sequences, significant differences in the amino acid conformational parameters were observed in sequences flanking helical and non-helical conformations. In the present study significant differences are found only for the amino acids C, D, H and S at the N-terminus and amino acids K, L, Q, T and V for the C-terminus flanking sequences. Since Cysteines tend to form disulfide bonds, their high occurrence in the flanking sequences of the wild type helices imparts extra stability to these structures.
The solvent accessibility of the flanking sequences are also studied in the present work. Figure 7 shows the fraction of flanking sequences at different average normalized solvent accessibility. Despite close similarity in their amino acid preferences, these flanking sequences have vast differences in solvent accessibility. The sequences flanking the non-helical conformations at both N- and C-terminus are less solvent accessible compared to those flanking the wild type helices. The significance of the result is validated by the t-test, which gives a P-value of P<0.001 and T-value T=20.3459. Similar differences in the solvent accessibility of the flanking sequences were observed in earlier studies for chameleon/ambivalent sequences [25, 28, 30, 34].
Helices from the non-redundant database are point mutated to yield mutated sequences which are mapped in the SCOP database to obtain identical sequences with non-helical conformations. The results of this study confirm that small helices are more prone to non-helical conformations on point mutations. Mutations with helix breaking residues mostly transform the mutated sequence to non-helical conformations, while mutations of helix indifferent residues hardly yield non-helical conformations.
Solvent accessibility of mutating/mutated residues can be a primary factor for the helices to conform to non-helical conformations following point mutations. Most mutated residues are found to differ in the degree of solvent accessibility in helical and non-helical conformations. The non-helical conformations obtained after point mutations are found to be less solvent accessible as compared to the original helical conformations. Minimal differences appear in the amino acid distributions for the flanking sequences of helices in non-redundant database and their corresponding non-helical conformations in the SCOP database. However, the solvent accessibility of the flanking sequences of helices are vastly different from those of the respective non-helical conformations which are buried in the protein interior.
Voigt CA, Kauffman S, Wang ZG: Rational evolutionary design: the theory of in vitro protein evolution. Advan Protein Chem 2000, 55: 79–160.
Lehmann M, Pasamontes L, Lassen SF, Wyss M: The consensus concept for thermostability engineering of proteins. Biochim Biophys Acta 2000, 1543: 408–415. 10.1016/S0167-4838(00)00238-7
van den Burg B: Selection of mutations for increased protein stability. Curr Opin Biotechnol 2002, 13: 333–337. 10.1016/S0958-1669(02)00325-7
DePristo MA, Weinreich DM, Hartl DL: Missense meanderings in sequence space: a biophysical view of protein evolution. Nat Rev Genet 2005, 6: 678–687. 10.1038/nrg1672
Pal C, Papp B, Lercher MJ: An integrated view of protein evolution. Nat Rev Genet 2006, 7: 337–348. 10.1038/nrg1838
Bloom JD, Silberg JJ, Wilke CO, Drummond DA, Adami C, Arnold FH: Thermodynamic prediction of protein neutrality. Proc Natl Acad Sci USA 2005, 102: 606–611. 10.1073/pnas.0406744102
Bershtein S, Segal M, Bekerman R, Tokuriki N, Tawfik DS: Robustness-epistasis link shapes the fitness landscape of a randomly drifting protein. Nature 2006, 444: 929–932. 10.1038/nature05385
England JL, Shakhnovich BE, Shakhnovich EI: Natural selection of more designable folds: a mechanism for thermophilic adaptation. Proc Natl Acad Sci USA 2003, 100: 8727–8731. 10.1073/pnas.1530713100
Govindarajan S, Goldstein RA: Evolution of model proteins on a foldability landscape. Proteins 1997, 29: 461–466. 10.1002/(SICI)1097-0134(199712)29:4<461::AID-PROT6>3.0.CO;2-B
Bornberg-Bauer E, Chan HS: Modeling evolutionary landscapes: mutational stability, topology, and superfunnels in sequence space. Proc Natl Acad Sci USA 1999, 96: 10689–10694. 10.1073/pnas.96.19.10689
Butterfoss GL, Kuhlman B: Computerbased design of novel protein structures. Annu Rev Biophys Biomol Struct 2006, 35: 49–65. 10.1146/annurev.biophys.35.040405.102046
Yang W, Ko T, Corselli L, Johnson RC, Hanna SY: Conversion of a β-strand to an α-helix induced by a single-site mutation observed in the crystal structure of Fis mutant Pro26Ala. Protein Sci 1998, 7: 1875–1883. 10.1002/pro.5560070904
Tidow H, Lauber T, Vitzithum K, Sommerhoff CP, Rosch P, Marx UC: The solution structure of a chimeric LEKTI domain reveals a chameleon sequence. Biochemistry 2004, 43: 11238–11247. 10.1021/bi0492399
Prusiner SB: Molecular biology and genetics of prion disease. Philos Trans R Soc London Ser B 1994, 343: 447–463. 10.1098/rstb.1994.0043
Harrison PM, Chan HS, Prusiner SB, Cohen FE: Thermodynamics of model prions and its implications for the problem of prion protein folding. J Mol Biol 1999, 286: 593–606. 10.1006/jmbi.1998.2497
Bell JA, Becktel WJ, Sauer C, Baase WA, Matthews BW: Dissection of helix capping in T4 lysozyme by structural and thermodynamic analysis of 6 amino acid substitutions at Thr 59. Biochemistry 1992, 31: 3590–3596. 10.1021/bi00129a006
Pinker RJ, Lin L, Rose GD, Kallenbach NR: Effects of alanine substitutions in α-helices of sperm whale myoglobin on protein stability. Protein Sci 1993, 2: 1099–1105. 10.1002/pro.5560020704
Chakrabartty A, Baldwin RL: Stability of α-helices. Adv Protein Chem 1995, 46: 141–176.
Smith CK, Regan L: Construction and design of β-sheets. Acc Chem Res 1997, 30: 153–161. 10.1021/ar9601048
Lifson S, Roig A: On the theory of helix-coil transition in polypeptides. J Chem Phys 1961, 34: 1963–1974. 10.1063/1.1731802
Zimm BH, Bragg JK: Theory of the phase transition between helix and random coil in polypeptide chains. J Chem Phys 1959, 31: 326–335.
Bernstein FC, Koetzle TF, Williams GJB, Meyer EFJ, Brice MD, Rodgers JR, Kennard O, Shimanouchi T, Tasumi M: The protein data bank: A computer-based archival file for macromolecular structures. J Mol Biol 1977, 112: 535–542. 10.1016/S0022-2836(77)80200-3
Kabsch W, Sander C: On the use of sequence homologies to predict protein structure: identical pentapeptides can have completely different conformations. Proc Natl Acad Sci USA 1984, 81: 1075–1078. 10.1073/pnas.81.4.1075
Argos P: Analysis of sequence-similar pentapeptides in unrelated protein tertiary structures: Strategies for protein folding and a guide for site-directed mutagenesis. J Mol Biol 1987, 197: 331–348. 10.1016/0022-2836(87)90127-6
Cohen BI, Presnell SR, Cohen FE: Origins of structural diversity within sequentially identical hexapeptides. Protein Sci 1993, 2: 2134–2145. 10.1002/pro.5560021213
Han KF, Baker D: Global properties of the mapping between local amino acid sequence and local structure in proteins. Proc Natl Acad Sci USA 1996, 93: 5814–5818. 10.1073/pnas.93.12.5814
Mezei M: Chameleon sequences in the PDB. Protein Eng 1998, 11: 411–414. 10.1093/protein/11.6.411
Zhou X, Alber F, Folkers G, Gonnet GH, Chelvanayagam G: An analysis of the helix-to-strand transition between peptides with identical sequenc. Proteins 2000, 41: 248–256. 10.1002/1097-0134(20001101)41:2<248::AID-PROT90>3.0.CO;2-J
Kuznetsov IB, Rackovsky S: On the properties and sequence context of structurally ambivalent fragments in proteins. Protein Sci 2003, 12: 2420–2433.
Guo J, Jaromczyk JW, Xu Y: Analysis of chameleon sequences and their implications in biological processes. Proteins 2007, 67: 548–558. 10.1002/prot.21285
Kelly JW: The alternative conformations of amyloidogenic proteins and their multi-step assembly pathways. Curr Opin Struct Biol 1998, 8: 101–106. 10.1016/S0959-440X(98)80016-X
Serpell LC: Alzheimer’s amyloid fibrils: structure and assembly. Biochim Biophys Acta 2000, 1502: 16–30. 10.1016/S0925-4439(00)00029-6
Rost B: Review: protein secondary structure prediction continues to rise. J Struct Biol 2001, 134: 204–218.
Bhattacharjee N, Biswas P: Statistical analysis and molecular dynamics simulations of ambivalent α-helices. BMC Bioinformatics 2010, 11: 519. 10.1186/1471-2105-11-519
Bhattacharjee N, Biswas P: Local order and mobility of water molecules around ambivalent helices. J Phys Chem B 2011, 117: 12257–12265.
Bhattacharjee N: Are ambivalent α-helices entropically driven? Protein Eng Des Sel 2012, 25: 73–79. 10.1093/protein/gzr059
Alexander PA, He Y, Chen Y, Orban J, Bryan PN: A minimal sequence code for switching protein structure and function. Proc Natl Acad Sci USA 2009, 106: 21149–21154. 10.1073/pnas.0906408106
He Y, Chen Y, Alexander PA, Bryan PN, Orban J: Mutational tipping points for switching protein folds and functions. Structure 2012, 20: 283–291. 10.1016/j.str.2011.11.018
Sikoseka T, Chanb HS, Bornberg-Bauer E: Escape from adaptive conflict follows from weak functional trade-offs and mutational robustness. Proc Natl Acad Sci USA 2012, 109: 14888–14893. 10.1073/pnas.1115620109
Sikosek T: Evolutionary dynamics on protein bi-stability landscapes can potentially resolve adaptive conflicts. PLOS Comput Biol 2012, 8: e1002659. 10.1371/journal.pcbi.1002659
Rost B, Sander C: Combining evolutionary information and neural networks to predict protein secondary structure. Proteins 1994, 19: 55–72. 10.1002/prot.340190108
Kabsch W, Sander C: Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 1983, 22: 2577–2637. 10.1002/bip.360221211
Guo J, Chen H, Sun ZR, Lin YL: A novel method for protein secondary structure prediction using dual-layer SVM and profiles. Proteins 2004, 54: 738–743. 10.1002/prot.10634
Hua SJ, Sun ZR: A novel method of protein secondary structure prediction with high segment overlap measure: Support vector machine approach. J Mol Biol 2001, 308: 397–407. 10.1006/jmbi.2001.4580
Kim H, Park H: Protein secondary structure prediction based on an improved support vector machines approach. Protein Eng 2003, 16: 553–560. 10.1093/protein/gzg072
Murzin AG, Brenner SE, Hubbard T, Chothia C: SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 1995, 247: 536–540.
Wang G, Dunbrack RL: PISCES: a protein sequence culling server. Bioinformatics 2003, 19: 1589–1591. 10.1093/bioinformatics/btg224
Engel DE, DeGrado WF: Amino acid propensities are position-dependent throughout the length of α-helices. J Mol Biol 2004, 337: 1195–1205. 10.1016/j.jmb.2004.02.004
Chou PY, Fasman GD: Prediction of protein conformation. Biochemistry 1974, 13: 222–245. 10.1021/bi00699a002
Koehl P, Levitt M: Structure-based conformational preferences of amino acids. Proc Natl Acad Sci USA 1999, 96: 12524–12529. 10.1073/pnas.96.22.12524
O’Neil KT, DeGrado WF: A thermodynamic Scale for the helix-forming tendencies of the commonly occurring amino acids. Science 1991, 250: 646–651.
Richardson JS, Richardson DC: Amino acid preferences for specific locations at the ends of α-helices. Science 1988, 240: 1648–1652. 10.1126/science.3381086
Argos P, Palau J: Amino acid distribution in protein secondary structures. Int J Prot Pept Res 1982, 19: 380–393.
Aurora R, Srinivasan R, Rose GD: Rules for α-helix termination by glycine. Science 1994, 264: 1126–1130. 10.1126/science.8178170
Blader M, Zhang XJ, Matthews BW: Structural basis of amino acid α-helix propensity. Science 1993, 260: 1637–1640. 10.1126/science.8503008
Padmanabhan S, Marquesee S, Ridgeway T, Laue TM, Baldwin RL: Relative helix-forming tendencies of nonpolar amino acids. Nature 1990, 344: 268–270. 10.1038/344268a0
Kumar S, Bansal M: Dissecting α-helices: position-specific analysis of α-helices in globular proteins. Proteins 1998, 31: 460–476. 10.1002/(SICI)1097-0134(19980601)31:4<460::AID-PROT12>3.0.CO;2-D
Wuilmart C, Urbain J: Alpha secondary structures generate weak but recurrent periodicity in proteins. Eur J Biochem 1984, 139: 35–40. 10.1111/j.1432-1033.1984.tb07972.x
Kanehisa MI, Tsong TY: Local hydrophobicity stabilizes secondary structures in proteins. Biopolymers 1980, 19: 1617–1628. 10.1002/bip.1980.360190906
Horne DS: Prediction of protein helix content from an autocorrelation analysis of sequence hydrophobicities. Biopolymers 1988, 27: 451–477. 10.1002/bip.360270308
Creighton TE: Proteins: Structures and Molecular Properties. New York: W. H. Freeman; 1992.
Ki D, Xu D, Guo JT, Ellortt K: Xu Y: PROSPECT II: protein structure prediction program for genome-scale applications. Protein Eng 2003, 16: 641–650. 10.1093/protein/gzg081
Minor DLJ, Kim PS: Context-dependent secondary structure formation of a designed protein sequence. Nature 1996, 380: 730–734. 10.1038/380730a0
Ghozlane A, Agnel Praveen Joseph AP: Analysis of protein chameleon sequence characteristics. Bioinformation 2009, 3: 367–369. 10.6026/97320630003367
The authors gratefully acknowledge the financial assistance from Delhi University research grant. N.B. acknowledges CSIR, INDIA for providing financial assistance in form of SRF.
Both authors declare that they have no competing interests.
NB and PB have designed research. NB has performed research. PB and NB have analyzed data, written the manuscript and approved the final version. Both authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.