Investigating dynamic and energetic determinants of protein nucleic acid recognition: analysis of the zinc finger zif268-DNA complexes
- Rubben Torella†1,
- Elisabetta Moroni†1, 2,
- Michele Caselle2,
- Giulia Morra1Email author and
- Giorgio Colombo1Email author
© Torella et al; licensee BioMed Central Ltd. 2010
Received: 14 June 2010
Accepted: 24 November 2010
Published: 24 November 2010
Protein-DNA recognition underlies fundamental biological processes ranging from transcription to replication and modification. Herein, we present a computational study of the sequence modulation of internal dynamic properties and of intraprotein networks of aminoacid interactions that determine the stability and specificity of protein-DNA complexes.
To this aim, we apply novel theoretical approaches to analyze the dynamics and energetics of biological systems starting from MD trajectories. As model system, we chose different sequences of Zinc Fingers (ZF) of the Zif268 family bound with different sequences of DNA. The complexes differ for their experimental stability properties, but share the same overall 3 D structure and do not undergo structural modifications during the simulations. The results of our analysis suggest that the energy landscape for DNA binding may be populated by dynamically different states, even in the absence of major conformational changes. Energetic couplings between residues change in response to protein and/or DNA sequence variations thus modulating the selectivity of recognition and the relative importance of different regions for binding.
The results show differences in the organization of the intra-protein energy-networks responsible for the stabilization of the protein conformations recognizing and binding DNA. These, in turn, are reflected into different modulation of the ZF's internal dynamics. The results also show a correlation between energetic and dynamic properties of the different proteins and their specificity/selectivity for DNA sequences. Finally, a dynamic and energetic model for the recognition of DNA by Zinc Fingers is proposed.
Protein-DNA recognition mechanisms underlie the functioning and regulation of several cellular processes ranging from transcription to replication, modification and restriction. Consequently, it is not surprising that questions on how to achieve a detailed molecular understanding of these phenomena have emerged since the first X-ray structures of complexes appeared.
One of the central problems involves the understanding of how a certain binding protein efficiently selects a specific target sequence from a large number of possible sites . Initial studies concentrated on the specific hydrogen bonding between aminoacid side-chains and DNA bases . This initial picture evolved to a more complex one  in which several additional factors have to be taken into account: electrostatics [4–9], the effects of localized water molecules [10, 11] and general solvation effects [12–14], shape complementarity , DNA deformation have all been shown to play a critical role [16–23].
However, despite significant progress at the experimental and theoretical level, the molecular determinants of the events at the basis of protein-DNA recognition have not been fully characterized.
In this study, we apply all-atom, explicit solvent Molecular Dynamics (MD) simulations to protein-DNA complexes that show the same overall 3-Dimensional (3D) structures but differ for point mutations in either the protein or the DNA. Experimental data show that these sequence-differences have an impact on the affinity and specificity in recognition. Our goal here is to study the applicability of novel theoretical/computational approaches to map the principal energetic interactions and internal dynamic properties of complexes to investigate the determinants of stability, selectivity and specificity of different mutants with the same 3 D organization for selected DNA sequences.
As a model system we chose the Zinc Finger (ZF) proteins of the Zif268 family [24, 25]. Zinc fingers represent one of the most recurrent motifs among eukaryotic DNA-binding proteins. ZFs specifically recognize and bind their target nucleotide sequences . In particular, Zif268 (subsequently re-named Egr1) is a nuclear protein with transcriptional regulating functions: the transcripts activated by this molecule code for proteins required for cell differentiation and mitogenesis. The importance of this protein family increased after its relationships with p53-regulated apoptotic pathways were clarified [26–28].
Zinc Fingers of Zif268 belong to the C2H2 family (where Zn is coordinated by two Cys and two His residues) and are characterized by a modular structure featuring three repeated domains [24, 25]. Each finger consists of about 30 aminoacids and contains a short β-sheet and one α-helix. The two secondary structures are held in a compact conformation by a small hydrophobic core and the presence of the Zn ion that coordinates two Cys residues from the β-sheet and two His residues from the α-helix.
Analyses of X-ray data of the Zif268-DNA complexes revealed that residues at the four specific positions, -1, 2, 3 and 6 (numbering with respect to the start of the α-helix) in helix 1 make most of the contacts to the DNA stretch [24, 25]. To evaluate the effects of variations in the protein sequence on the DNA binding specificities, Rebar and Pabo used phage display approaches to prepare a library of variants randomizing the four critical aminoacids in the first Zinc Finger of Zif268 . Affinity selections using DNA sequences with base variations in the region recognized by the protein, allowed to identify protein variants that could bind specifically to new DNA sites. Dissociation constants were then determined for each selected protein in complex with its DNA-target sequence . Crystal structures of the complexes between wild type or mutated proteins with their target sequences were also obtained. Overall, the different structures showed a high degree of similarity .
Complexes and Dissociation Constants.
Aminoacid sequence at positions -1, 2, 3, 6
Nucleotidic sequence at positions 8-11
GCGT (wild type)
GCGT (wild type)
RDER (wild type)
RDER (wild type)
GCGT (wild type)
Analysis of the trajectories of each complex was then carried out to map the dynamic residue-residue coordination and rigidity distribution, the principal energetic interactions and the differences in their profiles.
The results showed differences in terms of the organization of the intra-protein energy-networks responsible for the selection and stabilization of the protein conformations recognizing DNA, and consequently for the specificity. Moreover, sequence differences in the protein and DNA-mutations are reflected into different modulation of the ZF's internal dynamics. The results also showed a correlation between energetic and dynamic properties of the different proteins and their selectivity/specificity for DNA sequences. Finally, we propose a dynamic and energetic model for the recognition of DNA by Zinc Fingers that may be useful to improve our understanding of the physico-chemical bases of protein-DNA recognition mechanisms.
The time evolution of each protein's atom-positional Root Mean Square Deviation (RMSD) from the initial X-ray structures was evaluated after combining the data from the three independent trajectories for each complex, as described in Materials and Methods. In all cases, RMSD values stabilized around 0.2 nm, showing the substantial stability of the complexes in the simulation conditions (Additional file 1, Figure S2). The calculation of average RMSD values obtained by comparing all the structures visited by each trajectory with all the structures visited by the other trajectories also yielded an average value of 0.2nm, showing high degrees of structural similarity among the different complexes. The overall conservation of structural properties was also apparent in the secondary structure analysis. No major variation could be detected, indicating the absence of large conformational changes or folding-unfolding events.
Protein Flexibility in Relation to Affinity and Specificity
Next, possible correlations between the dynamics of each protein and its DNA-binding characteristics were evaluated. First, Covariance analysis and Essential Dynamics (ED) were carried out on the trajectories of the complexes. ED identifies relevant low-energy displacements of groups of residues and emphasizes the amplitude and direction of dominant protein motions by projecting the trajectories on a subset of the principal eigenvalues and eigenvectors of the residue pair covariance matrix calculated from MD [29, 30]. Using this approach, protein regions responsible for the most relevant collective motions could be identified, and the information used to illuminate the effects of the protein and/or DNA sequences on recognition and binding.
Differential fluctuations can also be noticed in the analysis of RADR sequences. Consistently with the previous observation, the proteins with higher fluctuations display better DNA-binding properties in their respective complexes (1A1I and 1A1J), compared to 1A1K. (Figure 2).
The trend of flexible protein and rigid DNA was also conserved also in the REDR sequences (Figure 2).
Fluctuations and Specificity.
Aminoacid sequence at positions -1, 2, 3, 6
Total Protein Flexibility (RMSF sum (nm))
Total Complex Flexibility (RMSF sum (nm))
RDER (wild type)
RDER (wild type)
First, we summed up all the residue-based RMSF values for each protein and plotted the resulting value of the flexibility parameter against the respective log(Kd) value. The log of Kd was chosen to calculate the correlations, as this quantity is directly related to differences in free energy or entropy between different complexes (vide infra). It is to be noted that the direct linear correlation of Kd values with calculated parameters yields comparable correlation values, in all cases. Interestingly, the calculation of correlation between log(Kd) and the RMSF flexibility parameter yielded a correlation coefficient of -0.92. When considering also the fluctuations of the DNA stretch in the calculation, the correlation coefficient resulted -0.88 (Table 2).
A modulation of internal flexibility may also be related to a modulation of the conformational entropy. Indeed, upon evaluating each protein's conformational entropy with the Schlitter's approximation , and plotting it against the respective log(Kd), a correlation coefficient of -0.63 was found (Figure 3b). It worth noting, at this point, that these quantitative correlations can be obtained only after combining the statistics from different trajectories. This underlines the importance of significant sampling, even when starting simulations from well-defined X-ray structures.
Our results indicate a clear, non-trivial anticorrelation between affinity and flexibility for different sequences, suggesting that in general ZF proteins may specifically select oligonucleotide sequences by adapting their conformational ensemble to the rather rigid DNA target. In this framework, the possibility to access a diverse and larger pool of conformations for more flexible proteins (compared to rigid ones) favors the selection of the fine-tuned interaction motifs necessary to stabilize a certain complex.
Protein Internal Dynamics, Coordination and Rigidity
Differential flexibility plays a key role in helping different proteins select different, and relatively rigid, DNA targets [6, 32]. Analysis of helix 1, the secondary structure element where mutations are located and which should directly sense the binding to DNA, does not highlight significant differences among mean fluctuations in different proteins. In contrast, differences can be noticed in the fluctuations of helix 2 and 3 (Figure 2). This suggests that the structural and conformational effects of protein-DNA interaction directly involving helix 1 can be transmitted long-range to different regions of the protein. Variations due to changes in the side-chain interactions may thus be reflected in the collective modification of the dynamics allowing a certain protein sequence to adapt to a specific DNA stretch.
To gain more insights into these points, we applied a novel method for the analysis of the dynamic connectivity within a protein. Our approach aims at the quantitative description of the degree of internal coordination between residue pairs in the presence of dynamics.
Internal dynamic coordination is recapitulated by means of the ICRM (Internal Coordination and Rigidity Matrix) matrix (See Materials and Methods). According to the definitions presented in Materials and Methods, the elements of the matrix, Rij, describe how residue pairs are dynamically connected: high Rij values are due to low distance fluctuations and therefore detect residue pairs characterized by high dynamical coordination. On the other hand low Rij values describe poorly correlated moving pairs. Coordination between neighbouring residues may be a simple consequence of local interactions, while strong coordination between residues located at high distances sheds light on long-range correlations. In this model, the lower the distance fluctuation between two residues, the better their coordination. Groups of locally highly coordinated residues identify protein's rigid sub-structures that may have specific functional roles.
Dynamic coordination and rigidity (flexibility) can be unequally distributed among different secondary structures within each protein, and this distribution may change when considering different systems.
In this frame of thought, the components of the principal eigenvector weighted by the respective eigenvalue were calculated for each complex. Figure 5 reports on the different collective coordination profiles, ordered based on the respective protein sequence. Dynamic differences between the complexes seem to be especially evident for the helices 2, 3 and for the loop connecting helix 1 and 2. This observation is consistent with a model where the effects of mutations on helix 1 are propagated long-range across the protein structure, modulating the recognition and binding properties of the different mutants.
Overall, these data suggest that ZF proteins may exploit flexibility-rigidity modulation to facilitate DNA recognition and binding. The affinity for a DNA stretch may thus be linked to differences in the underlying dynamics as a function of small changes in the sequence and of the identity of the binding partner. These considerations corroborate the hypotheses of Elrod-Ericksson and co-workers who postulated flexibility as a necessary property for a protein to bind and optimally adapt to the relatively rigid DNA structure .
Energetics of Complexes
Next, we set out to calculate the principal energetic interactions that are involved in the stabilization of each complex, and are ultimately responsible for specificity. To this end, we applied the energy decomposition approach [33–39]. This method was introduced to extract the major contributions to energetic stability of the native structure of a protein from all-atom molecular dynamics (MD) simulations, and its results have been verified and benchmarked against a diverse set of experimental data [33–39]. For a system of N residues, the matrix of average non-bonded interactions between pairs of residues is built from an MD trajectory. All the matrices with average energy values and corresponding error bars are provided in Additional files 2, 3, 4, 5, 6, 7 and 8. The energy map is then simplified through eigenvalue decomposition. Analysis of the N components of the eigenvector associated with the lowest eigenvalue (called first eigenvector and first eigenvalue, respectively) was shown to single out those residues (hot sites) behaving as strongly interacting and possible stabilizing centers. The eigenvector associated with the main eigenvalue provides a compact description of the participation of each aminoacid to the global stabilization of a structure or a complex. The properties of this first eigenvector (labelled Sequence Eigenvector, SE) ultimately depend on the sequence.
Thus, the approximated energy obtained after the matrix decomposition (see Materials and Methods) accounts for the main attractive interactions that stabilize a certain state.
According to our approximation, any two residues i and j in a system interact with an energy , where λ1 is the first eigenvalue and wi 1 is the component of the associated first eigenvector contributed by residue i. The summation over all possible residue-pairs of the energetic couplings provides the effective approximation to the stabilization energy. The contribution of a specific residue i to the overall effective stabilization energy is thus the product of wi 1 with all other wj.1
Energetics of Complexes.
Total Stabilization Energy
Analysis of the results shows that the highest contributions to the global stabilization of the complexes are provided by DNA-DNA (60% of the total stabilizing energy) and protein-DNA interactions (35%).
From the physico-chemical point of view, this result indicates that the internal energetic distribution of the protein reorganizes specifically in response to the presence of a certain DNA stretch and in response to specific sequence mutations.
Summarizing, the results of our energy analysis suggest that recognition and binding properties are linked to a specific distribution of the stabilization energy. As expected, the global stabilization of the complex is mainly due to electrostatic interactions bringing the protein and DNA together. Modulation of affinity for a specific DNA sequence is further regulated through a redistribution of the stabilizing intraprotein interactions that strongly depends on the protein sequence.
The energy contributions we calculate with the energy decomposition method are to be considered as the effective energies approximating the free energy of binding in a situation where the unbound state is set to a common reference state (ensemble of unbound states) in which the non-bonded energy is equal for all sequences. Given the high degree of similarities of the sequences, this is a viable hypothesis, and was already shown to hold for the study of folding-unfolding of related proteins .
In this paper, we have concentrated on protein-DNA recognition, using MD simulations to investigate the global mechanisms by which Zinc Fingers bind to and modulate their affinities for given oligonucleotide sequences. DNA-binding proteins are in general able to efficiently find their binding sites, amongst large numbers of alternative genomic sequences [1, 40, 41]. Conformational dynamics and specific energetic factors underlie the process. In some cases one of the two factors may be prevalent, so that either enthalpic or entropic factors mainly determine the complex formation reaction.
In order to evaluate these contributions and shed light on the molecular details underlying protein-DNA recognition and specificity, we have applied novel methods to the analysis of simulation data. To this end, we have extended our analysis of signal transduction [42, 43] and protein energetics [33–39, 42] to obtain a compact description of the effects of mutations in the sequence of either the protein or the target DNA (or both) on internal collective dynamics and on interaction networks responsible for the stabilization of a certain complex.
The results of our analysis, based on the statistics obtained from the combination of multiple trajectories for each system, show quantitative correlations between the degree of protein flexibility and intraprotein residue-residue coupling energies with Kd values for the protein-DNA complexes under study.
The emerging picture is that selectivity and specificity are strictly correlated to lower rigidity and higher conformational entropy of the protein. In particular, the loop regions connecting different secondary structures emerge as the least coordinated and rigid motifs in the proteins showing the highest affinities. The possibility to visit a higher number of conformational states on the binding energy landscape may represent an advantage for the protein in the adaptation to the rigid-body like structure of DNA, and would allow the search for the best possible set of stabilizing interactions. Moreover, our results show that sequence variations in helix 1 determine long-range effects altering the dynamics of helices 2 and 3, suggesting a cooperative perturbation of the conformational dynamics that extends beyond the point of mutation and influences the dynamics of the whole protein in the complex.
From the energetic point of view, the application of the energy decomposition method to the trajectories of the different complexes revealed that the modulation of specific intraprotein interaction networks in response to the presence of a certain DNA-stretch quantitatively impacts on the binding affinities. Interestingly, the proteins with higher affinities (1A1F, 1A1J and 1AAY) are characterized by a more spread distribution of stabilizing residues, distributed all along the sequence. A diffuse network of strongly interacting residues once more suggests the importance of cooperative effects in determining complex stabilization and specificity.
These observations are consistent with the recent observations by Miyazono et al . The authors demonstrated, through X-ray crystallography and binding assays, the fundamental role played by extended regions in determining the specificity of homeodomain proteins for DNA and the cooperativity of the binding mechanisms. Extended, flexible regions were suggested to promote the diversity of recognition mechanisms necessary for DNA-recognition, and their deletion was shown to dramatically decrease the cooperative character of the DNA-binding event.
Overall, our calculations and models suggest that both flexibility (i.e. entropic factors) and energy modulation (enthalpic factors) contribute to the affinity and selectivity of the Zinc Fingers examined here for their target DNA sequences. The two factors are strictly interconnected: the distribution and modulation of dominant interactions reverberates in the corresponding dynamics of the complexes. Indeed, from the thermodynamic point of view, Kd is related to the complexation reaction free energy (ΔG), which is ultimately determined by the combination of internal energy and entropy. The good correlations with Kd obtained for intraprotein energy, entropy and flexibility reflect this point.
From the mechanistic point of view, our calculations suggest that binding to DNA and selection of a certain sequence can be part of a hierarchical process: at the first level, electrostatic interactions due to the charged nature of the oligonucleotide stretch and of the binding site on the protein contribute to stabilize the complex. These interactions are typically long-range and would not allow a fine-tuned discrimination of sequences. Interestingly, all variations at the dynamic and/or energetic level do not involve any significant collective DNA-protein deformations, in line with what was already highlighted by Lavery and coworkers [17, 32].
Specificity and selectivity, in turn, can be achieved through the modulation of interactions of differential intensity among specific subsets of residues. In other words, specific energetic interaction patterns throughout the protein structure can determine the accessibility of different sub-sets of conformations that allow the protein to optimally adapt to the structure and sequence of the target DNA.
Our analyses of collective properties allowed us to appreciate the pervasive effects of perturbations in one limited region of the protein on the global dynamic and energetic determinants of recognition and binding.
Overall, the results of our MD simulations and analyses have suggested that the energy landscape for DNA binding may be populated by dynamically different states, even in the absence of major conformational changes. Energetic couplings between residues may change in response to sequence variations thus modulating the importance of different regions for binding, and the consequent dynamics of protein-DNA complex formation.
From the applicative point of view , given the quantitative agreement with experimental data, we conjecture that the approach presented here may be used for the computational design and modification of (small) proteins specific for given DNA sequences. Iterative cycles of in silico mutations and evaluation of the dynamic and energetic properties with the methods presented here, would allow to select protein sequences with dynamic and energetic profiles that have maximal similarity with the ones of known proteins with high affinity and specificity.
MD set-up and simulations
All MD simulations were performed using the AMBER 9.0 package  with the ff03 force field. Each protein was solvated in a cubic box large enough to contain 0.8 nm of solvent around the complex. The TIP3P water model was used for solvation . A 1 nm non-bonded cutoff was used for van der Waals interactions, while the Particle Mesh Ewald summation method (PME) was used to deal with long-range Coulomb interactions . The Berendsen thermostat was used to control temperature and pressure . Charges on sidechains were chosen to correspond to a pH value of 7. Na+ counterions were added to ensure electroneutrality. All the structural and dynamical analyses were performed with analyses programs from the GROMACS package, after the trajectories were translated to the suitable format.
In the Zif268 protein, each of the three zinc ions is coordinated to two cysteines and two histidines, one ion for each helix. The zinc was modeled as covalently bound to the ligands, and the parameters used to describe the metal, its charge and bonding properties were the ones developed by Merz and coworkers .
In our work, 7 protein-DNA complexes were considered: they differ, with respect of the aminoacids placed on residues 16, 18, 19, 22 and of the nucleotides placed on the position 8, 9, 10, 11 and their complementary (Table 1) .
Every complex was initially minimized in vacuo by multiple minimizations (200 steps steepest descent plus 200 steps conjugate gradient). After this, each system was solvated. Multiple solute equilibration processes were performed, in order to reorganize the water molecules (100 steps steepest descent + 50 ps equilibration dynamics at constant P and T = 100K).
After the solute equilibration, another system minimization was performed (200 steps, steepest descent); afterwards, the temperature of the system was slowly brought to the desired value of 300K in 3 steps, with subsequent 100 ps NVT equilibration processes at 100K, 200K and 300K).
Finally, a last 100 ps equilibration NPT process was performed. From this final structure a set of 3 different 20 ns MD simulations was performed for each system. Three different sets of initial velocities obtained from a Maxwellian velocity distribution at the desired temperature of 300K were used to yield three different production runs for each complex.
Overall, this protocol resulted in 3 runs for each of the seven complexes studies, providing a total of 420 ns of simulation time. All the analyses were carried out on the combined trajectories obtained by concatenating each of the 3 runs for each complex, after eliminating the first 5 ns of each trajectory to allow for equilibration.
Covariance matrices were built by averaging motions of Cα atoms of the aminoacids and of the C1 atoms of the deoxyribose ring of the nucleotides, deviating from the mean structure, with the latter calculated over the trajectory. The essential directions of correlated motions during dynamics were then calculated by means of the Essential Dynamics method , or principal component analysis of the 3N × 3N covariance matrix Cij.
The covariance matrix was also used to calculate the entropic content of the complexes using the Schlitter's approach .
Analysis of Internal Coordination and Rigidity
Where Δxijk represents the instant value of the k-th cartesian component of the distance between Cα of residues i and j and yields the fluctuation of the distance component when averaged over the trajectory. We set Rij = 0 on the diagonal and for nearest-neighbours to avoid divergence. In this way, the ICRM matrix describes how residue pairs are dynamically connected: high Rij values are due to low distance fluctuations and therefore detect residue pairs characterized by high dynamical coordination. On the other hand low Rij values describe poorly correlated moving pairs (i.e., they are poorly coordinated and are characterized by low communication propensity due to high distance fluctuations). Coordination between neighbouring residues may be a trivial consequence of local interactions, while strong coordination between residues located at high distances sheds light on long range correlations. Hence, to summarize, the lower the distance fluctuation between two residues, the better they are coordinated and behaving like two points of a rigid body. Groups of locally highly coordinated residues identify protein's rigid sub-structures, such as secondary structure elements. On the other hand, residue pairs at long distance having low distance fluctuations can be due to mutually coordinated protein sub-domains possibly related to long range correlations.
Energy Decomposition Method
The energy decomposition method (EDM) [33–39] aims at the identification of crucial residues (hotspots) for the stabilization of a certain structure and its energetic stability. As first step, the method computes the matrix of non-bonded interaction energies (namely, van der Waals and electrostatic interactions) between pairs of residues. This matrix is afterwards diagonalized and, from the analysis of the eigenvector associated with the lowest eigenvalue, it is possible to determine those residues that behave as strongly interacting and stabilizing centers.
Herein, the structures sampled every 1 ns for each complex were taken into account and the average non-bonded interaction energy matrix was computed from averaging non-bonded pair interactions over all the protein structures saved. The solvent is directly taken into account using the GBSA method  in the calculation of the non-bonded interactions. This type of averaging calculation allows to obtain strong correlations with experimental free-energy related values, in contrast to the simple use of representative structures of main clusters.
N is the sum of the aminoacids and nucleotides in the complex;
λk is the k-th eigenvalue;
W ik is the i-th component of the k-th eigenvector;
Eigenvalues and eigenvectors are usually labeled following an increasing order. Therefore, λ1 is the lowest eigenvalue and, from now on, we will refer to the first eigenvector as the eigenvector corresponding to the eigenvalue λ1.
As mentioned above, the eigenvector associated with the lowest eigenvalue is used to identify the most stabilizing aminoacids. In particular, considering its squared components as the weights of the corresponding residues in the structural stabilization, we can define "hot spots" those residues with a weight higher than a threshold t. This threshold is chosen equal to the squared component of a normalized "flat eigenvector" (namely, a normalized vector whose components provide the same contribution for each site). This corresponds to a case in which each residue equally contributes to the structural stability and, therefore, the threshold t is equal to 1/N, where N is the number of the eigenvector components.
This work was supported by a grant from AIRC (Associazione Italiana Ricerca sul Cancro) to GC, from the Fondazione Cariplo grant Nr. 2008.2198, and from the Italian Ministry of Health GR-2007 programme.
- ICRM Matrix:
Internal Coordination and Rigidity Matrix.
- Berger MF, Philippakis AA, Qureshi AM, He FH, Estep III PW, Bulyk ML: Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities. Nature Biotech 2006, 23: 1429–1435. 10.1038/nbt1246View ArticleGoogle Scholar
- Seaman NC, Rosenberg JM, Rich A: Sequence specific recognition of double helical nucleic acids by proteins. Proc Natl Acad Sci USA 1976, 73: 804–808. 10.1073/pnas.73.3.804View ArticleGoogle Scholar
- MacKerell AD, Nilsson L: Molecular dynamics simulations of nucleic acid-protein complexes. Curr Op Struct Biol 2008, 18(2):194–199.View ArticleGoogle Scholar
- Aci-Seche S, Garnier N, Goffinont S, Ganest D, Spotheim-Maurizot M, Genest M: Comparing native and irradiated E. coli lactose repressor-operator complex by molecular dynamics simulation. Eur Biophys J with Biophis Lett 2010, 39(10):1375–1384.View ArticleGoogle Scholar
- La Penna G, Perico A: Wrapped around models for Lac Operon Complex. Biophys J 2010, 98(12):2964–2973. 10.1016/j.bpj.2010.03.024PubMed CentralView ArticlePubMedGoogle Scholar
- Temiz AN, Benos PV, Camacho CJ: Electrostatic hot spot on DNA-binding domains mediates phosphate desolvation and the preorganization of specificity determinant side chains. NAR 2010, 38(7):2134–2144. 10.1093/nar/gkp1132PubMed CentralView ArticlePubMedGoogle Scholar
- Bahadur RP, Kannan S, Zacharias M: Binding of the Bacteriophage P22 N-Peptide to the boxB RNA Motif Studied by Molecular Dynamics Simulations. Biophys J 2009, 97(12):3139–3149. 10.1016/j.bpj.2009.09.035PubMed CentralView ArticlePubMedGoogle Scholar
- Sanchez IE, Ferreiro DU, Dellarole M, de Prat-Gray G: Experimental snapshots of a protein-dna binding landscape. Proc Natl Acad Sci USA 2010, 107(17):7751–7756. 10.1073/pnas.0911734107PubMed CentralView ArticlePubMedGoogle Scholar
- Givaty O, Levy Y: Protein sliding along DNA: dynamics and structural characterization. J Mol Biol 2009, 385(4):1087–1097. 10.1016/j.jmb.2008.11.016View ArticlePubMedGoogle Scholar
- Yamane T, Okamura H, Ikeguchi M, Nishimura Y, Kidera A: Water-mediated interactions between DNA and PhoB DNA-binding/transactivation domain: NMR-restrained molecular dynamics in explicit water environment. Proteins 2008, 71(4):1970–1983. 10.1002/prot.21874View ArticlePubMedGoogle Scholar
- Mishra SH, Spring AM, Germann MW: Thermodynamic profiling of HIV RREIIB RNA-Zinc Finger Interactions. J Mol Biol 2009, 393(2):369–382. 10.1016/j.jmb.2009.07.066PubMed CentralView ArticlePubMedGoogle Scholar
- Ponomarev SY, Putkaradze V, Bishop TC: Relaxation dynamics of nucleosomal DNA. PhysChemChemPhys 2009, 11(45):10633–10643.Google Scholar
- Singhal P, Jayaram B, Dixit SB, Beveridgey DL: Prokaryotic gene finding based on physicochemical characteristics of codons calculated from molecular dynamics Simulations. Biophys J 2008, 94(11):4173–4183. 10.1529/biophysj.107.116392PubMed CentralView ArticlePubMedGoogle Scholar
- Sands ZA, Laughton CA: Molecular dynamics simulations of DNA using the generalized born solvation model: Quantitative comparisons with explicit solvation results. J Phys Chem B 2004, 108(28):10113–10119. 10.1021/jp048757qView ArticleGoogle Scholar
- Lian P, Liu LA, Shi YX, Bu YX, Wei DQ: Tethered-Hopping Model for Protein-DNA Binding and Unbinding Based on Sox2-Oct1-Hoxb1 Ternary Complex Simulations. Biophys J 2010, 98(7):1285–1293. 10.1016/j.bpj.2009.12.4274PubMed CentralView ArticlePubMedGoogle Scholar
- Kannan S, Zacharias M: Simulation of DNA double-strand dissociation and formation during replica-exchange molecular dynamics simulations. PhysChemChemPhys 2009, 11(45):10589–10595.Google Scholar
- Paillard G, Lavery R: Analyzing protein-DNA recognition mechanisms. Structure 2004, 12: 113–122. 10.1016/j.str.2003.11.022View ArticlePubMedGoogle Scholar
- Rohs R, West SM, Sosinsky A, Liu P, Mann RS, Honig B: The role of DNA shape in protein-DNA recognition. Nature 2009, 461(7268):1248. 10.1038/nature08473PubMed CentralView ArticlePubMedGoogle Scholar
- Cheatham TEI: Simulation and modeling of nucleic acid structure, dynamics and interactions. Curr Op Struct Biol 2004, 14: 360–367. 10.1016/j.sbi.2004.05.001View ArticleGoogle Scholar
- Tsui V, Radhakrishnan I, Wright PE, Case DA: NMR and molecular dynamics studies of the hydration of a zinc finger-DNA complex. J Mol Biol 2000, 302: 1101–1117. 10.1006/jmbi.2000.4108View ArticlePubMedGoogle Scholar
- Kamberaj H, van der Vaart A: Extracting the casuality of correlated motions from molecular dynamics simulations. Biophys J 2009, 97: 1747–1755. 10.1016/j.bpj.2009.07.019PubMed CentralView ArticlePubMedGoogle Scholar
- van der Vaart A, Bursulaya BD, Brooks CLI, Merz KMMJ: Are many body effects important in protein folding? J Phys Chem B 2000, 104: 9554–9563. 10.1021/jp001193fView ArticleGoogle Scholar
- Bradley MJ, Chivers PT, Baker NA: Molecular dynamics simulation of the Escherichia coli NikR protein: Equilibrium conformational fluctuations reveal interdomain allosteric communication pathways. J Mol Biol 2008, 378(5):1155–1173. 10.1016/j.jmb.2008.03.010PubMed CentralView ArticlePubMedGoogle Scholar
- Rebar EJ, Pabo CO: Zinc Finger Phage: Affinity selection of fingers with new DNA-binding specificities. Science 1994, 263: 671–673. 10.1126/science.8303274View ArticlePubMedGoogle Scholar
- Elrod-Erickson M, Benson TE, Pabo CO: High-resolution structures of variant Zif268-DNA complexes: implications for understanding zinc finger-DNA recognition. Structure 1998, 6: 451–464. 10.1016/S0969-2126(98)00047-1View ArticlePubMedGoogle Scholar
- Kitayner M, Rozenberg H, Rohs R, Suad O, Rabinovich D, Honig B, Shakked Z: Diversity in DNA recognition by p53 revealed by crystal structures with Hoogsteen base pairs. Nat Struct Mol Biol 2010, 17(4):423. 10.1038/nsmb.1800PubMed CentralView ArticlePubMedGoogle Scholar
- Weisz L, Zalcenstein A, Stambolsky P, Cohen Y, Goldfinger N, Oren M, Rotter V: Transactivation of the EGR1 gene contributes to mutant p53 gain of function. Cancer Res 2004, 64(22):8318–8327. 10.1158/0008-5472.CAN-04-1145View ArticlePubMedGoogle Scholar
- Okamura H, Yoshida K, Morimoto H, Haneji T: PTEN expression elicted by EGR-1 transcriptio factor in calyucin A-induced apoptotic cells. J Cell Biochem 2005, 94(1):117–125. 10.1002/jcb.20283View ArticlePubMedGoogle Scholar
- Amadei A, Linssen ABM, Berendsen HJC: Essential dynamics of proteins. Proteins: Struct, Funct, and Gen 1993, 17: 412–425. 10.1002/prot.340170408View ArticleGoogle Scholar
- García A: Large-amplitude nonlinear motions in proteins. Phys Rev Lett 1992, 66: 2696–2699. 10.1103/PhysRevLett.68.2696View ArticleGoogle Scholar
- Schlitter J: Estimation of absolute and relative entropies of macromolecules using the covariance matrix. Chem Phys Lett 1993, 215(6):617–621. 10.1016/0009-2614(93)89366-PView ArticleGoogle Scholar
- Bouvier B, Lavery R: A Free Energy Pathway for the Interaction of the SRY Protein with Its Binding Site on DNA from Atomistic Simulations. J Am Chem Soc 2009, 131(29):9864. 10.1021/ja901761aView ArticlePubMedGoogle Scholar
- Morra G, Colombo G: Relationship between energy distribution and fold stability: Insights from molecular dynamics simulations of native and mutant proteins. Proteins: Structure, Function and Bioinformatics 2008, 72(2):660–672. 10.1002/prot.21963View ArticleGoogle Scholar
- Ragona L, Colombo G, Catalano M, Molinari H: Determinants of protein stability and folding: Comparative analysis of beta-lactoglobulins and liver basic fatty acid binding protein. Proteins: Structure, Function and Bioinformatics 2005, 61(2):366–376. 10.1002/prot.20493View ArticleGoogle Scholar
- Colacino S, Tiana G, Colombo G: Similar folds with different stabilization mechanisms: the cases of Prion and Doppel proteins. BMC Struct Biol 2006., 6(17):Google Scholar
- Tiana G, Simona F, De Mori GMS, Broglia RA, Colombo G: Understanding the determinants of stability and folding of small globular proteins from their energetics. Protein Science 2004, 13(1):113–124. 10.1110/ps.03223804PubMed CentralView ArticlePubMedGoogle Scholar
- Morra G, Baragli C, Colombo G: Selecting sequences that fold into a defined 3 D structure: A new approach for protein design based on molecular dynamics and energetics. Biophys Chem 2010, 146(32):76–84. 10.1016/j.bpc.2009.10.007View ArticlePubMedGoogle Scholar
- Scarabelli G, Morra G, Colombo G: Predicting interaction sited from the energetics of isolated proteins: a new approach to epitope mapping. Biophys J 2010, 98(9):1966–1975. 10.1016/j.bpj.2010.01.014PubMed CentralView ArticlePubMedGoogle Scholar
- Genoni A, Morra G, Merz KMM, Colombo G: Computational Study of the Resistance Shown by the Subtype B/HIV-1 Protease to Currently Known Inhibitors. Biochemistry 2010, 49(19):4283–4295. 10.1021/bi100569uPubMed CentralView ArticlePubMedGoogle Scholar
- Berg OG, Winter RB, von Hippel PH: Diffusion driven mechanisms of protein translocation on nucleic acids. 1. Models and theory. Biochemistry 1981, 20: 6929–6948. 10.1021/bi00527a028View ArticlePubMedGoogle Scholar
- Halford SE, Marko JF: How do site-specific DNA-binding proteins find their targets? Nucleic Acid Res 2004, 32: 3040–3052. 10.1093/nar/gkh624PubMed CentralView ArticlePubMedGoogle Scholar
- Morra G, Verkhivker GM, Colombo G: Modeling signal propagation mechanisms and ligand-based conformational dynamics of the Hsp90 molecular chaperone full length dimer. PLOS Comp Biol 2009, 5(3):e1000323. 10.1371/journal.pcbi.1000323View ArticleGoogle Scholar
- Morra G, Neves MAC, Plescia CJ, Tsutsumi S, Neckers L, Verkhivker G, Altieri DC, Colombo G: Dynamics-Based Discovery of Allosteric Inhibitors: Selection of New Ligands for the C-terminal Domain of Hsp90. J Chem Theory and Computation 2010, 6(9):2978–2989. 10.1021/ct100334nView ArticleGoogle Scholar
- Miyazono K, Zhi Y, Takamura Y, Nagata K, Saigo K, Kojima T, Tanokura M: Cooperative DNA-Binding and sequence-recognition mechanisms of aristaless and clawless. EMBO J 2010, 29: 1613–1623. 10.1038/emboj.2010.53PubMed CentralView ArticlePubMedGoogle Scholar
- Pabo CO: Specificity by design. Nat Biotech 2006, 24(8):954–955. 10.1038/nbt0806-954View ArticleGoogle Scholar
- Case DA, Darden TA, Cheatham TEI, Simmerling CL, Wang J, Duke RE, Luo R, Merz KM, Pearlman DA, Crowley M, Walker RC, Zhang W, Wang B, Hayik S, Roitberg A, Seabra G, Wong KF, Paesani F, Wu X, Brozell S, Tsui V, Gohlke H, Yang L, Tan C, Mongan J, Hornak V, Cui G, Beroza P, Mathews DH, Schafmeister C, Ross WS, Kollman PA: AMBER 9. University of California 2006.Google Scholar
- Jorgensen WL, Chandrasekhar J, Madura J, Impey RW, Klein ML: Comparison of simple potential functions for simulating liquid water. J Chem Phys 1983, 79: 926–935. 10.1063/1.445869View ArticleGoogle Scholar
- Darden T, York D, Pedersen L: Particle mesh Ewald: An N-log(N) method for Ewald sums in large systems. J Chem Phys 1993., 98(10089–10092):Google Scholar
- Berendsen HJC, Postma JPM, Gunsteren WFv, Nola AD, Haak JR: Molecular dynamics with coupling to an external bath. J Chem Phys 1984, 81: 3684–3690. 10.1063/1.448118View ArticleGoogle Scholar
- Merz KMMJ, Murcko MA, Kollman PA: Inhibition of carbonic anhydrase. J Am Chem Soc 1991, 113(12):4484–4490. 10.1021/ja00012a017View ArticleGoogle Scholar
- Amadei A, Linssen ABM, Berendsen HJC: Essential dynamics of proteins. Proteins: Struct Funct Genet 1993, 17: 412–425. 10.1002/prot.340170408View ArticleGoogle Scholar
- Chennubhotla C, Yang Z, Bahar I: Coupling between global dynamics and signal transduction pathways: a mechanism of allostery for chaperonin GroEL. Mol BioSyst 2008, 4: 287–292. 10.1039/b717819kView ArticlePubMedGoogle Scholar
- Still WC, Tempczyk A, Hawley RC, Hendrickson T: Semianalytical Treatment of Solvation for Molecular Mechanics and Dynamics. J Am Chem Soc 1990, 112: 6127–6129. 10.1021/ja00172a038View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.