Crosslinking and mass spectrometry suggest that the isolated NTD domain dimer of Moloney murine leukemia virus integrase adopts a parallel arrangement in solution
© Henriquez et al.; licensee BioMed Central Ltd. 2013
Received: 7 January 2013
Accepted: 8 July 2013
Published: 11 July 2013
Retroviral integrases (INs) catalyze the integration of viral DNA in the chromosomal DNA of the infected cell. This reaction requires the multimerization of IN to coordinate a nucleophilic attack of the 3’ ends of viral DNA at two staggered phosphodiester bonds on the recipient DNA. Several models indicate that a tetramer of IN would be required for two-end concerted integration. Complementation assays have shown that the N-terminal domain (NTD) of integrase is essential for concerted integration, contributing to the formation of a multimer through protein-protein interaction. The isolated NTD of Mo-MLV integrase behave as a dimer in solution however the structure of the dimer in solution is not known.
In this work, crosslinking and mass spectrometry were used to identify regions involved in the dimerization of the isolated Mo-MLV NTD. The distances between the crosslinked lysines within the monomer are in agreement with the structure of the NTD monomer found in 3NNQ. The intermolecular crosslinked peptides corresponding to Lys 20-Lys 31, Lys 24-Lys 24 and Lys 68-Lys 88 were identified. The 3D coordinates of 3NNQ were used to derive a theoretical structure of the NTD dimer with the suite 3D-Dock, based on shape and electrostatics complementarity, and filtered with the distance restraints determined in the crosslinking experiments.
The crosslinking results are consistent with the monomeric structure of NTD in 3NNQ, but for the dimer, in our model both polypeptides are oriented in parallel with each other and the contacting areas between the monomers would involve the interactions between helices 1 and helices 3 and 4.
Protein-protein interactions play a fundamental role on the assembly of multimeric complexes of IN to carry out two-end concerted DNA integration [1–3]. Retroviral integrase structures are organized in three domains, a central domain that contain the catalytic site (CCD), a N-terminal domain (NTD) that binds zinc and a C-terminal domain (CTD) having a SH3 fold .
Biochemical and genetic studies have shown that the NTD of integrase (IN) is involved in protein-protein interactions favoring protein multimerization and stabilization of the DNA-IN complex [5–8]. Prototype foamy virus (PFV) structure shows binding of this domain to LTR . A polypeptide containing the first 105 amino acids of Mo-MLV IN, expressed in E. coli, is functionally active since it complements in trans an IN mutant lacking the NTD. Gel filtration chromatography indicates that this NTD behaves as a dimer in solution . Similar observations have been made in HIV-1 IN . The 3D structure of HIV-1 IN NTD, solved by NMR showed the dimer interface is highly hydrophobic and it includes the α-helices 3 and 4, and the N-terminal of helix 1 .
The NTD is essential for 3’ processing and strand transfer, however determining its role in the integration process in lentiviruses and oncogenic viruses has been difficult due to the absence of the full-length structure of IN and the complexity of the protein-protein and protein-DNA interactions involved in the synaptic complex. Several models based on the partial 3D structure of IN fragments have been proposed for HIV-1 IN and ASV [11–13]. The X-ray structure of a tetrameric integrase complex of the PFV IN and processed U5 DNA was solved [14–16]. In this complex, the NTDs of the external subunits have been located at distal positions of the complex , however their structure was unresolved in the crystal structure. The quaternary structure of HIV integrase in solution has been examined by small and wide X-ray angle scattering and chemical crosslinking . It has been concluded that integrase can assemble as a tetramer by the interaction of two different dimers: one of them is stabilized by interactions between the CCDs of two subunits while the other dimer is stabilized by interactions of one NTD with the CCD, CTD, and NTD of the other subunit. The interaction between the NTDs in the latter dimer was detected by chemical crosslinking .
Results and discussion
The NTD of Mo-MLV integrase behaves as a dimer in solution according to gel filtration on Superose 12 and favors multimerization of IN [6, 8]. In complementation studies it has been shown that addition of an isolated NTD domain to an IN construct lacking this domain restores the activities of IN. However, there is no information on the structure of the dimer of Mo-MLV in solution and how it interacts with the other IN domains. In this work we carried out chemical crosslinking to identify lysine residues that were located on the interface of the dimer in order to determine the regions involved in dimerization. This procedure involved the analysis of the crosslinked peptides on the isolated dimer and uncrosslinked monomers by mass spectrometry [19–21].
Chemical cross-linking of IN 1–105 with BS3
Identification of intramolecular and intermolecular crosslinking peptide with BS3
Looplinked peptides sequences of BS 3 crosslinked peptides of Mo-MLV IN NTD and the Cβ distances between modified lysine residues in the generated model
88 – 95
31 – 33
95 – 104
TLK88NITETCK - ACAQVNASK104S
88 – 104
Intermolecular peptides sequences of BS 3 crosslinked peptides of Mo-MLV IN NTD and the Cβ distances between modified lysine residues (bold type) in the dimeric model
24 – 24 *
31 – 20/24
31 – 20 *
68 – 88
Model of the NTD dimer of Mo-MLV integrase
The solution structure of the isolated HIV-1 NTD dimer also exhibits a parallel arrangement , however, structural information on isolated NTDs of other integrases is not available.
It is generally thought that a tetramer is involved in the integration process. It has been proposed that this tetramer is assembled by two dimers that differ in their conformation. Differences in conformation reflect different combinations of interactions between the three domains of the enzyme. Crosslinking experiments in solution have revealed NTD-NTD interactions in the tetramer  that were not observed in the crystal structure of fragments of HIV-1 IN or PFV IN structure. It has been speculated that this kind of interaction may be related to a “domain swapping” phenomenon in which an interaction between the NTD and CCD domains is substituted by an interaction between the NTDs . It is possible that this interaction may be evident in the isolated NTDs due to the absence of the CCD domain.
In order to confirm that crosslinking was not produced by the reaction between dimers due to the high reactivity of BMOE with SH groups, the crosslinked reaction mixture was further analyzed by gel filtration on superdex S-200. After crosslinking with BMOE most of the protein eluted as a dimer (not shown).
All retroviral integrases contain a HHCC zinc finger motif at the N-terminal domain that is involved in protein-protein interactions, which is essential for retroviral integration. The structure of NTD of HIV-1 IN has been solved by X-ray crystallography and NMR. However for Mo-MLV with a larger NTD domain, the only information available is the deposited x-ray structure 3NNQ. The dimeric structure represented in the asymmetric unit of 3NNQ fails to explain our crosslinking results. Studies of the isolated Mo-MLV NTD indicates that the domain behaves as a dimer in solution and as such is active in complementation of the IN activities. Therefore our aim was to determine the contacting regions of the NTD domain using chemical crosslinking and mass spectrometry to identify the dimer present in solution. Identification of intramolecular crosslinks of lysines using BS3 as a spacer agree with the conformation of the monomer in 3NNQ. However, the intermolecular crosslinking results did not match the interacting regions found in 3NNQ dimer.
The current approach to determine protein-protein interaction in the NTD domain of Mo-MLV integrase includes the use of the 3D coordinates of the 3NNQ monomer and experimental distance restraints obtained using lysines crosslinking and MS/MS sequencing methodologies. We propose a homodimeric model where both polypeptides are aligned in parallel with the β strands away from the interface, formed mainly by helices 1 and 4.
Our model was tested by cysteine crosslinking with BMOE and as expected a high crosslinking yield was achieved. We expect that this theoretical model could be useful to test other properties of the NTD such as its interactions with the other domains of IN and to understand the mechanisms used by this HHCC domains to regulate protein interactions in different contexts.
E. coli BL21 (DE3) harboring the plasmid pET/IN 1–105 was used to express IN 1–105 . Protein production was induced with lactose according to the protocol of Studier, F. . Cells were grown at 20°C up to 5 O.D. at 600 nm in 400 mL. Cells were collected by centrifugation at 6,000 rpm for 30 min in a Sorvall RC-5, GSA rotor. The pellet was resuspended in 40 mL of lysis buffer (10 mM CHAPS, 10 mM imidazole, 300 mM NaCl, 50 mM NaH2PO4 pH 8,0 and protease inhibitor tablet (Roche) was sonicated for 6 pulses of 30 s at maximum intensity (Branson) and the cell debris was removed by centrifugation at 9,000 rpm for 40 min. 1 mL of Ni-NTA-agarose was added to the supernatant and mixed overnight by rotation in a twister VS-96 TW at 4°C. The resin was washed with 40 mL of buffer 1 (50 mM NaH2PO4 pH 8.0, 10 mM CHAPS, 1 M NaCl, 10 mM 2-mercaptoethanol, 20% glycerol for 10 min and the resin was set on a column (BioRad). The column was washed sequentially with 4 mL of: 25, 50, 100 and 250 mM imidazole dissolved in buffer 1. Fractions of 1 mL were collected. Protein was quantified by Bradford (BioRad) and analyzed by SDS-PAGE on 15% acrylamide gels.
Construction of K104C IN 1–105 mutant
This construct was made by PCR using the plasmid pET IN1-105 as a template  and the mutagenic primer K104C 5’-AAA AGG ATC CTA AGA GCA GCT GGC GTT GA -3’ that included the Bam HI site (underlined) and the T7 promoter 5’- CTATAGTGAGTCGTATTA -3’. The 480 bp PCR product was digested with Nde I and Bam HI (NEB) and purified by electrophoresis in 1.2% agarose gels and ligated to pET 11b digested with Nde I and Bam HI. The mutation was confirmed by DNA sequencing.
Chemical crosslinking of IN 1–105
The experimental approach to identify crosslinked lysines is based on the use of homo- bifunctional cross-linking agents that are directed primarily towards amino groups. ϵ-NH2 groups of lysine or the α-NH2 of the N-terminal amino acid of the protein would potentially react with the N-hydroxysuccinimide ester. This in turn produces intramolecular crosslinking when lysines of the same polypeptide are located at the appropriate distance. Intermolecular crosslinking can also be produced when the reactive lysines are located in different polypeptides. In our approach the reacted protein was digested with sequencing grade proteases and analyzed by MALDI TOF/TOF and LC-MS/MS to identify the position of crosslinking. For the first round analysis, the data was searched using Bioworks with +138 (for intra peptide link) and +156 as lysine variable modifications. In these studies the homobifuntional crosslinker bis(sulfosuccimidyl) suberate (BS3, Pierce) was used. This crosslinker spans 11.4 Å. 0.4 nmoles of the IN 1–105 (10 μM) was incubated at 25°C in 50 mM Hepes pH 7.8 and 100 mM NaCl with 100 μM of BS3 for 35 min. The crosslinking reaction mixture was quenched with a loading buffer of protein and 50 mM lysine pH 8.0 and subjected to SDS-PAGE and stained with Coomasie Brilliant Blue. Under these conditions 50% of IN 1–105 was crosslinked. The protein band corresponding to the crosslinked dimer was excised and digested with trypsin for mass spectrometry analysis.
Characterization of crosslinked products of IN 1–105 by gel filtration
1.3 mg of protein in 2 mL was crosslinked as described above. The reaction products were concentrated 10-fold in an Amicon filter (10000 MWCO, Millipore) and 1.3 mg of protein (200 μL) was loaded on a Superdex S-200 column (30 × 1.7 cm Pharmacia), equilibrated with 10 mM Tris HCl pH 7.5, 0.5 M NaCl, 1 mM DTT and 5% glycerol which were connected to a diode array detector (Jasco). Fractions of 0.5 mL were collected and the protein visualized by SDS-PAGE 12% and Coomasie Brilliant Blue staining.
Proteolytic digestion in gel
The stained gel piece was incubated with 1 mL of 50 mM of NH4HCO3 in 50% v/v acetonitrile for 1 h or until the stain disappeared. Then, the protein was reduced with 60 μL of 20 mM DTT in 50 mM NH4HCO3 for 15 min at 60°C. The solution was removed and the gel was treated with 40 mM iodoacetamide in 50 mM NH4HCO3 for 30 min at 37°C, in the dark. The gel piece was washed with 50 mM NH4HCO3 in 50% v/v acetonitrile and with 100% acetonitrile twice and dried 10 min at room temperature, then 0.8 μg of trypsin in 40 μL of 25 mM NH4HCO3, pH 7.8 were added and incubated at 37°C for 16 h. Peptides were recovered by concentration in speed vac and analyzed by LC-MS/MS (U3000 from Dionex and LTQ from Thermofisher) or MALDI-TOF/TOF (Applied biosystem).
Proteolytic digestion in solution
The crosslinking reaction mixture (0.4 nmoles of protein) in 50 mM NH4HCO3 was treated with 20 mM iodoacetamide for 30 min at 37°C. The reaction was stopped by the addition of 20 mM DTT for 15 min. Then, trypsin in a ratio 1:20 (wt/wt) was added, and the digestion was carried out at 37°C for 16 h. The digested peptides were then analyzed by nano LC-MS/MS (U3000 from Dionex and LTQ from Thermofisher) or 4800 MALDI-TOF/TOF instrument (Applied biosystem).
Peptide crosslinked analysis
A list of likely peptides, containing one undigested lysine for trypsin or with more than one lysine for chymotrypsin, was used as lysine modification in order to compare with unmodified peptide profile of the control without BS3 and search LC-MS/MS data by Bioworks software or for manual interpretation of MALDI-MS/MS data. For intra-peptide cross-linking and BS3 single residue modification, 138.068 and 156.0786 were added on lysine as modifications respectively. The MS/MS spectra of cross-linked peptides were manually confirmed.
Concerted two-end integration assay
The concerted two-end integration assay is a modification of that previously described [24–26]. One pmole of the 5’-labeled substrate used for strand transfer was incubated in SST buffer (20 mM MES pH 6,4, 20 mM KCl, 5% glycerol, 10 mM DTT, 20 mM MnCl2 and 10% DMSO) for 30 min on ice with 7 pmol of IN wt or 56 pmoles of IN 1–105 (or mutant K105C) and 7 pmoles of IN 106–404 for complementation assay. Then 100 pmol of pUC18 DNA were added and incubated at 37°C for 1 h in a final volume of 15 μL. The reaction was stopped with 3 μL of 0.1 M EDTA, 5% SDS plus 1 μL of proteinase K (20 mg/mL) and incubated at 55°C for 1 h. The reaction products were separated by electrophoresis on 1% agarose gels. The gel was dried and exposed on a Phosphor-Imager (Bio-Rad).
Just prior to use, the IN 1–105 and K104C IN 1–105 were adjusted to 0.8 mg/mL and ZnSO4 (10 μM final) was added to chelate the cysteines of the zinc binding motif. The proteins were kept on ice for 30 min. The bifunctional maleimide-coupled crosslinking agent BMOE (span arm 8 Å) and BM(PEG)2 (span arm 14.7 Å) were prepared in DMSO just prior to use. The crosslinker was added to the protein solution at a different crosslinker/protein ratio. The reaction mixture was left on ice for 60 min and the crosslinker excess was quenched with 10 mM DTT for 30 min with further 30 min on ice. The reaction products were analyzed by SDS-PAGE and protein were stained with Coomasie Brilliant Blue.
Monomer of N-terminal integrase
Integrase N-terminal 3D coordinates, residues 11 to 105, was a modification of 3NNQ pdb file, where the three selenomethionine residues in Chain A were edited to methionine residues.
Integrase N-terminal dimer was built using 3D-Dock suite and experimental crosslinking restraints. In summary, one monomer of N-terminal integrase was defined to the program as a fixed structure and the other monomer as mobile. FTDock program, based on the correlation algorithm of Katchalski-Katzir plus an electrostatic function, generated 10,000 possible complexes through rotation and translation of the mobile monomer previous to the correlation calculations. Experimental cross linking information was used as a filter (filter routine in the suite) to select the complex that agreed with the distance restraint data . After the model was selected, Multidock routine was used to refine the side chains of the amino acids involved in the interface and to perform a rigid body energy minimization of the complex.
- Lutzke RA, Plasterk RH: Structure-based mutational analysis of the C-terminal DNA-binding domain of human immunodeficiency virus type 1 integrase: critical residues for protein oligomerization and DNA binding. J Virol 1998, 72: 4841–4848.PubMed CentralPubMedGoogle Scholar
- Li M, Mizuuchi M, Burke TR Jr, Craigie R: Retroviral DNA integration: reaction pathway and critical intermediates. EMBO J 2006, 25: 1295–1304. 10.1038/sj.emboj.7601005PubMed CentralView ArticlePubMedGoogle Scholar
- Jenkins TM, Engelman A, Ghirlando R, Craigie R: A soluble active mutant of HIV-1 integrase: involvement of both the core and carboxyl-terminal domains in multimerization. J Biol Chem 1996, 271: 7712–7718. 10.1074/jbc.271.13.7712View ArticlePubMedGoogle Scholar
- Eijkelenboom AP, Lutzke RA, Boelens R, Plasterk RH, Kaptein R, Hard K: The DNA-binding domain of HIV-1 integrase has an SH3-like fold. Nat Struct Biol 1995, 2: 807–810. 10.1038/nsb0995-807View ArticlePubMedGoogle Scholar
- Ellison V, Brown PO: A stable complex between integrase and viral DNA ends mediates human immunodeficiency virus integration in vitro. Proc Natl Acad Sci U S A 1994, 91: 7316–7320. 10.1073/pnas.91.15.7316PubMed CentralView ArticlePubMedGoogle Scholar
- Zheng R, Jenkins TM, Craigie R: Zinc folds the N-terminal domain of HIV-1 integrase, promotes multimerization, and enhances catalytic activity. Proc Natl Acad Sci U S A 1996, 93: 13659–13664. 10.1073/pnas.93.24.13659PubMed CentralView ArticlePubMedGoogle Scholar
- Donzella GA, Leon O, Roth MJ: Implication of a central cysteine residue and the HHCC domain of Moloney murine leukemia virus integrase protein in functional multimerization. J Virol 1998, 72: 1691–1698.PubMed CentralPubMedGoogle Scholar
- Yang F, Seamon JA, Roth MJ: Mutational analysis of the N-terminus of Moloney murine leukemia virus integrase. Virology 2001, 291: 32–45. 10.1006/viro.2001.1218View ArticlePubMedGoogle Scholar
- Krishnan L, Li X, Naraharisetty HL, Hare S, Cherepanov P, Engelman A: Structure-based modeling of the functional HIV-1 intasome and its inhibition. Proc Natl Acad Sci U S A 2010, 107: 15910–15915. 10.1073/pnas.1002346107PubMed CentralView ArticlePubMedGoogle Scholar
- Cai M, Zheng R, Caffrey M, Craigie R, Clore GM, Gronenborn AM: Solution structure of the N-terminal zinc binding domain of HIV-1 integrase. Nat Struct Biol 1997, 4: 567–577. 10.1038/nsb0797-567View ArticlePubMedGoogle Scholar
- Podtelezhnikov AA, Gao K, Bushman FD, McCammon JA: Modeling HIV-1 integrase complexes based on their hydrodynamic properties. Biopolymers 2003, 68: 110–120. 10.1002/bip.10217View ArticlePubMedGoogle Scholar
- Chen A, Weber IT, Harrison RW, Leis J: Identification of amino acids in HIV-1 and avian sarcoma virus integrase subsites required for specific recognition of the long terminal repeat Ends. J Biol Chem 2006, 281: 4173–4182. 10.1074/jbc.M510628200PubMed CentralView ArticlePubMedGoogle Scholar
- Chen X, Tsiang M, Yu F, Hung M, Jones GS, Zeynalzadegan A, Qi X, Jin H, Kim CU, Swaminathan S, Chen JM: Modeling, analysis, and validation of a novel HIV integrase structure provide insights into the binding modes of potent integrase inhibitors. J Mol Biol 2008, 380: 504–519. 10.1016/j.jmb.2008.04.054View ArticlePubMedGoogle Scholar
- Hare S, Gupta SS, Valkov E, Engelman A, Cherepanov P: Retroviral intasome assembly and inhibition of DNA strand transfer. Nature 2010, 464: 232–236. 10.1038/nature08784PubMed CentralView ArticlePubMedGoogle Scholar
- Hare S, Vos AM, Clayton RF, Thuring JW, Cummings MD, Cherepanov P: Molecular mechanisms of retroviral integrase inhibition and the evolution of viral resistance. Proc Natl Acad Sci U S A 2010, 107: 20057–20062. 10.1073/pnas.1010246107PubMed CentralView ArticlePubMedGoogle Scholar
- Hare S, Maertens GN, Cherepanov P: 3’-processing and strand transfer catalysed by retroviral integrase in crystallo. EMBO J 2012, 31: 3020–3028. 10.1038/emboj.2012.118PubMed CentralView ArticlePubMedGoogle Scholar
- Gupta K, Curtis JE, Krueger S, Hwang Y, Cherepanov P, Bushman FD, Van Duyne GD: Solution conformations of prototype foamy virus integrase and its stable synaptic complex with u5 viral DNA. Structure 2012, 20: 1918–1928. 10.1016/j.str.2012.08.023PubMed CentralView ArticlePubMedGoogle Scholar
- Bojja RS, Andrake MD, Merkel G, Weigand S, Dunbrack RL Jr, Skalka AM: Architecture and assembly of HIV integrase multimers in the absence of DNA substrates. J Biol Chem 2013, 288: 7373–7386. 10.1074/jbc.M112.434431PubMed CentralView ArticlePubMedGoogle Scholar
- Young MM, Tang N, Hempel JC, Oshiro CM, Taylor EW, Kuntz ID, Gibson BW, Dollinger G: High throughput protein fold identification by using experimental constraints derived from intramolecular cross-links and mass spectrometry. Proc Natl Acad Sci U S A 2000, 97: 5802–5806. 10.1073/pnas.090099097PubMed CentralView ArticlePubMedGoogle Scholar
- Livney YD, Schwan AL, Dalgleish DG: A study of beta-casein tertiary structure by intramolecular crosslinking and mass spectrometry. J Dairy Sci 2004, 87: 3638–3647. 10.3168/jds.S0022-0302(04)73502-XView ArticlePubMedGoogle Scholar
- Sinz A: Chemical cross-linking and mass spectrometry to map three-dimensional protein structures and protein-protein interactions. Mass Spectrom Rev 2006, 25: 663–682. 10.1002/mas.20082View ArticlePubMedGoogle Scholar
- Balasu MC, Spiridon LN, Miron S, Craescu CT, Scheidig AJ, Petrescu AJ, Szedlacsek SE: Interface analysis of the complex between ERK2 and PTP-SL. PLoS One 2009, 4: e5432. 10.1371/journal.pone.0005432PubMed CentralView ArticlePubMedGoogle Scholar
- Studier FW: Protein production by auto-induction in high density shaking cultures. Protein Expr Purif 2005, 41: 207–234. 10.1016/j.pep.2005.01.016View ArticlePubMedGoogle Scholar
- Yang F, Roth MJ: Assembly and catalysis of concerted two-end integration events by Moloney murine leukemia virus integrase. J Virol 2001, 75: 9561–9570. 10.1128/JVI.75.20.9561-9570.2001PubMed CentralView ArticlePubMedGoogle Scholar
- Acevedo ML, Arbildua JJ, Monasterio O, Toledo H, Leon O: Role of the 207–218 peptide region of Moloney murine leukemia virus integrase in enzyme catalysis. Arch Biochem Biophys 2010, 495: 28–34. 10.1016/j.abb.2009.12.018View ArticlePubMedGoogle Scholar
- Villanueva RA, Jonsson CB, Jones J, Georgiadis MM, Roth MJ: Differential multimerization of Moloney murine leukemia virus integrase purified under nondenaturing conditions. Virology 2003, 316: 146–160. 10.1016/S0042-6822(03)00559-2View ArticlePubMedGoogle Scholar
- Aloy P, Moont G, Gabb HA, Querol E, Aviles FX, Sternberg MJ: Modelling repressor proteins docking to DNA. Proteins 1998, 33: 535–549. 10.1002/(SICI)1097-0134(19981201)33:4<535::AID-PROT6>3.0.CO;2-DView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.