Comparative sequence and structure analysis of eIF1A and eIF1AD

Yu, Jielin; Marintchev, Assen

doi:10.1186/s12900-018-0091-6

Research article
Open access
Published: 04 September 2018

Comparative sequence and structure analysis of eIF1A and eIF1AD

BMC Structural Biology volume 18, Article number: 11 (2018) Cite this article

3538 Accesses
2 Citations
1 Altmetric
Metrics details

Abstract

Background

Eukaryotic translation initiation factor 1A (eIF1A) is universally conserved in all organisms. It has multiple functions in translation initiation, including assembly of the ribosomal pre-initiation complexes, mRNA binding, scanning, and ribosomal subunit joining. eIF1A binds directly to the small ribosomal subunit, as well as to several other translation initiation factors. The structure of an eIF1A homolog, the eIF1A domain-containing protein (eIF1AD) was recently determined but its biological functions are unknown. Since eIF1AD has a known structure, as well as a homolog, whose structure and functions have been extensively studied, it is a very attractive target for sequence and structure analysis.

Results

Structure/sequence analysis of eIF1AD found significant conservation in the surfaces corresponding to the ribosome-binding surfaces of its paralog eIF1A, including a nearly invariant surface-exposed tryptophan residue, which plays an important role in the interaction of eIF1A with the ribosome. These results indicate that eIF1AD may bind to the ribosome, similar to its paralog eIF1A, and could have roles in ribosome biogenenesis or regulation of translation. We identified conserved surfaces and sequence motifs in the folded domain as well as the C-terminal tail of eIF1AD, which are likely protein-protein interaction sites. The roles of these regions for eIF1AD function remain to be determined. We have also identified a set of trypanosomatid-specific surface determinants in eIF1A that could be a promising target for development of treatments against these parasites.

Conclusions

The results described here identify regions in eIF1A and eIF1AD that are likely to play major functional roles and are promising therapeutic targets. Our findings and hypotheses will promote new research and help elucidate the functions of eIF1AD.

Background

Translation initiation in eukaryotes is a multistep process involving over ten eukaryotic translation initiation factors (eIFs) (reviewed in [1,2,3,4]).

(1)
Several eIFs and the initiator Metionyl-tRNA (Met-tRNA_i) bind to the small ribosomal subunit, forming the pre-initiation complex (PIC). Met-tRNA_i is recruited to the ribosome in complex with the GTPase eIF2. eIF2 is an αβγ heterotrimer. eIF2γ is the actual GTPase, responsible for the bulk of the interaction with Met-tRNA_i, while eIF2α and β play accessory and regulatory roles. The N-terminal tail of eIF2β (eIF2β-NTT) contains three conserved poly-lysine stretches (K-boxes) that mediate binding to eIF5, which is the GTPase-activating protein (GAP) of eIF2.
(2)
The PIC is recruited to the 5′-end of the mRNA by the Cap-binding complex, composed of eIF4E, 4G, and 4A.
(3)
The PIC then scans the mRNA until it reaches a start codon in a proper nucleotide context.
(4)
Start codon recognition (basepairing of the Met-tRNA_i anticodon with the start codon) triggers major conformational rearrangements in the PIC, leading to the release of most eIFs and preparing the PIC for ribosomal subunit joining.
(5)
The last step in translation initiation is ribosomal subunit joining (binding of the large ribosomal subunit to the PIC), promoted by the GTPase eIF5B and eIF1A. eIF5B then hydrolyzes GTP and is released together with eIF1A, leaving behind a ribosome ready to translate the mRNA.

Translation initiation in bacteria is less complex, involving only three translation initiation factors (IFs), two of which, IF1 and IF2, are homologs of eIF1A and eIF5B, respectively. There is no scanning; instead, the small ribosomal subunit binds directly at the translation start site (reviewed in [3]). eIF1A is universally conserved in all Kingdoms of life. It shares with its bacterial homolog, IF1 the same binding site on the ribosome [5,6,7,8] and common functions. They both: (i) bind in the Aminoacyl-tRNA binding site (A-site) of the small ribosomal subunit and induce conformational changes in the ribosome, mimicking those caused by the binding of an aminoacyl-tRNA in the A-site [5, 7]; (ii) promote the assembly of the PIC at the start codon; and (iii) play a role in ribosomal subunit joining. Both IF1 and eIF1A have an oligonucleotide/oligosaccharide binding fold (OB) domain. eIF1A also has a helical subdomain, as well as N- and C-terminal tails (NTT and CTT, respectively) which are intrinsically disordered [9, 10] (Fig. 1a). eIF1A has acquired a number of eukaryote-specific functions and plays a role in virtually every step of the process of translation initiation. Together with other eIFs, eIF1A promotes PIC formation, mRNA binding, scanning, start codon selection, and ribosomal subunit joining. eIF1A has been reported to bind to several other eIFs: eIF2, eIF3, eIF5, and eIF5B; however, only the interaction interfaces with eIF5B have been mapped (reviewed in [1,2,3,4]). eIF1A and eIF1 were found to bind to the ribosome immediately adjacent to each other, although no productive interactions between the two proteins were observed [6, 7].

The first evidence for a second eIF1A homolog in eukaryotes, the eIF1A domain containing protein (eIF1AD), came from genome sequencing projects (see e.g. Human, Schizosaccharomyces pombe (S. pombe), Caenorhabditis elegans (C. elegans)). The protein has also been called Haponin in human [11, 12] and Obelix in chicken [13]. eIF1AD is typically annotated in databases as an RNA-binding protein and a translation initiation factor, owing to its homology to eIF1A; however, there is no available supporting experimental evidence for either. High-throughput expression, interaction and phenotype studies have provided limited information about the function of eIF1AD. Deletion of the gene in S. pombe caused abnormal cell shape, but showed that eIF1AD is not essential [14]. A number of alleles are reported in C. elegans, including an embryonic lethal (www.wormbase.org, gene ID ZK856.11), indicating that eIF1AD is essential in this organism. The protein was found to be preferentially localized to the nucleus in human [12], chicken [13], as well as S. pombe [15], which makes a role at least in canonical translation initiation unlikely. Yeast two-hybrid (Y2H) studies indicate that human eIF1AD interacts with the signal transducer and activator of transcription 1 (STAT1) transcription factor [16] and glyceraldehyde 3-phosphate dehydrogenase (GAPDH) [11]. The C. elegans eIF1AD homolog was reported to interact with ferritin heavy chains 1 and 2 in Y2H screens, while the Drosophila melanogaster (D. melanogaster) protein was found to bind to the transcription factor Extradenticle (Exd) [17, 18]. eIF1AD was found to be highly expressed in testes and ovaries in C. elegans and Drosophila (www.wormbase.org, gene ID ZK856.11; flybase.org, gene ID FBgn0051957) and upon neural induction in chicken [13]. Its overexpression in mammalian cells was reported to increase sensitivity to oxidative stress [12]. Thus, the available data fail to offer insights into the functions of eIF1AD. Recently, the Nuclear Magnetic Resonance (NMR) solution structure of human eIF1AD was solved by the Yokoyama group as part of the RIKEN Structural Genomics/Proteomics Initiative (2dgy.pdb). The structure shows significant similarity to the structure of its paralog eIF1A [9], as expected from the sequence homology between the two proteins. Like eIF1A, eIF1AD consists of a folded domain composed of an OB-fold and helical subdomains, flanked by intrinsically disordered N- and C-terminal tails.

eIF1AD is interesting in that its cellular function is unknown while at the same time its structure has been solved and it has a paralog (eIF1A) with extensively characterized functions and interactions. Based on the sequence and structure homology between eIF1A and eIF1AD, we reasoned that there may be conservation of the interaction surfaces of these two proteins. For example, eIF1A and eIF1AD could use the same surfaces for interactions with their respective ligands, or could even have a common interacting partner. If this is indeed the case, one can expect the corresponding surfaces to be conserved between the two proteins. Therefore, eIF1AD is a very promising candidate for applying bioinformatics sequence and structure analysis to generate hypotheses about its functions and interactions. In this work, we report remarkable conservation between the ribosome-binding surfaces of eIF1A and the corresponding regions in eIF1AD. These results indicate that eIF1AD may bind to the ribosome, similar to its paralog eIF1A, and could have roles in ribosome biogenesis or regulation of translation. We also identified potential protein-protein interaction motifs in eIF1AD. Our analysis of eIF1A identified a set of trypanosomatid-specific surface determinants that could be a promising target for development of treatments against these parasites.

The main goals of this work were to:

1.
Identify regions on the eIF1AD surface with high degree of sequence conservation, since these are likely to be functionally important ligand-binding sites.
2.
Compare surfaces conserved in eIF1AD with the corresponding regions in eIF1A. A surface conserved between the two proteins may serve the same function/bind to the same ligand.
3.
Analyze the sequence conservation of regions in eIF1A and eIF1AD in individual branches of the eukaryotic domain. The goal was to obtain insights into whether any functions/interactions mapped to the respective region are conserved among all eukaryotes or are restricted to certain groups of organisms. This analysis allows determining when it is appropriate to extrapolate results obtained from one species to others. Conversely, it can also point out important functional differences between model organisms.

Methods

Sequence homology searches and sequence alignments

We used a non-redundant protein PSI-BLAST [19] tool from NCBI (http://blast.ncbi.nlm.nih.gov/Blast.cgi) with maximum target sequences set to 20,000. The results were then curated manually based on E-value and protein length, to eliminate incomplete sequences. We extracted representative sets of sequences from the alignments, based of pairwise sequence identity and minimum coverage using HHfilter [20, 21] from the Max-Planck Institute for Developmental Biology Bioinformatics Toolkit (http://toolkit.tuebingen.mpg.de). The sequences were aligned using ClustalW [22, 23] through the Max-Planck Institute for Developmental Biology Bioinformatics Toolkit (http://toolkit.tuebingen.mpg.de/). The alignment results were checked manually. ClustalW multiple sequence alignments and protein structures were used as input for ESPript [24] (http://espript.ibcp.fr) to produce sequence alignments color-coded for sequence conservation, also showing secondary structure elements and solvent accessibility.

Protein structure analysis

We used Molmol [25] for structure analysis and visualization. For homology modeling of protein structures we used Swiss Model [26], in alignment mode. The sequence alignments were obtained using ClustalW [22, 23]. Sequence conservation was mapped onto the protein structures in Molmol, using Protskin [27] (http://www.mcgnmr.mcgill.ca/ProtSkin/). The consensus sequence as well as the conservation scores were both recorded. A threshold similarity score for conservation was selected based on the distribution of scores for the particular set of sequences as well as the percent identity.

Results

As the first step in this work, we analyzed the sequence conservation of functionally important regions in eIF1A. The goals were twofold. The first one was to provide a reference point required for the subsequent analysis of eIF1AD. The second goal was to find out whether individual eIF1A interactions are conserved among all eukaryotes or only within certain branches of the eukaryotic domain, since this type of analysis has not been previously performed on eIF1A.

Sequence conservation of surfaces in eIF1A

eIF1A is involved in a number of protein/protein and protein/RNA interactions [6,7,8, 28, 29]. Therefore, we set out to analyze whether each of the respective interaction surfaces in the protein is conserved. As described above, eIF1A is composed of a folded domain surrounded by an N-terminal and a C-terminal tails, which are intrinsically disordered (Fig. 1a). The folded domain itself consists of an OB-fold subdomain and a helical subdomain [9] (Fig. 1a). Extensive surfaces in the folded domain, as well as the NTT bind to the small 40S ribosomal subunit [6,7,8] (Fig. 1b, colored blue in Fig. 2a). On the 40S subunit, eIF1A also comes in close proximity to eIF1 (Fig. 1b, the corresponding eIF1A surface is colored cyan in Fig. 2a); however there are no obvious productive interactions [6, 7]. The extreme C-terminus of eIF1A binds to the C-terminal domain of eIF5B (eIF5B-CTD) [28, 29] (red in Fig. 2a). Another segment of eIF1A-CTT, closer to the folded domain (orange in Fig. 2a), plays a role in maintaining the stringency of start codon recognition [30], but it is not clear whether this is mediated by protein/protein interactions. The folded domain of eIF1A also contacts eIF5B (gold in Fig. 2a) [6,7,8, 31, 32]. The remaining surfaces of eIF1A (grey in Fig. 2a) were considered a separate group in the analysis.

The eIF1A sequence conservation is very high: nearly identical among all mammals; > 80% identity among vertebrates, with the zebra fish (Danio rerio) sequence, for instance, being 99% identical to that of human eIF1A. Even the sequence and length of the intrinsically disordered tails are conserved. Comparisons among the different eukaryotic kingdoms show that the NTT and the folded domain of eIF1A remain well conserved, whereas the sequence and length of the CTT are less conserved (Fig. 3, Fig. 2b). For example, human and Saccharomyces cerevisiae (S. cerevisiae) eIF1A sequences have 62% identity overall and 69% identity with no gaps over the NTT and the folded domain (excluding the CTT).

As expected, the ribosome-binding surfaces are most conserved, with clear conservation of positively charged residues (compare Fig. 2a, b, and c). By far the most conserved surface-exposed residue is the invariant W69 (Fig. 3, labeled in Fig. 2b, c), at the ribosome binding surface. The eIF1A surface facing eIF1 is least conserved (compare Fig. 2a, b, and c).

Overall, we did not observe differential conservation in individual branches of the eukaryotic lineage (Fig. 3), with the two notable exceptions discussed below.

In roundworms (Phylum Nematoda) and flatworms (Phylum Platyhelminthes), the eIF1A C-terminus has no discernible eIF5B-CTD binding motif and carries a positive charge. Nematodes and Platyhelminthes belong to different clades: Ecdysozoans and Lophotrochozoans, respectively. eIF1A sequences from species belonging to other phyla from both of these clades, e.g. Arthropoda (Ecdysozoans) or Annelida and Mollusca (Lophotrochozoans) have a conserved eIF5B-CTD binding site and the entire eIF1A-CTT is negatively charged (Fig. 3). Therefore, the loss of the eIF5B-CTD binding site and the added positive charges must have occurred twice in evolution.

In trypanosomatids, eIF1A shows markedly different sequence conservation pattern, compared to any other group of organisms (Fig. 3, Fig. 4a). There is a trypanosomatid-specific area with substantial hydrophobicity located on the ribosome-binding surface (compare the circled region on the Trypanosoma vivax (T. vivax) eIF1A structure, Fig. 4a, left, with the corresponding region in human eIF1A, Fig. 4a, right, and Fig. 4b, left). The eIF1A C-terminus also has a segment with high hydrophobicity unique to trypanosomatids (Fig. 3, Fig. 4a, left).

Homology between eIF1A and eIF1AD

eIF1AD is present only in eukaryotes. It must have been present in the last common ancestor of all eukaryotes, because some eukaryotes that have branched out early, e.g. Giardia theta, have an eIF1AD gene. At the same time, eIF1AD has been lost in a number of eukaryotes, including S. cerevisiae.

Like eIF1A, eIF1AD has an OB domain surrounded by two intrinsically disordered tails (Fig. 5a). The sequence homology between the two proteins is highest in the folded domain with 23% identity and 34% homology (Fig. 5b). While the NTT sequences are not well conserved, they both have an overall positive charge. No similarity exists between the CTT sequences of eIF1A and eIF1AD (compare Figs. 3 and 6).

The eIF1AD sequence conservation (Fig. 6) is not as high as that of eIF1A (see Fig. 2): > 80% identity among mammals; > 50% identity among vertebrates. As with eIF1A, the NTT and the folded domain of eIF1AD are well conserved, whereas the sequence and length of the CTT are not (Fig. 6). Analysis of surface-exposed residues in eIF1AD shows that the surfaces corresponding to ribosome-binding surfaces in eIF1A are among the best-conserved, with a substantial number of positively charged residues (Fig. 7a, b, compare with Fig. 2).

The folded domain of eIF1AD has a conserved surface with significant hydrophobicity (circled in Fig. 7b, right), which corresponds to part of the ribosome-binding surface in eIF1A, but is less conserved. This surface is a likely site of protein-protein interactions, since solvent-exposed hydrophobic residues are both energetically unfavorable and can contribute to stability and specificity of interactions. If eIF1AD does indeed bind to the ribosome in the same way as eIF1A, this surface would become buried at the interface.

While the eIF1AD C-terminal tail as a whole is not conserved, it contains highly conserved sequence motifs, which are likely protein-protein interactions sites (Fig. 6). One of the conserved motifs is found in most eukaryotes, but absent in Fungi and one branch of Metazoans: members of the phylum Platyhelminthes. We designate this motif here as “NTNR” (Asn-Thr-Asn-Arg), after the most conserved core of residues. The second motif is found in Fungi and the phyla Platyhelminthes, Mollusca, Anelida, and Nematoda. We designate this motif here as “LPPS” (Leu-Pro-Pro-Ser), after the most conserved core of residues. eIF1AD from Mollusca, Anelida, and Nematoda has both motifs (Fig. 6), which indicates that these are two independent sequence motifs, with independent functions. In plants, the eIF1AD-CTT contains an extended NTNR sequence motif, where the first portion of the sequence resembles the LPPS motif. It is thus possible that this motif has resulted from merging the NTNR and LPPS motifs in tandem in the plant eIF1AD sequence. Thus, the C-terminal tail of eIF1AD contains a set of sequence motifs that vary in consensus, and likely also function, among different branches of eukaryotes.

Discussion

eIF1A is one of only two universally conserved translation initiation factors, found in every organism, from bacteria to human. There is also very high degree of conservation in eIF1A sequences among eukaryotes (Fig. 3). The invariant W69, at the ribosome binding surface (Fig. 3, labeled in Fig. 2b, c), is also conserved in the archaeal eIF1A homolog aIF1A, whereas the bacterial homolog IF1 has an arginine at this position (not shown). Remarkably, W69 was recently found to form a stacking interaction with a functionally important base A1709 in the Tetrahymena 18S small ribosomal RNA (rRNA), stabilizing it in a flipped-out conformation [7]. A1709 in helix 44 of the Tetrahymena 18S rRNA (A1819 in rabbit) corresponds to A1493 in Escherichia coli (E. coli), which in its flipped-out form “inspects” the proper codon-anticodon basepairing in the Aminoacyl-tRNA site (A-site) of the ribosome [33]. During translation initiation in bacteria, A1493 is also flipped out by the bacterial eIF1A homolog IF1, thus inducing conformational changes in the ribosome that can mimic the presence of a tRNA in the A-site [5]. A W69A mutant was found to cause a defect in start codon recognition (48S initiation complex formation) in vitro and appearance of aberrant 48S complexes with mRNA not positioned correctly on the ribosome [9]. The defect in 48S complex formation was not as drastic as could have been expected from the exceptionally high degree of conservation of W69 and its observed interaction with A1709, and the mutation had little effect on the assembly of the pre-initiation complex off mRNA (43S complex formation) [9]. Therefore, it appears that the main role of W69 is to induce conformational changes in the 40S ribosomal subunit, rather than in eIF1A binding to the ribosome per se.

The eIF1A surface facing eIF1 is least conserved (compare Fig. 2a, b, and c), consistent with the observation that there are no productive contacts between the two proteins in the 40S/eIF1A/eIF1 crystal structure [6, 7]. This indicates that the observed ~ 10-fold cooperativity of eIF1A and eIF1 binding to the 40S ribosomal subunit [34] is likely mediated by both proteins promoting similar conformational changes in the ribosome. It is also possible that direct contacts between eIF1 and eIF1A do contribute to the cooperativity, since binding to the ribosome places eIF1A and eIF1 in such close proximity that even weak interactions between the two proteins could have a stabilizing effect.

The eIF5B-CTD binding motif at the eIF1A C-terminus is conserved in almost all eukaryotic species, except roundworms (Phylum Nematoda) and flatworms (Phylum Platyhelminthes), where the eIF1A-CTT carries a positive charge, instead (Fig. 3). The interaction between eIF1A-CTT and eIF5B-CTD was found to be important for ribosomal subunit joining in S. cerevisiae [35]. It is thus interesting to know whether eIF1A-CTT still plays the same role in these worms. There are no compensatory changes in the respective surface on eIF5B-CTD (not shown); therefore, it is highly unlikely that the eIF1A-CTT can still bind there. Alternatively, since eIF5B-CTD interacts with the large ribosomal subunit, the positive charge of eIF1A in these species could allow it to bind to the rRNA in the vicinity of eIF5B-CTD.

As described above (Fig. 3), trypanosomatid eIF1A sequences appear to have diverged from the consensus in the rest of eukaryotes. Regions with increased hydrophobicity, conserved among trypanosomatids, but not other species, are observed in the NTT, CTT, and certain surfaces of the folded domain (Fig. 3, Fig. 4a). These regions may be sites of novel trypanosomatid-specific interactions. While trypanosomatid translation initiation is still not fully understood, there are a number of known differences from other eukaryotes. We reported recently that the eIF1A C-terminus dynamically contacts the ribosome-binding surface of eIF1A, an interaction that is disrupted when eIF1A binds to the ribosome ([36], Fig. 4b, right). Therefore, the two trypanosomatid-specific hydrophobic segments likely contact each other when eIF1A is not ribosome-bound. However, once eIF1A is bound to the ribosome, both its NTT and CTT are free to interact with other proteins [8, 30]. eIF1A interacts with eIF5B via regions adjacent to the trypanosomatid-specific hydrophobic surfaces [28, 29, 36] (see also Fig. 2a). eIF1A is known to interact with eIF2, eIF3 and the C-terminal domain of eIF5 (eIF5-CTD) [29, 37, 38], and on the ribosome, both eIF1A-NTT and -CTT are in proximity to eIF2, eIF3c, and eIF5, as well as to eIF1 [8, 30, 38,39,40]. We did not observe any trypanosomatid-specific hydrophobic surfaces in eIF1 or eIF5B (data not shown). Therefore, it is unlikely that the trypanosomatid-specific hydrophobic surfaces in eIF1A affect the interactions with eIF1 or eIF5B. The degree of sequence conservation of eIF3c and eIF5-CTD is too low for meaningful analysis and we cannot make any predictions about their interactions with eIF1A. Two of the three eIF2 subunits, α and β, have trypanosomatid-specific characteristics: eIF2α has a small N-terminal domain not found in most other eukaryotes, while eIF2β-NTT lacks the three conserved K-boxes that in other eukaryotes bind eIF5-CTD (see above). Thus, while the functional significance of these differences is not known, both eIF2α and -β could interact with the unique hydrophobic surfaces in trypanosomatid eIF1A. It is of course possible that the trypanosomatid-specific surfaces in eIF1A form novel interactions unique to trypanosomatids. For instance, trypanosomatid mRNAs are first transcribed as large polycistronic mRNAs and their maturation involves trans-splicing, adding a ~ 40 nt capped leader sequence to every mRNA. Trypanosomatids contain multiple eIF4E and eIF4G isoforms [41,42,43,44,45,46]. It was recently reported that in mammals, the PIC inspects the mRNA from the very 5′-end, placing the cap-binding complex in the vicinity of the ribosomal A- and P-sites at the beginning of scanning [47]. If this is also the case in trypanosomatids, then eIF1A could also be involved in trypanosomatid-specific interactions with eIF4A, 4E, and/or 4G. Since a number of trypanosomatids are parasites, a unique hydrophobic region on the ribosome-binding surface of an essential protein like eIF1A is a promising therapeutic target.

While eIF1AD is present in most eukaryotes, its sequence conservation s not as high as that of eIF1A (Fig. 6). Nevertheless, there is significant conservation not only among eIF1AD sequences, but also between eIF1A and eIF1AD (Fig. 5b). The majority of surface-exposed residues conserved between eIF1A and eIF1AD map to the ribosome-binding surface of eIF1A (Fig. 7c). The most conserved surface-exposed residue in eIF1AD, the almost invariant W62 (labeled in Fig. 7) also maps to the ribosome-binding surface. Remarkably, eIF1AD W62 corresponds to the most conserved residue in eIF1A, W69 (Fig. 2, Fig. 3b). As discussed above, W69 is involved in promoting conformational changes in the ribosome upon eIF1A binding [7]. The significant sequence conservation between the ribosome-binding surfaces of eIF1A and the corresponding regions of eIF1AD indicates that eIF1AD is also likely to bind to the ribosome, or rRNA. This hypothesis is further strengthened by the observation that the same tryptophan residue known to be important for eIF1A function is the most conserved surface-exposed residue in both proteins. Furthermore, the high degree of sequence conservation in the positively charged eIF1AD-NTT indicates that it may also be involved in ribosome binding, similar to eIF1A-NTT. If this is indeed the case, the function of such eIF1AD interaction remains to be determined. The eIF5B-binding regions of eIF1A are not conserved with eIF1AD, indicating that eIF1AD does not interact with eIF5B. Therefore, eIF1AD either does not act as an alternative translation initiation factor or it functions in a unique pathway that does not involve eIF5B. A number of homologs of canonical translation initiation factors have been described. In most, if not all, of the cases where the functions of these homologs are known, they are in some way or other related to translation. For example, Pdcd4, PAIP1, 5MP1, and 5MP2, which are homologs of eIF4G and eIF5, are translation regulators [48,49,50,51]. The nuclear Cap-Binding Complex, composed of CBP80 (an eIF4G homolog) and CBP20 (homologous to eIF4B and 4H) [52] plays multiple roles in transcription, mRNA maturation and export, as well as the pioneer round of translation and Nonsense-Mediated Decay (NMD) [53, 54]. eIF4A3 (a homolog of eIF4A) is part of the Exon Junction Complex and is involved in mRNA export and NMD [54,55,56], as well as in rRNA biogenesis, together with another eIF4G homolog, NOM1 [57]. Therefore, eIF1AD is likely involved in one or more of these processes. Since eIF1AD has been found to be localized predominantly in the nucleus [12, 13, 15], direct role in translation regulation is somewhat less likely. Instead, it could play roles in regulation of ribosome biogenesis or mRNA maturation.

Conclusions

In summary, our structure/sequence analysis of eIF1AD found significant conservation in the surfaces corresponding to the ribosome-binding surfaces of its paralog eIF1A. Remarkably, both protein families share a nearly invariant surface-exposed tryptophan residue, which plays an important role in the interaction of eIF1A with the ribosome. These results indicate that eIF1AD may bind to the ribosome, similar to its paralog eIF1A, and could have roles in ribosome biogenenesis or regulation of translation. We also identified conserved surfaces and sequence motifs in the folded domain as well as the CTT of eIF1AD, which are likely protein-protein interaction sites. The roles of these regions for eIF1AD function remain to be determined. Furthermore, our analysis of eIF1A identified a set of trypanosomatid-specific surface determinants that could be a promising target for development of treatments against these parasites. We expect that the results and hypotheses described here will promote new research and help elucidate the functions of eIF1AD.

Abbreviations

A-site:: Aminoacyl-tRNA site
C. elegans :: Caenorhabditis elegans
CTD:: C-terminal domain
CTT:: C-terminal tail
D. melanogaster :: Drosophila melanogaster
E. coli :: Escherichia coli
eIF1A:: Eukaryotic translation initiation factor 1A
eIF1AD:: eIF1A domain-containing protein
Exd :: Extradenticle
GAPDH:: Glyceraldehyde 3-phosphate dehydrogenase
IF1:: Initiation factor 1
NMD:: Nonsense-Mediated Decay
NMR:: Nuclear Magnetic Resonance
NTT:: N-terminal tail
OB:: Oligonucleotide/oligosaccharide binding fold
rpS:: Small ribosomal subunit protein
rRNA:: ribosomal RNA
S. cerevisiae :: Saccharomyces cerevisiae
S. pombe :: Schizosaccharomyces pombe
STAT1:: Signal transducer and activator of transcription 1
T. vivax :: Trypanosoma vivax
Y2H:: Yeast two-hybrid

References

Hinnebusch AG. Molecular mechanism of scanning and start codon selection in eukaryotes. Microbiol Mol Biol Rev. 2011;75(3):434–67.
Article PubMed PubMed Central CAS Google Scholar
Jackson RJ, Hellen CU, Pestova TV. The mechanism of eukaryotic translation initiation and principles of its regulation. Nat Rev Mol Cell Biol. 2010;11(2):113–27.
Article PubMed PubMed Central CAS Google Scholar
Marintchev A, Wagner G. Translation initiation: structures, mechanisms and evolution. Q Rev Biophys. 2004;37(3–4):197–284.
PubMed CAS Google Scholar
Sonenberg N, Hinnebusch AG. Regulation of translation initiation in eukaryotes: mechanisms and biological targets. Cell. 2009;136(4):731–45.
Article PubMed PubMed Central CAS Google Scholar
Carter AP, Clemons WM Jr, Brodersen DE, Morgan-Warren RJ, Hartsch T, Wimberly BT, Ramakrishnan V. Crystal structure of an initiation factor bound to the 30S ribosomal subunit. Science. 2001;291(5503):498–501.
Article PubMed CAS Google Scholar
Lomakin IB, Steitz TA. The initiation of mammalian protein synthesis and mRNA scanning mechanism. Nature. 2013;500(7462):307–11.
Article PubMed PubMed Central CAS Google Scholar
Weisser M, Voigts-Hoffmann F, Rabl J, Leibundgut M, Ban N. The crystal structure of the eukaryotic 40S ribosomal subunit in complex with eIF1 and eIF1A. Nat Struct Mol Biol. 2013;20(8):1015–7.
Article PubMed CAS Google Scholar
Yu Y, Marintchev A, Kolupaeva VG, Unbehaun A, Veryasova T, Lai SC, Hong P, Wagner G, Hellen CU, Pestova TV. Position of eukaryotic translation initiation factor eIF1A on the 40S ribosomal subunit mapped by directed hydroxyl radical probing. Nucleic Acids Res. 2009;37(15):5167–82.
Article PubMed PubMed Central CAS Google Scholar
Battiste JL, Pestova TV, Hellen CU, Wagner G. The eIF1A solution structure reveals a large RNA-binding surface important for scanning function. Mol Cell. 2000;5(1):109–19.
Article PubMed CAS Google Scholar
Sette M, van Tilborg P, Spurio R, Kaptein R, Paci M, Gualerzi CO, Boelens R. The structure of the translational initiation factor IF1 from E.Coli contains an oligomer-binding motif. EMBO J. 1997;16(6):1436–43.
Article PubMed PubMed Central CAS Google Scholar
Rakitina TV, Bogatova OV, Smirnova EV, Pozdeev VI, Kostanian IA, Lipkin VM. Haponin (eIF1AD) interacts with glyceraldehyde 3-phosphate dehydrogenase in the CHO-K1 cell line. Bioorg Khim. 2010;36(3):312–8.
PubMed CAS Google Scholar
Smirnova EV, Rakitina TV, Bogatova OV, Ivanova DL, Vorobyeva EE, Lipkin AV, Kostanyan IA, Lipkin VM. Novel protein haponin regulates cellular response to oxidative stress. Dokl Biochem Biophys. 2011;440:225–7.
Article PubMed CAS Google Scholar
Pinho S, Simonsson PR, Trevers KE, Stower MJ, Sherlock WT, Khan M, Streit A, Sheng G, Stern CD. Distinct steps of neural induction revealed by Asterix, Obelix and TrkC, genes induced by different signals from the organizer. PLoS One. 2011;6(4):e19157.
Article PubMed PubMed Central CAS Google Scholar
Hayles J, Wood V, Jeffery L, Hoe KL, Kim DU, Park HO, Salas-Pino S, Heichinger C, Nurse P. A genome-wide resource of cell cycle and cell shape genes of fission yeast. Open Biol. 2013;3(5):130053.
Article PubMed PubMed Central CAS Google Scholar
Matsuyama A, Arai R, Yashiroda Y, Shirai A, Kamata A, Sekido S, Kobayashi Y, Hashimoto A, Hamamoto M, Hiraoka Y, et al. ORFeome cloning and global analysis of protein localization in the fission yeast Schizosaccharomyces pombe. Nat Biotechnol. 2006;24(7):841–7.
Article PubMed CAS Google Scholar
Rual JF, Venkatesan K, Hao T, Hirozane-Kishikawa T, Dricot A, Li N, Berriz GF, Gibbons FD, Dreze M, Ayivi-Guedehoussou N, et al. Towards a proteome-scale map of the human protein-protein interaction network. Nature. 2005;437(7062):1173–8.
Article PubMed CAS Google Scholar
Giot L, Bader JS, Brouwer C, Chaudhuri A, Kuang B, Li Y, Hao YL, Ooi CE, Godwin B, Vitols E, et al. A protein interaction map of Drosophila melanogaster. Science. 2003;302(5651):1727–36.
Article PubMed CAS Google Scholar
Li S, Armstrong CM, Bertin N, Ge H, Milstein S, Boxem M, Vidalain PO, Han JD, Chesneau A, Hao T, et al. A map of the interactome network of the metazoan C. Elegans. Science. 2004;303(5657):540–3.
Article PubMed PubMed Central CAS Google Scholar
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25(17):3389–402.
Article PubMed PubMed Central CAS Google Scholar
Remmert M, Biegert A, Hauser A, Soding J. HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat Methods. 2011;9(2):173–5.
Article PubMed CAS Google Scholar
Zimmermann L, Stephens A, Nam SZ, Rau D, Kubler J, Lozajic M, Gabler F, Soding J, Lupas AN, Alva V. A Completely Reimplemented MPI bioinformatics toolkit with a new HHpred server at its Core. J Mol Biol. 2017;430(15):2237–43.
Article PubMed CAS Google Scholar
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, et al. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23(21):2947–8.
Article PubMed CAS Google Scholar
Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22(22):4673–80.
Article PubMed PubMed Central CAS Google Scholar
Robert X, Gouet P. Deciphering key features in protein structures with the new ENDscript server. Nucleic Acids Res. 2014;42(Web Server issue):W320–4.
Article PubMed PubMed Central CAS Google Scholar
Koradi R, Billeter M, Wuthrich K. MOLMOL: a program for display and analysis of macromolecular structures. J Mol Graph. 1996;14(1):51–5. 29-32
Article PubMed CAS Google Scholar
Biasini M, Bienert S, Waterhouse A, Arnold K, Studer G, Schmidt T, Kiefer F, Gallo Cassarino T, Bertoni M, Bordoli L, et al. SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Res. 2014;42(Web Server issue):W252–8.
Article PubMed PubMed Central CAS Google Scholar
Ritter B, Denisov AY, Philie J, Deprez C, Tung EC, Gehring K, McPherson PS. Two WXXF-based motifs in NECAPs define the specificity of accessory protein binding to AP-1 and AP-2. EMBO J. 2004;23(19):3701–10.
Article PubMed PubMed Central CAS Google Scholar
Marintchev A, Kolupaeva VG, Pestova TV, Wagner G. Mapping the binding interface between human eukaryotic initiation factors 1A and 5B: a new interaction between old partners. Proc Natl Acad Sci U S A. 2003;100(4):1535–40.
Article PubMed PubMed Central CAS Google Scholar
Olsen DS, Savner EM, Mathew A, Zhang F, Krishnamoorthy T, Phan L, Hinnebusch AG. Domains of eIF1A that mediate binding to eIF2, eIF3 and eIF5B and promote ternary complex recruitment in vivo. EMBO J. 2003;22(2):193–204.
Article PubMed PubMed Central CAS Google Scholar
Saini AK, Nanda JS, Lorsch JR, Hinnebusch AG. Regulatory elements in eIF1A control the fidelity of start codon selection by modulating tRNA(i) (met) binding to the ribosome. Genes Dev. 2010;24(1):97–110.
Article PubMed PubMed Central CAS Google Scholar
Unbehaun A, Marintchev A, Lomakin IB, Didenko T, Wagner G, Hellen CU, Pestova TV. Position of eukaryotic initiation factor eIF5B on the 80S ribosome mapped by directed hydroxyl radical probing. EMBO J. 2007;26(13):3109–23.
Article PubMed PubMed Central CAS Google Scholar
Fernandez IS, Bai XC, Hussain T, Kelley AC, Lorsch JR, Ramakrishnan V, Scheres SH. Molecular architecture of a eukaryotic translational initiation complex. Science. 2013;342(6160):1240585.
Article PubMed CAS Google Scholar
Ogle JM, Brodersen DE, Clemons WM Jr, Tarry MJ, Carter AP, Ramakrishnan V. Recognition of cognate transfer RNA by the 30S ribosomal subunit. Science. 2001;292(5518):897–902.
Article PubMed CAS Google Scholar
Maag D, Lorsch JR. Communication between eukaryotic translation initiation factors 1 and 1A on the yeast small ribosomal subunit. J Mol Biol. 2003;330(5):917–24.
Article PubMed CAS Google Scholar
Acker MG, Shin BS, Dever TE, Lorsch JR. Interaction between eukaryotic initiation factors 1A and 5B is required for efficient ribosomal subunit joining. J Biol Chem. 2006;281(13):8469–75.
Article PubMed CAS Google Scholar
Nag N, Lin KY, Edmonds KA, Yu J, Nadkarni D, Marintcheva B, Marintchev A. eIF1A/eIF5B interaction network and its functions in translation initiation complex assembly and remodeling. Nucleic Acids Res. 2016;44(15):7441–56.
PubMed PubMed Central CAS Google Scholar
Luna RE, Arthanari H, Hiraishi H, Akabayov B, Tang L, Cox C, Markus MA, Luna LE, Ikeda Y, Watanabe R, et al. The interaction between eukaryotic initiation factor 1A and eIF5 retains eIF1 within scanning preinitiation complexes. Biochemistry. 2013;52(52):9510–8.
Article PubMed PubMed Central CAS Google Scholar
Luna RE, Arthanari H, Hiraishi H, Nanda J, Martin-Marcos P, Markus MA, Akabayov B, Milbradt AG, Luna LE, Seo HC, et al. The C-terminal domain of eukaryotic initiation factor 5 promotes start codon recognition by its dynamic interplay with eIF1 and eIF2beta. Cell Rep. 2012;1(6):689–702.
Article PubMed PubMed Central CAS Google Scholar
Hussain T, Llacer JL, Fernandez IS, Munoz A, Martin-Marcos P, Savva CG, Lorsch JR, Hinnebusch AG, Ramakrishnan V. Structural changes enable start codon recognition by the eukaryotic translation initiation complex. Cell. 2014;159(3):597–607.
Article PubMed PubMed Central CAS Google Scholar
Llacer JL, Hussain T, Marler L, Aitken CE, Thakur A, Lorsch JR, Hinnebusch AG, Ramakrishnan V. Conformational differences between open and closed states of the eukaryotic translation initiation complex. Mol Cell. 2015;59(3):399–412.
Article PubMed PubMed Central CAS Google Scholar
Agabian N. Trans splicing of nuclear pre-mRNAs. Cell. 1990;61(7):1157–60.
Article PubMed CAS Google Scholar
Bonen L. Trans-splicing of pre-mRNA in plants, animals, and protists. FASEB J. 1993;7(1):40–6.
Article PubMed CAS Google Scholar
Fernandez-Moya SM, Estevez AM. Posttranscriptional control and the role of RNA-binding proteins in gene regulation in trypanosomatid protozoan parasites. Wiley Interdiscip Rev RNA. 2010;1(1):34–46.
Article PubMed CAS Google Scholar
Freire ER, Sturm NR, Campbell DA, de Melo Neto OP. The role of cytoplasmic mRNA cap-binding protein complexes in Trypanosoma brucei and other Trypanosomatids. Pathogens. 2017;6:4.
Article Google Scholar
Pereira MM, Malvezzi AM, Nascimento LM, Lima TD, Alves VS, Palma ML, Freire ER, Moura DM, Reis CR, de Melo Neto OP. The eIF4E subunits of two distinct trypanosomatid eIF4F complexes are subjected to differential post-translational modifications associated to distinct growth phases in culture. Mol Biochem Parasitol. 2013;190(2):82–6.
Article PubMed CAS Google Scholar
Yoffe Y, Leger M, Zinoviev A, Zuberek J, Darzynkiewicz E, Wagner G, Shapira M. Evolutionary changes in the Leishmania eIF4F complex involve variations in the eIF4E-eIF4G interactions. Nucleic Acids Res. 2009;37(10):3243–53.
Article PubMed PubMed Central CAS Google Scholar
Kumar P, Hellen CU, Pestova TV. Toward the mechanism of eIF4F-mediated ribosomal attachment to mammalian capped mRNAs. Genes Dev. 2016;30(13):1573–88.
Article PubMed PubMed Central CAS Google Scholar
Craig AW, Haghighat A, Yu AT, Sonenberg N. Interaction of polyadenylate-binding protein with the eIF4G homologue PAIP enhances translation. Nature. 1998;392(6675):520–3.
Article PubMed CAS Google Scholar
Loughran G, Firth AE, Atkins JF, Ivanov IP. Translational autoregulation of BZW1 and BZW2 expression by modulating the stringency of start codon selection. PLoS One. 2018;13(2):e0192648.
Article PubMed PubMed Central CAS Google Scholar
Tang L, Morris J, Wan J, Moore C, Fujita Y, Gillaspie S, Aube E, Nanda J, Marques M, Jangal M, et al. Competition between translation initiation factor eIF5 and its mimic protein 5MP determines non-AUG initiation rate genome-wide. Nucleic Acids Res. 2017;45(20):11941–53.
Article PubMed PubMed Central CAS Google Scholar
Yang HS, Jansen AP, Komar AA, Zheng X, Merrick WC, Costes S, Lockett SJ, Sonenberg N, Colburn NH. The transformation suppressor Pdcd4 is a novel eukaryotic translation initiation factor 4A binding protein that inhibits translation. Mol Cell Biol. 2003;23(1):26–37.
Article PubMed PubMed Central CAS Google Scholar
Marintchev A, Wagner G. eIF4G and CBP80 share a common origin and similar domain organization: implications for the structure and function of eIF4G. Biochemistry. 2005;44(37):12265–72.
Article PubMed CAS Google Scholar
Gonatopoulos-Pournatzis T, Cowling VH. Cap-binding complex (CBC). Biochem J. 2014;457(2):231–42.
Article PubMed CAS Google Scholar
Maquat LE, Tarn WY, Isken O. The pioneer round of translation: features and functions. Cell. 2010;142(3):368–74.
Article PubMed PubMed Central CAS Google Scholar
Andersen CB, Ballut L, Johansen JS, Chamieh H, Nielsen KH, Oliveira CL, Pedersen JS, Seraphin B, Le Hir H, Andersen GR. Structure of the exon junction core complex with a trapped DEAD-box ATPase bound to RNA. Science. 2006;313(5795):1968–72.
Article PubMed CAS Google Scholar
Bono F, Ebert J, Lorentzen E, Conti E. The crystal structure of the exon junction complex reveals how it maintains a stable grip on mRNA. Cell. 2006;126(4):713–25.
Article PubMed CAS Google Scholar
Alexandrov A, Colognori D, Steitz JA. Human eIF4AIII interacts with an eIF4G-like partner, NOM1, revealing an evolutionarily conserved function outside the exon junction complex. Genes Dev. 2011;25(10):1078–90.
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

The authors thank the team of scientists developing and maintaining the Bioinformatics Toolkit at the Max-Planck Institute, which was an extremely valuable resource.

Funding

This work was supported by National Institutes of Health Grant GM095720 to A.M.

Availability of data and materials

The raw datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Author information

Authors and Affiliations

Department of Physiology and Biophysics, Boston University School of Medicine, Boston, MA, USA
Jielin Yu & Assen Marintchev

Authors

Jielin Yu
View author publications
You can also search for this author in PubMed Google Scholar
Assen Marintchev
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JY helped design the study, performed sequence and structure analysis, and helped draft the manuscript. AM conceived and designed the study, carried out homology modeling, and drafted the manuscript. Both authors contributed to analysis and interpretation of data and finalized and approved the manuscript.

Corresponding author

Correspondence to Assen Marintchev.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Yu, J., Marintchev, A. Comparative sequence and structure analysis of eIF1A and eIF1AD. BMC Struct Biol 18, 11 (2018). https://doi.org/10.1186/s12900-018-0091-6

Download citation

Received: 13 March 2018
Accepted: 24 August 2018
Published: 04 September 2018
DOI: https://doi.org/10.1186/s12900-018-0091-6

Comparative sequence and structure analysis of eIF1A and eIF1AD