Identification of hemagglutinin structural domain and polymorphisms which may modulate swine H1N1 interactions with human receptor
© Veljkovic et al; licensee BioMed Central Ltd. 2009
Received: 18 June 2009
Accepted: 28 September 2009
Published: 28 September 2009
The novel A/H1N1 influenza virus, which recently emerged in North America is most closely related to North American H1N1/N2 swine viruses. Until the beginning of 2009, North American swine H1N1/N2 viruses have only sporadically infected humans as dead-end hosts. In 2009 the A/H1N1 virus acquired the capacity to spread efficiently by human to human transmission. The novel A/H1N1 influenza virus has struck thousands of people in more than 70 countries and killed more than 140, representing a public health emergency of international concern. Here we have studied properties of hemagglutinin of A/H1N1 which may modulate virus/receptor interaction.
Analyses by ISM bioinformatics platform of the HA1 protein of North American swine H1N1/N2 viruses and the new A/H1N1 showed that both groups of viruses differed in conserved characteristics that reflect a distinct propensity of these viruses to undergo a specific interaction with swine or human host proteins or receptors. Swine H1N1/N2 viruses that sporadically infected humans featured both the swine and the human interaction pattern. Substitutions F71S, T128S, E302K, M314L in HA1 of swine H1N1 viruses from North America are identified as critical for the human interaction pattern of A/H1N1 and residues D94, D196 and D274 are predicted to be "hot-spots" for polymorphisms which could increase infectivity of A/H1N1 virus. At least one of these residues has already emerged in the A/H1N1 isolates from Spain, Italy and USA. The domain 286-326 was identified to be involved in virus/receptor interaction.
Our results (i) contribute to better understanding of the origin of the novel A/H1N1 influenza virus, (ii) provide a tool for monitoring its molecular evolution (iii) predicts hotspots associated with enhanced infectivity in humans and (iv) identify therapeutic and diagnostic targets for prevention and treatment of A/H1N1 infection.
Sporadic infections of humans by swine influenza viruses have been reported from the United States and worldwide, mostly from classical swine influenza [1–3]. During the late nineties multiple subtypes of triple reassortants influenza viruses with genes from avian, human and pig origin emerged and became predominant in North American swine [1, 4]. Triple reassortant H1N1 and H1N2 subtypes occasionally infected humans but human to human transmission was rare and always very limited. However, disease severity and clinical out-come was always unpredictable [5–7]. In April 2009 a H1N1 triple reassortant swine influenza virus infected humans in North America  and continued to effectively transmit from human to humans (REF). The virus spread rapidly within Mexico and with some delay across the United States before spread to other continents. From 19 April to 1 August 2009, 60 655 specimens tested positive for influenza and were reported to Flu-Net by 73 countries, areas and territories . As a result, WHO declared the virus a pandemic threat. While most fatal cases occurred in Mexico and the United States, the virus appeared to be less aggressive in Europe and Asia. The rapid spread of this swine influenza virus mainly among young healthy adults and outside of the classical influenza season added to the unpredictability of this virus. Thus the virus and its molecular evolution raise a number of questions that are of prime international public health concern.
Recently, we applied the Informational Spectrum Method (ISM) bioinformatics platform , for analysis of the structure and function of the HA subunit 1 (HA1) of H5N1 influenza viruses. Results of this analysis revealed that HA1 of H5N1 viruses encodes specific and highly conserved information which may determine the recognition and targeting of these highly pathogenic avian influenza (HPAI) viruses to their receptor . We also showed that a subset of H5N1 in Egypt may be evolving toward an H1N1-like receptor usage, indicating more efficient human-to-human transmission . This prediction is in accord with recently observed H5N1 subclinical cases in Egypt. This silent spread of H5N1 in human populations sets the stage for increased transmission efficiencies and represents another threat with pandemic potential.
Here we used the ISM platform to compare North American swine H1N1/N2 influenza viruses with the new pandemic A/H1N1 virus. Our results showed that both groups of viruses differed in conserved characteristics that reflect a distinct propensity of these viruses to undergo a specific interaction with swine or human host proteins or receptors. Swine H1N1/N2 viruses that sporadically infected humans featured both the swine and the human interaction pattern. Furthermore, we identified several amino acid positions that are predicted to be "hot-spots" for polymorphisms which could increase human infectivity of A/H1N1 virus. At least one of these residues has already emerged in the A/H1N1 isolates from Spain, Italy and USA.
The tree was calculated with the Neighbour-Joining method (Kimura-2-parameter) using MEGA 4 software.
All HA1 sequences were retrieved from GenBank database. For the analysis of swine H1N1 and H1N2 influenza viruses found in North America between 1931 and 2008 all sequences were downloaded from GenBank:
A/Swine/Minnesota/55551/00 (H1N2) [AF455678]; A/Swine/Indiana/P12439/00 (H1N2) [AF455680]; A/Swine/Ohio/891/01(H1N2) [AF455675]; A/Swine/North Carolina/93523/01 (H1N2) [AF455677]; A/Swine/North Carolina/98225/01(H1N2) [AF455676]; A/Swine/Illinois/100084/01 (H1N2) [AF455682]; A/Swine/Illinois/100085A/01 (H1N2) [AF455681]; A/Swine/Iowa/930/01(H1N2) [AF455679]; A/SW/CO/17871/01(H1N2) [AY060046]; A/SW/MO/1877/01(H1N2) [AY060049]; A/SW/MN/23124-S/01(H1N2) [AY060048];/SW/MN/17138/01(H1N2) [AY060052]; A/Swine/Indiana/P12439/00 (H1N2) [AF455680]; A/Swine/Indiana/9K035/99 (H1N2) [AF250124]; A/swine/Minnesota/1192/2001(H1N2) [EU139828]; A/Swine/Ohio/891/01(H1N2) [AF455675]; A/swine/Guangxi/17/2005(H1N2) [EF556201]; A/SW/MN/23124-T/01(H1N2) [AY060047]; A/SW/MN/16419/01(H1N2) [AY060050]; A/SW/MN/23124-S/01(H1N2) [AY060048]; A/Swine/Illinois/100085A/01 (H1N2) [AF455681]; A/Swine/Illinois/100084/01 (H1N2) [F455682]; A/swine/Minnesota/00194/2003(H1N2) [EU139830]; A/swine/Kansas/00246/2004(H1N2) [EU139831]; A/swine/OH/511445/2007(H1N1) [EU604689]; A/Swine/North Carolina/93523/01 (H1N2) [AF455677]; A/swine/Memphis/1/1990(H1N1)) [CY035070]; A/swine/Maryland/23239/1991(H1N1)) [CY022477]; A/swine/California/T9001707/1991(H1N1)) [CY028780]; A/swine/Iowa/24297/1991(H1N1) [CY027155]; A/swine/Ohio/C62006/06(H1N1) [EU409960]; A/swine/Ohio/24366/07(H1N1) [EU409948]; A/swine/Iowa/17672/1988(H1N1) [CY022333]; A/swine/Wisconsin/1915/1988(H1N1) [CY022429]; A/swine/Iowa/31483/1988(H1N1) [CY022970]; A/Swine/Indiana/1726/1988(H1N1) [M81707]; A/swine/Kansas/3024/1987(H1N1) [CY025010]; A/swine/Kansas/3228/1987(H1N1) [CY022469]; A/swine/Iowa/2/1985(H1N1); [CY027507]; A/swine/Iowa/3/1985(H1N1) [CY022325]; Awine/Iowa/1/1985(H1N1) [CY022317]; A/swine/Iowa/1/1987(H1N1) [CY022962]; A/swine/Iowa/1/1986(H1N1) [CY028788]; A/swine/Tennessee/82/1977(H1N1) [CY022301]; A/swine/Minnesota/5892-7/1979(H1N1) [CY022365]; A/swine/Tennessee/4/1978(H1N1) [CY028427]; A/swine/Wisconsin/641/1980(H1N1) [CY022445]; A/swine/Wisconsin/629/1980(H1N1) [CY022994]; A/swine/Tennessee/87/1977(H1N1) [CY024970]; A/swine/Iowa/2/1987(H1N1) [CY028171]; A/swine/Wisconsin/663/1980(H1N1) [CY024994];A/swine/Wisconsin/661/1980(H1N1) [CY022453]; A/swine/Tennessee/84/1977(H1N1) [CY024954]; A/swine/Tennessee/88/1977(H1N1) [CY024978]; A/swine/Ontario/11112/04(H1N1) [DQ280250]; A/swine/Tennessee/5/1978(H1N1) [CY027515]; A/swine/Minnesota/37866/1999(H1N1) EU139827]; A/swine/Iowa/00239/2004(H1N1) [EU139832]; A/Swine/Wisconsin/457/98(H1N1) [AF222034]; A/Swine/Wisconsin/168/97(H1N1) [AF222031]; A/Swine/Wisconsin/163/97(H1N1) [AF222028]; A/Swine/Wisconsin/166/97(H1N1) [AF222030]; A/Swine/Wisconsin/458/98(H1N1) [AF222035]; A/Swine/Wisconsin/136/97(H1N1) [AF222027]; A/Swine/Wisconsin/164/97(H1N1) [AF222029]; A/Swine/Wisconsin/238/97(H1N1) [AF222033]; A/Swine/Wisconsin/235/97(H1N1) [AF222032]; A/Swine/Wisconsin/464/98(H1N1) [AF222036]; A/Swine/Wisconsin/125/97(H1N1) [AF222026]; A/swine/Iowa/1976/1931(H1N1)) [U11858]; A/swine/OH/511445/2007(H1N1) [EU604689].
Informational spectrum method
where Z i is the valence number of the i-th atomic component, n i is the number of atoms of the i-th component, m is the number of atomic components in the molecule, and Nis the total number of atoms. The EIIP values calculated according to equations (1) and (2) are in Rydbergs (Ry).
The EIIP parameter was used as a basis of the informational spectrum method (ISM) for structure/function analysis of proteins, analysis of protein - protein interaction and de novo design of biologically active peptides [for reviews see Ref.  and references therein]. Here we will only briefly present this bioinformatics method.
The electron-ion interaction potential (EIIP) of amino acids.
In this way, sequences are analyzed as discrete signals. It is assumed that their points are equidistant with the distance d = 1. The maximal frequency in a spectrum defined as above is F = 1/2 d = 0.5. The frequency range is independent of the total number of points in the sequence. The total number of points in a sequence influences only the resolution of the spectrum. The resolution of the N-point sequence is 1/n. The n-th point in the spectral function corresponds to a frequency f(n) = nf = n/N. Thus, the initial information defined by the sequence of amino acids can now be presented in the form of the informational spectrum (IS), representing the series of frequencies and their amplitudes.
where Π(i,j) is the j-th element of the i-th power spectrum and C(j) is the j-th element of CIS. Thus, CIS is the Fourier transform of the correlation function for the spectrum. Thus, any spectral component (frequency) not present in all compared informational spectra is eliminated. Peak frequencies in CIS are common frequency components for the analyzed sequences. A measure of similarity for each peak is a signal-to-noise ratio (S/N), which represents a ratio between signal intensity at one particular IS frequency and the main value of the whole spectrum. If one calculates a CIS for a group of proteins, which have different primary structures, and finds strictly defined peak frequencies, it means that the primary structures of the analyzed proteins encode the same information. It has been demonstrated that: 1) such a peak exists only for a group of proteins with the same biological function; 2) no significant peaks exist for biologically unrelated proteins; 3) peak frequencies are different for different biological functions. Furthermore, it was shown that the proteins and their targets (ligand/receptor, antibody/antigen, etc.) have the same characteristic frequency in common . Thus, it can be postulated that IS frequencies not only characterize general function but also recognition and interaction between a particular protein and its target. Once the characteristic frequency for a particular protein function/interaction is identified, it is possible then to utilize the ISM approach to predict the amino acids in the sequence, which essentially contribute to this frequency and are likely to be crucial for the observed function . The server for free on-line ISM analysis can be accessed at http://www.vinca.rs/180 and http://www.bioprotection.org.
Results and discussion
The CIS of HA1 of swine H1N2/H1N1 and A/H1N1 strains have characteristic dominant peaks at the IS frequencies F(0.055) and F(0.295), respectively (Figure 2a and 2c). According to the ISM concept this reflects a differential interaction pattern of the HA1 of the two groups of viruses. The tropism of the viruses suggests that a high amplitude at F(0.055) corresponds to a higher propensity to interact with swine protein(s) while the high amplitude at F(0.295) may correspond to a preferred interaction with human protein(s). CIS of the HA1 gene of swine H1N1 viruses isolated from humans in US (before 2008) contains characteristic peaks at both frequencies F(0.055) and F(0.295) (Figure 2b). This suggests that these viruses, that sporadically infected humans, display both the distinct "swine" interaction pattern shared with the swine H1N1/N2 viruses and the characteristic "human" interaction pattern shared with A/H1N1 viruses. Thus, these three groups of viruses have a distinct propensity to interact with swine and human proteins which can be described by ISM analysis. These results also provide additional strong evidence that HA1 from swine viruses infecting humans in the US before 2008 were the likely precursors of A/H1N1.
Effect of HA1 polymorphisms on the amplitudes corresponding to IS frequencies F(0.055) and F(0.295).
Propensity for human interaction pattern
We further investigated which of these mutations or combinations thereof are most important for the switch between interaction patterns from swine H1N1/N2 to A/H1N1 strains. As shown above, the interaction between H1N1/N2 and swine protein(s), and the interaction between A/H1N1 and human protein(s) are characterized by the frequencies F(0.055) and F(0.295), respectively. According to the ISM concept [9, 16] mutations in HA1 which increase the amplitude at F(0.295) and decrease amplitude at F(0.055) would potentially contribute to the switch of the viral host tropism from swine to human. Seven of the 14 mutations presented in Table 2 (R36K, F71S, T128S, T216I, S271P, E302K, M314L) increase amplitude on the F(0.295) and decrease amplitude on F(0.055), suggesting that these mutations may be critical for the switch of H1N1/N2 from a swine to a human tropism. It is of note that three of these mutations (R36K, T216I, S271P) also are present in swine H1N1 viruses that infected humans in the US between 2005 and 2007 (Figure 2b). ISM analysis also showed that any of the combinations of the mutations F71S, T128S, E302K and M314L, that are only present in A/H1N1, decrease the amplitude in F(0.055) and increase the amplitude in F(0.295). This suggests that these four mutations may play an important role in the efficient infection of humans by A/H1N1, and perhaps the effective human to human transmission. It is of interest to note that 7 of 14 mutations presented in Table 2 decrease amplitude on F(0.055) and increase amplitude on F(0.295), 6 mutations decrease amplitudes on both frequencies or have no effect, and only one (N168D) increases the amplitude on F(0.055) and decreases amplitude on F(0.295). This suggests that the mutations in A/H1N1 that predispose for the human interaction pattern are remarkable of more prevalent than mutations that predispose for the swine interaction pattern.
The relative position of the receptor binding domain and the receptor targeting domain (VIN2) in 3D structure of A/H1N1 HA1 is similar to the position of these two domains in seasonal flu H1N1 viruses but different than in H1N1 1918 viruses . This suggests that efficacy of interaction between A/H1N1 and its receptor is similar to seasonal flu H1N1 viruses but less efficient than in 1918 viruses.
Analyses by the ISM bioinformatics platform of the HA1 protein of North American swine H1N1 and H1N2 viruses and the new A/H1N1 that emerged recently in Mexico and the USA showed that both groups of viruses differed in characteristic parameters that reflect a distinct propensity of these viruses to undergo a specific interaction with swine or human host proteins or receptors. Using the same approach, amino acid substitutions F71S, T128S, E302K, M314L in the A/H1N1 HA1 essential for the human interaction pattern of these viruses were identified and residues 94D, 196D and 274D of A/H1N1 HA1 were predicted as "hot-spots" for mutations that may significantly increase the propensity of this virus to interact preferentially with human host proteins. At least one of these mutations (D274E) was already found in the A/H1N1 isolates from Spain, Italy and US, suggesting the virus further adapts to the human host. In addition, it has been suggested that the highly conserved domain 286 - 326 of HA1 plays an important role in A/H1N1-receptor interaction and represents a candidate target for diagnostics, vaccines and therapies.
This work was supported by the Ministry of Science and Technological Development of the Republic of Serbia (Grant no. 143001). COST Action B28 is gratefully acknowledged.
- Vincent AL, Ma W, Lager KM, Janke BH, Richt JA: Swine influenza viruses: a North American perspective. Adv Virus Res 2008, 72: 127–154. 10.1016/S0065-3527(08)00403-XView ArticlePubMedGoogle Scholar
- Myers KP, Olsen CW, Gray GC: Cases of swine influenza in humans: a review of the literature. Clin Infect Dis 2007, 44: 1084–1088. 10.1086/512813PubMed CentralView ArticlePubMedGoogle Scholar
- Thompson RL, Sande MA, Wenzel RP, Hoke CH Jr, Gwaltney JM Jr: Swine-influenza infection in civilians: report of two cases. N Engl J Med 1976, 295: 714–715.View ArticlePubMedGoogle Scholar
- Olsen CW: The emergence of novel swine influenza viruses in North America. Virus Res 2002, 85: 199–210. 10.1016/S0168-1702(02)00027-8View ArticlePubMedGoogle Scholar
- Newman AP, Reisdorf E, Beinemann J, Uyeki TM, Balish A, Shu B, Lindstrom S, Achenbach J, Smith C, Davis JP: Human case of swine influenza A (H1N1) triple reassortant virus infection, Wisconsin. Emerg Infect Dis 2008, 14: 1470–1472. 10.3201/eid1409.080305PubMed CentralView ArticlePubMedGoogle Scholar
- Shinde V, Bridges CB, Uyeki TM, Shu B, Balish A, Xu X, Lindstrom S, Gubareva LV, Deyde V, Garten RJ, Harris M, Gerber S, Vagasky S, Smith F, Pascoe N, Martin K, Dufficy D, Ritger K, Conover C, Quinlisk P, Klimov A, Bresee JS, Finelli L: Triple-Reassortant Swine Influenza A (H1) in Humans in the United States, 2005–2009. N Engl J Med 2009, 360: 2616–2625. 10.1056/NEJMoa0903812View ArticlePubMedGoogle Scholar
- Centers for Disease Control and Prevention (CDC): Swine influenza A (H1N1) infection in two children -- southern California, March-April 2009. MMWR Morb Mortal Wkly Rep 2009, 58: 400–402.Google Scholar
- WHO: Situation updates - Influenza A(H1N1).2009. [http://www.who.int/csr/disease/swineflu/updates/]Google Scholar
- Veljkovic N, Glisic S, Prljic J, Perovic V, Botta M, Veljkovic V: Discovery of new therapeutic targets by the informational spectrum method. Curr Protein Pept Sci 2008, 9: 493–506. 10.2174/138920308785915245View ArticlePubMedGoogle Scholar
- Veljkovic V, Veljkovic N, Muller CP, Muller S, Glisic S, Perovic V, Kohler H: Characterization of conserved properties of hemagglutinin of H5N1 and human influenza viruses: possible consequences for therapy and infection control. BMC Struct Biol 2009, 9: 21. 10.1186/1472-6807-9-21PubMed CentralView ArticlePubMedGoogle Scholar
- Veljkovic V: A theoretical approach to preselection of carcinogens and chemical carcinogenesis. New York: Gordon & Breach; 1980.Google Scholar
- Veljkovic V, Slavic I: Simple general-model pseudopotential. Phys Rev Lett 1972, 29: 105–107. 10.1103/PhysRevLett.29.105View ArticleGoogle Scholar
- Veljkovic V: The dependence of the Fermi energy on the atomic number. Phys Lett 1973, 45A: 41–42.View ArticleGoogle Scholar
- Prediction of efficacy of A/H1N1 - receptor interaction http://www.vinca.rs/180/; http://www.bioprotection.org
- Trifonov V, Khiabanian H, Greenbaum B, Rabadan R: The origin of the recent swine influenza A(H1N1) virus infecting humans. Euro Surveill 2009, 14: 3.Google Scholar
- Glisic S, Arrigo P, Alavantic D, Perovic V, Prljic J, Veljkovic N: Lipoprotein lipase: A bioinformatics criterion for assessment of mutations as a risk factor for cardiovascular disease. Proteins 2008, 70: 855–862. 10.1002/prot.21581View ArticlePubMedGoogle Scholar