Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Fatal Cases of Influenza A(H3N2) in Children: Insights from Whole Genome Sequence Analysis

Abstract

During the Northern Hemisphere winter of 2003–2004 the emergence of a novel influenza antigenic variant, A/Fujian/411/2002-like(H3N2), was associated with an unusually high number of fatalities in children. Seventeen fatal cases in the UK were laboratory confirmed for Fujian/411-like viruses. To look for phylogenetic patterns and genetic markers that might be associated with increased virulence, sequencing and phylogenetic analysis of the whole genomes of 63 viruses isolated from fatal cases and non fatal “control” cases was undertaken. The analysis revealed the circulation of two main genetic groups, I and II, both of which contained viruses from fatal cases. No associated amino acid substitutions could be linked with an exclusive or higher occurrence in fatal cases. The Fujian/411-like viruses in genetic groups I and II completely displaced other A(H3N2) viruses, but they disappeared after 2004. This study shows that two A(H3N2) virus genotypes circulated exclusively during the winter of 2003–2004 in the UK and caused an unusually high number of deaths in children. Host factors related to immune state and differences in genetic background between patients may also play important roles in determining the outcome of an influenza infection.

Introduction

Influenza viruses are a common cause of human respiratory infections [1]. Epidemics occur every year during the winter seasons in the Northern and Southern Hemispheres and result in considerable morbidity and mortality. Disease severity is greatest in the elderly, in infants and in people with certain chronic diseases. An average of 12,554 deaths occurred in England and Wales during annual influenza epidemics between 1990–2000 [2].

Acquisition of point mutations in the haemagglutinin glycoprotein of influenza A virus leads to continuous antigenic change, a process called ‘antigenic drift’. This results in continuous replacement of circulating viruses with new variants which are able to re-infect hosts despite their immunity to antigenic variants that circulated previously. In humans, A(H3N2) viruses are considered to evolve faster than the A(H1N1) subtype [3], [4]. Every three to eight years, predominant A(H3N2) viruses are replaced by a novel antigenic variant, prompting an update of the recommended influenza vaccine strain [5]. During the 2002–2007 period, the A(H3N2) component of the vaccine was updated four times [6].

A(H3N2) viruses are associated with increased morbidity and mortality [7]. The Northern Hemisphere season of 2003–2004 was characterised by the emergence of an antigenic drift variant, A/Fujian/411/2002, which completely displaced the previously circulating variant, A/Panama/2007/99. Initial circulation of the Fujian/411-like variants in the UK and the US was accompanied by an unusually high number of influenza-associated fatalities in children [8], [9]. Seventeen such laboratory-confirmed influenza cases were reported in the UK during 2003–2004. Clinical and pathological findings identified no recognised pre-existing risk factors for severe influenza illness in 88% of the fatal cases and only 18% presented secondary bacterial infections. Serological and community morbidity studies showed increased susceptibility in the youngest age groups [8]. This posed the question of whether intrinsic virus virulence or underlying host susceptibility was more important in determining a fatal outcome. The aim of this study was to identify any genetic markers of virulence in Fujian/411-like influenza A viruses from 2003–2004 by sequencing whole genomes of viruses isolated from fatal and non-fatal paediatric infections in the UK. Genetic information was used to determine if virus mutations were associated with fatal outcome by comparison with genetic features of viruses from previous and subsequent influenza seasons and viruses from the same season elsewhere.

Results

Whole-genome analysis of influenza sequences from fatal and non-fatal cases

Sequences of the complete coding regions of influenza whole genomes were obtained from original respiratory material and/or viruses isolated from 12/17 fatal cases (Table S2). The remaining 5 fatal cases had HA1 fragment sequences available from original material only. Three viruses from non-fatal, adult contacts of fatal case #6 (A/England/740/2003) were also isolated and genome-sequenced. Further genomes were sequenced from 51 viruses obtained from non-fatal control cases. Where samples were available (14/63), sequencing was performed both from viral isolate and original material.

Given that the focus of our study was on detecting mutations that could be associated with more severe infections, we first sought to establish whether genetic changes could have been induced during the adaptation to cell culture isolation. Comparison of sequences showed no genetic changes generated through virus culture for any of the 14 original material/isolate pairs, except for virus A/England/484/03 (control case) which showed a mutation G720A in the haemagglutinin (HA) gene sequence from the viral isolate (corresponding to V242I in the deduced HA protein). Overlapping peaks were observed at that position in the HA of the isolate sequence (data not shown), suggesting the presence of a mixture of variants or quasispecies which may have been differentially selected through culture.

A phylogenetic tree was inferred using the concatenated sequences of the main coding regions of 70 viruses (Figure 1 new). For the purpose of comparison, we also included genome sequences representing the main clusters seen in A(H3N2) viruses from New York during the season of 2003–2004 [10]. Virus sequences from the adult contacts of fatal case #6 were not included in the trees to avoid statistical bias. The analysis revealed that UK A(H3N2) viruses from 2003–2004 clustered within two genetic groups, named I and II. Two UK viruses, A/England/567/2003 and A/Scotland/78/2003 grouped in a single branch, external to both genetic groups I and II. A third minor genetic group (III) outgrouping all the others included a single UK strain, A/England/558/2003 (control case), together with A/Fujian/411/2002 and two viruses from New York, All viruses isolated from fatal cases were interspersed with control viruses within genetic groups I and II, with no evidence for clustering that could suggest genetic differences specific to the fatal cases viruses.

thumbnail
Figure 1. Phylogenetic relationships of the concatenated main coding regions of influenza A(H3N2) viruses isolated in the UK during 2003–2004.

Sequences from reference viruses A/Panama/2007/99 and A/Fujian/411/2002 were included in the dendrograms. For the purposes of comparison, the dendrograms also included sequences from A/New York/38/2003, A/New York/11/2003, A/New York/32/2003, A/New York/199/2003 and A/New York/52/2004. All phylogenies were rooted using A/Panama/2007/99 as an outgroup. Branch lengths are drawn to scale. Viruses isolated from fatal cases are displayed in red type. Bootstrap values (>70%) are displayed on the nodes. Genetic groups I and II are indicated with brackets. Amino acid substitutions characterising these groups are annotated on the nodes.

https://doi.org/10.1371/journal.pone.0033166.g001

Next, we constructed phylogenetic trees of the main coding regions for each gene Figures 2 & 3). The topology of the individual gene trees was very similar to the concatenated tree, with distinctive separation of viruses into three genetic groups, except in the HA phylogeny where the branch corresponding to genetic group III was not evident.

thumbnail
Figure 2. Phylogenetic relationships of the main coding regions of PB2, PB1, PA & HA of influenza A(H3N2) viruses isolated in the UK during 2003–2004.

Details are similar to those described in Figure 1 footnote.

https://doi.org/10.1371/journal.pone.0033166.g002

thumbnail
Figure 3. Phylogenetic relationships of the main coding regions of NP, NA, M & NS of influenza A(H3N2) viruses isolated in the UK during 2003–2004.

Details are similar to those described in Figure 1 footnote.

https://doi.org/10.1371/journal.pone.0033166.g003

Evidence for gene reassortment

In general, there was a clear co-segregation of genes from each virus, as shown by similar phylogenetic topologies, with some exceptions: a) genes NA, NP and PA from fatal case A/England/431/2003 (fatal case #4) clustered within genetic group I, whereas the rest of its genes fell within genetic group II; b) genes from control case A/England/585/2003 fell into genetic group I, except for PB2 which fell within genetic group III; c) while most genes of A/England/558/2003 fell into genetic group III, the HA fell in a branch whose closest relationship was with genetic group II; d) genes NP, NA, NS and HA from control viruses A/England/567/2003 and A/Scotland/78/2003 clearly clustered within genetic group I, while PB2 and PA fell in a separate distinct branch that clustered with genetic group I (albeit with <70% bootstrap values) - sequences from their PB1 and M genes grouped in a single branch, external to both genetic groups I and II. These phylogenetic incongruencies suggest the occurrence of reassortment events for all these viruses. In addition, all the full genome concatenated sequences were tested for the presence of mosaicism using bootscanning and informative sites analysis in order to formally identify recombination breakpoints (Figure S1 & Text S1). The analysis confirmed the presence of reassortment (recombination in a concatenated genome) and the breakpoints detected were consistent with the genes involved in the reassortment for each in the above mentioned viruses.

Amino acid substitutions associated with a fatal outcome

Given the presence of these incongruencies in the phylogenetic topologies, the number of viruses clustering into each genetic group varied slightly for each dendrogram. Genetic group II contained 4–5 out of 12 fatal cases (33–42%), depending on the gene, but only 8 out of 51 non-fatal cases (16%). The difference between these percentages was not significant (Fisher's exact test values ranging from p = 0.107 to p = 0.234, depending on the gene segment).

We next looked at the inferred protein sequences (including PB1-F2, NS2 & M2) from the gene alignments to determine if specific amino acid substitutions could be consistently associated with a fatal outcome. None of the amino acid polymorphisms present within the whole genome was uniquely or consistently present in viruses from the fatal cases. Observed signature substitutions that differentiated genetic groups I and II, as annotated in Figures 2 & 3, were present in all viruses in these clusters irrespective of the clinical outcome.

An epidemiological cluster was defined for fatal case #6, A/England/740/2003: A/England/741/2003 was recovered from a family contact (sibling of unknown age), while A/England/752/2003 and A/England/753/2003 were isolated from adult healthcare workers who had been in contact with the index fatal case. The contact cases developed influenza-like illness with no severe complications and fully recovered. Comparison of the coding regions of the viruses recovered from the A/England/740/2003 case contacts showed that nucleotide coding sequences were identical for all genes despite the differences in disease severity observed.

Glycosylation of haemagglutinin

It has been postulated that, for A(H3N2) viruses, an increase in the number of potential sites of N-linked glycosylation in the HA protein resulted in decreased virulence [11]. Ten potential N-glycosylation sites were identified along the full length HA of both control and fatal case viruses, at the following residues: 8, 22, 63, 126, 133, 144, 165, 246, 285 (all in HA1) and 483 (in HA2). The potential glycosylation sites were conserved among the viruses with the exception of position 126, which was absent in UK viruses belonging to genetic group I. These findings were related to phylogenetic clustering and do not associate with a fatal outcome.

Worldwide prevalence of viruses belonging to UK genetic groups I and II

We next attempted to assess the frequency with which viruses isolated worldwide during 2003–2004 clustered with UK viruses belonging to genetic groups I and II. However, when trying to do so by using whole genome sequences available in GenBank, we found 413 genomes from A(H3N2) influenza viruses from 2003–2004, strongly biased towards viruses recovered in the US, Australia and New Zealand. We therefore performed this ‘prevalence’ phylogenetic analysis with a larger, worldwide-representative dataset of 1,893 HA1 sequences (positions 346 to 866 of the HA ORF, encompassing the main antigenic sites of the HA1 glycoprotein) including the UK viruses and all A(H3N2) viruses isolated in 2003–2004 as available in GenBank [12]. This allowed us to include those obtained from original material for the five fatal cases for which full genome sequences were not available.

The phylogenetic relationships in this dendrogram showed that there was clear evidence of geographical clustering, where most clusters showed predominance of viruses from the same region or neighbouring countries (Figure 4). However, in each cluster there were also sporadic viruses from other regions (not detailed in the figure). Genetic group I UK viruses formed part of a large cluster (highlighted in green) with a robust bootstrap value (80%) comprising 360 viruses from the rest of Europe (Austria, Finland and Denmark), North America (US) and Oceania (Australia and New Zealand). There were at least four subclusters within this group (condensed for clarity): i) US+UK viruses; ii) UK+Finland+Denmark; iii) Austria; iv) Australia+New Zealand. Genetic group I UK viruses were also represented in single branches within this large cluster.

thumbnail
Figure 4. Phylogenetic relationships of partial sequences (nucleotides 346 to 866) of the HA1 coding fragment for A(H3N2) viruses isolated in 2003–2004 in the UK and worldwide.

Genetic groups where viruses from the UK clustered are highlighted in green (I) and red (II). Some branches were condensed for clarity purposes; adjacent legends indicate the country/region where the majority of viruses within a cluster were isolated and the number of viruses included. These clusters also included sporadic viruses from other countries which were not detailed. Sequences from reference viruses A/Panama/2007/99 and A/Fujian/411/2002 were included in the phylogeny. The ML phylogeny was rooted using A/Panama/2007/99 as an outgroup. Branch lengths are drawn to scale. Bootstrap values (>70%) are displayed on the nodes.

https://doi.org/10.1371/journal.pone.0033166.g004

In contrast, genetic group II UK viruses clustered in a small branch of 29 viruses out of 1,893 with a 100% bootstrap (highlighted in red). While 15 of these were UK viruses (including 6 from fatal cases) the remaining 14 were from the US (n = 5) and some European countries (France (2), Romania (1), Italy (1), Sweden (1), Denmark (2) and Norway (2)). The prevalence of genetic group II viruses worldwide, based on the total number of HA1 sequences from 2003–2004 available in GenBank, was only 1.5%, compared to a higher prevalence of 22% (n = 15/68) in the UK.

Evolution of A(H3N2) influenza viruses after 2003–2004 in the UK

To determine the prevalence and/or survival of genetic groups I and II in subsequent influenza seasons, we analysed our initial set of viruses in a larger dataset of UK sequences encompassing influenza seasons from 2002–2003 to 2007–2008, using the same HA1 fragment selected to construct the phylogeny presented in Figure 4, again including the sequences from the five ‘HA1 only’ fatal cases.

A total of 159 HA1 sequences were used to build the phylogeny shown in Figure 5: the two major genetic groups from 2003–2004 are marked as I and II. For each season there was a predominant replacement of the older viruses by new ones, with some representatives of each season continuing to survive for one or two seasons further. There was an almost complete predominance of viruses falling within the Fujian/411 clade during 2003–2004 in the UK, all characterized by 12 amino acid substitutions compared to the UK 2002–2003 (Panama/2007-like) viruses (Figure 5 & 6). Of these amino acid changes, 10 were located on the surface of the HA trimer, although two were positioned in the stalk region; and 9 were located in antigenic sites, Within this clade, amino acid substitution N126D defines genetic group I and R142K and S189N, both in antigenic site B, define genetic group II. However, no viruses belonging to these two genetic groups were isolated after that season and the major branch spanning all isolates from 2004–2005 onwards completely diverged from the genetic group II branch. This suggests that viruses representing genetic groups I and II did not survive past the 2003–2004 season. In addition, all viruses characterised since 2004–2005 showed fixation of at least four amino acid substitutions: K145N, Y159F, V226I, S227P, relative to the Fujian/411 clade HA1 genetic groups I and II (Figure 5 & 6). All four changes are exposed on the surface of the globular head of the HA trimer; positions 145 and 159 lie on antigenic sites A and B, whereas positions 226 and 227 are located on the receptor binding site. A/England/558/2003, which clustered with post-2003–2004 viruses, exhibited two of these amino acid changes (Y159F and S227P).

thumbnail
Figure 5. Phylogenetic relationships of partial sequences (nucleotides 346 to 866) of the HA1 coding fragment for A(H3N2) viruses isolated between 2002 and 2008 in the UK.

The annotated box indicates the reference colours for viruses isolated in different influenza seasons, between 2002–2003 and 2007–2008. Genetic groups I and II, within the Fujian/411 clade, are indicated with brackets. Viruses from fatal cases are highlighted with #. Sequences from reference viruses A/Panama/2007/99 and A/Fujian/411/2002 were included in the phylogeny. The ML tree was rooted using A/Panama/2007/99 as an outgroup. Branch lengths are drawn to scale. Bootstrap values (>50%) and signature amino acid substitutions seen in viruses from 2003–2004 and 2004–05 are annotated on the nodes.

https://doi.org/10.1371/journal.pone.0033166.g005

thumbnail
Figure 6. Surface representation of the hemagglutinin hetero-trimer from a seasonal H3N2 strain [47].

Location of the amino acids substitutions marking the transition between Panama-like and Fujian-like viruses are highlighted in red; those changes which differentiate viruses isolated since 2004–2005 from Fujian/411-like viruses are coloured in blue; genetic group I-specific changes are highlighted in green and genetic group II-specific changes are marked with yellow.

https://doi.org/10.1371/journal.pone.0033166.g006

We then compared amino acid substitutions in whole genome products between Fujian/411-like viruses and earlier (Panama/2007-like) viruses (Table 1). Given the unavailability of full genome sequences from UK viruses isolated before 2003–2004, we used instead a set of 40 viruses from GenBank representative of seasons 1999–2000 to 2002–2003 for the comparison (listed in Table S3). Thirteen substitutions were found to distinguish the HA protein of Fujian/411-like viruses from earlier A(H3N2) viruses. An additional 26 substitutions differentiated other gene products, with 11 of them being located in the NA protein. Many of these mutations (marked with * in Table 1) appeared concomitantly in the genome of some viruses isolated in Australia and New York in 2001 and 2002. However, this observation did not include changes seen in the HA sequence.

thumbnail
Table 1. Amino acid substitutions in the inferred protein sequences that distinguish UK Fujian/411-like viruses from earlier (Panama/2007-like) viruses.

https://doi.org/10.1371/journal.pone.0033166.t001

Discussion

An essential question that remains unanswered is whether virus virulence or host-related susceptibility factors have the greater impact on determining the clinical outcome of an influenza infection. In this study we have analysed a set of viruses obtained from a series of paediatric fatal cases with laboratory confirmed A(H3N2) influenza infection, which occurred during the influenza season of 2003–2004 in the UK. Johnson et al [8] described clinical and epidemiological findings from these cases and established some significant facts: (1) 14 of 16 children (88%) for whom health records were available had no pre-existing risk factors; (2) none of the children had received the vaccine for winter 2003–2004; (3) three children (18%) had evidence of bacterial co-infections. This profile indicates a cohort of healthy children who were likely naïve to influenza infection in most cases (65% (11) were under 5 years old) for whom bacterial co-infections may not have contributed significantly to fatal outcome. Therefore we sought to determine whether there were patterns in the phylogenetic relationships or changes in the sequences of the genes and proteins from virus isolates that could be associated with a fatal outcome.

In our study we did not find any clear evidence that single genetic changes in the virus genes or proteins could be associated with a higher, significant occurrence of severe or fatal cases. The majority of substitutions observed correspond to signature amino acid substitutions that differentiated circulating A(H3N2) viruses in the UK during 2003–2004 into genetic groups I and II; a third minor genetic group (III) was also observed. These three genetic groups correlated well with those from 2003–2004 identified in the US [10], as shown by the inclusion of New York strains in the phylogeny presented in Figures 1 to 3. The slightly different topology of genetic group III in the HA phylogeny has been previously described as a consequence of a reassortment event that put an HA segment from a 2002–2003 virus into the genomic background of the Fujian/411-like viruses [10], [13]. The existence of similar phylogenetic patterns in our study and in New York reinforces the observation that global migration of influenza viruses is an important component of the seasonal diversity observed every year [14].

Comparison of sequences from 14 pairs of virus isolates vs. primary clinical samples revealed that only one pair differed by a nonsilent mutation in the HA. Host-mediated mutations are a well-known consequence of adaptation to laboratory culture and are a potential source of problems in evolutionary studies [15]. However, finding only one mutated nucleotide out of 183148 (13082 bases per whole genome) would indicate that the effect of cell-culture mediated mutations is irrelevant in our study. Overlapping peaks in the mutated sequence suggested the presence of a mixture of variants in that isolate. The use of deep sequencing would have added valuable information on the diversity of the virus population within each sample. However, as most viruses had only isolates available for sequencing, any attempt to apply deep sequencing to the analysis of intrasample diversity would have been invalidated by the potential selection of variants in tissue culture, therefore not reflecting the true proportion of quasiespecies, if present, in the primary clinical material.

Although the majority of individual gene segments showed identical clustering in phylogenies, there were at least four topological change events in the dendrograms, strongly suggesting evidence of reassortment (Figures 2 & 3), also confirmed by bootscanning and informative sites analysis. This adds further support to the notion that gene reassortment among same-subtype strains is a major driving force for evolution in influenza A virus [16]. When analyzing the worldwide occurrence of variants circulating during 2003–2004 we observed a low frequency of genetic group II viruses (1.5%, Figure 4), compared with a higher frequency in the UK, both for fatal (33 or 42%) and non-fatal (16%) cases, which suggest overrepresentation of genetic group II viruses in the UK in 2003–2004. Although we included the total number of HA1 sequences for influenza A(H3N2) viruses from 2003–2004 available in GenBank for this analysis, we acknowledge the limitations of using GenBank numbers to make epidemiological assumptions, as numbers of sequences are not representative of the population; also, some countries are overrepresented while some do not submit sequence data, so it is possible that genetic group II viruses could have circulated in high frequencies in other geographic locations with poor surveillance system.

Fujian/411-like viruses completely replaced the Panama/2007-like variants circulating in 2002–2003, as shown by analysis of genetic groups I and II in a set of 159 partial HA1 sequences spanning six consecutive influenza seasons from 2002–2003. However, while Fujian/411-like antigenic variants predominated during the 2003–2004 season in the UK, they did not circulate thereafter [17]. In our study, at least 12 amino acid substitutions in the HA1 glycoprotein mark the transition between Panama/2007- and Fujian/411-like viruses. The majority were located on the surface of the HA trimer and within one of the five antigenic sites proposed for H3 haemagglutinin [18], [19], therefore showing potential for antigenic variation, although only H155T and Q156H were actually responsible for the antigenic drift [20]. Four further amino acid substitutions in HA1 differentiate the Fujian/411-like viruses from viruses isolated after 2003–2004. All of them were also exposed on the surface of the globular head, either related to antigenic sites A or B, or located in the receptor binding site. In our study, the Fujian/411-like viruses exhibited substitutions W222R and G225D when compared to the previous Panama/2007-like variants. Recently, substitution D225G in the HA (D222G by H1 numbering) of pandemic influenza A(H1N1) 2009 viruses has been implicated in higher severity and fatal cases [21]. This substitution has been shown to induce alterations in the receptor binding site, increasing the affinity of these viruses for receptors expressed on ciliated bronchial epithelial cells and on epithelia within the lung [22]. Position 226 (H3 numbering) has also been implicated in changes of receptor binding specificity, cell tropism and host range restriction for several human and animal influenza subtypes [23], [24], [25], [26]. In summary, HA changes distinguishing the transition between Panama/2007-like, Fujian/411-like and later viruses have enough potential for altering biological properties such as antigenicity, receptor binding affinity or others which in turn might affect virus virulence.

An additional 26 substitutions were shown to distinguish the remaining gene products of Fujian/411-like viruses from earlier (Panama/2007-like) viruses. Eighteen of them were already detected concurrently in viruses isolated in 2001 and 2002, suggesting that the genomic background of the Fujian/411-like viruses was already circulating at that time. However, these viruses did not have the HA changes seen in the Fujian/411-like viruses, an observation that corresponds with other reports suggesting that the HA gene of Fujian/411-like viruses was only placed by reassortment into the genomic background of the Fujian/411-like viruses shortly before the 2003–2004 season [10], [13]. It has also been suggested that interaction among viral proteins may play a key role in the fitness and virulence of a virus [10]. Some of the genomic substitutions observed in the Fujian/411-like viruses were related to host range specificity. For instance, substitution I667V in PB2 represents a human-to-avian specific change [27]. Substitutions V421I(PB1) and T9N(PB2) were described as human-like changes in H5N1 viruses with increased pathogenicity in mice [28]. The biological relevance of substitutions I421V and N9T (characteristic of genetic group I) seen in the Fujian/411-like viruses is unknown. Substitutions at positions 27, 103, 146 and 197 in NP are located in known virus cytotoxic T lymphocyte (CTL) epitopes [29], [30], [31] and they may confer on Fujian/411-like viruses a higher efficiency of escape from CTL-mediated immune responses. Substitutions at positions 172, 199 and 370 in NA are located in antigenic sites [32], while the K93N substitution introduces an additional potential glycosylation site; all/some of these may contribute to the antigenic drift seen in Fujian/411-like viruses. Many of these substitutions (marked with * in Table 1) have been seen concomitantly in the genome of some A(H3N2) viruses from before 2003–2004. A search in the NCBI Influenza Virus Resource through all the available A(H3N2) genome sequences (>2,000) showed that viruses exhibiting these substitutions were seen from seasons 2001–2002 to 2003–2004 disappearing thereafter, consistent with the replacement of the Fujian/411-like viruses in 2004–2005 in a genome-wide sweep, as previously described [3].

While Fujian/411-like viruses exhibited a unique gene constellation and carried HA amino acid substitutions that caused significant antigenic drift [33] and replacement of all other A(H3N2) viruses circulating that season, no specific genetic changes and consequent amino acid substitutions could be related to severe or fatal clinical outcomes. On this basis, we conclude that host-related susceptibility was the major determinant of severe illness or death due to infection by Fujian/411-like viruses in 2003–2004. It is likely that, if any virus feature contributed to a more severe outcome, this occurred in the context of naïve children undergoing what was likely to be their first influenza infection.

Clinical incidences of influenza-like illness (ILI) in young age groups were shown to be at their highest in the years when novel antigenic drift variants emerged [8]. Similarly, studies of mortality in children associated with pandemic influenza A(H1N1) 2009 revealed the highest mortality rate in children aged less than 1 year and a substantial proportion of children with no pre-existing disorders (27%) who rapidly deteriorated and died [34], [35]. Seven out of 17 paediatric deaths in the UK in 2003–2004 occurred suddenly with no previous illness or within a day of the onset of symptoms, with no obvious link to a predominance of genetic groups I or II viruses among these cases. Overall, we found no consistent relationship between the genetic features of these viruses and the main clinical findings observed for the UK paediatric fatal cases (Table S2).

The contribution of host factors to the outcome of an influenza infection is supported by our finding that the same, genetically identical virus (represented by A/England/740/2003) could cause death in a child while causing uncomplicated respiratory illness in one family contact (a sibling of unknown age) and two non-related adult contacts. Factors related to the infecting-dose of virus could not be determined. Innate immune response, preexisting adaptive immunity, genetic background [36], [37] and even environmental conditions surrounding host and virus (reviewed in [38]) are all factors that can contribute to virus pathogenesis.

Limitations of the present study include the probable underestimate of the number of paediatric death cases during the 2003–2004 influenza season as we have studied only those reported to the HPA and laboratory-confirmed. Since this time, reporting mechanism of fatal cases has been strengthened. Also, the whole genome sequence data available outside the UK was biased to specific countries and this may have altered the interpretation. However, this study reinforces the conclusion that the virulence of an influenza virus results from a complex interplay between polygenic virus virulence factors, the host immune response and host genetic factors. Any difference in virulence between genetic groups I and II viruses could be tested in animal models by using recombinant viruses generated through reverse genetic systems. It is possible that the Fujian/411-like viruses (genetic groups I and II) were all more virulent than previously circulating viruses. Alternatively, if any virus factor was contributing to a higher virulence, its relative impact could have been substantial in the context of a highly susceptible host. Additional studies are needed to better identify host genetic determinants of severe influenza infection, perhaps focussing on human genes responsible for the development of an atypical antiviral innate immune response. This could be a critical factor that leads to severe illness and death in naïve hosts. The matching of virus genomics with clinical and epidemiological findings is an important tool for studying highly mutable pathogens, such as influenza A virus. The methodology carried out in this study will be useful in the analysis of seasonal epidemics of influenza and also in the study of the recently emerged pandemic influenza A(H1N1) 2009 virus.

Materials and Methods

Ethics statement

This study was conducted as part of a public health inquiry by the Health Protection Agency and Health Protection Scotland into paediatric deaths, where it was agreed that ethical approval for linking personal clinical outcome information to virus characterisation was not required.

Fatal and control cases

Seventeen fatal cases of laboratory-confirmed influenza A in children under eighteen years were reported to HPA between September and December 2003. The median age was two years (range 4 months – 17 years); eleven (64%) were under five years of age and ten (58%) were females. Further clinical findings are described in Johnson et al [8]. Three control cases were selected per fatal case, on the basis of matching age, week of infection and geographical location. HPA influenza surveillance relies on the timely collection of clinical and virological data across the UK. Information sources have been described previously [33]. Overall, over 80% of reported laboratory-confirmed cases have respiratory specimens referred to National Reference Laboratory.

Viruses

Respiratory samples and tissue biopsy samples were received at HPA for laboratory confirmation of A(H3N2) influenza virus infection and further characterisation of the infecting viruses. Original material was inoculated onto either Madin-Darby canine kidney cells (MDCK) or the SIAT1 derivative [39]. Virological data is described with more details elsewhere [33].

Whole genome sequencing

Virus RNA was extracted from either aliquots of original respiratory/biopsy material or from virus isolates using a Qiagen viral RNA mini kit (Qiagen Ltd, Crawley, West Sussex, England). Complete genomes were amplified using a one-step RT-PCR procedure employing Superscript III RT and Platinum Pfx, a proof-reading DNA polymerase (Invitrogen Ltd, Paisley, England). RT-PCR products were purified using QIAquick gel extraction and PCR purification kit (Qiagen). Sequences were obtained by a gene-walking approach using a commercial sequencing service (GATC Biotech Ltd, Cambridge, England). Specific primers for amplification and sequencing are described in Table S1. Genbank accession numbers range from CY087969 to CY088568 for virus isolate sequences and from CY107071 to CY107182 for original material sequences.

Phylogenetic analyses

Raw sequencing data was edited and assembled into continuous gene sequences (contigs) covering the entire ORF of each gene using Sequencher 4.9 (http://www.genecodes.com).

Nucleotide sequences for each gene segment were aligned using ClustalX 1.81 [40]. Alignments were edited with BioEdit 7.0.5.3 [41]. Maximum Likelihood (ML) phylogenetic trees of each gene segment were inferred using the PAUP 4.0 package (Swofford DL; Sinauer Associates). The best-fit model of nucleotide substitution was identified using Modeltest [42] as the general-time reversible GTR+Γ4 model for most genes, except for NP (GTR+I), NS (GTR) and M (HKY85+Γ4). Nucleotide frequencies, transition-transversion ratio, proportion of invariable sites and gamma distribution of among-site rate variation were estimated from the empirical data. In all cases, ML trees were determined through a heuristic search using the tree bisection-reconnection branch-swapping algorithm.

Bootstrap analysis was performed through the Neighbor Joining algorithm with 1,000 replicates, incorporating the ML substitution model previously determined for each gene with Modeltest.

Two large datasets, the set of 1,893 sequences used to generate the phylogeny presented in Figure 4 and the 70 sequences formed by alphabetic concatenation of influenza coding regions (Figure 1) were aligned using a parallel implementation of ClustalW, ClustalW-MPI [43]. Phylogenetic trees from these alignments were inferred using a rapid parallelised ML inference method; RAxML [44]. RAxML utilises the general-time reversible (GTR) model of nucleotide substitution and performs heuristic ML searches using maximum parsimony trees as starting points. The trees were created using 200 independent inferences and 1,000 bootstrap replicates.

Identification of recombination/reassortment breakpoints in the concatenated whole genome sequences through bootscanning and informative site analysis was performed using Simplot v3.2 [45]

Potential N-glycosylation sites in the deduced amino acid sequences were predicted using NetNGlyc version 1.0 [46].

Structural analysis

A model of the HA influenza trimer was generated using an X-ray crystallographic structure (PDB code: 2VIU [47]) downloaded from the Protein Data Bank website [48]. The structure was manipulated and presented using PyMOL. One monomer of HA trimer was highlighted to indicate the arrangement of monomers within the overall trimers. The structural position of amino acid variants discriminating between Panama/Fujian/later strains and the positions specific to genetic groups I and II were highlighted and labelled on the trimeric structure.

Supporting Information

Figure S1.

Bootscan analysis for reassortant UK viruses. A) A/England/431/2003; B) A/England/585/2003; C) A/England/558/2003; D) A/England/567/2003; E) A/Scotland/78/2003.

https://doi.org/10.1371/journal.pone.0033166.s001

(TIF)

Table S1.

Primers for whole genome sequencing of Influenza A(H3N2) viruses. Primers for gene fragment amplification and sequencing are highlighted in bold.

https://doi.org/10.1371/journal.pone.0033166.s002

(DOC)

Table S2.

Relationship between phylogenetic clustering of viruses from fatal cases and main clinical findings (described with more detail in [8] ).

https://doi.org/10.1371/journal.pone.0033166.s003

(DOC)

Table S3.

Set of A(H3N2) viruses selected from GenBank and used for the amino acid substitution comparison featured in Table 1 .

https://doi.org/10.1371/journal.pone.0033166.s004

(DOC)

Text S1.

FindSites analysis of the reassortant UK viruses.

https://doi.org/10.1371/journal.pone.0033166.s005

(RTF)

Acknowledgments

We wish to acknowledge the Respiratory Virus Unit staff for laboratory investigation of respiratory samples. We also acknowledge Louise Wilson, Alex Elliot, Wendy Barclay, Richard Pebody, Jim McMenamin and Douglas Fleming for their contribution to the clinical and epidemiological description of the paediatric fatal cases for which the present study is a follow-up.

Author Contributions

Conceived and designed the experiments: MG BJ MZ. Performed the experiments: MG BJ. Analyzed the data: MG BJ RM JE. Contributed reagents/materials/analysis tools: RM RD. Wrote the paper: MG BJ RM JE RD MZ.

References

  1. 1. Palese P, Shaw ML (2007) Orthomyxoviridae: The viruses and their replication. In: Fields BN, Knipe DM, Howley PM, et al., editors. Fields Virology. Fifth Edition ed. Philadelphia: Lippincott-Raven Publishers. pp. 1647–1689.
  2. 2. Fleming DM (2000) The contribution of influenza to combined acute respiratory infections, hospital admissions, and deaths in winter. Commun Dis Public Health 3: 32–38.
  3. 3. Rambaut A, Pybus OG, Nelson MI, Viboud C, Taubenberger JK, et al. (2008) The genomic and epidemiological dynamics of human influenza A virus. Nature 453: 615–619.
  4. 4. Ferguson NM, Galvani AP, Bush RM (2003) Ecological and immunological determinants of influenza evolution. Nature 422: 428–433.
  5. 5. Smith DJ, Lapedes AS, de Jong JC, Bestebroer TM, Rimmelzwaan GF, et al. (2004) Mapping the antigenic and genetic evolution of influenza virus. Science 305: 371–376.
  6. 6. Russell CA, Jones TC, Barr IG, Cox NJ, Garten RJ, et al. (2008) The global circulation of seasonal influenza A (H3N2) viruses. Science 320: 340–346.
  7. 7. Simonsen L (1999) The global impact of influenza on morbidity and mortality. Vaccine 17: S3–S10.
  8. 8. Johnson BF, Wilson LE, Ellis J, Elliot AJ, Barclay WS, et al. (2009) Fatal cases of influenza A in childhood. PLoS One 4: e7671.
  9. 9. Bhat N, Wright JG, Broder KR, Murray EL, Greenberg ME, et al. (2005) Influenza-associated deaths among children in the United States, 2003–2004. N Engl J Med 353: 2559–2567.
  10. 10. Holmes EC, Ghedin E, Miller N, Taylor J, Bao Y, et al. (2005) Whole-genome analysis of human influenza A virus reveals multiple persistent lineages and reassortment among recent H3N2 viruses. PLoS Biol 3: e300.
  11. 11. Vigerust DJ, Ulett KB, Boyd KL, Madsen J, Hawgood S, et al. (2007) N-linked glycosylation attenuates H3N2 influenza viruses. J Virol 81: 8593–8600.
  12. 12. Bao Y, Bolotov P, Dernovoy D, Kiryutin B, Zaslavsky L, et al. (2008) The influenza virus resource at the National Center for Biotechnology Information. J Virol 82: 596–601.
  13. 13. Barr IG, Komadina N, Hurt AC, Iannello P, Tomasov C, et al. (2005) An influenza A(H3) reassortant was epidemic in Australia and New Zealand in 2003. J Med Virol 76: 391–397.
  14. 14. Nelson MI, Simonsen L, Viboud C, Miller MA, Holmes EC (2007) Phylogenetic analysis reveals the global migration of seasonal influenza A viruses. PLoS Pathog 3: 1220–1228.
  15. 15. Bush RM, Smith CB, Cox NJ, Fitch WM (2000) Effects of passage history and sampling bias on phylogenetic reconstruction of human influenza A evolution. Proc Natl Acad Sci U S A 97: 6974–6980.
  16. 16. Nelson MI, Holmes EC (2007) The evolution of epidemic influenza. Nat Rev Genet 8: 196–205.
  17. 17. WHO (2005) Recommended composition of influenza virus vaccines for use in the 2005–2006 influenza season. WHO Weekly Epidemiological Report 80: 65–76.
  18. 18. Wiley DC, Wilson IA, Skehel JJ (1981) Structural identification of the antibody-binding sites of Hong Kong influenza haemagglutinin and their involvement in antigenic variation. Nature 289: 373–378.
  19. 19. Wilson IA, Skehel JJ, Wiley DC (1981) Structure of the haemagglutinin membrane glycoprotein of influenza virus at 3 Å resolution. Nature 289: 366–378.
  20. 20. Jin H, Zhou H, Liu H, Chan W, Adhikary L, et al. (2005) Two residues in the hemagglutinin of A/Fujian/411/02-like influenza viruses are responsible for antigenic drift from A/Panama/2007/99. Virology 336: 113–119.
  21. 21. Kilander A, Rykkvin R, Dudman SG, Hungnes O (2010) Observed association between the HA1 mutation D222G in the 2009 pandemic influenza A(H1N1) virus and severe clinical outcome, Norway 2009–2010. Euro Surveill 15 pii(9): 19498. Available from http://www.eurosurveillance.org/ViewArticle.aspx?ArticleId=19498.
  22. 22. Liu Y, Childs RA, Matrosovich T, Wharton S, Palma AS, et al. (2010) Altered receptor specificity and cell tropism of D222G hemagglutinin mutants isolated from fatal cases of pandemic A(H1N1) 2009 influenza virus. J Virol 84: 12069–12074.
  23. 23. Rogers GN, Paulson JC, Daniels RS, Skehel JJ, Wilson IA, et al. (1983) Single amino acid substitutions in influenza haemagglutinin change receptor binding specificity. Nature 304: 76–78.
  24. 24. Matrosovich M, Tuzikov A, Bovin N, Gambaryan A, Klimov A, et al. (2000) Early alterations of the receptor-binding properties of H1, H2, and H3 avian influenza virus hemagglutinins after their introduction into mammals. J Virol 74: 8502–8512.
  25. 25. Vines A, Wells K, Matrosovich M, Castrucci MR, Ito T, et al. (1998) The role of Influenza A virus hemagglutinin residues 226 and 228 in receptor specificity and host range restriction. J Virol 72: 7626–7631.
  26. 26. Wan H, Perez DR (2007) Amino Acid 226 in the Hemagglutinin of H9N2 Influenza Viruses Determines Cell Tropism and Replication in Human Airway Epithelial Cells. J Virol 81: 5181–5191.
  27. 27. Hiromoto Y, Saito T, Lindstrom S, Nerome K (2000) Characterization of low virulent strains of highly pathogenic A/Hong Kong/156/97 (H5N1) virus in mice after passage in embryonated hens' eggs. Virology 272: 429–437.
  28. 28. Kim JH, Hatta M, Watanabe S, Neumann G, Watanabe T, et al. (2010) Role of host-specific amino acids in the pathogenicity of avian H5N1 influenza viruses in mice. J Gen Virol 91: 1284–1289.
  29. 29. Berkhoff EGM, Geelhoed-Mieras MM, Fouchier RAM, Osterhaus ADME, Rimmelzwaan GF (2007) Assessment of the extent of variation in influenza A virus cytotoxic T-lymphocyte epitopes by using virus-specific CD8+ T-cell clones. J Gen Virol 88: 530–535.
  30. 30. DiBrino M, Parker KC, Margulies DH, Shiloach J, Turner RV, et al. (1994) The HLA-B14 peptide binding site can accommodate peptides with different combinations of anchor residues. J Biol Chem 269: 32426–32434.
  31. 31. Gianfrani C, Oseroff C, Sidney J, Chesnut RW, Sette A (2000) Human memory CTL response specific for influenza A virus is broad and multispecific. Hum Immunol 61: 438–452.
  32. 32. Colman PM, Varghese JN, Laver WG (1983) Structure of the catalytic and antigenic sites in influenza virus neuraminidase. Nature 303: 41–44.
  33. 33. Cooke MK, Crofts JP, Joseph CA, Goddard N, Ellis J, et al. (2005) Influenza and other respiratory viruses surveillance in the United Kingdom: October 2003 to May 2004. Commun Dis Rep CDR Suppl 15(1): 1–8. Available from http://www.hpa.org.uk/cdr/archives/2005/flu2004_5.pdf.
  34. 34. Pebody RG, McLean E, Zhao H, Cleary P, Bracebridge S, et al. (2010) Pandemic Influenza A (H1N1) 2009 and mortality in the United Kingdom: risk factors for death, April 2009 to March 2010. Euro Surveill 15:
  35. 35. Sachedina N, Donaldson LJ (2010) Paediatric mortality related to pandemic influenza A H1N1 infection in England: an observational population-based study. Lancet 376: 1846–1852.
  36. 36. Albright FS, Orlando P, Pavia AT, Jackson GG, Cannon Albright LA (2008) Evidence for a heritable predisposition to death due to influenza. J Infect Dis 197: 18–24.
  37. 37. Mubareka S, Palese P (2008) Human genes and influenza. J Infect Dis 197: 1–3.
  38. 38. Fukuyama S, Kawaoka Y (2011) The pathogenesis of influenza virus infections: the contributions of virus and host factors. Curr Opin Immunol 23: 481–486.
  39. 39. Matrosovich M, Matrosovich T, Carr J, Roberts NA, Klenk HD (2003) Overexpression of the alpha-2,6-sialyltransferase in MDCK cells increases influenza virus sensitivity to neuraminidase inhibitors. J Virol 77: 8418–8425.
  40. 40. Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG (1997) The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res 25: 4876–4882.
  41. 41. Hall TA (1999) BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucl Acids Symp Ser 41: 95–98.
  42. 42. Posada D, Crandall KA (1998) MODELTEST: testing the model of DNA substitution. Bioinformatics 14: 817–818.
  43. 43. Li KB (2003) ClustalW-MPI: ClustalW analysis using distributed and parallel computing. Bioinformatics 19: 1585–1586.
  44. 44. Stamatakis A, Ludwig T, Meier H (2005) RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees. Bioinformatics 21: 456–463.
  45. 45. Lole KS, Bollinger RC, Paranjape RS, Gadkari D, Kulkarni SS, et al. (1999) Full-Length Human Immunodeficiency Virus Type 1 Genomes from Subtype C-Infected Seroconverters in India, with Evidence of Intersubtype Recombination. J Virol 73: 152–160.
  46. 46. Gupta R, Jung E, Brunak S (2004) Prediction of N-glycosylation sites in human proteins. Available from http://www.cbs.dtu.dk/services/NetNGlyc/. Accessed 08/03/2010.
  47. 47. Fleury D, Wharton SA, Skehel JJ, Knossow M, Bizebard T (1998) Antigen distortion allows influenza virus to escape neutralization. Nat Struct Biol 5: 119–123.
  48. 48. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, et al. (2000) The Protein Data Bank. Nucleic Acids Res 28: 235–242.