Facile laboratory tools are needed to augment identification in contamination events to trace the contamination back to the source (traceback) of Salmonella enterica subsp. enterica serovar Enteritidis (S. Enteritidis). Understanding the evolution and diversity within and among outbreak strains is the first step towards this goal. To this end, we collected 106 new S. Enteriditis isolates within S. Enteriditis Pulsed-Field Gel Electrophoresis (PFGE) pattern JEGX01.0004 and close relatives, and determined their genome sequences. Sources for these isolates spanned food, clinical and environmental farm sources collected during the 2010 S. Enteritidis shell egg outbreak in the United States along with closely related serovars, S. Dublin, S. Gallinarum biovar Pullorum and S. Gallinarum. Despite the highly homogeneous structure of this population, S. Enteritidis isolates examined in this study revealed thousands of SNP differences and numerous variable genes (n = 366). Twenty-one of these genes from the lineages leading to outbreak-associated samples had nonsynonymous (causing amino acid changes) changes and five genes are putatively involved in known Salmonella virulence pathways. While chromosome synteny and genome organization appeared to be stable among these isolates, genome size differences were observed due to variation in the presence or absence of several phages and plasmids, including phage RE-2010, phage P125109, plasmid pSEEE3072_19 (similar to pSENV), plasmid pOU1114 and two newly observed mobile plasmid elements pSEEE1729_15 and pSEEE0956_35. These differences produced modifications to the assembled bases for these draft genomes in the size range of approximately 4.6 to 4.8 mbp, with S. Dublin being larger (~4.9 mbp) and S. Gallinarum smaller (4.55 mbp) when compared to S. Enteritidis. Finally, we identified variable S. Enteritidis genes associated with virulence pathways that may be useful markers for the development of rapid surveillance and typing methods, potentially aiding in traceback efforts during future outbreaks involving S. Enteritidis PFGE pattern JEGX01.0004.
Citation: Allard MW, Luo Y, Strain E, Pettengill J, Timme R, et al. (2013) On the Evolutionary History, Population Genetics and Diversity among Isolates of Salmonella Enteritidis PFGE Pattern JEGX01.0004. PLoS ONE 8(1): e55254. doi:10.1371/journal.pone.0055254
Editor: Jose Alejandro Chabalgoity, Facultad de Medicina, Uruguay
Received: July 9, 2012; Accepted: December 21, 2012; Published: January 30, 2013
This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Funding: Food and Drug Administration research funds were provided. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The accurate subtyping and subsequent clustering of bacterial isolates associated with a foodborne outbreak event is important for successful investigation and eventual traceback to a specific food or environmental source. However, clonally derived strains, common within Salmonella enterica subsp. enterica serovar Enteritidis (S. Enteritidis), confound epidemiological investigations because of the limited genetic differentiation of these strains –. Existing approaches often lack the resolution for separating tightly linked bacterial isolates such as those originating from S. Enteritidis. In response to such events, federal public health, academic and industry food safety laboratories are exploring next-generation sequencing (NGS) technologies to investigate complex and challenging outbreak scenarios –. Recent examples in the literature illustrate the ability of NGS to detect variation within otherwise indistinguishable isolates –. These efforts have identified micro-evolutionary differences that genetically link clinical isolates, outbreak isolates found in foods, and their environmental counterparts in Salmonella –, Escherichia coli –, Vibrio – as well as numerous other bacteria –. Our genomics laboratory and others have successfully applied these NGS approaches to a case study of S. Montevideo in spiced Italian-style meats – where it was determined that the methods and results were reproducible. Moreover, extensive data mining within these novel genomes should yield novel genetic targets to augment investigations during outbreaks of highly clonal Salmonella pathogens.
S. Enteritidis remains a significant pathogen and a substantial threat to the food supply. It also represents one of the most genetically homogeneous serotypes of Salmonella, and certain clonal lineages remain intractable to differentiation by commonly used conventional subtyping methods –. The unusual genetic homogeneity observed among certain lineages of S. Enteritidis strains remains intriguing. Recent population genetic studies suggest that most S. Enteritidis strains belong to a single multilocus genotype –. A subpopulation of this clone was shown to associate more frequently with egg-related salmonellosis and clinical illness . Thus, specific requirements for colonization and survival in infected poultry may select for only a few genotypes of S. Enteritidis in the poultry environment. The random amplification of polymorphic DNA (RAPD), real-time polymerase chain reaction (RT-PCR), and Phage typing (PT) methods , , , ,  from diverse isolates within S. Enteritidis have revealed only a limited amount of genetic variation. More recently, more resolved discriminations of these salmonellae have been reported using rapidly-evolving CRISPR elements , . Conversely, rather than targeting a subset or region of variation in the S. Enteritidis chromosome, whole genome sequencing (WGS) will capture all of the genetic variation that exists among these highly clonal lineages. To date, only a few strains of S. Enteritidis are available as complete genomes – along with close relatives S. Gallinarum  and S. Gallinarum biovar Pullorum . These isolates have genome sizes around 4.7 mbp. The basic pan genomes are described in these initial studies, but currently, there are no published NCBI draft comparative genomes or associated manuscripts describing variation within S. Enteritidis. In this study, we describe the natural genetic variation within S. Enteritidis isolates associated with a widespread egg contamination event and retaining pulsed-field gel electrophoresis (PFGE) pattern JEGX01.0004 and analyze the comparative evolutionary genetics within this important foodborne pathogen and several of its closest relatives.
In 2010, the Centers for Disease Control and Prevention (CDC) along with many state laboratories identified a nationwide increase in S. Enteritidis isolates submitted to PulseNet (http://www.cdc.gov/salmonella/enteritidis/). Epidemiological investigations suggested that shell eggs were the most likely source of this increase. FDA, CDC, and state partners conducted traceback investigations and found many of the restaurants involved received shell eggs from a single company (http://www.fda.gov/food/newsevents/whatsnewinfood/ucm222684.htm). As a result, on August 13, 2010, one egg producer initiated a nationwide voluntary recall of shell eggs that had been sold to distributors and wholesalers in 22 states and Mexico. A record 380 million shell eggs were recalled under many different brand names. On August 19, a second egg producer initiated an additional recall of eggs that went to grocery stores, distributors, and wholesalers in 14 states. The second producer shared a contaminated feed supply with the first and was geographically nearby. In all, more than 500 million eggs were involved during this nationwide recall.
The primary goal of this study was to examine the genetic variability of isolates collected during the 2010 S. Enteritidis shell egg outbreak within the PFGE pattern JEGX01.0004, a pattern comprising over 40% of all of the S. Enteritidis isolates submitted to the national database. We also included several other isolates with similar PFGE patterns to JEGX01.0004 found in the associated egg-farm environment. We went on to describe the genetic diversity and evolutionary history of 106 new draft genomes for this virulent pathogen within this narrow but important sampling of S. Enteritidis diversity. As a result, we were able to provide new genetic targets useful for distinguishing S. Enteritidis isolates otherwise indistinguishable by several current methodologies. Once validated, these new SNP targets can be interrogated using widely available DNA sequencing through capillary electrophoresis (CE), short-read pyrosequencing, real-time PCR, or mass spectrometry of PCR amplicons. Finally, this study evaluates the potential use of targeted genomic sequencing with next generation sequencing (NGS) for rapidly resolving future S. Enteritidis outbreaks in eggs.
Materials and Methods
Salmonella Enteritidis strains
A set of 67 food, environmental, and clinical S. Enteritidis isolates collected from farms and egg sources linked to the 2010 egg contamination event was included for whole genome sequencing. Specifically, 36 S. Enteritidis isolates, originating from environmental swabs, were collected directly from various farm sources implicated in the contamination event (e.g., egg wash water). Four S. Enteritidis were isolated directly from shell eggs, liquid eggs, or other egg-containing food sources known to be contaminated during this time period. Two S. Enteritidis isolates were obtained directly from chicken feed or components thereof at the implicated farms. An additional 25 clinical isolates, collected during the time of the egg contamination event (2010) and retaining common PFGE patterns to the egg S. Enteritidis isolates, were kindly provided by the Centers for Disease Control and included for sequencing. In addition, 39 isolates, collected earlier in time and unrelated to the contamination event, were added as reference S. Enteritidis for the WGS analysis. These included 13 isolates with two-enzyme matching PFGE patterns, seven single-enzyme matching patterns, indistinguishable in either the primary (XbaI n = 3) or secondary (BlnI n = 4) enzyme, and 19 isolates with no common PFGE patterns to the contamination event. These isolates also were used to further investigate the phylogenetic utility of phage-typing. Included in this group of 39 were 10 of unknown PT and, 14 of historical PT8 isolates. The remainder were 15 isolates of S. Enteritidis from ten other diverged PTs such as PT1, 21, 2, 4, 14b, 13, 13a, 23, 28 and 35. S. Enteritidis strains were phage-typed by previously described methods  at the National Microbiology Laboratory, Canadian Science Centre for Human and Animal Health, Winnipeg, Manitoba, Canada. Strains that reacted with phages but retained unrecognizable lytic patterns were atypical and were designated atypical or RDNC (reacts but does not conform). Specific PFGE pattern names, PTs, and other metadata associated with the S. Enteritidis strains are listed in Table 1 (PTs are included in the tree label names).
Table 1. Metadata associated with the isolates examined in this study.doi:10.1371/journal.pone.0055254.t001
Growth of bacterial strains, and genomic and plasmid DNA isolation
Genomic DNA was isolated from overnight cultures as follows: each initial pure culture sample was taken from frozen stock, plated on Trypticase Soy Agar, and incubated overnight at 37°C. After incubation, cells were taken from the plate and inoculated into Trypticase Soy Broth culture for DNA extraction. All samples were representative cultures from a full-plate inoculation and were not single colonies. Genomic DNA was extracted using Qiagen DNeasy kits.
Library construction and genome sequencing
For this study, all S. Enteritidis isolates were shotgun sequenced using the Roche 454 GS Titanium NGS technology . This platform provided longer read lengths relative to other sequencing methods and has a relatively shorter time to generate raw sequence information. Taxon sampling included one new isolate each of S. Gallinarum and S. Gallinarum biovar Pullorum, two isolates of S. Dublin and 106 new isolates of S. Enteritidis including a few isolates differing by PFGE patterns, and the majority of isolates sharing the same PFGE pattern (Table 1). These Salmonella serotypes have been considered to be close relatives traditionally. Each isolate was run on a quarter of a titanium plate that produced roughly 250,000 reads per draft genome resulting in an average genome coverage of about 20×.
Genome assembly and annotation
De novo assemblies were created for each S. Enteritidis isolate using the Roche Newbler run Assembly software (v. 2.6). All draft genomes were annotated using NCBI's Prokaryotic Genomes Automatic Annotation Pipeline (PGAAP, ). Comparison of the de novo assemblies against the complete genome for S. Enteritidis strain 125109 (GenBank accession: AM933172) using Mauve  identified several large contigs that did not map to the reference genome: phage RE-2010 (Accession: HM7700079), plasmid pOU1114 (Accession: DQ115387, strain SL909), plasmid from strain CDC_2010K_1729 (pSEEE1729_15), plasmid from strain CDC_2010K-0956 (pSEEE0956_35), and plasmid from strain 607307-2 (pSEEE3072_19). The reference sequence used for mapping reads was comprised of the complete S. Enteritidis genome (AM933172, which includes the P125109 phage) plus the 5 additional elements previously described.
Comparative genomic analysis
SNPs were identified by mapping the 454 reads to the reference genome using Roche Newbler runMapping software (v. 2.6). SNPs were defined as positions where one or more isolates differed from the reference sequence with coverage ≥4× and with ≥95% of the reads containing the SNP, excluding insertions and deletions [indels] The alignments were then screened to find non-gap phylogenetically informative nucleotide positions (i.e. minor allele count ≥2). The mapped consensus base for each isolate at the reference SNP positions were then concatenated in a multiple FASTA file for phylogenetic analysis. The maximum likelihood tree was constructed using GARLI  with 1000 bootstrap replicates. All GARLI analyses were performed with the default parameter settings and the GTR+Γ+I nucleotide substitution model. SNPs in single copy protein coding genes were identified using the same criteria by mapping the isolate reads to the annotated CDS regions in AM933172. Multiple alignments for genes with SNPs were created using the UCLUST  software package. There were 366 genes that met the SNP criteria that were present in 95% or more of the 106 isolates. These 366 genes represent a conservative estimate of the set of variable genes as we have eliminated indels and CDS regions that could not be reliably predicted and annotated. A phylogenetic tree also was built with TNT  and characters were optimized onto the tree to assess character evolution for several of the critical nodes on the tree associated with the outbreak implicated farm isolates  as well as for identifying SNPs specific to S. Enteritidis.
Phylogenetic analyses of the clonal S. Enteritidis data set including multiple outgroups were performed on the concatenated informative SNP matrix described above. Approximately 99% of the sites in the 5MB Salmonella genomes are phylogenetically uninformative (i.e. showing no differences that provide clustering information) and eliminating them dramatically reduces computation time and memory requirements. Additional, phylogenetic analyses were performed on the set of 366 concatenated genes containing informative SNPs.
Whole genome shotgun accessions (WGS), bioproject accession numbers are listed in Table 1.
Genome size, order and conservation
New draft genomes are provided for 110 Salmonella isolates including 106 S. Enteritidis, and four closely related outgroups, two S. Dublin and one each of S. Gallinarum, and S. Gallinarum biovar Pullorum (Table 1). While synteny and genome organization were largely stable among these isolates, genome size differences were observed due to variation in the presence or absence of several phages and plasmids including phage RE-2010 , phage P125109 , plasmid pOU1114 , and several newly observed plasmid mobile elements pSEEE1729_15, pSEEE0956_35 and pSEEE3072_19 (Figs. 1 and 2, Table 1). One of these, pOU1114, is a newly finished complete plasmid known from partial data to reside within S. Enteritidis and its close relative S. Dublin. pSEEE3072_19 is closely related to the previously characterized S. Enteritidis plasmid pSENV . Presence or absence of mobile elements in S. Enteritidis contributed to a genome size ranging from 4.6 to 4.9 mbp, with S. Dublin being relatively larger (~4.9 mbp) and S. Gallinarum smaller (4.55 mbp) when compared to the S. Enteritidis genomes collected here. A bimodal split centered on 4.7 mbp was noted, which largely corresponds to mobile elements that partition predictably between phylogenetic lineages (Table 1, Figures 1, 3).
Figure 1. The number of assembled bases and N50 contig size listed for each of the sequenced isolates.
Points are colored according to the phages and plasmids that were found in the sequencing results.doi:10.1371/journal.pone.0055254.g001
Figure 2. Circle plot showing general conservation of synteny among PFGE pattern JEGX01.0004 of Salmonella Enteritidis, with phage and plasmid differences listed for 9 representative isolates.doi:10.1371/journal.pone.0055254.g002
Figure 3. Phylogenetic tree based on the maximum-likelihood method implemented in GARLI.
Numbers associated with branches represent the percent of 1000 bootstrap replicates supporting the major clades C1 through C9. Acquisition of ALFR00000000 putative plasmid pSEEE1729_15 is defined by a star at the base of C1.doi:10.1371/journal.pone.0055254.g003
Most clinical isolates are phylogenetically close to isolates from two egg farms
A set of 106 ecologically diverse food, environmental, and clinical S. Enteritidis strain isolates, associated with the time period surrounding the 2010 egg contamination event, were included for whole genome sequencing. Strains with expanding diversity and representing three important levels for comparison were included in the analysis. The first group of 60 strains represented a highly homogeneous set of environmental, farm, food, and clinical S. Enteritidis isolates sharing a common PFGE pattern and temporally associated with the 2010 egg contamination event. The second tier of 30 strains included a set of historical environmental, food, and clinical S. Enteritidis isolates that retained identical or highly similar PFGE patterns but were unassociated with the 2010 egg contamination event, unrelated in time, location or isolation source. Finally, the last group of 16 isolates was also unrelated to the 2010 egg event and included a series of S. Enteritidis strains with more diverged PFGE patterns and phage types away from the 2010 egg S. Enteritidis isolates. These strains served largely as genetic references, effectively allowing for a testing of the phylogenetic monophyly of the 2010 egg-associated S. Enteritidis isolates. As an example, these isolates include other phage types such as PT4, PT23, PT14b, and PT1 and date back over 50 years in time.
Phylogenetic analysis of these genomes revealed several interesting observations. First, the S. Enteritidis PFGE Pattern JEGX01.0004 plus related strains and strains with similar PFGE patterns formed a monophyletic group distinct from other neighboring serovars S. Dublin, S. Gallinarum, and S. Gallinarum biovar Pullorum. Previous comparative genomics studies , – have shown that S. Enteritidis, S. Dublin, S. Gallinarum biovar Pullorum and S. Gallinarum form a natural group, a finding supported by our results. Second, within S. Enteritidis, nine lineages were defined from the tree (Figure 3). Genetic diversity between different serovars included thousands of differences while variability between the nine lineages of S. Enteritidis labeled C1–C9, ranged only in the order of 100 to 600 nucleotide changes. Within lineage variation was usually less than 100 bp with the exception of lineage C7 which had over 200 bp of intra-clade variability (Table 2).
Table 2. Pairwise SNP distances+/−SD between major lineages identified in the phylogenetic tree (C = clade).doi:10.1371/journal.pone.0055254.t002
Among the isolates compared, results for clinical isolates sorted into each of the major lineages (Clades C1, C2, C3 and C5, Figure 3) with most falling into clades C1 and C2. It is noteworthy that no apparent increase in substitutions was observed for the isolates that passed through patients compared to their environmental clones. If there was an increase or expansion in genetic diversity among the clinical isolates studied, compared to other food and environmental S. Enteritidis collected in relation to the 2010 egg event, one would expect observed genetic diversity to have been expressed as increased or longer branch lengths among the terminal tree nodes leading to the 2010 clinical isolates in the tree. In general, this was not observed. Albeit, several clinical isolates (i.e., SEEE9845 and SEEE4647 both from Ohio) reflect the accumulation of just a few additional SNPs in the tree as their terminal branches project slightly from the base of the 2010 egg isolates in clade 1. However, comparable subtle genetic variations among environmental and egg isolates were also noted as well in the tree indicating that no additional or overt pressure to change was applied in vivo for the clinical strains included here among the 2010 egg and environmental isolates. For example, environmental isolates from Ohio (e.g., SEEE1117 and SEEE1618), also in clade 1, vary comparably in their branch lengths to the aforementioned clinical isolates.
Clades C7, C8 and C9 contained a diversity of isolates from unrelated and historical freezer stocks that were not connected to the large shell egg outbreak (Table 1). Additionally, environmental S. Enteritidis isolates taken from Farm 1 were found in clades C6 and C1, while environmental S. Enteritidis isolates from Farm 2 were observed in Clades C4, C2 and one isolate in C1. It is important to note that in our S. Enteritidis strain tree presented here, the phylogenomic data sort in a largely hierarchical fashion. That is, isolates associated with the 2010 S. Enteritidis egg event do cluster most closely together with additional SNP diversity providing higher resolution for related strains within the contamination event. Additionally, nearly all of the reference isolates retaining common PFGE patterns but unassociated with the egg event sort adjacent to but outside of the 2010 S. Enteritidis egg, clinical, and farm swarm of isolates. Surprisingly, however, several of these genetically similar S. Enteritidis reference strains lacking any temporal relatedness to the 2010 egg event do partition with other egg isolates. One S. Enteritidis isolate from 2004, for example, formed a sub-clade with two clinical isolates from Tennessee within the larger clade 2 in the genome tree (Figure 3). Also in clade 2, a historical S. Enteritidis isolate from California (1441) sorted closely with two S. Enteritidis clinical isolates from Minnesota collected from 2010 and during the egg event. The substantial number of SNPS that partition strains within S. Enteritidis clades 1 and 2 and examples of phylogenetic homogeneity may point to additional source reservoirs of S. Enteritidis contamination during the 2010 egg event.
It is important to note that many S. Enteritidis strains with common phage-types are polyphyletic (do not sort into a single group) in the whole-genome sequence tree. S. Enteritidis strains designated as PT8, for example, are phylogenetically distributed across clades 1, 2, 3, 5, 6, 7, and 8 suggesting that despite retaining this common phenotypic feature, phage types are phylogenetically distinct and diverged among their genome sequences. This observation is not unexpected  given the intrinsic horizontal movement of phage restriction across diverged strains of S. enterica.
Genetic variation defining S. Enteritidis
More than 50 genes vary with SNPs that define S. Enteritidis separately from the outgroups compared in this study (Table 3). For example, the multicopper oxidase gene, (cueO, locus tag SEN0173), represents one gene with numerous genetic signatures unique to S. Enteritidis strains. This gene and protein alignment show a dozen SNP differences and three amino acid differences that appear to be present in all S. Enteritidis examined. Serovar-defining signature amino acid differences include E to Q (position 132), P to L (position 337), and L to S changes (position 342). Other genes that vary with S. Enteritidis specific SNPs and amino acid changes include: the fimbrial usher protein (bcfC, locus tag SEN0022); fimbrial structural subunit (safD, locus tag SEN0284); 2-methylcitrate dehydratase (prpD, locus tag SEN0353); Trp operon repressor (trpR, locus tag SEN4339); tRNA(Ile)-lysidine synthetase gene (tilS, locus tag SeD_A0258); iron-hydroxamate transporter ATP-binding subunit (fhuC, locus tag STM0192); ABC transporter ATP-binding protein (locus tag SEN0716); electron transfer flavoprotein (fixA, locus tag SEN0076); and invasion-associated secreted effector protein (sopE2, locus tag SEN1182) to name a few (Table 3).
Table 3. Variable genes observed that may define the serotype Salmonella Enteritidis.doi:10.1371/journal.pone.0055254.t003
Genetic variation defining S. Enteritidis outbreak lineages
At least 366 genes varied among S. Enteritidis strains comprising the egg-associated foodborne isolates, the farm environmental samples, and temporally-associated clinical samples (Table S1). Of the 366 genes that varied, 21 had nonsynonymous changes that were optimized to one of the branches supporting egg-associated clades C1, C2 or the shared lineage leading to C1 and C2 collectively (Table 4). These variable genes represent micro-evolutionary changes that arose within this highly clonal lineage of Salmonella persisting in the food supply and chicken farm environment; thus they may play a role in the subsequent rapid subtyping of isolates in future food contamination events involving S. Enteritidis pattern JEGX01.0004.
Table 4. Variable genes observed for several critical outbreak clades.doi:10.1371/journal.pone.0055254.t004
Specific genes associated with implicated farm isolates
Nucleotide substitutions in 17 genes, 11 of which were nonsynonymous were identified at the node uniting isolates from the two egg farms (Table 4). In addition, isolates obtained from Farm 1 shared nonsynonymous changes in two genes SthB and YjjP. Farm 2 S. Enteritidis isolates shared substitutions in nine genes, eight of which were nonsynonymous.
Like other molecular epidemiology studies of Salmonella employing genomic technologies –, this work demonstrates that comparative NGS methods can be employed to clearly augment food contamination investigations by genetically linking the implicated sources of contamination with farm and clinical isolates. The genomic evidence herein corroborates epidemiological conclusions from outbreak investigations based on statistical analysis and source tracking leads. However, with NGS, one can gain additional detailed micro-evolutionary knowledge within the associated outbreak and reference isolates; thus providing additional evidence linking implicated farms to some of the clinical isolates but not others initially associated with this foodborne contamination. Moreover, the level of genetic resolution obtained using NGS methods permits a delimiting of the scope of an outbreak in the context of an investigation even for the most genetically homogeneous salmonellae (e.g., S. Enteritidis). In this study, NGS data retrospectively supported the decision to recall a half a billion shell eggs by revealing numerous nucleotide and amino acid changes (SNPs) found in both eggs and from hen houses; the changes were also shared with some food and clinical isolates. It is noteworthy that the comparative NGS results reported here provided additional resolution, with new genomic data, that some clinical isolates collected during the time of the egg contamination event and with the same PFGE Pattern JEGX01.0004 may not be linked to the implicated farm isolates studied. That is, while most of the strains collected during this time period and sharing a common PFGE pattern fall into clades 1 and 2 (Figure 3) with the egg and farm isolates, several strains known to be unrelated to the outbreak, including historical isolates from 2004, interrupt these lineages, indicating additional potential sources of contamination.
Data mining associated with these novel genomes should provide new genetic targets for tool development in public health laboratories and that will augment investigations during highly clonal outbreaks of Salmonella pathogens. Akin to earlier findings of NGS-based differentiation of S. Montevideo isolates associated with pepper and spiced meats –, the signature genetic differences uncovered here will provide additional insight into what will likely remain a common pattern of S. Enteritidis associated with the food supply. This bolus of unique genetic identifiers yielded from whole-genome sequencing clearly earmark NGS as a valuable tool for augmenting future molecular epidemiology investigations both for rapidly distinguishing distinct serotypes and PFGE types as well as providing markers that can differentiate highly clonal outbreak lineages into insightful isolate sublineages.
By using a targeted comparative genomic approach that spanned nearly the entire genomic complement of the highly homogeneous S. Enteritidis variants included here (i.e., PFGE pattern JEGX01.0004), a robust genotyping SNP panel was compiled that not only discriminated this S. Enteritidis clone from other closely related strains but also fully resolved member isolates within this cluster. This is an important alternative to other methods that have been examined for surveying genomic diversity among foodborne pathogenic strains. One such approach uses NGS to examine diversity among a pooled isolate set instead of on pure cultures, but as expected, this approach is far less robust. As an example, a recent genotyping panel for 0157 STECs revealed lower diversity among the isolates using the selected NGS-based genotyping panel than a two-enzyme PFGE method . Specifically, the authors reported finding over 16,000 variable SNPs, but by pooling STEC isolates and sequencing at low coverage, critical SNPs defining major lineages and sublineages went undetected in this analysis. This was likely due to the failure of the “pooling” approach to link signature SNPs back to a particular source genome. While strain “pooling” may be a faster way to collect SNP data, it may not be an optimal method when discriminating a specific lineage of strains or an isolate cluster of interest. In contrast, comparative genomics approaches rely on high-coverage draft genomes coupled with rigorous phylogenetic analyses and character optimization to resolve accurate evolutionary and genetic relatedness among closely related strains. With such information, individual SNPs can be evaluated in an evolutionary context (i.e., whether they define lineages or represent homoplasy due to convergent gains or character reversals). Indeed, a targeted phylogenetic approach produces a robust genotyping panel because the resultant SNPs can be carefully chosen to represent diversity among targeted isolates while omitting uninformative SNPs –. Conversely, “pooling” strategies might work better within clonal outbreak lineages where hundreds not thousands of SNPs are present.
Mobile elements, such as phages and plasmids, are often the most promiscuous portions of the bacterial genome including Salmonella . The mobilome, as it is often collectively referred, appears to be regularly rearranging among closely related clonal lineages of Salmonella , . As expected, S. Enteritidis shows a similar susceptibility to loss and gain of these elements , as do other members of the Enterobactericeae. In addition to seeing variability among these elements, several new plasmids were discovered, suggesting that additional mobile elements were previously undescribed across the Salmonella genome. Recent examples of new phages and plasmids are being published regularly –. It is becoming apparent that a renewed effort to describe and identify the complete mobilomes of newly sequenced isolates should be undertaken, especially for pathogenic strains that persist and emanate from the environment. From these data, it would appear that mobility of these elements is not restricted to close members. At least one of the newly discovered Salmonella plasmids (pSEEE1729_15) had its closest BLAST match to an E. coli 0157:H7 strain EC4115 , suggesting that parts of the mobilome may be transferred from other related enterobacterial species. Moreover, observations of this nature clearly broaden the possibility of new acquisitions into the S. Enteritidis pan genome .
Natural selection has been reported in other Salmonella isolates and appears to be a major component of the evolution of this pathogen , . Some of the genes that vary are found on the mobilome, such as the putative phage terminase gene, supporting the notion that there are actively evolving genes on some mobile elements. This strategy for evolution could provide a scenario whereby highly selected genes could be shaped by natural selection and then easily distributed among the various members of a serotype and other more distant lineages through mobile genetic elements.
Some investigators are beginning to search for genetic determinants for survival and virulence of S. Enteritidis in chickens, mice, and cell culture models. Through observing which genes varied in environmental farm and clinical isolates, such insight was sought in the hopes of identifying potential contributing factors to outbreaks. One study linked SNP variability in a stress response gene (rpoS) to isolates able to infect poultry . We observed nonsynonymous variability in a gene (phoP) that has been demonstrated to be a regulator of rpoS ,  and that gene varied uniquely in the lineage defining Clades 1 and 2 (Table 4). The phoP gene also is thought to be important to S. Enteritidis virulence based on evidence from a mouse model . This change was observed in the SNPs listed in Table 4, which are a conservative subset of variable SNPs and genes, although these SNPs were chosen for potential diagnostic utility and not for a full description of comparative genomics purposes within these isolates.
Another recent hypothesis for the genes involved in salmonellosis, focuses on the ABC transporter genes and the ability of pathogens to acquire nutrients for survival during host infection , . Our study shows variability in an ABC transporter for methionine specific for clades 1 and 2 (Table 4). The S. Enteritidis model that Osborne et al.  tested for in vivo with an ABC transporter of alanine is similar to the natural variability for a similar gene in the implicated farm and associated clinical isolates. If this model, affirmed in cell culture studies, holds in chickens, then infections in chickens and eggs in 2010 may be related to the ability of S. Enteritidis to survive in a poultry host due to the enhanced access to methionine. The ABC transporters have been hypothesized to be an important new acquisition for all of subspecies I Salmonella enterica . Perhaps the ABC transporter gene gave Salmonella subspecies I an overall enhanced ability to survive in a warm blooded vertebrate host, and later mutations of the gene allow some serotypes to have special affinity for one host over another. It is common to see serotype specific Salmonella that are more common to one host, such as S. Kentucky in cattle and S. Enteritidis in poultry and eggs. Another nonsynonymous gene change observed is in the threonine/serine transporter tdcC gene (Table 4), demonstrating that several transporter genes are evolving within these critical isolates.
Salmonella's ability to gain access to another valuable resource such as metals, like Fe, Mn, and Zn, may help give this foodborne pathogen a competitive edge in the vertebrate gut . Variability in genes related to metal acquisition may help Salmonella bypass a process called nutritional immunity. We see another nonsynonymous change unique to the outbreak-associated isolates in a ferrochelatase gene (hemH), lending support to this hypothesis. Another hypothesis, argues that diversification within the Salmonella fimbriae gene clusters has been implicated as a source for virulence  through possible host specific intestinal adhesion mechanisms. At least three genes from gene complexes (bcfC, safD, and stbE) show unique amino acid changes that may define S. Enteritidis (Table 3) and one fibrial gene (fimD) shows a unique amino acid change leading to clades 1 and 2 (Table 4).
The nonsynonymous changes that we see among genes that vary for clades 1 and 2 suggest that there may not be a single cause for increased risk of infection and outbreak stemming from chickens and shell eggs. Rather a combination of several of these genetic factors that raise the risks for Salmonella invasion may be causing contaminations in the food supply today. The fact that 5 of the 21 nonsynonymous changes varying among the outbreak isolates (Table 3) are putatively involved in virulence-based pathways strongly suggest that there may be multiple and potentially synergistic causes to the expanding rate of S. Enteritidis populations. This also suggests that the other genes (Table 3 and 4) that vary in S. Enteritidis should be carefully examined and experimentally tested, as more of these are likely to be associated with an increase in virulence and infection , , .
Based on both PCR and sequencing evidence, numerous studies have found little genetic variation within S. Enteritidis –. Our genomic diversity estimates for the S. Enteritidis PFGE Pattern JEGX01.0004 examined in this study are consistent with other diversity comparisons described between two S. Enteritidis isolates of phage type 13 . This variation was observed both as SNP variation among 366 genes as well as the presence and absence of numerous phages and plasmids among these close relatives. This genetic variability was used to define the most variable genes and to assess population and phylogenetic evolutionary patterns for these important foodborne pathogens. In this study, our comparative genomics approach allowed us to cluster clinical isolates within the context of their environmental source, farm isolates, many of which were associated with a large national shell egg recall. Numerous genetic changes clearly link some clinical and environmental isolates to the farms that were implicated in the recall of over a half a billion eggs. One known plasmid in S. Enteritidis was completely sequenced, and three plasmids were reported. Several of the genes that varied with nonsynonymous changes had previously been associated with virulence pathways in prior in vitro experiments.
Availability of data and cultures
All NCBI S. Enteritidis isolates are linked to Bioproject and new accession numbers AHUJ00000000- AHUR00000000, ALEA00000000- ALEZ00000000, ALFA00000000- ALFZ00000000, ALGA00000000-ALGZ00000000, ALHA00000000- ALHZ00000000. and ALIA00000000- ALID00000000. Cultures included in this study are also available upon request. Please direct any queries to our strain curator Dwayne Roberson, at Dwayne.Roberson@fda.hhs.gov.
Variable genes observed within our sample of Salmonella Enteritidis.
We would like to thank the NCBI rapid annotation pipeline team, Bill Klimke, Dmitry Dernovoy, Stacy Ciufo, Kathleen O'Neill, Azat Badretdin and Tatiana Tatusova, for key genome annotation services. We would also like to acknowledge Donald Zink, John Guzewich, Sherri McGarry, Mickey Parrish, Kathy Gombas, Roberta Wagner, Donald Kraemer, and Michael Landa from CFSAN-FDA for program support and for important epidemiological and investigatory insights. We would also like to extend our sincere thanks to our FDA partners in the Division of Field Sciences and regional field laboratories including Rebecca Dreisch, Peggy Carter, Norma Duran, and Palmer Orlandi for providing environmental and egg isolates as well as Eija Trees, John Besser, Patti Fields, Robert Tauxes and Peter Gerner-Schmidt from the U.S. Centers for Disease Control, and Pat McDermott and Shaohua Zhao from the Center for Veterinary Medicine for sharing important clinical isolates associated, unassociated and unknown with the 2010 egg outbreak, and for manuscript review. Aaron Heifetz, Jose Alejandro Chabalgoity and two anonymous reviewers also provided comments that greatly improved our manuscript. No human subjects or animals were used in this study. All authors have read the manuscript and agree to its content, subject matter, and author line order. These data are novel and have not been previously published elsewhere. Disclosure forms provided by the authors will be available with the full text of this article.
Conceived and designed the experiments: MWA SMM CK JZ EWB. Performed the experiments: CW CL. Analyzed the data: ES JP RT YL. Contributed reagents/materials/analysis tools: RS. Wrote the paper: MWA ES JP RT MRW EWB.
- 1. Stanley J, Goldsworthy M, Threlfall EJ (1992) Molecular phylogenetic typing of pandemic isolates of Salmonella enteritidis. FEMS Microbiol Lett 69: 153–160. doi: 10.1111/j.1574-6968.1992.tb05143.x
- 2. Ward LR, de Sa JD, Rowe B (1987) A phage-typing scheme for Salmonella Enteritidis. Epidemiol Infect 99: 291–294. doi: 10.1017/S0950268800067765
- 3. Saeed AM, Walk ST, Arshad M, Whittam TS (2006) Clonal structure and variation in virulence of Salmonella Enteritidis isolated from mice, chickens, and humans. J AOAC Int 89: 504–511.
- 4. Botteldoorn N, Van Coillie E, Goris J, Werbrouck H, Piessens V, et al. (2010) Limited genetic diversity and gene expression differences between egg- and non-egg-related Salmonella Enteritidis strains. Zoonoses Public Health 57 (5) 345–57. doi: 10.1111/j.1863-2378.2008.01216.x
- 5. Liu F, Kariyawasam S, Jayarao BM, Barrangou R, Gerner-Smidt P, et al. (2011) Subtyping Salmonella enterica serovar enteritidis isolates from different sources by using sequence typing based on virulence genes and clustered regularly interspaced short palindromic repeats (CRISPRs). Appl Environ Microbiol 77 (13) 4520–6. doi: 10.1128/AEM.00468-11
- 6. Olson AB, Andrysiak AK, Tracz DM, Guard-Bouldin J, Demczuk W, et al. (2007) Limited genetic diversity in Salmonella enterica serovar Enteritidis PT13. BMC Microbiol 1;7: 87.
- 7. Guard J, Morales CA, Fedorka-Cray P, Gast RK (2011) Single nucleotide polymorphisms that differentiate two subpopulations of Salmonella enteritidis within phage type. BMC Res Notes 26;4: 369. doi: 10.1186/1756-0500-4-369
- 8. Shah DH, Casavant C, Hawley Q, Addwebi T, Call DR, et al. (2012) Salmonella enteritidis strains from poultry exhibit differential responses to Acid stress, oxidative stress, and survival in the egg albumen. Foodborne Pathog Dis 9 (3) 258–264. doi: 10.1089/fpd.2011.1009
- 9. Tankouo-Sandjong B, Kinde H, Wallace I (2012) Development of a sequence typing scheme for differentiation of Salmonella Enteritidis strains. FEMS Micro Let 331 (2) 165–175. doi: 10.1186/1756-0500-4-369
- 10. Betancor L, Yim L, Martínez A, Fookes M, Sasias S, et al. (2012) Genomic Comparison of the Closely Related Salmonella enterica Serovars Enteritidis and Dublin. Open Microbiol J 6: 5–13. doi: 10.2174/1874285801206010005
- 11. Thomson NR, Clayton DJ, Windhorst D, Vernikos G, Davidson S, et al. (2008) Comparative genome analysis of Salmonella Enteritidis PT4 and Salmonella Gallinarum 287/91 provides insights into evolutionary and host adaptation pathways. Genome Res 18 (10) 1624–37. doi: 10.1101/gr.077404.108
- 12. den Bakker HC, Moreno Switt AI, Govoni G, Cummings CA, Ranieri ML, et al. (2011) Genome sequencing reveals diversification of virulence factor content and possible host adaptation in distinct subpopulations of Salmonella enterica. BMC Genomics 22;12: 425. doi: 10.1186/1471-2164-12-425
- 13. Brankatschk K, Blom J, Goesmann A, Smits TH, Duffy B (2012) Comparative genomic analysis of Salmonella enterica subsp. enterica serovar Weltevreden foodborne strains with other serovars. Int J Food Microbiol 155 (3) 247–256. doi: 10.1016/j.ijfoodmicro.2012.01.024
- 14. Feng Y, Xu HF, Li QH, Zhang SY, Wang CX, et al. (2012) Complete genome sequence of Salmonella enterica serovar pullorum RKS5078. J Bacteriol 194 (3) 744. doi: 10.1128/JB.06507-11
- 15. Lienau EK, Wang C, Blazar JM, Brown EW, Stones R, et al. (2012) Phylogenomic Analyses Identifies Gene Gains that Define Salmonella enterica subspecies evolution, diagnostics and pathogenesis. Plos One In review. doi: 10.1186/1471-2164-12-425
- 16. Jacobsen A, Hendriksen RS, Aaresturp FM, Ussery DW, Friis C (2011) The Salmonella enterica Pan-genome. Microb Ecol 62: 487–504. doi: 10.1007/s00248-011-9880-1
- 17. Fricke WF, Mammel MK, McDermott PF, Tartera C, White DG, et al. (2011) Comparative genomics of 28 Salmonella enterica isolates: evidence for CRISPR-mediated adaptive sublineage evolution. J Bacteriol 193: 3556–3568. doi: 10.1128/JB.00297-11
- 18. Leekitcharoenphon P, Lukjancenko O, Friis C, Aarestrup FM, Ussery DW (2012) Genomic variation in Salmonella enterica core genes for epidemiological typing. BMC Genomics 12;13 (1) 88.
- 19. Lienau EK, Strain E, Wang C, Cao G, Zheng J, et al. (2011) Identification of a Salmonellosis Outbreak by Means of Molecular Sequencing. N Engl J Med 364: 981–982. doi: 10.1056/NEJMc1100443
- 20. den Bakker HC, Moreno Switt AI, Cummings CA, Hoelzer K, Degoricija L, et al. (2011) A whole genome SNP based approach to trace and identify outbreak linked to a common Salmonella enterica subsp. enterica serovar Montevideo Pulsed Field Gel Electrophoresis type. Appl Environ Microbiol 77 (24) 8648–8655. doi: 10.1128/AEM.06538-11
- 21. Allard MW, Luo Y, Strain E, Li C, Keys CE, et al. (2012) High Resolution Clustering of Salmonella enterica serovar Montevideo Strains Using a Next-Generation Sequencing Approach. BMC Genomics 13: 32. doi: 10.1186/1471-2164-13-32
- 22. Holt KE, Parkhill J, Mazzoni CJ, Roumagnac P, Weill FX, et al. (2008) High-throughput sequencing provides insights into genome variation and evolution in Salmonella Typhi. Nat Genet 40: 987–993.23. doi: 10.1038/ng.195
- 23. Okoro CK, Kingsley RA, Quail MA, Kankwatira AM, Feasey NA, et al. (2012) High-resolution single nucleotide polymorphism analysis distinguishes recrudescence and reinfection in recurrent invasive nontyphoidal salmonella typhimurium disease. Clin Infect Dis 54 (7) 955–963. doi: 10.1093/cid/cir1032
- 24. Okoro CK, Kingsley RA, Connor TR, Harris SR, Parry CM, et al. (2012) Intracontinental spread of human invasive Salmonella Typhimurium pathovariants in sub-Saharan Africa. Nat Genet 44 (11) 1215–21. doi: 10.1038/ng.2423
- 25. Rasko DA, Webster DR, Sahl JW, Bashir A, Boisen N, et al. (2011) Origins of the E. coli strain causing an outbreak of hemolytic-uremic syndrome in Germany. N Engl J Med 365 (8) 709–717. doi: 10.1056/NEJMoa1106920
- 26. Eppinger M, Mammel MK, Leclerc JE, Ravel J, Cebula TA (2011) Genomic anatomy of Escherichia coli O157:H7 outbreak. Proc Natl Acad Sci USA 13;108 (50) 20142–7. doi: 10.1073/pnas.1107176108
- 27. Mellmann A, Harmsen D, Cummings CA, Zentz EB, Leopold SR, et al. (2011) Prospective Genomic Characterization of the German Enterohemorrhagic Escherichia coli O104:H4 Outbreak by Rapid Next Generation Sequencing Technology. PLoS ONE 6 (7) e22751. doi: 10.1371/journal.pone.0022751
- 28. Hendriksen RS, Price LB, Schupp JM, Gillece JD, Kaas RS, et al. (2010) Population Genetics of Vibrio cholerae from Nepal in 2010: Evidence on the Origin of the Haitian Outbreak. mBio 2 (4) e00157–11. doi: 10.1128/mBio.00157-11
- 29. Chin C-S, Sorenson J, Harris JB, Robins WP, Charles RC, et al. (2011) The Origin of the Haitian Cholera Outbreak strain. The New Engl J Med 364: 33–42. doi: 10.1038/nrg2719
- 30. Frerichs RR, Keim PS, Barrais R, Piarroux R (2012) Nepalese origin of cholera epidemic in Haiti. Clin Microbiol Infect 18 (6) E158–E163. doi: 10.1111/j.1469-0691.2012.03841.x
- 31. Gardy JL, Johnston JC, Ho Sui SJ, Cook VJ, Shah L, et al. (2011) Whole-Genome Sequencing and Social-Network Analysis of a Tuberculosis Outbreak. N Engl J Med 364: 730–739. doi: 10.1056/NEJMoa1003176
- 32. Gillece JD, Schupp JM, Balajee SA, Harris J, Pearson T, et al. (2011) Whole Genome Sequence Analysis of Cryptococcus gattii from the Pacific Northwest Reveals Unexpected Diversity. PLoS ONE 6 (12) e28550. doi: 10.1371/journal.pone.0028550
- 33. Wright AM, Beres SB, Consamus EN, Long SW, Flores AR, et al. (2011) Rapidly Progressive, Fatal, Inhalation Anthrax-like Infection in a Human: Case Report, Pathogen Genome Sequencing, Pathology, and Coordinated Response. Arch Path Lab Med 135 (11) 1447–1459. doi: 10.5858/2011-0362-SAIR.1
- 34. Engelthaler DM, Bowers J, Schupp JA, Pearson T, Ginther J, et al. (2011) Molecular Investigations of a Locally Acquired Case of Melioidosis in Southern AZ, USA. PLoS Negl Trop Dis 5 (10) e1347. doi: 10.1371/journal.pntd.0001347
- 35. Harris SR, Feil EJ, Holden MT, Quail MA, Nickerson EK, et al. (2010) Evolution of MRSA during hospital transmission and intercontinental spread. Science 327: 469–74. doi: 10.1126/science.1182395
- 36. Snitkin ES, Zelazny AM, Thomas PJ, Stock F, Program NC, et al. (2012) Tracking a Hospital Outbreak of Carbapenem-Resistant Klebsiella pneumoniae with Whole-Genome Sequencing. Sci Transl Med 4 (148) 148ra116. doi: 10.1126/scitranslmed.3004129
- 37. Köser CU, Ellington MJ, Cartwright EJP, Gillespie SH, Brown NM, et al. (2012) Routine Use of Microbial Whole Genome Sequencing in Diagnostic and Public Health Microbiology. PLoS Pathogens 8 (8) e1002824. doi: 10.1371/journal.ppat.1002824
- 38. Fitzgerald C, Collins M, van Duyne S, Mikoleit M, Brown T, et al. (2007) Multiplex, bead-based suspension array for molecular determination of common Salmonella serogroups. J Clin Microbiol 45: 3323–3334. doi: 10.1128/JCM.00025-07
- 39. Sukhnanand S, Alcaine S, Warnick LD, Su W-L, Hof J, et al. (2005) DNA sequence-based subtyping and evolutionary analysis of selected Salmonella enterica serotypes. J Clin Microbiol 43: 3688–3698. doi: 10.1128/JCM.43.8.3688-3698.2005
- 40. McQuiston JR, Herrera-Leon S, Wertheim BC, Doyle J, Fields PI, et al. (2008) Molecular phylogeny of the Salmonellae: relationships among Salmonella species and subspecies determined from four housekeeping genes and evidence of lateral gene transfer events. J Bacteriol 190: 7060–7067. doi: 10.1128/JB.01552-07
- 41. Xi M, Zheng J, Zhao S, Brown EW, Meng J (2008) An enhanced discriminatory pulsed-field gel electrophoresis scheme for subtyping Salmonella serotypes Heidelberg, Kentucky, SaintPaul, and Hadar. J Food Prot 71: 2067–2072.
- 42. Hudson CR, Garcia M, Gast RK, Maurer JJ (2001) Determination of close genetic relatedness of the major Salmonella Enteritidis phage types by pulsed-field gel electrophoresis and DNA sequence analysis of several Salmonella virulence genes. Avian Dis 45: 875–886. doi: 10.2307/1592867
- 43. Olsen JE, Skov MN, Threlfall EJ, Brown DJ (1994) Clonal lines of Salmonella enterica serotype Enteritidis documented by IS200-, ribo-, pulsed-field gel electrophoresis and RFLP typing. J Med Microbiol 40: 15–22. doi: 10.1099/00222615-40-1-15
- 44. Zheng J, Keys CE, Zhao S, Meng J, Brown EW (2007) Enhanced subtyping scheme for Salmonella enteritidis. Emerg Infect Dis 13: 1932–1935.46. doi: 10.3201/eid1312.070185
- 45. Wise MG, Siragusa GR, Plumblee J, Healy M, Cray PJ, et al. (2009) Predicting Salmonella enterica serotypes by repetitive sequence-based PCR. J Microbiol Methods 76: 18–24. doi: 10.1016/j.mimet.2008.09.006
- 46. Cebula TA, Brown EW, Jackson SA, Mammel MK, Mukherjee A, et al. (2005) Molecular applications for identifying microbial pathogens in the post-9/11 era. Expert Rev Mol Diagn 5: 431–445. doi: 10.1586/1473722.214.171.1241
- 47. Grépinet O, Rossignol A, Loux V, Chiapello H, Gendrault A, et al. (2012) Genome Sequence of the Invasive Salmonella enterica subsp. enterica Serotype Enteritidis Strain LA5. J Bacteriol 194: 2387–2388. doi: 10.1128/JB.00256-12
- 48. Timme RE, Allard MW, Luo Y, Strain E, Pettengill J, et al. (2012) Draft Genome Sequences of 21 Salmonella enterica Serovar Enteritidis Strains. J Bacteriol 194 (21) 5994–5995. doi: 10.1128/JB.01289-12
- 49. Feng Y, Xu H-F, Li Q-H, Zhang S-Y, Wang C-X, et al. (2012) Complete Genome Sequence of Salmonella enterica Serovar Pullorum RKS5078. J Bacteriol 194 (3) 744. doi: 10.1128/JB.06507-11
- 50. Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, et al. (2005) Genome sequencing in microfabricated high-density picolitre reactors. Nature 437: 376–380. doi: 10.1038/nature03959
- 51. Klimke W, Agarwala R, Badretdin A, Chetvernin S, Ciufo S, et al. (2009) The National Center for Biotechnology Information's Protein Clusters Database. Nuc Acids Res 37: D216–223. doi: 10.1093/nar/gkn734
- 52. Darling ACE, Mau R, Blatter FR, Perna NT (2004) Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res 14 (7) 1394–1403. doi: 10.1101/gr.2289704
- 53. Zwickl DJ (2006) Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maximum likelihood criterion. Ph.D. dissertation, The University of Texas at Austin.
- 54. Quince C, Lanzen A, Davenport RJ, Turnbaugh PJ (2011) Removing noise from pyrosequenced amplicons. BMC Bioinformatics 12: 38. doi: 10.1186/1471-2105-12-38
- 55. Goloboff PA, Farris JS, Nixon KC (2008) TNT, a free program for phylogenetic analysis. Cladistics 24: 5. doi: 10.1111/j.1096-0031.2008.00217.x
- 56. Goloboff PA (1996) Methods for faster parsimony analysis. Cladistics 12 (3) 199–220. doi: 10.1111/j.1096-0031.2008.00217.x
- 57. Hanna LF, Matthews TD, Dinsdale EA, Hasty D, Edwards RA (2012) Characterization of the ELPhiS Prophage from Salmonella enterica Serovar Enteritidis Strain LK5. Appl Environ Microbiol 78 (6) 1785–1793. doi: 10.1128/AEM.07241-11
- 58. Chu C, Feng Y, Chien AC, Hu S, Chu CH, et al. (2008) Evolution of genes on the Salmonella Virulence plasmid phylogeny revealed from sequencing of the virulence plasmids of S. enterica serotype Dublin and comparative analysis. Genomics 92 (5) 339–343. doi: 10.1016/j.ygeno.2008.07.010
- 59. Feng Y, Liu J, Li YG, Cao FL, Johnston RN, et al. (2012) Inheritance of the Salmonella virulence plasmids: mostly vertical and rarely horizontal. Infect Genet Evol 12 (5) 1058–1061. doi: 10.1016/j.meegid.2012.03.004
- 60. Bono JL, Smith TPL, Keen JE, Harhay GP, McDaneld TG, et al. (2012) Phylogeny of Shiga Toxin-Producing Escherichia coli O157 Isolated from Cattle and Clinically Ill Humans. Mol Biol Evol 29 (8) 2047–2062. doi: 10.1093/molbev/mss072
- 61. Boyd EF (2012) Bacteriophage-encoded bacterial virulence factors and phage-pathogenicity island interactions. Adv Virus Res 82: 91–118. doi: 10.1016/B978-0-12-394621-8.00014-5
- 62. Karberg KA, Olsen GJ, Davis JJ (2011) Similarity of genes horizontally acquired by Escherichia coli and Salmonella enterica is evidence of a supraspecies pangenome. PNAS 108 (50) 20154–20159. doi: 10.1073/pnas.1109451108
- 63. Lee JH, Shin H, Ryu S (2012) Complete Genome Sequence of Salmonella enterica Serovar Typhimurium Bacteriophage SPN3UB. J Virol 86 (6) 3404–3405. doi: 10.1128/JVI.07226-11
- 64. Shin H, Lee JH, Lim JA, Kim H, Ryu S (2012) Complete genome sequence of Salmonella enterica serovar typhimurium bacteriophage SPN1S. J Virol 86 (2) 1284–1285. doi: 10.1128/JVI.06696-11
- 65. Battesti A, Tsegaye YM, Packer DG, Majdalani N, Gottesman S (2012) H-NS Regulation of IraD and IraM Anti-adaptors for Control of RpoS Degradation. J Bact 194 (10) 2470–2478. doi: 10.1128/JB.00132-12
- 66. Tang YT, Gao R, Havranek JJ, Groisman EA, Stock AM, et al. (2012) Inhibition of Bacterial Virulence: Drug-Like Molecules Targeting the Salmonella enterica PhoP Response Regulator. Chem Biol Drug Des 79 (6) 1007–1017. doi: 10.1111/j.1747-0285.2012.01362.x
- 67. Silva CA, Blondel CJ, Quezada CP, Porwollik S, Andrews-Polymenis HL, et al. (2012) Infection of Mice by Salmonella enterica Serovar Enteritidis Involves Additional Genes That Are Absent in the Genome of Serovar Typhimurium. Infect Immun 80 (2) 839–849. doi: 10.1128/IAI.05497-11
- 68. Osborne SE, Tuinema BR, Mok MC, Lau PS, Bui NK, et al. (2012) Characterization of DalS, an ATP-binding cassette transporter for D-alanine, and its role in pathogenesis in Salmonella enterica. J Biol Chem 287 (19) 15242–15250. doi: 10.1074/jbc.M112.348227
- 69. Rohmer L, Hocquet D, Miller SI (2011) Are pathogenic bacteria just looking for food? Metabolism and microbial pathogenesis. Trend Microbiology 19 (7) 341–348. doi: 10.1016/j.tim.2011.04.003
- 70. David R (2012) Bacterial pathogenesis: A competitive edge for Salmonella. Nature Reviews Microbiology 10: 309. doi: 10.1016/j.tim.2011.04.003
- 71. Yue M, Rankin SC, Blanchet RT, Nulton JD, Edwards RA, et al. (2012) Diversification of the Salmonella Fimbriae: A Model of Macro- and Microevolution. PLoS ONE 7 (6) e38596. doi: 10.1371/journal.pone.0038596