Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Yersinia spp. Identification Using Copy Diversity in the Chromosomal 16S rRNA Gene Sequence

  • Huijing Hao ,

    Contributed equally to this work with: Huijing Hao, Junrong Liang, Ran Duan

    Affiliation National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Beijing, China

  • Junrong Liang ,

    Contributed equally to this work with: Huijing Hao, Junrong Liang, Ran Duan

    Affiliation National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Beijing, China

  • Ran Duan ,

    Contributed equally to this work with: Huijing Hao, Junrong Liang, Ran Duan

    Affiliation National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Beijing, China

  • Yuhuang Chen,

    Affiliation National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Beijing, China

  • Chang Liu,

    Affiliations National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Beijing, China, Department of Pathogenic Biology, School of Medical Science, Jiangsu University, Zhenjiang, Jiangsu Province, China

  • Yuchun Xiao,

    Affiliation National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Beijing, China

  • Xu Li,

    Affiliation National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Beijing, China

  • Mingming Su,

    Affiliation Institute of Biophysics, Chinese Academy of Sciences, Beijing, China

  • Huaiqi Jing,

    Affiliation National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Beijing, China

  • Xin Wang

    wangxin@icdc.cn

    Affiliation National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, State Key Laboratory of Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Beijing, China

Abstract

API 20E strip test, the standard for Enterobacteriaceae identification, is not sufficient to discriminate some Yersinia species for some unstable biochemical reactions and the same biochemical profile presented in some species, e.g. Yersinia ferderiksenii and Yersinia intermedia, which need a variety of molecular biology methods as auxiliaries for identification. The 16S rRNA gene is considered a valuable tool for assigning bacterial strains to species. However, the resolution of the 16S rRNA gene may be insufficient for discrimination because of the high similarity of sequences between some species and heterogeneity within copies at the intra-genomic level. In this study, for each strain we randomly selected five 16S rRNA gene clones from 768 Yersinia strains, and collected 3,840 sequences of the 16S rRNA gene from 10 species, which were divided into 439 patterns. The similarity among the five clones of 16S rRNA gene is over 99% for most strains. Identical sequences were found in strains of different species. A phylogenetic tree was constructed using the five 16S rRNA gene sequences for each strain where the phylogenetic classifications are consistent with biochemical tests; and species that are difficult to identify by biochemical phenotype can be differentiated. Most Yersinia strains form distinct groups within each species. However Yersinia kristensenii, a heterogeneous species, clusters with some Yersinia enterocolitica and Yersinia ferderiksenii/intermedia strains, while not affecting the overall efficiency of this species classification. In conclusion, through analysis derived from integrated information from multiple 16S rRNA gene sequences, the discrimination ability of Yersinia species is improved using our method.

Introduction

The genus Yersinia is widely distributed in nature and currently has 17 species [13], three of which are pathogenic (Y. enterocolitica, Y. pestis, and Y. pseudotuberculosis) and have been exhaustively researched. The remaining species are referred to as Y. enterocolitica-like bacteria where their etiology in disease is not understood [4, 5]. Traditionally, bacteria were classified according to similarities and differences in phenotypes, such as morphology and biochemical reactions. The API 20E strip test is the standard for identifying Enterobacteriaceae; however this test has limitations in identifying Yersinia species. Several Yersinia species can be identified using API 20E strip test where the accuracy is influenced by passage number, culture conditions, and instability of some biochemical reactions [6]. Therefore, sensitive molecular biology methods are needed to assist the traditional approach to identify the Yersinia.

Since the 1980s, the ribosomal RNA gene has been used for phylogenetic studies, and ribosomal RNA-based approaches have been increasingly applied to bacterial classification and identification, especially using the 16S rRNA. The 16S rRNA gene is generally accepted as the best molecular sequence to use for identification because it is functionally constant and shows a mosaic of structure having conserved and variable regions and exists in all organisms; and its length is easily sequenced [7]. These properties make it uniquely well suited for systematics applications. However, the presence of multiple copies of the rRNA operons and intra-genomic heterogeneity in the 16S rRNA gene is regarded as a limiting factor in species identification [810]. In bacteria, three rRNA genes (16S, 23S, and 5S) are organized into a gene cluster, which is expressed as a single operon. The total number of rRNA operons in prokaryotes range from one to 15[10].With more and more bacterial genomes being completely sequenced, the heterogeneity of 16S rRNA gene at the intra-genomic level has been discussed. Pei collected 883 prokaryotic genomes from the GenBank database, representing 568 unique species, 235 of which have copy diversity in the 16S rRNA gene within a genome (0.06% to 20.38%)[9]. The diversity in 24 species are close to or over the threshold of the 16S rRNA gene-based operational definition of a species (1% to 1.3% diversity)[9], so these species maybe misclassified into a new species if a different copy is used for identification. Recently, 2,013 genomes were analyzed and 22.5% were found divergence at over 1% in the 16S rRNA gene copies [11]. In addition, the 16S rRNA gene shows high similarity comparing some different species where identical sequences were found even[12]. This limits the use of the 16S rRNA gene in identifying bacterial species. Therefore, sequencing the 16S rRNA gene from multiple operons from isolates is recommended to achieve significant phylogenetic information for species identification [13].

Given the reasons above, we used the copy diversity in the 16S rRNA gene to identify Yersinia species by analyzing multiple copies of the 16S rRNA gene from 768 strains within the genus Yersinia.

Materials and Methods

Source of Strains

We used 768 strains of Yersinia including 10 species of Yersinia. Seven hundred and eleven-one isolated strains were widely distributed within 21 provinces of China; and thirty-three reference strains were from Japan, France and the National Institutes for Food and Drug Control (NIFDC) in China. The dates of strain isolation encompass 52 years (1962–2014). Biochemical data were determined using API 20E test strips (Biomerieux, France), showing 407 strains of Y. ferderiksenii/intermedia, 119 strains of Y. kristensenii, 123 strains of Y. enterocolitica (72 nonpathogenic and 51 pathogenic), 46 strains of Y. pseudotuberculosis and 49 strains of Y. pestis. The source and hosts of strains are shown in Table 1. Twenty-four complete-genome-sequenced strains were selected from the NCBI (http://www.ncbi.nlm.nih.gov/pubmed/) with the GenBank number listed in Table 2.

thumbnail
Table 1. Source and host distribution of Yersinia strains used.

https://doi.org/10.1371/journal.pone.0147639.t001

thumbnail
Table 2. The GenBank numbers of 24 complete-genome-sequenced strains.

https://doi.org/10.1371/journal.pone.0147639.t002

Culture and Identification of Strains

Enrichment was performed using phosphate-buffered saline with sorbitol and bile salts (PSB) at 4°C for 21 days. Then strains were inoculated onto Yersinia-selective agar (cefsulodin-irgasan-novobiocin [CIN] agar; Difco). Suspected colonies having a typical bull’s-eye appearance (deep red centers surrounded by an outer transparent zone) on CIN agar were selected and allowed to grow on brain heart infusion agar (Beijing Land Bridge Technology Co., Ltd., China) at 25°C for 24 to 48 hours to obtain pure strains[14, 15]. We used the commercial rapid identification system, API 20E strip test (bioMérieux, France), for the identification of presumptive isolates. The biotypes of Y. enterocolitica strains were identified using the scheme reviewed by Bottone [16].

T-vectors Cloning and Sequencing of the 16S rRNA Gene

The bacterial genome was extracted using a DNA nucleic acid extraction kit (Tiangen, China). The 16S rRNA gene was amplified using the universal primers 27F (5’-AGAGTTTGATCCTGGCTCAG-3’) and 1492R (5’-GGTTACCTTGTTACGACTT-3’)[17]with the Q5 Hot Start High-Fidelity 2×Master Mix(NEB, U.K.). After gel electrophoresis, specific PCR products were purified using a pEASY purification kit (Transgene, China). Connection and transformation were performed using the pEASYTM-Blunt Simple Cloning Kit (Transgene, China). The transformed bacteria were grown at 37°C for 12h on Luria-Bertani agar, containing X-gal (20 mg/ml), IPTG (24 mg/ml), and kanamycin (50 mg/ml). Five white single colonies were randomly selected for sequencing. All were sequenced with an ABI Prism BigDye Terminator cycle sequencing ready reaction kit using AmpliTaq DNA polymerase and an ABI Prism 377xl DNA sequencer (Applied Biosystems, Foster City, CA, USA) according to the instructions of the manufacturer at Tsingke BioTech Co., Ltd., sequencing the amplicons in both directions.

Sequence Analysis

Sequence alignment was performed using Seqman (Lersergene7.0). A phylogenetic tree based on a single copy of the 16S rRNA gene was constructed using Kimura’s 2-parameter distance and the neighbor-joining method (MEGA 6.0). All of the 16S rRNA gene copies from each complete-genome-sequenced strain were coded with a random number, and then five sequences were selected using a random number table. After elimination of redundancies, all the five sequences in the 16S rRNA gene of each strain were obtained corresponding to a unique pattern number. The minimum distance between species was calculated on the basis of a full array between every two strains and each of the five sequences. A phylogenetic tree based on the multi-copy of the diverse 16S rRNA gene was constructed using a distance matrix. The distance of the five sequences within each strain was also calculated.

Ethics

The sample collection and detection protocols were approved by the Ethics Review Committee [Institutional Review Board (IRB)] of National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention. The copy of ethical approval documents were provided comply with PLOS ONE requirements. The authors were involved in sample collections involving human subjects and authors can access to personal information before the authors had access to the data. Signed informed consent was obtained from all study participants. For all the patients under 18 years-old, a written consent form was signed by a parent or legal guardian. The field studies did not involve endangered or protected species, so the locations/activities for which specific permission was not required.

Results

16S rRNA Copies at the Intra-Genomic Level

In total, 3,840 sequences of 16S rRNA gene were obtained from 768 strains used in this study. One or more base difference between two sequences was defined as a different 16S rRNA gene type. The type number of intra-strain 16S rRNA genes is shown in Fig 1A. In general, 60% of the strains have two or three types, 17% have four, and 18% have one. Only 5% of the strains have five totally different 16S rRNA gene types. Fig 1B shows a comparison of the gene type number between the different Yersinia species. The number is primarily one or two in Y. pseudotuberculosis and Y. kristensenii, and two to four in Y. ferderiksenii/intermedia. The strains of pathogenic Y. enterocolitica usually have one to three types, whereas the non-pathogenic strains have two to three types where one strain having five identical copies is uncommon at about 8% of the strains. Y. pestis and Y. pseudotuberculosis do not have five different copy sequences.

thumbnail
Fig 1. A. Distribution of the type number of 16S rRNA genes in 768 Yersinia strains. The colours in different sections of the pie chart represent the type number of 16S rRNA gene in strains, one type, two types, three types, four types, five types, respectively. The number in the pie represents the number of strains that have each kind of copies of 16S rRNA gene, the percentage in parentheses represents the proportion of all strains. B. The proportion of each copy appearing in different Yersinia species. C. The identical 16S rRNA patterns that exist in different Yersinia species, except for Y. pestis and Y. pseudotuberculosis.

Numbers in the crossed circle represent the number of identical patterns in the corresponding Yersinia species. Numbers in parentheses represent the amount of total patterns in corresponding species. Specific patterns are not shown.

https://doi.org/10.1371/journal.pone.0147639.g001

Pairwise Comparison of the 16S rRNA Gene at the Intra-Genomic Level

The sequence similarity comparison between pairwise copies of 16S rRNA genes in each strain is shown in Table 3 classified by Yersinia species. The sequence similarity of pairwise copies in each Yersinia species strain exceeds 99% for most strains. There are only nine Y. ferderiksenii/intermedia (accounting for 2.2%) of the strains with similarity below 98.7%, and one strain of Y. kristensenii and non-pathogenic Y. enterocolitica of relatively low similarity 96.77% and 97.94%, respectively, whose sequence variation are 47 and 30 bases, respectively.

thumbnail
Table 3. Pairwise comparison of 16S rRNA gene at the intra-genomic level in each Yersinia species.

https://doi.org/10.1371/journal.pone.0147639.t003

Cluster Analysis Based on 16S rRNA Gene Sequence

A phylogenetic tree (Fig 2), constructed on the basis of the five copies of 16S rRNA gene presented in each strain and using the minimum evolution method, can be divided into six groups: 1a, 1b, 2, 3, 4 and 5.

thumbnail
Fig 2. A phylogenetic tree constructed on the basis of the five copies of 16S rRNA gene in each strain using the minimum evolution method.

A. Dots with different colors represent the corresponding Yersinia species; tree branch colors are consistent with triangles in B., which represent different clustering groups.

https://doi.org/10.1371/journal.pone.0147639.g002

Group 1a and group 1b comprise Y. ferderiksenii/intermedia strains, 220 and 90 strains, respectively. The similarity between strains of each group is over 90%. Group 3 primarily contains Y. enterocolitica strains; but clustered with three Y. kristenii strains and one Y. ferderiksenii strain. Y. enterocolitica strains of biotype 1b show a distinct sub-cluster (the dark blue and bold clade in Fig 1A) in group 3, three isolated strains and two complete-genome-sequenced strains WA and 8081. Two sub-clusters comprise group 4: one is a combination of Y. ferderiksenii/intermedia strains, represented by two reference strains of Y. ferderksenii, ATCC33641 and 52235; another one contains four Y. enterocolitica strains and one complete-genome-sequenced strain of Y. rohdei. Group 5 consists of all the strains of Y. pestis and Y. pseudotuberculosis involved in this study, and a complete-genome-sequenced strain of Y. similis, located outside of the group.

Group 2 contains multiple Yersinia species, such as Y. enterocolitica, Y. ferderiksenii/intermedia and Y. kristensenii, which can be divided into three clusters: a, b and c. Group 2a includes two reference strains of Y. kristensenii and one complete-genome-sequenced strain of Y. aleksiciae. Group 2b is complicated hence described in three parts. Group 2b-1 is primarily made up of Y. kristenii strains, next is two reference strains of Y. kristensenii, 52246 and 52232, and one complete-genome-sequenced strain of Y. ruckeri. Group 2b-2 has two sub-clusters, one consisting of an Y. enterocolitica strain and a whole-genome-sequenced strain of Y. aldovae, and another with 43 Y. ferderiksenii/intermedia strains of high similarity (>99.6%), two of which are reference strains of Y. intermedia. Group 2b-3 has three sub-clusters. The first one contains 13 Y. kristensenii strains, five Y. enterocolitica strains, and one complete-genome-sequenced strain of Y. ferderiksenii Y225 that is distant from the rest of the strains in this sub-cluster. The second contains 33 Y. kristensenii strains of high similarity (>99.6%), except for two Y. ferderiksenii/kristensenii strains. The third with high similarities between strains (>99.6%), presents diversity, containing 35 Y. kristensenii (two are complete-genome-sequenced) and 43 Y. ferderiksenii/intermedia strains (three reference strains of Y. intermedia and one complete-genome-sequenced strain of Y. intermedia). There is a single Y. ferderiksenii/intermedia strain alone between group 2–1 and group 2b-2. Group 2c contains nine Y. enterocolitica strains, two Y. kristensenii strains and one complete-genome-sequenced strain of Y. ferderiksenii.

The Dominant Patterns of the 16S rRNA Gene in Each Yersinia Species

Four hundred thirty-nine patterns of 16S rRNA gene sequences, coded as 0 to 438(S1 File and S1 Table), were obtained after removing the redundant sequences. Table 4 shows each Yersinia species has dominant patterns. Though non-dominant patterns are more than dominant patterns, they appear at a low frequency and usually appear only once. There exists the same dominant gene pattern in the strains of Y. kristensenii and Y. ferderiksenii/intermedia, and Y. pestis and Y. pseudotuberculosis, respectively.

thumbnail
Table 4. The dominant patterns of the 16S rRNA genes in each Yersinia species.

https://doi.org/10.1371/journal.pone.0147639.t004

Identical 16S rRNA Gene Sequences from Different Yersinia Species

Strains of different Yersinia species have identical 16S rRNA gene sequences (Fig 1C). Non-pathogenic Y. enterocolitica strains (biotype 1A) have the same 16S rRNA gene patterns with many kinds of other Yersinia species, with nine same gene patterns with pathogenic Y. enterocolitica, four with Y. kristensenii strains and only one with the other Yersinia species. Y. kristensenii strains have three identical gene patterns with Y. ferderiksenii/intermedia strains from group 2b, one of which is dominant, accounting 20% for Y. kristensenii and 31.9% for Y. ferderiksenii/intermedia respectively. There also exist identical 16S rRNA gene patterns among non-pathogenic Y. enterocolitica, Y. kristensenii, and Y. ferderiksenii/intermedia strains from group 2b, and also among non-pathogenic Y. enterocolitica, Y. kristensenii, and pathogenic Y. enterocolitica. Only group 1a and 1b exist three same 16S rRNA gene patterns among the four groups of Y. ferderiksenii/intermedia (1a, 1b, 2 and 4). Y. pestis strains have six identical patterns with Y. pseudotuberculosis (not shown in Fig 1C), one of which is dominant in both species, accounting for 34.2% and 64.1%, respectively.

Discussion

Compared to bacteria identification methods using phenotype, the approach based on genotype stands out for its consistency. One desirable candidate is the 16S rRNA gene, highly conserved and seldom variable within species, and is becoming an important technique for phylogeny research and species classification[7]. Specifically speaking, for Yersinia, the limitation of commercial tests for identification systems such as API 20E strip test has shown a deficiency in distinguishing some species[18, 19], which can be made up through the method based on 16S rRNA gene. However, the similarity of 16S rRNA gene sequence between species is as high as 96.9%-99.8%; it is easy therefore to misclassify species of high homology[20]. Recently, the rRNA operon is shown to have multiple copies with some heterogeneity[17]. In some cases, the diversity between multiple copies of a strain is so high as to be misclassified into different species if a different copy is used to identify a bacterial strain [21, 22]. This is a limiting factor for species identification using a single 16S rRNA gene. The discrimination ability can be improved by integrating multiple 16S rRNA gene copies. Therefore, here we first report species differentiation using five copies of the 16S rRNA gene from 768 strains of Yersinia.

Previously, the copy number of Yersinia16S rRNA gene was shown to be around seven [11]. According to whole genome sequence strain from NCBI, the number of variable copies is within three. Accordingly, five colonies was randomly selected for 16S rRNA gene sequencing for each strain. Our results show 60% of the strains have two to three genotypes and only 5% have all 5 types, therefore the selection of 5 colonies is reasonable. Though 84% of the strains have more than one sequence type of 16S rRNA gene, the comparison of the intra-genomic 16S rRNA gene showed that the similarity of 98.4% of the strains is above the threshold of species identification (98.7% to 99%)[9]. In other words, for most Yersinia strains, the heterogeneity among different copies does not affect species classification. Whereas for 12 strains, the similarity of intra-genomic 16S rRNA gene is below or close to the threshold of classification. When using a single copy for identification, a different result may occur. Hence in this study, with integrated information from multiple 16S rRNA genes and a comprehensive reflection of phylogeny, species identification is more accurate.

A phylogenetic tree (Fig 2) was constructed by combining five 16S rRNA gene sequences for each strain, most clustering of species is accordant with classical identification. Y. pestis and Y. pseudotuberculosis share a large number of identical 16S rRNA gene sequences and are close to each other in the phylogenetic tree, which is consistent with the hypothesis that Y. pestis evolved from O: 1b strains of Y. pseudotuberculosis about 1,500–20,000 years ago[23, 24], indirectly supporting our method. Y. enterocolitica strains are in group 3, where Y. enterocolitica subsp. enterocolitica (biotype 1b) and Y. enterocolitica subsp. Palearctica (biotype 1A, 2, 3, 4 and 5) diverge into two clusters, coinciding with previous work[2527].

The majority of Y. kristensenii strains are within group 2, spreading widely and having multiple branches. In group2b-3-3, Y. kristensenii and Y. ferderiksenii/intermedia strains are closely clustered with each other, sharing high similarity, due to the fact that the consensus sequence of the two species are also dominant in each species. Notably, very few Y. kristensenii strains cluster with Y. enterocolitica strains. Compared with other species, Y. kristensenii presents higher genetic variability. This may be owing to the uncertainty of biochemical identification that these ‘cross-over’ strains may be classified into other Yersinia species. Sequencing more than five random selected colonies may solve this problem and needs further investigation.

Y. ferderiksenii and Y. intermedia strains are indistinguishable using API 20E strip test where they are referred to as Y. ferderiksenii/intermedia. In this study, they are gathered within group 1a, 1b, 2b and 4. The reference strains of Y. intermedia were all in group 2b, so strains in this group should be classified to be Y. intermedia strains. Consequently, strains in group 1a, 1b and 4 belong to Y. ferderiksenii. Identified in 1980, Y. ferderiksenii included three geno-species without biochemical differences [28]. Clustering analysis showed sequence types of group 1a, 1b, and 4 clustered with geno-species 2, 3, 1 of Y. ferderiksenii strains, respectively (Fig 3). Therefore, the 16S rRNA gene sequence types can be used for unclassified strains from API 20E strip test, and also for subtyping strains.

thumbnail
Fig 3. Phylogenetic tree based on single 16S rRNA gene from Y. ferderiksenii/intermedia strains in group 1a, 1b, and 4 and strains of Y. ferderiksenii belonging to three geno-species.

Hollow circles represent all 16S rRNA gene types of Y. ferderiksenii /intermedia strains in group 1a, 1b, and 4; while solid circles represent Y. ferderikseniis trains of three geno-species [1]. Triangles represent identical 16S rRNA gene patterns of strains in group 1a and group 1b.

https://doi.org/10.1371/journal.pone.0147639.g003

Presently there is divergence in species classification for Yersinia. In addition to traditional biochemical reactions, refined molecular biology techniques are needed to compensate for this deficiency. Y. aleksiciae and Y. kristensenii have no difference in their biochemical phenotype but differ into distinct species using 16S rRNA gene sequences[29]. In group 2a, Y. kristensenii reference strains (52242 and 52247) share 99.9% similarity with the genome sequenced strain of Y. aleksiciae, a recent phylogenetic relationship. A previous study shows Y.similis is indistinguishable from Y. pseudotuberculosis through biochemical phenotyping that it was regarded as Y. pseudotuberculosis; however, their pathogenicity are significantly different and they differed using 16S rRNA gene sequencing [30]. In this study, genome sequenced strain of Y. similis were inseparably clustered with Y. pseudotuberculosis and Y. pestis in group 5, an outlier (black dot in group 5 Fig 2A) with an outermost evolutionary distance. Further, several comparable subspecies clustered respectively in group2b-1 and group 2b-2: genome sequenced Y. ruckeri strain and four Y. kristensenii strains formed a sub-branch; Y. aldovae and one Y. enterocolitica strain formed another, far away from the rest of the Y. enterocolitica strains. We suspect this Y. enterocolitica strain may belong to another undescribed Yersinia species. Similarly, in group 4, three Y. enterocolitica strains formed another sub-branch with one genome sequenced strain of Y. rohdei. All of these strains show Yersinia is a genetically diverse genus; the strains involved here may be classified into other species of more diversity.

Based on integrated information from multiple 16S rRNA gene copies and clustering analysis, this study shows comparable species clusters of Yersinia and reduces identification chaos derived from using a single copy of the 16S rRNA gene sequence which is sometimes similar or identical between species, and remarkably increases the discrimination ability of 16S rRNA gene sequence. Further study using our classification method would allow increased differentiation of Yersinia species identification.

Supporting Information

S1 File. The 439 Patterns (Coded as 0 to 438) of 16S rRNA Gene Sequences in this study.

https://doi.org/10.1371/journal.pone.0147639.s001

(PDF)

S1 Table. Patterns Code of Five 16S rRNA Gene Sequences for each Strain.

https://doi.org/10.1371/journal.pone.0147639.s002

(XLSX)

Acknowledgments

We thank Liuying Tang and Jim Nelson for critical reading and helpful comments on our manuscript.

Author Contributions

Conceived and designed the experiments: HJ XW. Performed the experiments: HH RD YC CL YX XL. Analyzed the data: HH JL MS. Contributed reagents/materials/analysis tools: HH JL MS. Wrote the paper: HH JL RD HJ.

References

  1. 1. Souza RA, Imori PF, Falcao JP. Multilocus sequence analysis and 16S rRNA gene sequencing reveal that Yersinia frederiksenii genospecies 2 is Yersinia massiliensis. International journal of systematic and evolutionary microbiology. 2013;63(Pt 8):3124–9. pmid:23908151.
  2. 2. Neubauer H, Sprague LD. Strains of Yersinia wautersii should continue to be classified as the 'Korean Group' of the Yersinia pseudotuberculosis complex and not as a separate species. International journal of systematic and evolutionary microbiology. 2015;65(Pt 2):732–3. pmid:25505347.
  3. 3. Savin C, Martin L, Bouchier C, Filali S, Chenau J, Zhou Z, et al. The Yersinia pseudotuberculosis complex: characterization and delineation of a new species, Yersinia wautersii. International journal of medical microbiology: IJMM. 2014;304(3–4):452–63. pmid:24598372.
  4. 4. Sulakvelidze A. Yersiniae other than Y. enterocolitica, Y. pseudotuberculosis, and Y. pestis: the ignored species. Microbes and infection / Institut Pasteur. 2000;2(5):497–513. pmid:10865195.
  5. 5. Mittal S, Mallik S, Sharma S, Virdi JS. Characteristics of beta-lactamases and their genes (blaA and blaB) in Yersinia intermedia and Y. frederiksenii. BMC microbiology. 2007;7:25. pmid:17407578; PubMed Central PMCID: PMC1853101.
  6. 6. Woo PC, Ng KH, Lau SK, Yip KT, Fung AM, Leung KW, et al. Usefulness of the MicroSeq 500 16S ribosomal DNA-based bacterial identification system for identification of clinically significant bacterial isolates with ambiguous biochemical profiles. Journal of clinical microbiology. 2003;41(5):1996–2001. pmid:12734240; PubMed Central PMCID: PMC154750.
  7. 7. Woese CR. Bacterial evolution. Microbiological reviews. 1987;51(2):221–71. pmid:2439888; PubMed Central PMCID: PMC373105.
  8. 8. Fox GE, Wisotzkey JD, Jurtshuk P Jr. How close is close: 16S rRNA sequence identity may not be sufficient to guarantee species identity. International journal of systematic bacteriology. 1992;42(1):166–70. pmid:1371061.
  9. 9. Pei AY, Oberdorf WE, Nossa CW, Agarwal A, Chokshi P, Gerz EA, et al. Diversity of 16S rRNA genes within individual prokaryotic genomes. Applied and environmental microbiology. 2010;76(12):3886–97. pmid:20418441; PubMed Central PMCID: PMC2893482.
  10. 10. Klappenbach JA, Saxman PR, Cole JR, Schmidt TM. rrndb: the Ribosomal RNA Operon Copy Number Database. Nucleic acids research. 2001;29(1):181–4. pmid:11125085; PubMed Central PMCID: PMC29826.
  11. 11. Sun DL, Jiang X, Wu QL, Zhou NY. Intragenomic heterogeneity of 16S rRNA genes causes overestimation of prokaryotic diversity. Applied and environmental microbiology. 2013;79(19):5962–9. pmid:23872556; PubMed Central PMCID: PMC3811346.
  12. 12. Jaspers E, Overmann J. Ecological significance of microdiversity: identical 16S rRNA gene sequences can be found in bacteria with highly divergent genomes and ecophysiologies. Applied and environmental microbiology. 2004;70(8):4831–9. pmid:15294821; PubMed Central PMCID: PMC492463.
  13. 13. Clayton RA, Sutton G, Hinkle PS Jr., Bult C, Fields C. Intraspecific variation in small-subunit rRNA sequences in GenBank: why single sequences may not adequately represent prokaryotic taxa. International journal of systematic bacteriology. 1995;45(3):595–9. pmid:8590690.
  14. 14. Wang X, Cui Z, Jin D, Tang L, Xia S, Wang H, et al. Distribution of pathogenic Yersinia enterocolitica in China. European journal of clinical microbiology & infectious diseases: official publication of the European Society of Clinical Microbiology. 2009;28(10):1237–44. pmid:19575249.
  15. 15. Liang J, Wang X, Xiao Y, Cui Z, Xia S, Hao Q, et al. Prevalence of Yersinia enterocolitica in pigs slaughtered in Chinese abattoirs. Applied and environmental microbiology. 2012;78(8):2949–56. pmid:22327599; PubMed Central PMCID: PMC3318836.
  16. 16. Bottone EJ. Yersinia enterocolitica: a panoramic view of a charismatic microorganism. CRC critical reviews in microbiology. 1977;5(2):211–41. pmid:844324.
  17. 17. Moreno C, Romero J, Espejo RT. Polymorphism in repeated 16S rRNA genes is a common property of type strains and environmental isolates of the genus Vibrio. Microbiology. 2002;148(Pt 4):1233–9. pmid:11932467.
  18. 18. Neubauer H, Sauer T, Becker H, Aleksic S, Meyer H. Comparison of systems for identification and differentiation of species within the genus Yersinia. Journal of clinical microbiology. 1998;36(11):3366–8. pmid:9774596; PubMed Central PMCID: PMC105332.
  19. 19. Linde HJ, Neubauer H, Meyer H, Aleksic S, Lehn N. Identification of Yersinia species by the Vitek GNI card. Journal of clinical microbiology. 1999;37(1):211–4. pmid:9854094; PubMed Central PMCID: PMC84211.
  20. 20. Ibrahim A, Goebel BM, Liesack W, Griffiths M, Stackebrandt E. The phylogeny of the genus Yersinia based on 16S rDNA sequences. FEMS microbiology letters. 1993;114(2):173–7. pmid:7506685.
  21. 21. Wang Y, Zhang Z, Ramanan N. The actinomycete Thermobispora bispora contains two distinct types of transcriptionally active 16S rRNA genes. Journal of bacteriology. 1997;179(10):3270–6. pmid:9150223; PubMed Central PMCID: PMC179106.
  22. 22. Yap WH, Zhang Z, Wang Y. Distinct types of rRNA operons exist in the genome of the actinomycete Thermomonospora chromogena and evidence for horizontal transfer of an entire rRNA operon. Journal of bacteriology. 1999;181(17):5201–9. pmid:10464188; PubMed Central PMCID: PMC94023.
  23. 23. Achtman M, Zurth K, Morelli G, Torrea G, Guiyoule A, Carniel E. Yersinia pestis, the cause of plague, is a recently emerged clone of Yersinia pseudotuberculosis. Proceedings of the National Academy of Sciences of the United States of America. 1999;96(24):14043–8. pmid:10570195; PubMed Central PMCID: PMC24187.
  24. 24. Kim W, Song MO, Song W, Kim KJ, Chung SI, Choi CS, et al. Comparison of 16S rDNA analysis and rep-PCR genomic fingerprinting for molecular identification of Yersinia pseudotuberculosis. Antonie van Leeuwenhoek. 2003;83(2):125–33. pmid:12785306.
  25. 25. Neubauer H, Aleksic S, Hensel A, Finke EJ, Meyer H. Yersinia enterocolitica 16S rRNA gene types belong to the same genospecies but form three homology groups. International journal of medical microbiology: IJMM. 2000;290(1):61–4. pmid:11043982.
  26. 26. Sihvonen LM, Jalkanen K, Huovinen E, Toivonen S, Corander J, Kuusi M, et al. Clinical isolates of Yersinia enterocolitica biotype 1A represent two phylogenetic lineages with differing pathogenicity-related properties. BMC microbiology. 2012;12:208. pmid:22985268; PubMed Central PMCID: PMC3512526.
  27. 27. Batzilla J, Hoper D, Antonenka U, Heesemann J, Rakin A. Complete genome sequence of Yersinia enterocolitica subsp. palearctica serogroup O:3. Journal of bacteriology. 2011;193(8):2067. pmid:21296963; PubMed Central PMCID: PMC3133053.
  28. 28. Jan Ursing DJB, Bercovier Herv4,O. Richard Fanning, Arnold G. Steigerwalt, Jacqueline Brault, H. H. Mollaret. Yersinia frederiksenii: A New Species of Enterobacteriaceae Composed of Rhamnose-Positive Strains (Formerly Called Atypical Yersinia enterocolitica or Yersinia enterocolitica-Like). CURRENT MICROBIOLOGY. 1980;4:213–7.
  29. 29. Sprague LD, Neubauer H. Yersinia aleksiciae sp. nov. International journal of systematic and evolutionary microbiology. 2005;55(Pt 2):831–5. pmid:15774670.
  30. 30. Sprague LD, Scholz HC, Amann S, Busse HJ, Neubauer H. Yersinia similis sp. nov. International journal of systematic and evolutionary microbiology. 2008;58(Pt 4):952–8. pmid:18398201.