Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Transcriptome Characterization and Functional Marker Development in Sorghum Sudanense

  • Jieqin Li ,

    Contributed equally to this work with: Jieqin Li, Lihua Wang

    Affiliation College of Agriculture, Anhui Science and Technology University, Fengyang, China

  • Lihua Wang ,

    Contributed equally to this work with: Jieqin Li, Lihua Wang

    Affiliation College of Agriculture, Anhui Science and Technology University, Fengyang, China

  • Qiuwen Zhan ,

    qwzhan@163.com

    Affiliation College of Agriculture, Anhui Science and Technology University, Fengyang, China

  • Yanlong Liu,

    Affiliation College of Agriculture, Anhui Science and Technology University, Fengyang, China

  • Xiaocui Yang

    Affiliation College of Agriculture, Anhui Science and Technology University, Fengyang, China

Abstract

Sudangrass, Sorghum sudanense, is an important forage in warm regions. But little is known about its genome. In this study, the transcriptomes of sudangrass S722 and sorghum Tx623B were sequenced by Illumina sequencing. More than 4Gb bases were sequenced for each library. For Tx623B and S722, 88.79% and 83.88% reads, respectively were matched to the Sorghum bicolor genome. A total of 2,397 differentially expressed genes (DEGs) were detected by RNA-Seq between the two libraries, including 849 up-regulated genes and 1,548 down-regulated genes. These DEGs could be divided into three groups by annotation analysis. A total of 44,495 single nucleotide polymorphisms (SNPs) were discovered by aligning S722 reads to the sorghum reference genome. Of these SNPs, 61.37% were transition, and this value did not differ much between different chromosomes. In addition, 16,928 insertion and deletion (indel) loci were identified between the two genomes. A total of 5,344 indel markers were designed, 15 of which were selected to construct the genetic map derived from the cross of Tx623A and Sa. It was indicated that the indel markers were useful and versatile between sorghum and sudangrass. Comparison of synonymous base substitutions (Ks) and non-synonymous base substitutions (Ka) between the two libraries showed that 95% orthologous pairs exhibited Ka/Ks<1.0, indicating that these genes were influenced by purifying selection. The results from this study provide important information for molecular genetic research and a rich resource for marker development in sudangrass and other Sorghum species.

Introduction

Sorghum is a diverse genus consisting of cultivated and wild species, many of which are diploid, possessing considerable genetic and morphological diversity [1]. Sorghum bicolor (S. bicolor), commonly called sorghum, is an important cultivated species in the Sorghum genus and is used as food, forage and biofuel [2]. Sorghum sudanense (S. sudanense), commonly called sudangrass, is used as forage for ruminants. It was introduced into the United States in 1909 and rapidly became popular as forage [3]. Now, it is widely cultivated in Russia, Eastern Europe and South Asia [4]. Hybrids of these two species exhibit favorable forage yields, improved quality and disease resistance and therefore, are widely planted in China and the United States [5].

S. bicolor and S. sudanense exhibit big differences in morphology [6]. It is expected that genome-wide differences exist between these two species as well, but few studies have been focus on this. In addition, little genetic mechanism research on hybrids between S. sudanense and S. bicolor has been conducted due to lack of genomic information of S. sudanense. Therefore, development of genomic resources for S. sudanense is the basis of molecular biology studies on S. sudanense and hybrids of S. bicolor and S. sudanense.

RNA-sequencing (RNA-Seq) is a powerful tool for transcription profiling, providing a rapid access to a collection of expressed sequences (transcriptome). RNA-Seq offers novel opportunities in functional genomics and for developing molecular markers in non-model plants [7]. Up to now, RNA-Seq has been successfully applied in different domains of life from yeast to plants [8,9,10].

In this study, the leaf transcriptomes of two Sorghum species, S. bicolor and S. sudanense, were characterized. The main objective was to compare the transcriptomes of the two Sorghum species, and to develop DNA markers for genetic mapping of the genes for traits specific to S. sudanense. In addition, substitution rates of orthologous transcript pairs were estimated to calculate functional sequence diversity.

Materials and Methods

Plant materials

Sorghum Tx623B and sudangrass S722 were grown and maintained in a greenhouse under natural daylight condition. When the plants grew to the seven-leaf stage, all the visible leaves of three biological replicates were taken, frozen in liquid N2 and stored at -80°C until further use. Each replicate involved three plants.

RNA extraction and Illumina sequencing

Total RNA was extracted using an RNA Prep Pure Plant kit (LabKit Co., China) and analyzed for quality and quantity by an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA). Equal quantities of total RNA from the three biological replicates were pooled before used for cDNA library construction and sequencing. cDNA libraries were constructed using the Illumina RNAseq kit (Illumina, USA). Solexa adapters were then ligated to the ends of the cDNA fragments for Solexa sequencing. High-throughput sequencing was performed using Illumina Hiseq 2500 (Illumina, USA). Reads were 125 bases in length and generated from each end of the DNA fragments in paired-end sequencing.

Gene annotation and differential expression analysis

The adaptor sequences and low-quality sequence reads were removed from the data sets. Raw sequences were transformed into clean tags after data processing. These clean tags were then mapped to the reference genome sequence (http://phytozome.net/, Sorghum bicolor v2.1). GO (Gene Ontology, http://www.geneontology.org/) and COG (Clusters of Orthologous Groups of proteins, http://www.ncbi.nlm.nih.gov/COG/) analyses were subsequently performed.

The expression level of each gene was measured as the normalized number of matched clean tags. The normalization method of reads per kilobase per million mapped (RPKM) was used in this study [11]. The IDEG6 software was used to test the significance of gene expression. The false discovery rate (FDR) method was used to tune the threshold P value. An FDR <0.01 and an absolute log2 ratio ≥1 were used to judge the significance of differences in gene expression.

SNP and indel detection

All S722 reads were used to mine SNP markers by aligned to the sorghum reference genome using TopHat [12]. SNP loci were detected using SAMtools between the library and the reference genome [13]. For the nucleotides to be counted as a SNP, the following parameters were required: (1) The sequencing depth was more than 5X, but less than 100X; (2) the quality value of variant calling was larger than 20%; and (3) the distance between SNP loci was longer than 5 bases.

Indel loci were detected using GATK by aligning S722 reads to the reference genome using STAR. The parameters for indel loci detection were as follows: (1) The quality value was larger than 20; and (2) the distance between detected indel loci was longer than 35 bases.

Indel marker design and genetic map construction

All indel sequences (500 bases) were extracted from the sorghum reference genome. Then, indel markers were designed using Primer 3.0 [14]. The parameters were as follows: (1) Tm value was 55–60°C; and (2) PCR target product was 150–350 bases. A hundred indel markers (more than 3 bases of difference) were synthesized by the Springen Biotechnology Company (Nanjing, China). A hundred indel markers (10 markers for each chromosome) were screened for polymorphism between Tx623B and S722 by size detection using 8% non-denaturing polyacrylamide-based electrophoresis. To prove the value of these indel markers, a set of 92 F2 individuals derived from a cross between sorghum Tx623A and sudangrass Sa was used as the mapping population. Then, 15 indel markers were used to construct the genetic map. The primers are listed in S1 Table. The genetic map was constructed using Windows QTL IciMapping version 4.0 [15]. The Kosambi map function was used to calculate the genetic distances.

Analysis of non-synonymous and synonymous mutations

All reads were aligned to the reference genome for coding frame prediction using ClustalW2. The ratio of non-synonymous to synonymous mutations (Ka/Ks) was calculated for these coding reads using PAML_yn00 program. The related parameters for PAML_yn00 program were verbose = 0, icode = 0, weighting = 1, commonf3x4 = 0 and ndata = 1.

Results

Transcriptome sequencing output and aligning to the reference genome

High quality sequencing data is the base of transcriptome analysis. RNA-Seq of the two cDNA libraries, Tx623B and S722, was performed. The Illumina sequencing generated 48,155,337 sequence reads and 9.62 Gb of sequence data. Both GC contents of the sequences of the two libraries were larger than 50% (58.26% and 58.15% for Tx623B and S722, respectively), and CycleQ20 was 100%. All these values indicated that the accuracy and quality of the sequence data were high enough for further analysis.

All reads were aligned to the S. bicolor genome which contained 33,472 gene models. The results showed that most reads of the two libraries (88.79% and 83.88% for Tx623B and S722, respectively) matched to the S. bicolor genes (Table 1). The majority of the mapped reads were “perfectly mapped reads” without any base-pair mismatch or indel. Totally, 21,572 and 20,752 genes were identified in the reads of the Tx623B and S722 libraries, respectively. The results showed that more than 60% putative genes in the reference genome were expressed in the seven-leaf stage of the two Sorghum species.

thumbnail
Table 1. RNA-sequencing results for sorghum Tx623B and sudangrass S722.

https://doi.org/10.1371/journal.pone.0154947.t001

Differentially expressed gene analysis and new gene analysis

A total of 2,397 differentially expressed genes (DEGs) were detected by RNA-Seq between the two libraries, including 849 up-regulated ones and 1,548 down-regulated ones (S2 Table). The results showed that there were wide differences in gene expression between the two Sorghum species. And most of the DEGs were down-regulated in Sorghum sudanense. To get annotations, the DEGs of the two libraries were mapped using the GO database. After GO analysis, the 1,619 unigenes were divided into three groups: molecular function, cellular component and biological process (Fig 1). Then, gene set enrichment analysis was applied to unveil that common unigenes and DEG unigenes mainly differed in protein transcription factor activity, rhythmic process, viral reproduction and growth. These results explained the difference in growth rate between S. bicolor and S. sudanense in the seven-leaf stage.

thumbnail
Fig 1. Functional annotation of sequences based on Gene Ontology (GO) categorization.

https://doi.org/10.1371/journal.pone.0154947.g001

A total of 1,470 new genes were identified in S722 (S3 Table). These new genes, of different lengths, were annotated using different databases. Totally, 102 new genes were categorized into 25 functional groups according to the COG classification (Table 2). The group‘replication, recombination and repair’ contained the highest number of new genes, followed successively by the groups of ‘general function prediction only’, ‘transcription’, ‘cell cycle control, cell division, chromosome partitioning’ and ‘signal transduction mechanism’ (Fig 2). The percentage of annotated new genes was 60.6%. The results indicated that further analysis of the unannotated new genes may shed light on the genetic basis for the morphological differences between sorghum and sudangrass.

thumbnail
Table 2. The 102 new genes annotated using different annotation databases.

https://doi.org/10.1371/journal.pone.0154947.t002

thumbnail
Fig 2. Functional classification of the new genes using COG.

https://doi.org/10.1371/journal.pone.0154947.g002

SNP and indel discovery and validation

SNP and indel loci are important information to develop markers for molecular research of sorghum, sudangrass and hybrids of sorghum and sudangrass. A total of 44,495 SNPs were discovered by aligning S722 reads to the reference genome (S4 Table). The highest and lowest number of SNPs was found on chromosome 1 and 5, respectively (Table 3) with a mean of 4,415. This indicated that every 15.7 Kb of the sudangrass genome had 1 SNP between S. bicolor and S. sudanense, which was good enough for developing markers. Of these SNPs, 33,751 were located in genes, while 10,744 were in intergenic regions. It was found that more than 70% of the SNPs could be developed into functional markers. Further analysis showed that 61.37% of the SNPs were transition, and the value was almost the same among different chromosomes.

Indel markers are ubiquitous, co-dominant and locus-specific [16]. They are a type of useful marker in molecular genetics. In this research, 16,928 indel loci were identified by aligning S722 reads to the sorghum reference genome (S5 Table). On average, one indel was found in every 41.3 Kb of the sudangrass genome. Of these indels, most were only one-base different between S. bicolor and S. sudanense; 68.1% were 1- to 3-base different (Fig 3A). The highest and lowest number of indel was found on chromosome 1 and 5, respectively (Fig 3B).

thumbnail
Fig 3. The distribution of indel loci in sorghum (A: the distribution of base differences, X axis is base difference, and Y axis is number; B: the distribution of indel in different chromosomes, X axis is chromosome, and Y axis is number).

https://doi.org/10.1371/journal.pone.0154947.g003

SNP and indel validation and genetic map construction

To validate the SNPs, 30 SNP loci were randomly selected to be sequenced using the Sanger sequencing method. The amplification primers are listed in S6 Table. The results confirmed that all the SNP loci should be reliable.

A total of 5,344 indel primers were designed for the 16,928 indel loci using Primer 3.0 (S7 Table), 100 of which (more than 3 bases of difference) were synthesized and screened between Tx623B and S722. It turned out that 5 did not amplify PCR product. Ninety-five indel markers showed polymorphism between the two accessions. The percentage of polymorphism was 82% between Tx623A and Sa, further proving the universality of the 100 indel markers. Then, 15 out of the 100 indel markers were selected to construct the genetic map of the F2 population derived from the cross of Tx623A and Sa (Fig 4). All the results indicated that the SNPs and indels could provide important information for marker mining and applications in molecular genetic research and molecular marker-assisted breeding.

thumbnail
Fig 4. The genetic map of the F2 population derived from the cross of sorghum Tx623A and sudangrass Sa.

https://doi.org/10.1371/journal.pone.0154947.g004

Synonymous and non-synonymous mutation analysis

To detect functional sequence diversity, putative exon sequences were compared between S722 and the reference genome. Comparison between synonymous base substitutions (Ks) and non-synonymous base substitutions (Ka) can tell sequences are influenced by purifying or diversifying selection. The average Ka/Ks value was 0.39 for orthologous pairs. About 95% of the orthologous pairs exhibited a Ka/Ks<1.0, indicating that the genes were influenced by purifying selection (Fig 5). And 5% of the orthologous pairs showed a Ka/Ks>1.0, indicating that the genes were influenced by diversifying selection. Functional annotation was performed for the 15 pairs under diversifying selection. The largest numbers of annotated pairs were of catalytic activity and DNA/RNA binding molecular functions (S1 Fig).

Discussion

Transcriptome sequencing provides a new tool for genomic studies on model or non-model organisms [10,17]. In this study, the transcriptome of sudangrass, which is an important forage in temperate regions, was characterized. Compared to sorghum, sudangrass has a higher forage yield and a higher growth rate [5]. So, there is a need to understand the genetic mechanisms underlying the phenotypic differences between S. bicolor and S. sudanense so that S. sudanense genes can be utilized to develop related breeding strategies. Learning the expression profiles of S. bicolor and S. sudanense and the sequence polymorphisms provides an opportunity to understand the genetic basis of phenotypic differences. In this research, some up-regulated transcriptional factors involved in plant growth and development were also identified. For example, the up-regulated Sorghum_newGene_2588 in sudangrass was annotated as a Myb-related transcriptional factor involved in plant growth and development [18].

It is controversial about whether S. bicolor and S. sudanense are actually of a same species. Snowden treated sudangrass as S. sudanense, a species different from sorghum in spikelet, anthotaxy and plant traits [19]. De Wet suggested that sudangrass be placed as a subspecies, drummondii (steud), of S. bicolor (L.) Moench [20]. The results from this study showed that the GC contents of sorghum Tx623B and sudangrass S722 were just slightly different. The percentages of the mapped reads aligned to the reference genome also differed only slightly between the two transcriptomes. All these data indicated a high genomic similarity between the two transcriptomes. Although the results of this research could not support that sudangrass should be placed as a subspecies of S. sorghum, they provided a clue for further studies on the place of sudangrass in taxonomy.

The polymorphism of SSR markers developed from the sorghum genome was low (19.48%) between sorghum and sudangrass [21]. Therefore, it is important to develop high diversity markers for genetic research on sudangrass and hybrids of sorghum and sudangrass. As indel markers within genes are functional markers, they are superior to random DNA markers, such as RAPD, SSR, and AFLP, because functional markers are completely linked with trait locus alleles [22]. A total of 16,928 indel loci between S722 and the reference genome were detected, and 5,344 indel markers were developed. The percentage of polymorphism was 95% in the 100 selected indel markers between Tx623B and S722; the percentage of polymorphism was 82% between Tx623B and Sa. It was indicated that the indel loci were reliable and the indel markers were diverse between sorghum and sudangrass. To further verify the value of these indel markers, they were used to construct the genetic maps of chromosomes 1 and 3 derived from a cross between sorghum Tx623A and sudangrass Sa. The results demonstrated that these indel markers developed will be useful for QTL mapping and molecular-assisted breeding in sorghum, sudangrass, and hybrids of sorghum and sudangrass.

SNP detection is an important part of molecular genetic research because SNP loci can be exploited to construct high-density genetic maps and genome-wide association studies [8,23,24]. A total of 44,495 SNP loci between S722 and sorghum genome were detected in this work. The frequencies of transition and transversion were comparable to those observed in other plant species [25, 26, 27]. The density of SNP loci was 2.6 times that of indel loci. Therefore, SNP loci are an important supplement to indel markers in marker development for construction of high-density genetic maps or gene mapping. Additionally, it was found that 24.3% of the SNP loci were genic. These genic SNP loci will be more appropriate for developing phenotype-associated markers.

The ratio of Ka to Ks provides a measure of the selection pressure to which a gene pair is subjected and has been used to tell whether a gene is under positive/diversifying selection or under negative/purifying selection. Our results are consistent with the findings by other researchers that most orthologous genes are under purifying selection. It was also suggested that positive selection pressure results in the maintenance of divergence between the two species, which is also observed in other organisms [25, 28].

Conclusion

In this study, the transcriptome of Sorghum sudanense was sequenced and characterized using the next-generation sequencing platform. A total of 2,397 DEGs were detected by RNA-Seq between the two libraries, including 849 up-regulated ones and 1,548 down-regulated ones. A total of 44,495 SNPs were discovered by aligning S722 reads to the sorghum reference genome. A total of 16,928 indel loci were identified between the two genomes. A total of 5,344 indel markers were designed, 15 of which were selected to construct the genetic map derived from the cross of Tx623A and Sa. This work provides a set of functional markers and a resource for mapping of and molecular genetic research on sudangrass and other Sorghum species.

Supporting Information

S1 Fig. GO Classification of orthologous pairs.

https://doi.org/10.1371/journal.pone.0154947.s001

(DOCX)

S1 Table. The primers used to construct the genetic map.

https://doi.org/10.1371/journal.pone.0154947.s002

(DOCX)

S2 Table. Differentially expressed genes between the two libraries.

https://doi.org/10.1371/journal.pone.0154947.s003

(XLSX)

Acknowledgments

This work was supported by grants from the National Natural Science Foundation of China (31301383), the Key-construction Subject Plan of Anhui Province (No: WanJiaoMiKe[2014]28), the young talent project of Anhui province (gxyqZD2016217), the key project of natural science research of Anhui provincial education department (KJ2016A177), the Key-construction Subject Plan of Anhui Science and Technology University (AKZDXK2015A03).

Author Contributions

Conceived and designed the experiments: QWZ. Performed the experiments: LHW JQL YLL. Analyzed the data: JQL LHW. Contributed reagents/materials/analysis tools: XCY. Wrote the paper: JQL.

References

  1. 1. Menz MA, Klein RR, Mullet JE, Obert JA, Unruh NC, Klein PE. A high-density genetic map of Sorghum bicolor (L.) Moench based on 2926 AFLP, RFLP and SSR markers. Plant Mol Biol.2002; 48: 483–499. pmid:11999830
  2. 2. Carpita NC, McCann MC. Maize and sorghum: genetic resources for bioenergy grasses. Trends Plant Sci. 2008; 13: 415–420. pmid:18650120
  3. 3. Hacker JB. Sorghum drummondii (Steud) Millsp.& Chase.In:Mannetje LT, Jones RM. eds. Plant resources of South-East Asia 4 Forages. Wageningen: Pudoc Scientific Publishers; 1992.pp.206–208.
  4. 4. Beck PA, Hutchison S, Gunter SA, Losi TC, Stewart CB, Capps PK, et al. Chemical composition and in situ dry matter and fiber disappearance of sorghum x Sudangrass hybrids. J Anim Sci.2007; 85: 545–555. pmid:17235037
  5. 5. Zhan QW, Qian ZQ. Heterosis utilization of hybrid between sorghum [Sorghum bicolor (L.) Moench] and Sudangrass [Sorghum sudanense (Piper) Stapf]. Acta Agron Sin. 2004; 30: 73–77.
  6. 6. Zhan QW, Zhang TZ, Wang BH, Li JQ. Diversity comparison and phylogenetic relationships of S. bicolor and S. sudanense revealed by SSR markers. Plant Sci. 2008; 174: 9–16.
  7. 7. Chen XR, Xing YP, Li YP, Tong YH, Xu JY. RNA-Seq reveals infection-related gene expression changes in Phytophthora capsici. PLoS One.2013; 8: e74588. pmid:24019970
  8. 8. Novaes E, Drost DR, Farmerie WG, Pappas GJ, Grattapaglia D, Sederoff RR, et al. High-throughput gene and SNP discovery in Eucalyptus grandis, an uncharacterized genome. BMC Genomics. 2008; 9: 312. pmid:18590545
  9. 9. Kim MY, Lee S, Van K, Kim TH, Jeong SC, Choi IY, et al. Whole-genome sequencing and intensive analysis of the undomesticated soybean (Glycine sojaSieb. and Zucc.) genome. Proc Natl Acad Sci U S A. 2010; 107: 22032–22037. pmid:21131573
  10. 10. Zheng W, Ma L, Zhao J, Li Z, Sun F, Lu X. Comparative transcriptome analysis of two rice varieties in response to rice stripe virus and small brown plant hoppers during early interaction. PLoS One. 2013; 8: e82126. pmid:24358146
  11. 11. Wagner G, Kin K, Lynch V. Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples. Theory in Biosciences. 2012; 131: 281–285. pmid:22872506
  12. 12. Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013; 14: R36. pmid:23618408
  13. 13. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009; 25: 2078–2079. pmid:19505943
  14. 14. Untergasser A, Cutcutache I, Koressaar T, Ye J, Faircloth BC, et al. Primer3—new capabilities and interfaces. Nucleic Acids Res, 2012; 40: e115. pmid:22730293
  15. 15. Meng L, Li H, Zhang L, Wang J. QTL IciMapping: Integrated software for genetic linkage map construction and quantitative trait locus mapping in biparental populations. The Crop Journal. 2015; 3: 269–283.
  16. 16. Tan S, Cheng JW, Zhang L, Qin C, Nong DG, Li WP, et al. Construction of an interspecific genetic map based on InDel and SSR for mapping the QTLs affecting the initiation of flower primordia in pepper (Capsicum spp.). PLoS One. 2015; 10: e119389.
  17. 17. Muriira NG, Xu W, Muchugi A, Xu J, Liu A. De novo sequencing and assembly analysis of transcriptome in the Sodom apple (Calotropis gigantea). BMC Genomics. 2015; 16: 723. pmid:26395839
  18. 18. Jin H, Martin C. Multifunctionality and diversity within the plant MYB-gene family. Plant Mol Biol. 1999; 41: 577–585. pmid:10645718
  19. 19. Snowden JD. The Cultivated Races of Sorghum: Adlard& Son, Ltd.; 1936. pp. 274.
  20. 20. De Wet JMJ, Huckabay JP. The origin of Sorghum bicolor. II.Distribution and domestication. Evolution. 1967; 21: 787–802.
  21. 21. Liu YL, Wang LH, Li JQ, Zhan QW, Zhang Q, Li JF, et al. QTL mapping of forage yield and forage yield component traits in Sorghum bicolor x S. sudanense. Genet Mol Res. 2015; 14: 3854–3861. pmid:25966155
  22. 22. Liu J, Qu J, Yang C, Tang D, Li J, Lan H, et al. Development of genome-wide insertion and deletion markers for maize, based on next-generation sequencing data. BMC Genomics. 2015; 16: 601. pmid:26269146
  23. 23. Guo M, Rupe MA, Danilevskaya ON, Yang X, Hu Z. Genome-wide mRNA profiling reveals heterochronic allelic variation and a new imprinted gene in hybrid maize endosperm. Plant J. 2003; 36: 30–44. pmid:12974809
  24. 24. Duran C, Appleby N, Clark T, Wood D, Imelfort M, Batley J, et al. AutoSNPdb: an annotated single nucleotide polymorphism database for crop plants. Nucleic Acids Res. 2009; 37: 951–953.
  25. 25. Jhanwar S, Priya P, Garg R, Parida SK, Tyagi AK, Jain M. Transcriptome sequencing of wild chickpea as a rich resource for marker development. Plant Biotechnol J. 2012; 10: 690–702. pmid:22672127
  26. 26. Wu X, Ren C, Joshi T, Vuong T, Xu D, Nguyen HT. SNP discovery by high-throughput sequencing in soybean. BMC Genomics. 2010; 11: 469. pmid:20701770
  27. 27. Milano I, Babbucci M, Panitz F, Ogden R, Nielsen RO, Taylor MI, et al. Novel tools for conservation genomics: comparing two high-throughput approaches for SNP discovery in the transcriptome of the European hake. PLoS One. 2011; 6: e28008. pmid:22132191
  28. 28. Schlueter JA, Dixon P, Granger C, Grant D, Clark L, Doyle JJ, et al. Mining EST databases to resolve evolutionary events in major crop species. Genome. 2004; 47: 868–876. pmid:15499401