Effective Extraction and Assembly Methods for Simultaneously Obtaining Plastid and Mitochondrial Genomes

Wanjun Hao; Shihang Fan; Wei Hua; Hanzhong Wang

doi:10.1371/journal.pone.0108291

Abstract

Background

In conventional approaches to plastid and mitochondrial genome sequencing, the sequencing steps are performed separately; thus, plastid DNA (ptDNA) and mitochondrial DNA (mtDNA) should be prepared independently. However, it is difficult to extract pure ptDNA and mtDNA from plant tissue. Following the development of high-throughput sequencing technology, many researchers have attempted to obtain plastid genomes or mitochondrial genomes using high-throughput sequencing data from total DNA. Unfortunately, the huge datasets generated consume massive computing and storage resources and cost a great deal, and even more importantly, excessive pollution reads affect the accuracy of the assembly. Therefore, it is necessary to develop an effective method that can generate base sequences from plant tissue and that is suitable for all plant species. Here, we describe a highly effective, low-cost method for obtaining plastid and mitochondrial genomes simultaneously.

Results

First, we obtained high-quality DNA employing Partial Concentration Extraction. Second, we evaluated the purity of the DNA sample and determined the sequencing dataset size employing Vector Control Quantitative Analysis. Third, paired-end reads were obtained using a high-throughput sequencing platform. Fourth, we obtained scaffolds employing Two-step Assembly. Finally, we filled in gaps using specific methods and obtained complete plastid and mitochondrial genomes. To ensure the accuracy of plastid and mitochondrial genomes, we validated the assembly using PCR and Sanger sequencing. Using this method,we obtained complete plastid and mitochondrial genomes with lengths of 153,533 nt and 223,412 nt separately.

Conclusion

A simple method for extracting, evaluating, sequencing and assembling plastid and mitochondrial genomes was developed. This method has many advantages: it is timesaving, inexpensive and reproducible and produces high-quality sequence. Furthermore, this method can produce plastid and mitochondrial genomes simultaneously and be used for other plant species. Due to its simplicity and extensive applicability, this method will support research on plant cytoplasmic genomes.

Citation: Hao W, Fan S, Hua W, Wang H (2014) Effective Extraction and Assembly Methods for Simultaneously Obtaining Plastid and Mitochondrial Genomes. PLoS ONE 9(9): e108291. https://doi.org/10.1371/journal.pone.0108291

Editor: Steven M. Theg, University of California - Davis, United States of America

Received: May 29, 2014; Accepted: August 15, 2014; Published: September 24, 2014

Copyright: © 2014 Hao et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The authors confirm that all data underlying the findings are fully available without restriction. The complete plastid sequence is available from the Genbank database (accession number KJ872515).

Funding: This study was supported by the National High Technology Research and Development Program of China (2013AA102602), the National Key Basic Research Program of China (2011CB109300) and the Core Research Budget of the Non-profit Governmental Research Institution (161017). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

The majority of plant progenies inherit their plastid and mitochondrial DNA from the maternal parent, and in recent decades, plastid and mitochondrial genomes have been used widely in studies on diversity and evolution. Moreover, plastids and mitochondria are important energy and metabolism organelles in plant cells. Many anabolic and catabolic processes occur in these two organelles, such as photosynthesis, respiration, and fatty acid synthesis. Thus, plastid and mitochondrial DNA have recent particular attention in plant research, highlighting the need to obtain plastid and mitochondrial genomic sequences.

Conventional approaches to generating plastid and mitochondrial genome sequences use separate processes. Thus, plastid DNA (ptDNA) and mitochondrial DNA (mtDNA) are prepared independently. Typically, researchers purify ptDNA and mtDNA from green leaves and etiolated seedlings separately employing density-gradient ultracentrifugation (CsCl, sucrose, or percol) [1]–[4]. This demanding protocol is unsuitable for wide use for plant plastid and mitochondrial genome sequencing. An additional method uses Long-PCR to amplify ptDNA or mtDNA prior to sequencing. In recent years, high-throughput sequencing platforms have been used to capture sequence data from many individual PCR amplifications that cover the entire plastid or mitochondrial genome [5], [6]. Because this method requires a reference sequence, it can be used only for a few species; moreover, it is time consuming. Thus, many researchers attempted to obtain plastid genomes or mitochondrial genomes using high-throughput sequencing data from total DNA. Unfortunately, the huge datasets generated consume massive computing and storage resources and cost a great deal, and even more importantly, excessive false-positive reads affect the accuracy of the assembly.

Here, we report a simple method that extracts, evaluates, sequences and assembles plastid and mitochondrial genomes simultaneously. Using this method, first, crude plastids and mitochondria were isolated together employing differential centrifugation. Subsequently, ptDNA and mtDNA were extracted from this crude preparation of plastids and mitochondria. Following their evaluation, eligible DNA samples were used for high-throughput sequencing. Finally, the complete plastid and mitochondria genome sequences were obtained employing de novo assembly. This method is economical and timesaving and can be used for all species.

Materials and Methods

Plant material

Brassica napus L. line DH366, a fertile rapeseed line possessing Polima cytoplasm, was used for this study. Rapeseed seeds were surface sterilized using 70% ethanol for 2 min, treated with 10% sodium hypochlorite for 15 min, and subsequently washed 4–5 times using sterile water. The sterile seeds were inoculated in 150 ml Erlenmeyer flasks containing 1/2 MS media and incubated in the dark at 22°C and 70% relative humidity. Four-week-old etiolated seedlings were collected for DNA extraction.

DNA extraction

Reagents and solutions.

Homogenization medium: 0.4 M mannitol, 1 mM EDTA, 25 mM MOPS-KOH, 10 mM tricine, 8 mM cysteine, 0.1% BSA and 0.1% PVP-40, pH 7.8.

Extraction medium: 50 mM Tris-HCl, 10 mM EDTA, 2% sarkosyl and 0.012% Proteinase K, pH 8.0.

TE: 10 mM Tris-HCl and 1 mM EDTA, pH 8.0.

Protocols.

Step one to five were conducted at 0°C, and all equipment, consumables and reagents were cooled to 0°C.

Etiolated seedlings (5 g, fresh weight) were chopped up using a pair of scissors in 50 ml of pre-cooled homogenization medium.
The chopped tissue was then transferred into a Dounce tissue grinder and ground 80 times in an ice bath.
The homogenate was poured into a 50 ml centrifuge tube and centrifuged for 10 min at 500 g to remove nucleus and cell fragments.
The supernatant was transferred into a new 50 ml centrifuge tube and centrifuged for 5 min at 1000 g to remove residual contamination.
The supernatant was transferred into a new 50 ml centrifuge tube and centrifuged for 20 min at 12,000 g to precipitate the plastids and mitochondria.
The supernatant was discarded, and the pellet was composed of crude plastids and mitochondria.
Extraction medium (1.5 ml) was added to the crude pellets and mixed using pipette tips, and the mixture was then transferred into two 2 ml centrifuge tubes.
The tubes were capped and the suspensions incubated at 37°C for 1 hour to disrupt the organelles.
The tubes were cooled at room temperature for 5 min, and 85 µl of 2 M sodium acetate solution was then added to each tube.
Equilibrated phenol (850 µl) was added to each tube, and the solutions mixed well and then centrifuged at 13,200 rpm for 10 min.
The upper aqueous phase was transferred into new 1.5 ml centrifuge tubes, an equivalent volume of chloroform:isoamylalcohol (24∶1, v/v) was added to each tube, and the solutions were mixed well and then centrifuged at 13,200 rpm for 10 min.
The upper aqueous phase was transferred to new 1.5 ml centrifuge tubes, 2-fold volume of 100% ethanol was added, and the contents were mixed well and then incubated at −20°C for 30 min to dissociate the DNA.
The tube was centrifuged for 5 min at 13,200 rpm, and the supernatant was removed.
The precipitated DNA was washed twice with 70% ethanol.
The DNA was dried at room temperature and re-dissolved in 50 µl TE buffer.

DNA evaluation

In this study, a new method, Vector Control Quantitative Analysis (VCQA), was used for DNA evaluation. First, three genes–rpoB, ccmB and β-actin–were cloned from the rapeseed plastid, mitochondrial and nuclear genomes separately. Second, three genes were joined into one fragment using overlap extension PCR. Third, the synthetic fragment was cloned into the pMD-18 vector to obtain the control vector pMD18-T-VCQA (Figure 1). Fourth, the copy folds of rpoB/β-actin and ccmB/β-actin were determined employing qPCR. Finally, we used rpoB, ccmB and β-actin to represent ptDNA, mtDNA and ncDNA separately. Thus, the copy fold of organelle DNA to ncDNA in each DNA sample could be computed using the following equation:

Download:

Figure 1. The control pMD18-T-VCQA vector.

https://doi.org/10.1371/journal.pone.0108291.g001

where E = efficiency of amplification and CT = number of cycles. The detailed derivations of these equations are described in File S1.

DNA sequencing

Approximately 3 µg of purified DNA was used to prepare sequence libraries. A 500 bp DNA library was prepared following the manufacturer’s instructions (Illumina sample preparation protocol for paired-end sequencing). This DNA library was sequenced over 100×2 cycles on an Illumina HiSeq2000 sequencing platform (http://www.illumina.com). Base calling was performed using Illumina Pipeline 1.8 (Illumina, San Diego, CA, US). We then discarded read pairs in which the reads contained adaptor sequences, more than 10% unknown bases (N) or more than 50% low-quality (<5) bases. All of this work was performed by Novagene (Beijing, China).

De novo assembly

In this study, assembly was conducted using the Two-step Assembly (TSA) method. A detailed description of TSA follows.

During the first step, the plastid genome was assembled. The detailed pipeline is shown below (Figure 2A). 1) De novo assembly: de novo assembly was performed using SOAPdenovo2 software [7] using high kmer size and kmer frequency to eliminate the effects of plastid reads and nuclear reads. 2) Scaffold mapping: we mapped all the scaffolds to the reference plastid genome using BLAT software [8]. The mapped scaffolds were filtered and used to construct the draft genome. 4) Constructing the draft genome: all mapped scaffolds were ordered manually based on their position in the reference genome and were connected into a draft genome using overlapping information with gaps filled using N. 5) Gap closure: we filled gaps with all the reads employing the GapCloser software in SOAPdenovo2. 6) Assembly validation: we validated the order of the scaffolds and accuracy of the sequence employing PCR amplification and Sanger sequencing.

Download:

Figure 2. Flowchart showing the major steps of de novo assembly employing TSA.

A) First step: assembly of the plastid genome; B) second step: assembly of the mitochondrial genome.

https://doi.org/10.1371/journal.pone.0108291.g002

For the second step, we conducted de novo assembly of the mitochondrial genome. The detailed pipeline is shown below (Figure 2B). 1) Filtering plastid reads: we mapped the total clean reads to the plastid genome obtained above using SOAP2 software (http://soap.genomics.org.cn/soapaligner.html) with no mismatch. Only unmapped pair-end reads were used for the next step. 2) De novo assembly: the de novo assembly was performed using SOAPdenovo2 software. 3) Scaffold mapping: we mapped all the scaffolds to the reference mitochondrial genome using BLAT software. The mapped scaffolds were filtered and used to construct the draft genome. 4) Constructing the draft genome: all the mapped scaffolds were ordered manually and connected into a draft genome based on their position in the reference genome using overlap information with gaps filled with N. 5) Gap closure: because plastid reads were pre-removed prior to mitochondrial assembly, theoretically, any region similar to the plastid genome in a mitochondrial genome could create a gap. To ensure accuracy, we filled gaps employing a strict method. First, all reads were mapped to the draft genome. Then, paired-end reads, of which at least one end mapped to the draft genome exactly, were filtered. Finally, filtered reads were used to fill gaps employing GapCloser software. Read mapping, read filtering and gap filling continued until all gaps no longer extended. The remaining gaps were filled in using PCR amplification and Sanger sequencing. 6) Assembly validation: we validated the order of the scaffolds and accuracy of the sequence employing PCR amplification and Sanger sequencing.

Gene annotation and variation analyses

When we obtained the complete plastid genome and mitochondrial genome, genes were annotated using DOGMA (http://dogma.ccbb.utexas.edu/) [9]. Using their respective reference genomes, plastid and mitochondrial genomes were compared for variations using Crossmatch in Phrap (http://www.phrap.org/). All the variations were then annotated using snpEff software [10].

Results and Discussion

DNA extraction using PCE

The conventional approach used to prepare ptDNA or mtDNA samples for sequencing is quite complex. The method requires density gradient ultracentrifugation and large amounts of green leaves or etiolated seedlings [1]–[4]. In recent years, alternative methods have been developed, such as differential centrifugation combining DNase I digestion [1], [4], [11]–[19], Long-PCR [5], [6], multiply-primed rolling circle amplification (RCA) [20] and a probe enrichment strategy [21]. All of these methods have deficiencies, such as their complexity, instability or lack of suitability for wide use. Most importantly, no method can extract ptDNA and mtDNA together except when total DNA is used. In this study, we developed a method that can enrich ptDNA and mtDNA simultaneously. Because the method can only increase the proportion of ptDNA and mtDNA in a DNA sample, we named it Partial Concentration Extraction (PCE).

Using PCE, first, crude plastids and mitochondria were isolated employing differential centrifugation from etiolated seedlings. Then, ptDNA and mtDNA were extracted using phenol-chloroform extraction and ethanol precipitation. The ptDNA and mtDNA were enriched in the resulting DNA. The details of the DNA extraction are described in the Materials and Methods. Finally, a total of 7.17 µg DNA was obtained from 5 g of etiolated seedlings. The results in Table 1 show the DNA quality examined using a NanoDrop 1000. The quality of the DNA was further tested using electrophoresis of the DNA on a 1% agarose gel. As shown in Figure 3, no degradation of the DNA fragment extracted using PCE was observed. All the results indicate that we obtained a high-quality DNA sample suitable for sequencing.

Download:

Figure 3. Comparison of DNA extracted with CTAB versus PCE.

A) Total DNA extracted with CTAB and B) extracted with PCE; M denotes Lambda DNA/Hind III.

https://doi.org/10.1371/journal.pone.0108291.g003

Download:

Table 1. Evaluation of DNA quality using a NanoDrop 1000.

https://doi.org/10.1371/journal.pone.0108291.t001

During the entire procedure, no density gradient ultracentrifugation steps, DNase I digestion, PCR, special probes or reagents were used. Compared with the relatively simple method using differential centrifugation [17], this method saves time and requires less tissue (Table 2). However, to obtain sufficient and intact plastids and mitochondria, an essential precondition is the use of a Dounce tissue grinder and suitable homogenization medium. The etiolated seedlings can decrease the friction drag and facilitate tissue grinding. The simplicity of this method provides it with a higher success rate than others methods. Moreover, this method can be widely used in most laboratories and all plant species. More importantly, when combined with our assembly method, this method easily obtains plastid and mitochondrial genomes simultaneously at little cost.

Download:

Table 2. Comparison between partial concentration extraction and differential centrifugation.

https://doi.org/10.1371/journal.pone.0108291.t002

Purity evaluation using VCQA

Many studies indicate that sequence homology exists between the plastid, mitochondrial and nuclear genome. Thus, when de novo assembly is performed, the plastid genome should influence the mitochondrial genome and vice versa, and both the plastid and mitochondrial genomes should be affected by the nuclear genome. In the past, many studies used PCR or restriction digestion to perform qualitative analysis of ncDNA in pure ptDNA and mtDNA preparations; however, no method can perform quantitative analysis [12], [18], [19]. VCQA, a method based on qPCR, can perform quantitative analysis of DNA types in a DNA sample. For this method, we used a control vector, in which three genes–rpoB, ccmB and β-actin–belong to plastid, mitochondrial and nuclear genomes, respectively, and represent ptDNA, mtDNA and ncDNA separately. Thus, the copy folds of ptDNA/ncDNA and mtDNA/ncDNA in the DNA sample can be easily identified employing quantitative analysis of the three genes.

We evaluated the purity of the DNA sample using VCQA and predicted the sequencing data size for a given sequencing depth. The detailed results are listed in Table 3. In our DNA sample, the copy fold of mtDNA/ncDNA is 72, whereas the copy fold of ptDNA/ncDNA is, astonishingly, 1922. Thus, when the sequencing data cover the entire genome (approximately 1.2 GB) one layer, the mitochondrial genome is 72 layers and the plastid genome is 1922 layers. In theory, an ideal de novo assembly can be obtained when the sequencing depth is greater than 50 layers. Thus, if the sequencing dataset size reaches 1.2 GB, it is sufficient for de novo assembly of mitochondrial and plastid genomes in our DNA sample. These data suggest that our method produces high-purity DNA samples for sequencing plastid mitochondrial genomes.

Download:

Table 3. Quantitative analysis of DNA purity using VCQA employing qPCR.

https://doi.org/10.1371/journal.pone.0108291.t003

Because the method can be used for quantitative analysis, it can be used to compare the DNA of plastids and mitochondria from different materials when performing correlation analyses of ptDNA/mtDNA content and phenotypes.

Genome sequencing and sequencing depth statistics

In this study, a whole-genome shotgun strategy and HiSeq2000 sequencing platform were employed. Paired-end sequencing libraries with insert sizes of approximately 500 bp were constructed using 3 µg DNA samples. Sequencing reads containing adaptor sequences were cleaned, and the sequence data were filtered for low-quality reads. This process resulted in a total of 5,778,987 high-quality paired-end reads containing 1,155,797,400 bases with an average read size of 100 nt.

To obtain satisfactory assemblies, we computed the accurate sequencing depths of the mitochondrial and plastid genomes using the mitochondrial genome (Accession number FR715249) of 2H2A and the plastid genome (Accession number GQ861354) of ZY036 as references. First, we mapped the high-quality reads to the reference genome. Then we computed the sequencing depths of target genes (rpoB and ccmB) and unique regions excluding similar regions of plastid and mitochondrial genomes. The detailed results are listed in Table 4.

Download:

Table 4. Summary of sequence alignments and sequencing depths.

https://doi.org/10.1371/journal.pone.0108291.t004

The sequencing depths of unique regions of the plastid and mitochondrial genomes are similar to the sequencing depths obtained for rpoB and ccmB separately, with differences of less than 10% observed. The sequencing data were nearly 1.2 GB and covered the nuclear genome to approximately one layer. Thus, the sequencing depth of the plastid and mitochondrial genomes can be expected to be equivalent to the copy folds of ptDNA/ncDNA and mtDNA/ncDNA separately. Indeed, the sequencing depths of unique regions and target genes (rpoB and ccmB) are consistent with the copy folds, suggesting that this method of DNA evaluation is highly reliable.

We compared the sequencing depth of our DNA sequencing data and total DNA sequencing data. One sample was part of the whole genome sequencing data of total rapeseed DNA (unpublished). The additional two samples were from previous studies [22], [23]. When the nuclear genome was covered one layer, the sequencing depth of the plastid genome in our sample was 15–35-fold that of the three total DNA samples, and the sequencing depth of the mitochondrial genome in our sample was approximately 4–9-fold that of the three total DNA samples (Table 5). These data suggest that our DNA extraction method yielded a highly pure DNA sample for sequencing the plastid and mitochondrial genomes.

Download:

Table 5. Comparison of sequencing depth in different DNA samples.

https://doi.org/10.1371/journal.pone.0108291.t005

De novo assembly using TSA

With the use of the high-throughput sequencing platform, many researchers have attempted to use total DNA to obtain plastid genome or mitochondrial genome sequences; however, the assembly accuracy has never achieved a satisfactory resolution. In some cases, plastid reads were initially separated from total reads employing the published plastid genomes [24], [25] and then used for de novo assembly of plastid genomes. However, the sequence differences between different materials, in particular, the large insertions and deletions, affect the efficiency of the assembly and lead to more gaps. When assembling mitochondrial genomes, plastid reads were removed from total reads employing the existing plastid genomes [23]. Sequence differences in plastid genomes between different materials prevent the removal of partial plastid reads and lead to false assembly. In some studies [26], the contigs were filtered employing the different sequencing depths of plastids, mitochondria and nucleus. However, this approach cannot avoid false de novo assembly of contigs.

To avoid false assembly, we developed a new assembly method, TSA, that is used with the PCE DNA extraction method in combination. In our sequencing data, the sequencing depth of the nuclear genome is approximately one layer, which is far lower than the sequencing depths of plastid and mitochondrial genomes. Thus, the effect of nuclear reads can be ignored when performing de novo assembly of plastid and mitochondrial genomes. Despite the sequencing depth of the mitochondrial genome reaching almost 100 layers, it remains very low when compared with the 2000 layers obtained for the plastid genome. Thus, we can easily eliminate the effects of mitochondrial reads by employing the large kmer and kmer frequency when performing de novo assembly of the plastid genome.

Based on the depth evaluated above, de novo assembly was performed using a series of different parameters. Finally, we obtained an optimal result. Only six scaffolds (>1 kb) were assembled, and three scaffolds were aligned to the reference plastid genome (Accession number GQ861354; Table 6). Three aligned scaffolds were then ordered manually and connected into a draft sequence based on positions in the reference genome using overlap information with gaps filled with N. To obtain the finished sequence, all the reads was used to fill gaps employing GapCloser. Finally, we obtained a complete plastid genome with a length of 153,533 nt (Table 6) including a pair of inverted repeats (IRs) of 26,186 nt separated by one small and one large single-copy region (SSC and LSC) of 17,780 and 83,381 nt, respectively.

Download:

Table 6. Summary of de novo assembly results.

https://doi.org/10.1371/journal.pone.0108291.t006

The mitochondrial genome assembly was more complicated than that of the plastid genome. The sequencing depth of the plastid genome was far greater than for the mitochondrial genome, and we cannot eliminate effects of plastid reads employing sequencing depth. To reduce the effects of the plastid reads, we must remove the plastid reads from the total reads. We used the plastid genome obtained above as a reference to avoid sequence differences between the different materials. When at least one end mapped exactly to the plastid genome, we considered the paired-end read as a plastid read and removed it. Finally, 4,165,859 paired-end reads, accounting for 72% of the total reads, remained and were used for the de novo assembly of the mitochondrial genome.

After several attempts, we assembled the filtered reads using the following optimal parameters: kmer size 41, kmer frequency 20 and edge coverage 20. Finally, 44 scaffolds greater than 1 kb were obtained and mapped to the reference mitochondrial genome (Accession number FR715249), resulting in four mapped scaffolds (Table 6). The mapped scaffolds were ordered manually and connected into a draft genome based on their positions in the reference genome using overlap information with gaps filled with N. To ensure the accuracy of the sequence, we filled gaps employing a rigorous method. In the first cycle, we mapped the total reads to the draft genome, and 496,252 paired-end reads with at least one end were mapped and filtered. The filtered reads were used to fill gaps, and one gap was filled. During the second cycle, we mapped the total reads to the sequence obtained after the first cycle. Consequently, 594,260 paired-end reads were mapped and were used to fill the remaining gaps. After two cycles of read mapping, read filtering and gap filling, all gaps were filled. Ultimately, we obtained the complete mitochondrial genome sequence of 223,412 nt in length (Table 6).

We used PCR and Sanger sequencing to confirm the sequence accuracy of this novel method, which has been used for the first time to assemble plastid and mitochondrial genomes. We designed ten pairs of primers to amplify the sequence fragment, which contained several junction regions of two scaffolds (Table 7). Following PCR, we identified the sequence of the twelve PCR products using Sanger sequencing. The sequences obtained were compared with the complete genome. No false results were obtained in ten regions adding up to 8,071 bp (Table 7). These data indicate the high quality of the de novo assembly and the high accuracy of the order of scaffolds and gap filling.

Download:

Table 7. Primer pairs used for assembly validation by PCR using Sanger sequencing.

https://doi.org/10.1371/journal.pone.0108291.t007

Gene annotation and comparative analysis

The first plastid genome of the Polima cytoplasm was annotated using DOGMA. The detailed annotation information was submitted to GenBank (KJ872515). This genome contains the same gene number and order as the ZY036 plastid genome, which possesses a nap cytoplasm. However, when compared with ZY036, the plastid genome is 81 nt longer and there are 202 single-nucleotide polymorphisms (SNPs), 5 multi-nucleotide polymorphisms (MNPs), 106 insertions and deletions (indels) and 13 complex variations (CVs). We annotated all of these variations based on gene information from ZY036. Seventy-one SNPs were located in the coding region of 23 genes, and 31 SNPs changed the amino acid sequences of 10 genes. The indels ranged from 1 nt to 108 nt, with an average length of 5 nt. No indel, MNP or CV was located in coding regions (Table 8).

Download:

Table 8. Variations in plastid and mitochondrial genomes.

https://doi.org/10.1371/journal.pone.0108291.t008

For the first time, we report on the mitochondrial genome of a fertile rapeseed line that possesses a Polima cytoplasm. We compared it with the mitochondrial genome of a Polima cytoplasmic male-sterile line Shaan 2A [14]. Surprisingly, the mitochondrial genome of DH366 exhibited the same length as Shaan 2A. Only four SNPs were noted, of which only two were located in coding regions of two genes and only one SNP changed the amino acid sequence of ORF257 (Table 8). With respect to the exiguous variations, the mitochondrial genome of DH366 possesses the same gene number as Shaan 2A, including 34 protein encoding genes, three ribosomal RNA genes, 18 transfer RNA genes and 40 putative open reading frames (ORFs).

Similar to the plastid genome, enormous variations were observed in the mitochondrial genome when the nap cytoplasm and Polima cytoplasm were compared [14]. The numerous genome variations suggest different origins of the nap cytoplasm and Polima cytoplasm. Surprisingly, only four SNPs were noted when comparing the mitochondrial genomes of DH366 and Shaan 2A. This high congruency indicates the high quality of the mitochondrial genome obtained. For the far great sequencing depth of the plastid genome than the mitochondrial genome, we believe the plastid genome sequence have a high quality too.

Conclusion

In this study, we developed a method to obtain plastid and mitochondrial genomes from plant tissue. This method comprises DNA extraction, DNA evaluation, DNA sequencing and de novo assembly. This method has many advantages, such as simple management, high efficiency, low cost, high accuracy and general applicability; however, the most important benefit is that plastid and mitochondrial genomes can be obtained simultaneously. Using this method, we obtained a high yield of high-quality cytoplasmic DNA, and the method provides reliable purity. The DNA can be used for sequencing on high-throughput sequencing platforms. Finally, the sequencing data can be assembled into complete plastid and mitochondrial genomes. Indeed, we obtained complete plastid and mitochondrial genomes from two additional rapeseed cultivars employing this method (unpublished data). For the simply procedure, we believe this method will promote research on plant cytoplasmic genome and related investigations.

Supporting Information

File S1.

Details on the derivation of the VCQA equation.

https://doi.org/10.1371/journal.pone.0108291.s001

(DOC)

Author Contributions

Conceived and designed the experiments: WJH. Performed the experiments: WJH SHF. Analyzed the data: WJH. Contributed reagents/materials/analysis tools: HZW WH. Contributed to the writing of the manuscript: WJH. Revised the manuscript: HZW WH.

References

1. Alverson AJ, Wei X, Rice DW, Stern DB, Barry K, et al. (2010) Insights into the Evolution of Mitochondrial Genome Size from Complete Sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae). Molecular Biology and Evolution 27: 1436–1448.
- View Article
- Google Scholar
2. Alverson AJ, Zhuo S, Rice DW, Sloan DB, Palmer JD (2011) The Mitochondrial Genome of the Legume Vigna radiata and the Analysis of Recombination across Short Mitochondrial Repeats. PLoS ONE 6: e16404.
- View Article
- Google Scholar
3. Park J, Lee Y-P, Lee J, Choi B-S, Kim S, et al. (2013) Complete mitochondrial genome sequence and identification of a candidate gene responsible for cytoplasmic male sterility in radish (Raphanus sativus L.) containing DCGMS cytoplasm. Theoretical and Applied Genetics 126: 1763–1774.
- View Article
- Google Scholar
4. Sugiyama Y, Watase Y, Nagase M, Makita N, Yagura S, et al. (2005) The complete nucleotide sequence and multipartite organization of the tobacco mitochondrial genome: comparative analysis of mitochondrial genomes in higher plants. Molecular Genetics and Genomics 272: 603–615.
- View Article
- Google Scholar
5. Handa H (2003) The complete nucleotide sequence and RNA editing content of the mitochondrial genome of rapeseed (Brassica napus L.): comparative analysis of the mitochondrial genomes of rapeseed and Arabidopsis thaliana. Nucleic Acids Research 31: 5907–5916.
- View Article
- Google Scholar
6. Yi X, Gao L, Wang B, Su Y-J, Wang T (2013) The Complete Chloroplast Genome Sequence of Cephalotaxus oliveri (Cephalotaxaceae): Evolutionary Comparison of Cephalotaxus Chloroplast DNAs and Insights into the Loss of Inverted Repeat Copies in Gymnosperms. Genome Biology and Evolution 5: 688–698.
- View Article
- Google Scholar
7. Luo R, Liu B, Xie Y, Li Z, Huang W, et al. (2012) SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. GigaScience 1: 18.
- View Article
- Google Scholar
8. Kent WJ (2002) BLAT–The BLAST-Like Alignment Tool. Genome Research 12: 656–664.
- View Article
- Google Scholar
9. Wyman SK, Jansen RK, Boore JL (2004) Automatic annotation of organellar genomes with DOGMA. Bioinformatics 20: 3252–3255.
- View Article
- Google Scholar
10. Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, et al. (2012) A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6: 80–92.
- View Article
- Google Scholar
11. Shi C, Hu N, Huang H, Gao J, Zhao Y-J, et al. (2012) An Improved Chloroplast DNA Extraction Procedure for Whole Plastid Genome Sequencing. PLoS ONE 7: e31468.
- View Article
- Google Scholar
12. Kemble RJ (1987) A rapid, single leaf, nucleic acid assay for determining the cytoplasmic organelle complement of rapeseed and related Brassica species. Theoretical and Applied Genetics 73: 364–370.
- View Article
- Google Scholar
13. Darracq A, Varré JS, Maréchal-Drouard L, Courseaux A, Castric V, et al. (2011) Structural and Content Diversity of Mitochondrial Genome in Beet: A Comparative Genomic Analysis. Genome Biology and Evolution 3: 723–736.
- View Article
- Google Scholar
14. Chen J, Guan R, Chang S, Du T, Zhang H, et al. (2011) Substoichiometrically Different Mitotypes Coexist in Mitochondrial Genomes of Brassica napus L. PLoS ONE. 6: e17662.
- View Article
- Google Scholar
15. Liu H, Cui P, Zhan K, Lin Q, Zhuo G, et al. (2012) Comparative analysis of mitochondrial genomes between a wheat K-type cytoplasmic male sterility (CMS) line and its maintainer line. BMC Genomics 12: 163.
- View Article
- Google Scholar
16. Chang S, Chen J, Wang Y, Gu B, He J, et al. (2013) The Mitochondrial Genome of Raphanus sativus and Gene Evolution of Cruciferous Mitochondrial Types. Journal of Genetics and Genomics 40: 117–126.
- View Article
- Google Scholar
17. Wang J, Jiang J, Li X, Li A, Zhang Y, et al. (2012) Complete sequence of heterogenous-composition mitochondrial genome (Brassica napus) and its exogenous source. BMC Genomics 13: 675.
- View Article
- Google Scholar
18. Triboush SO, Danilenko NG, Davydenko OG (1998) A Method for Isolation of Chloroplast DNA and Mitochondrial DNA from Sunflower. Plant Molecular Biology Reporter 16: 183–183.
- View Article
- Google Scholar
19. Hu Z-yZG-m, Wang H-z, Hua W (2012) A Simple Method for Isolating Chloroplast DNA and Mitochondria DNA from the Same Rapeseed Green Leaf Tissue. Journal of Integrative Agriculture 11: 1212–1215.
- View Article
- Google Scholar
20. Atherton R, McComish B, Shepherd L, Berry L, Albert N, et al. (2010) Whole genome sequencing of enriched chloroplast DNA using the Illumina GAII platform. Plant Methods 6: 22.
- View Article
- Google Scholar
21. Stull GW, Moore MJ, Mandala VS, Douglas NA, Kates HR, et al. (2013) A Targeted Enrichment Strategy for Massively Parallel Sequencing of Angiosperm Plastid Genomes. Applications in Plant Sciences 1: 1–7.
- View Article
- Google Scholar
22. Goremykin VV, Lockhart PJ, Viola R, Velasco R (2012) The mitochondrial genome of Malus domestica and the import-driven hypothesis of mitochondrial genome expansion in seed plants. The Plant Journal 71: 615–626.
- View Article
- Google Scholar
23. Wang W, Wu Y, Messing J (2012) The Mitochondrial Genome of an Aquatic Plant, Spirodela polyrhiza. PLoS ONE 7: e46747.
- View Article
- Google Scholar
24. Zhang T, Fang Y, Wang X, Deng X, Zhang X, et al. (2012) The Complete Chloroplast and Mitochondrial Genome Sequences of Boea hygrometrica: Insights into the Evolution of Plant Organellar Genomes. PLoS ONE 7: e30531.
- View Article
- Google Scholar
25. Yang M, Zhang X, Liu G, Yin Y, Chen K, et al. (2010) The Complete Chloroplast Genome Sequence of Date Palm (Phoenix dactylifera L.). PLoS ONE 5: e12762.
- View Article
- Google Scholar
26. Tanaka Y, Tsuda M, Yasumoto K, Yamagishi H, Terachi T (2012) A complete mitochondrial genome sequence of Ogura-type male-sterile cytoplasm and its comparative analysis with that of normal cytoplasm in radish (Raphanus sativus L.). BMC Genomics 13: 352.
- View Article
- Google Scholar

[ref1] 1. Alverson AJ, Wei X, Rice DW, Stern DB, Barry K, et al. (2010) Insights into the Evolution of Mitochondrial Genome Size from Complete Sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae). Molecular Biology and Evolution 27: 1436–1448.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Alverson AJ, Zhuo S, Rice DW, Sloan DB, Palmer JD (2011) The Mitochondrial Genome of the Legume Vigna radiata and the Analysis of Recombination across Short Mitochondrial Repeats. PLoS ONE 6: e16404.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Park J, Lee Y-P, Lee J, Choi B-S, Kim S, et al. (2013) Complete mitochondrial genome sequence and identification of a candidate gene responsible for cytoplasmic male sterility in radish (Raphanus sativus L.) containing DCGMS cytoplasm. Theoretical and Applied Genetics 126: 1763–1774.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Sugiyama Y, Watase Y, Nagase M, Makita N, Yagura S, et al. (2005) The complete nucleotide sequence and multipartite organization of the tobacco mitochondrial genome: comparative analysis of mitochondrial genomes in higher plants. Molecular Genetics and Genomics 272: 603–615.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref5] 5. Handa H (2003) The complete nucleotide sequence and RNA editing content of the mitochondrial genome of rapeseed (Brassica napus L.): comparative analysis of the mitochondrial genomes of rapeseed and Arabidopsis thaliana. Nucleic Acids Research 31: 5907–5916.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref6] 6. Yi X, Gao L, Wang B, Su Y-J, Wang T (2013) The Complete Chloroplast Genome Sequence of Cephalotaxus oliveri (Cephalotaxaceae): Evolutionary Comparison of Cephalotaxus Chloroplast DNAs and Insights into the Loss of Inverted Repeat Copies in Gymnosperms. Genome Biology and Evolution 5: 688–698.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. Luo R, Liu B, Xie Y, Li Z, Huang W, et al. (2012) SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. GigaScience 1: 18.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref8] 8. Kent WJ (2002) BLAT–The BLAST-Like Alignment Tool. Genome Research 12: 656–664.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref9] 9. Wyman SK, Jansen RK, Boore JL (2004) Automatic annotation of organellar genomes with DOGMA. Bioinformatics 20: 3252–3255.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref10] 10. Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, et al. (2012) A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6: 80–92.
View Article
Google Scholar

[29] View Article

[30] Google Scholar

[ref11] 11. Shi C, Hu N, Huang H, Gao J, Zhao Y-J, et al. (2012) An Improved Chloroplast DNA Extraction Procedure for Whole Plastid Genome Sequencing. PLoS ONE 7: e31468.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref12] 12. Kemble RJ (1987) A rapid, single leaf, nucleic acid assay for determining the cytoplasmic organelle complement of rapeseed and related Brassica species. Theoretical and Applied Genetics 73: 364–370.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref13] 13. Darracq A, Varré JS, Maréchal-Drouard L, Courseaux A, Castric V, et al. (2011) Structural and Content Diversity of Mitochondrial Genome in Beet: A Comparative Genomic Analysis. Genome Biology and Evolution 3: 723–736.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref14] 14. Chen J, Guan R, Chang S, Du T, Zhang H, et al. (2011) Substoichiometrically Different Mitotypes Coexist in Mitochondrial Genomes of Brassica napus L. PLoS ONE. 6: e17662.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref15] 15. Liu H, Cui P, Zhan K, Lin Q, Zhuo G, et al. (2012) Comparative analysis of mitochondrial genomes between a wheat K-type cytoplasmic male sterility (CMS) line and its maintainer line. BMC Genomics 12: 163.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref16] 16. Chang S, Chen J, Wang Y, Gu B, He J, et al. (2013) The Mitochondrial Genome of Raphanus sativus and Gene Evolution of Cruciferous Mitochondrial Types. Journal of Genetics and Genomics 40: 117–126.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref17] 17. Wang J, Jiang J, Li X, Li A, Zhang Y, et al. (2012) Complete sequence of heterogenous-composition mitochondrial genome (Brassica napus) and its exogenous source. BMC Genomics 13: 675.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref18] 18. Triboush SO, Danilenko NG, Davydenko OG (1998) A Method for Isolation of Chloroplast DNA and Mitochondrial DNA from Sunflower. Plant Molecular Biology Reporter 16: 183–183.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref19] 19. Hu Z-yZG-m, Wang H-z, Hua W (2012) A Simple Method for Isolating Chloroplast DNA and Mitochondria DNA from the Same Rapeseed Green Leaf Tissue. Journal of Integrative Agriculture 11: 1212–1215.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref20] 20. Atherton R, McComish B, Shepherd L, Berry L, Albert N, et al. (2010) Whole genome sequencing of enriched chloroplast DNA using the Illumina GAII platform. Plant Methods 6: 22.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref21] 21. Stull GW, Moore MJ, Mandala VS, Douglas NA, Kates HR, et al. (2013) A Targeted Enrichment Strategy for Massively Parallel Sequencing of Angiosperm Plastid Genomes. Applications in Plant Sciences 1: 1–7.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref22] 22. Goremykin VV, Lockhart PJ, Viola R, Velasco R (2012) The mitochondrial genome of Malus domestica and the import-driven hypothesis of mitochondrial genome expansion in seed plants. The Plant Journal 71: 615–626.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref23] 23. Wang W, Wu Y, Messing J (2012) The Mitochondrial Genome of an Aquatic Plant, Spirodela polyrhiza. PLoS ONE 7: e46747.
View Article
Google Scholar

[68] View Article

[69] Google Scholar

[ref24] 24. Zhang T, Fang Y, Wang X, Deng X, Zhang X, et al. (2012) The Complete Chloroplast and Mitochondrial Genome Sequences of Boea hygrometrica: Insights into the Evolution of Plant Organellar Genomes. PLoS ONE 7: e30531.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref25] 25. Yang M, Zhang X, Liu G, Yin Y, Chen K, et al. (2010) The Complete Chloroplast Genome Sequence of Date Palm (Phoenix dactylifera L.). PLoS ONE 5: e12762.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref26] 26. Tanaka Y, Tsuda M, Yasumoto K, Yamagishi H, Terachi T (2012) A complete mitochondrial genome sequence of Ogura-type male-sterile cytoplasm and its comparative analysis with that of normal cytoplasm in radish (Raphanus sativus L.). BMC Genomics 13: 352.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

Figures

Abstract

Background

Results

Conclusion

Introduction

Materials and Methods

Plant material

DNA extraction

Reagents and solutions.

Protocols.

DNA evaluation

DNA sequencing

De novo assembly

Gene annotation and variation analyses

Results and Discussion

DNA extraction using PCE

Purity evaluation using VCQA

Genome sequencing and sequencing depth statistics

De novo assembly using TSA

Gene annotation and comparative analysis

Conclusion

Supporting Information

File S1.

Author Contributions

References