Assembly of a Marine Viral Metagenome after Physical Fractionation

Jennifer R. Brum; Alexander I. Culley; Grieg F. Steward

doi:10.1371/journal.pone.0060604

Abstract

Metagenomic analyses of marine viruses generate an overview of viral genes present in a sample, but the percentage of the resulting sequence fragments that can be reassembled is low and the phenotype of the virus from which a given sequence derives is usually unknown. In this study, we employed physical fractionation to characterize the morphological and genomic traits of a subset of uncultivated viruses from a natural marine assemblage. Viruses from Kāne‘ohe Bay, Hawai‘i were fractionated by equilibrium buoyant density centrifugation in a cesium chloride (CsCl) gradient, and one fraction from the CsCl gradient was then further fractionated by strong anion-exchange chromatography. One of the fractions resulting from this two-dimensional separation appeared to be dominated by only a few virus types based on genome sizes and morphology. Sequences generated from a shotgun clone library of the viruses in this fraction were assembled into significantly more numerous contigs than have been generated with previous metagenomic investigations of whole DNA viral assemblages with comparable sequencing effort. Analysis of the longer contigs (up to 6.5 kb) assembled from our metagenome allowed us to assess gene arrangement in this subset of marine viruses. Our results demonstrate the potential for physical fractionation to facilitate sequence assembly from viral metagenomes and permit linking of morphological and genomic data for uncultivated viruses.

Citation: Brum JR, Culley AI, Steward GF (2013) Assembly of a Marine Viral Metagenome after Physical Fractionation. PLoS ONE 8(4): e60604. https://doi.org/10.1371/journal.pone.0060604

Editor: Francisco Rodriguez-Valera, Universidad Miguel Hernandez, Spain

Received: December 13, 2012; Accepted: February 28, 2013; Published: April 8, 2013

Copyright: © 2013 Brum et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding: This research was supported by National Science Foundation grants to Grieg Steward (OCE 04-42664, OCE 08-26650) and the National Science Foundation-supported Center for Microbial Oceanography Research and Education (EF 04-24599). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Viruses are the most abundant biological entities in aquatic environments and have significant roles that include causing mortality, mediating genetic exchange, and altering the genetic potential of their hosts [1]. Investigations of the morphology (reviewed by [2]) and genome size distributions [3] of aquatic viruses have shown that they are a diverse component of aquatic ecosystems. However, investigating the genomic content of this diverse array of viruses has proven to be challenging.

Isolation of viruses from cultivated hosts allows for the sequencing of complete viral genomes which can be used to connect genomic with phenotypic information (e.g., [4], [5]) and to determine the gene organization and genetic capabilities of a given virus (e.g., [4], [6]). However, the ability to investigate viruses in this way is limited by the requirement of host cultivation. It has been estimated that >99% of environmental microorganisms are uncultivated [7] and that the groups of microorganisms that are in culture may not be representative of the environments from which they originate [8].

This cultivation bottleneck has led to the investigation of viral assemblages using metagenomics, in which random pieces of nucleic acid from viral samples are sequenced, resulting in a survey of viral genes within a sample (reviewed by [9]). Metagenomic analyses have supported the assessment that aquatic viruses are extraordinarily diverse, but the majority of sequences obtained from these investigations are not similar to known genes, indicating that much of the genomic information in aquatic viruses has yet to be characterized [10].

The high diversity of aquatic viral communities means that very few sequences from metagenomic analyses can be reassembled into larger stretches of sequence [11]–[13]. Without reassembly of the fragmented genomes, the genetic structure of individual viruses cannot be assessed and genes cannot be investigated within the context of whole genomes. The current methods used to construct these metagenomic libraries also eliminate any phenotypic information about viruses in the samples.

So far, with the exception of a small single-stranded DNA virus [14], reassembly of uncultivated prokaryotic and viral genomes from shotgun libraries of aquatic assemblages has only been achieved with samples that contain low diversity of bacteria or viruses [15]–[17]. This had led to the suggestion that, in addition to advances in sequencing technology and computational methods [18]–[20], there should also be a focus on improving upstream methods that are used to prepare samples for metagenomic analyses, specifically methods that reduce the diversity of the samples through physical fractionation [21]. In fact, computational models have shown that separating viruses from a sample into two or more fractions can increase the assembly of sequenced DNA fragments from the constituent viral assemblage [22].

Multi-dimensional physical fractionation of natural aquatic viral assemblages can be achieved by exploiting differences in the sizes, surface charges, and buoyant densities among different populations of viruses [23]. Here, we use two physical fractionation steps in series to enrich a limited number of viral consortia from a complex marine assemblage in order to test whether such a procedure would result in a high proportion of assembled sequences.

Materials and Methods

Ethics Statement

No specific permits were required for the described field studies. Samples were collected from public waters and no specific permissions were required. Samples consisted of microscopic plankton, which are not endangered or protected.

Sample Collection

A viral concentrate was collected on October 17, 2006 from a depth of 3 m approximately 25 m off the southeast shore of Coconut Island (Moku O Lo‘e) located in Kāne‘ohe Bay, Oahu, HI. Approximately 1800 l of water was filtered through 0.2 µm pore-size cartridge filters with polyethersulfone membranes (Polycap, Whatman). Viruses in the filtrate were concentrated with a tangential flow filtration cassette with 100 kDa nominal molecular weight cut-off (NMWCO) regenerated cellulose membrane (Pellicon 2, Millipore). The concentrate was stored at 4°C after addition of protease inhibitor (Sigma-Aldrich) at a final concentration of approximately 100 mg l⁻¹ in an attempt to decrease viral degradation. The sample was then further concentrated with 100 kDa NMWCO Centricon-80 centrifugal ultrafiltration devices (Millipore) and stored at 4°C until fractionation.

Viral Genome Size Distributions

Pulsed-field gel electrophoresis (PFGE) was used to monitor viral genome size distributions in the fractions collected from viral fractionation as an indicator of fractionation progress. Viruses in fractions were concentrated with 100 kDa NMWCO Nanosep centrifugal ultrafiltration devices (Pall) and processed for PFGE as previously described [24]. PFGE was carried out using a CHEF-DR II PFGE system (Bio-Rad) in Tris-Borate EDTA (TBE) buffer for 18 h with switch time ramping linearly from 1 to 12 s. DNA molecular weight markers (MidRange I and Lambda Ladder; New England Biolabs) and mass standards (High DNA Mass Ladder, Invitrogen) were run on all gels. Gels were stained overnight at 4°C with SYTO 60 (Invitrogen), then visualized and analyzed with the Odyssey Infrared Imaging System (Li-Cor Biosciences).

Viral Fractionation

Continuous cesium chloride (CsCl) gradients were used as the first fractionation step to separate viruses from one another based on their differing buoyant densities [23]. The density of the viral concentrate was adjusted to 1.45 g ml⁻¹ by the addition and dissolution of solid molecular grade CsCl (Fisher Scientific) and 10.5 ml of the resulting solution was deposited into a 12-ml polyallomer ultracentrifuge tube (Beckman Coulter). A 1-ml cushion of 1.52 g ml⁻¹ CsCl that had been prepared with ultrapure water (NANOPure DIamond, Barnstead) and filtered through a 0.02 µm pore-size syringe filter (Acrodisc, Pall) was deposited at the bottom of the tube with a Pasteur pipet to avoid pelleting of viruses more dense than the initial solution density before the gradient formed. The gradient was then centrifuged at 25000 rpm for 72 hrs at 4°C with a swinging bucket rotor (SW 41 Ti, Beckman Coulter) in an Optima XL-80K ultracentrifuge (Beckman Coulter). Fractions of ∼500 µl were collected top down from the gradient using a fraction collector (Auto Densi-Flow, Labconco) on low speed. Density of the fractions was determined gravimetrically and viruses were enumerated in each fraction using epifluorescence microscopy [25] with the stain SYBR Gold (Invitrogen). Assuming an average DNA content of 55 ag per virus [26], the volume of fraction required to obtain 100 ng of viral DNA was prepared for viral genome fingerprinting.

A viscous whitish substance was observed in the completed CsCl gradient at densities >1.4 g ml⁻¹. The distribution of genome sizes in fractions was the same in all fractions from this zone and similar to the unfractionated sample. Under the assumption that the viruses in this zone were aggregated or adsorbed to the unknown whitish substance, an attempt was made to desorb the viruses. The relevant fractions were pooled and Tween-80 (Fisher) was added at a final concentration of 1% followed by sonication of the sample for 3 minutes in a sonicator bath (Branson). The treated sample was then fractionated in a second continuous CsCl gradient.

A fraction from the continuous CsCl gradient was selected for further separation of viruses by strong anion-exchange chromatography [23]. A BioLogic HR Workstation (Bio-Rad) equipped with a 1-ml sample injector, gradient mixer, fraction collector, and UV and conductivity meters was used to run a step gradient through an UNO Q1 strong anion-exchange chromatography column (Bio-Rad). The starting buffer (20 mM Tris-HCl, pH 7.8) and elution buffer (20 mM Tris HCl, 1 M sodium chloride, pH 7.8) for chromatography were prepared with ultrapure water (NANOPure), autoclaved, and filtered through 0.22 µm pore-size filters. The remaining portion of the selected CsCl gradient fraction that had not been used for viral genome fingerprinting was exchanged into the chromatography starting buffer with a Centricon-20 centrifugal ultrafiltration device with a 100 kDa NMWCO filter (Millipore) and recovered at a final volume of ∼1.1 ml. The UNO Q1 chromatography column was equilibrated sequentially with 7 ml of starting buffer, 7 ml of elution buffer, and 7 ml of starting buffer at 1 ml min⁻¹. The sample was then loaded onto the column and a step gradient was run with 1% steps of increasing elution buffer between 26 and 42% elution buffer at 0.5 ml min⁻¹, with collection of 8 ml fractions per step. For each fraction, 300 µl was used for viral genome fingerprinting and the remaining volume was stored at 4°C. A fraction from this gradient was then selected for analysis with transmission electron microscopy (TEM), shotgun clone library construction, and sequencing.

Transmission Electron Microscopy

The morphological diversity of viruses in the selected fraction was investigated with TEM. An air-driven ultracentrifuge (Airfuge CLS, Beckman) was used to deposit viruses from 200 µl of the fraction on to copper grids (200 mesh) with carbon-stabilized formvar that had been rendered hydrophilic by UV irradiation (240 mJ). The grids were secured to the distal interior surface of the Airfuge rotor chambers (EM-90, Beckman) and the sample was centrifuged for 20 minutes at 118 000× g. Viruses on the grid were then stained with 10 µl of 0.02 µm-filtered 2% uranyl acetate for 45 s. The stain was then wicked away with absorbent filter paper (Whatman) and the grids were rinsed with 10 µl of ultrapure water (NANOPure DIamond, Barnstead) which was also wicked away with absorbent filter paper. The stained grids were then air dried and stored desiccated at room temperature (18–24°C) until analysis. Grids were examined at 100 000–125 000× magnification using a transmission electron microscope (LEO 912) with 100 kV accelerating voltage. Micrographs were taken of the first 50 observed viruses with a Proscan Slow-Scan Frame-Transfer cooled CCD camera with 1K ×1K resolution run with analySIS software (Soft Imaging Systems). Image-Pro Plus software (Media Cybernetics) was used to measure the capsid diameters and tail lengths of the first 50 observed viruses.

Library Construction and Sequencing

Viruses in the remaining portion of the fraction were concentrated with a 100 kDa NMWCO Nanosep centrifugal ultrafiltration device (Pall) and the DNA was extracted with a MasterPure Complete DNA and RNA Purification Kit (Epicentre). The extracted DNA was then split into four samples and separate clone libraries were constructed from three of the extracted samples. The DNA in those samples was amplified with three separate multiple displacement amplification (MDA) reactions (REPLI-g, Qiagen) in an effort to reduce amplification bias as a result of MDA [27]. After extracting the amplified DNA, one of the samples was then physically sheared to 3–5 kb using a HydroShear (Genomic Solutions) while the other two samples were sheared to 1–2 kb. The sheared samples were then purified with a MinElute PCR Purification Kit (Qiagen), the ends were made blunt with a DNA Terminator End Repair Kit (Lucigen), and gel electrophoresis was used to isolate the appropriate sizes of DNA from each sample. DNA was extracted from the first sample in the gel with a MinElute Gel Extraction Kit (Qiagen), but this resulted in low recovery of the DNA (∼5%), so the other two samples were extracted from the gel with a Centrilutor micro-eluter (Millipore), resulting in 35 to 52% recovery. A clone library was then constructed from each of the samples using the CloneSmart Blunt Cloning Kit (Lucigen). Plasmid sequencing of the clones from the three libraries was conducted with dye-terminator Sanger sequencing at the University of Hawai‘i Advanced Studies in Genomics, Proteomics, and Bioinformatics sequencing facility. Paired-end reads were obtained from 391 of the 1651 sequenced inserts for a total of 1942 sequences.

Analysis of Sequences

Sequences from the 3 libraries were pooled and analyzed as one library. Sequence trimming and assembly were performed with Sequencher 4.10.1 (Gene Codes Corp.). Vector sequence was removed using the automatic recognition function in the software. Assembly of all sequences to the vector sequence as a template revealed additional vector-only sequences, which were removed. Forward and reverse reads of the same clone were assembled using the “Assemble by Name” function. Some of these assemblies produced odd results, with forward and reverse reads in same direction. In some cases, the second strand assembled to the first immediately after a string of Ns in the middle of the first strand. These odd assemblies (11 contigs of 22 sequences) were removed. The remaining sequences were trimmed such that the first and last 99 base pairs (bp) contained <1 ambiguity and the first and last 20 bp contained <2 bp with a confidence value <40%. These conditions were applied repeatedly until all sequences met the criteria. The sequences were then trimmed further using the criteria that the first and last 20 bp had <1 bp with a confidence <20%. In some cases, sequences with poor quality regions (strings of Ns) in the middle of the sequence were not identified by these criteria and these were trimmed by hand to remove all sequence at and following the ambiguous bases. After trimming, sequences of <100 bp were removed leaving 1796 unassembled sequences. These sequences were deposited in GenBank (Accession Numbers JS807804–JS809599).

Sequences in the library were compared to the GenBank non-redundant protein database using BLASTx [28], [29], omitting sequences from uncultivated organisms. The sequences were classified based on the identity of the sequence with which it shared the greatest similarity, except when the most similar sequence was non-viral, but the sequence also displayed significant similarity (E-value ≤0.001) to a virus. In the latter case, the sequences were classified according to the most similar virus-derived sequence. Sequences classified as viral were further classified based on their family and protein type.

Phylogenetic Analysis

In an effort to assess phylogenetic diversity of viruses in our library, sequences that had any significant similarity (not just the highest similarity) to a viral DNA polymerase were used to construct a phylogram. These sequences were translated and aligned with other translated DNA polymerase gene sequences from viral genomes present in GenBank using custom scripts. A maximum-likelihood tree was then constructed based on this amino acid alignment as previously described [30] with RAxML [31] using the WAG substitution matrix with a subset estimation of invariable sites and gamma distribution in four discrete categories (WAG+Γ₄+ I).

Sequence Assembly and Contig Analysis

Sequencher was used to assemble forward and reverse reads using the “Assemble by Name” function. Those that assembled were merged into consensus sequences. The resulting 1723 sequences were then assembled using the criteria of a minimum overlap of 20 bp and a minimum of 98% identity according to Breitbart et al. [13]. Open reading frames (ORFs) were predicted in only the larger assembled contigs (>4 kb) using GeneMark.hmm 2.0 [32] and annotated by comparing the ORF sequences to the GenBank non-redundant protein database using BLASTx [28], [29] with the same criteria used as when analyzing the trimmed sequence library.

Results

Viral Fractionation

In the initial continuous cesium chloride (CsCl) gradient of the viral concentrate, a large portion of the viruses banded with little resolution over a broad range and at high densities (1.47–1.56 g ml⁻¹; data not shown), an atypical result for this method [23]. The presence of a viscous whitish matter in this region of the gradient suggested that the viruses could be adsorbed to an unknown substance. After treatment of all pooled fractions with Tween-80 and sonication, followed by separation in a second gradient, much of the material banded in the same position and remained unresolved (Figure 1A). There was also an aggregation of viruses that banded at the top of the gradient (1.300–1.322 g ml⁻¹). The remaining viruses were found in nine fractions between 1.389 and 1.456 g ml⁻¹ and most of these fractions showed distinct patterns of genome sizes.

Download:

Figure 1. Viral genome fingerprints of the fractions used in each fractionation step.

(A) Pulsed-field gel of the virus assemblages in each fraction collected from a continuous cesium chloride gradient of a viral concentrate from Kāne‘ohe Bay. The box around the fraction with a density of 1.44 g ml⁻¹ indicates that fraction was separated further using strong anion-exchange chromatography. (B) Pulsed-field gel of the virus assemblages in each fraction collected from the further separation of the indicated cesium chloride gradient fraction using strong anion-exchange chromatography. The box around the fraction that eluted with 38% elution buffer indicates the fraction selected for microscopy and sequencing. Arrows point to the three genome bands in the fraction. Marker lanes contain a Lambda Ladder (L) and a MidRange PFG Ladder (M). The unfractionated sample was also run for comparison (U).

https://doi.org/10.1371/journal.pone.0060604.g001

Viruses in the fraction having a density of 1.444 g ml⁻¹ were subjected to a second round of fractionation by anion-exchange chromatography. Most of the viruses eluted in 11 of the 21 fractions between 31% and 46% elution buffer with the gradient ending at 48% (Figure 1B). A final rinse out with 100% elution buffer resulted in the release of additional viruses, most likely those adsorbed to the unknown substance. The fraction that eluted with 38% elution buffer was selected for sequencing and included three visible viral genome bands. The dominant band was 62 kb and included 65% of the DNA in the fraction. The two minor bands were 31 kb and 139 kb and included 18% and 17% of the DNA in the fraction, respectively.

Transmission Electron Microscopy

Analysis of the viruses in the selected fraction with transmission electron microscopy (TEM) revealed that the fraction had four readily distinguishable morphotypes. The dominant morphotype, which comprised 44% of the population, had podovirus morphology with capsid diameters between 60 and 67 nm, and short (14–18 nm) tails or no visible tail (Figure 2A–C). The second group of viruses, which comprised 30% of the population, had myovirus morphology with capsid diameters between 76 and 103 nm, and long (109–118 nm) contractile tails (Figure 2D–F). The third group of viruses, which comprised 19% of the population, had podovirus morphology with capsid diameters between 44 and 50 nm, and short (15–17 nm) tails or no visible tail (Figure 2G–I). The fourth group of viruses, which comprised 7% of the population, had siphovirus morphology with capsid diameters between 52 and 60 nm, and long (100–102 nm) non-contractile tails (Figure 2J–L).

Download:

Figure 2. Transmission electron micrographs of viruses in the fraction selected for sequencing.

Representative viruses from the four morphological groups in the fraction are shown in A–C, D–F, G–I, and J–L. These groups comprised 44, 30, 19, and 7% of the population, respectively.

https://doi.org/10.1371/journal.pone.0060604.g002

Sequence Composition

After trimming, the average read length in the library was 609 (±130) bases and the average G+C content was 36 (±5)%. A search in the GenBank database using BLASTx revealed that the majority (55%) of sequences in the library had no significant similarity to other deposited sequences, 28% were similar to sequences from viruses, 13% to sequences from bacteria, and 4% to sequences from eukaryotes and archaea (Figure 3A). Of the virus-like sequences, 51% were similar to sequences derived from myoviruses, 25% to sequences from siphoviruses, and 13% to sequences from podoviruses (Figure 3B). The viruses from which nearly all of these most similar sequences derived were bacteriophages including three Synechococcus phages, three Pseudomonas phages, and two Prochlorococcus phages (Table 1). Matches to virus-derived genes included oxygenases, helicases, structural proteins, and DNA polymerases, but nearly half (47%) were to genes with unknown function (Table 2).

Download:

Figure 3. Taxonomic classification of the sequence library.

Classification of all sequences (A) and families represented in the virus sequences (B) based on significant hits (E-value ≤0.001) to the GenBank database using BLASTx. Numbers of sequences are in parentheses.

https://doi.org/10.1371/journal.pone.0060604.g003

Download:

Table 1. Viruses in the GenBank database with the highest number of significant similarities from the sequence library.

https://doi.org/10.1371/journal.pone.0060604.t001

Download:

Table 2. Categories of viral proteins in the sequence library.

https://doi.org/10.1371/journal.pone.0060604.t002

Phylogenetic Analysis

Fifty sequences in the library had significant similarity to viral DNA polymerases, with 34 of the sequences having the greatest similarity to the DNA polymerase of bacteriophage phi-JL001 [33]. An alignment of 9 of these sequences across 96 amino acid residues of the conserved DnaQ-like region of the polymerase, as determined using the Conserved Domain Database [34], was used to construct a phylogenetic tree (Figure 4). Although there was deep-branching support for clustering of the library sequences with the siphoviruses phi-JL001, YuA, and M6 (bootstrap value 100), the sequences from our Kāne‘ohe Bay library formed their own well-supported clade (bootstrap value 100) with five groups.

Download:

Figure 4. Phylogenetic evaluation of DNA polymerase sequences in the sequence library.

The unrooted phylogenetic tree was based on a 96 amino acid residue region of viral DNA polymerase sequences obtained from GenBank and putitive DNA polymerase sequences from this study. The letter designations P, S, and U correspond to Podoviridae, Siphoviridae, and unclassified viruses, respectively. All sequences from the Kāne‘ohe Bay library are designated with KB. Bootstrap values based on 100 resamplings are shown at the nodes if they were >50.

https://doi.org/10.1371/journal.pone.0060604.g004

Sequence Assembly and Contig Annotation

Assembly of the sequences resulted in 221 contigs comprised of 2 to 38 sequences each (Figure 5A) and ranging in size from 370 to 6536 bp in length (Figure 5B), with 65% of the sequences in the library comprising these contigs. Identification of ORFs in the largest contigs (>4 kb) revealed 47 complete ORFs with an average length of 640 bp (Figure 6). The majority of these contigs had larger ORFs, but the seventh contig was comprised entirely of short ORFs (111–513 bp) with no significant hits and the ninth contig contained a much larger ORF (3672 bp) with similarity to a viral tape measure protein. Annotation of the ORFs showed that they were primarily composed of viral sequences including repeated, highly significant hits (E-value <10⁻¹⁹) to ferrochelatase and 2OG-Fe(II) oxygenase genes from the Synechococcus phage S-SM1 [35].

Download:

Figure 5. Contig spectrum and length distribution of contigs assembled from the sequence library.

(A) Histogram of the number of sequences in each contig assembled with Sequencher using conditions of 98% minimum match and >20 bp overlap. (B) Histogram of the lengths of those contigs.

https://doi.org/10.1371/journal.pone.0060604.g005

Download:

Figure 6. Annotation of ORFs in large contigs (>4 kb) assembled from the sequence library.

Length and coverage of each contig are listed at right.

https://doi.org/10.1371/journal.pone.0060604.g006

Discussion

PFGE and morphological analyses supported the hypothesis that physical fractionation of a viral assemblage from Kāne‘ohe Bay could be used to enrich a limited number of viruses in a fraction. PFGE analysis indicated the presence of three distinct genome sizes, while TEM showed four distinct morphological groups. Both PFGE and TEM can underestimate actual diversity, since genetically distinct viruses can have indistinguishable genome sizes [24] or morphologies [36]. Given these caveats, we found that there was a minimum of four distinct groups of viruses in the sequenced fraction.

The sequence library did not contain matches to more than a few genes of any one virus, suggesting that the viral genomes represented in the library have not previously been sequenced. Most virus hits were to bacteriophages, consistent with the observed morphologies of the viruses in the sample, which mostly resembled tailed bacteriophages in the order Caudovirales.

The distant relationships of our library sequences to known viral DNA polymerase sequences suggest that the viruses in the sequenced fraction are not closely related to any previously sequenced virus, and thus information about their potential hosts cannot be inferred from the phylogenetic tree. However, the library sequences formed a well-supported clade, suggesting that the viruses in the fraction used to construct the library were relatively closely related with respect to the phylogeny of their putative DNA polymerase sequences. The phylogenetic results also show that there were viruses belonging to at least five operational taxonomic units in the sequenced fraction.

While we did not directly compare the fractionated viral assemblage to the whole, unfractionated viral community, assembly of the sequence library from the fractionated sample showed that there were many more contigs generated than from comparable metagenomic analyses of whole viral assemblages [11]–[13], [37], [38]. In the latter studies, only 0.3–3.5% of library sequences could be assembled into contigs with a maximum of 4 sequences per contig, whereas 65% of the sequences in our library were assembled into contigs with a maximum of 38 sequences in a contig. This supports the hypothesis that, by physically fractionating viral assemblages, there will be significantly greater reassembly of sequences from libraries constructed with the resulting fractions [21], [22].

The longer contigs assembled from this fractionated viral assemblage allowed for an assessment of genes within the context of genomic fragments from uncultivated viruses. ORFs with high similarity to 2OG-Fe(II) oxygenase were found in five out of the nine analyzed contigs. This gene has so far been found exclusively in T4-like cyanophages [35], suggesting that these five contigs came from the genome of the myovirus identified in the fraction. The fact that these genes occurred in multiple contigs, but in different locations relative to other genes, indicates that there could be several types of morphologically similar myoviruses with different genome arrangements in our sequenced fraction. Alternatively, these similar contigs could be chimeric assemblies resulting from low sequence coverage (2.0–4.3x), chimeras generated from MDA [39], or both.

Although we used a large volume concentrate for this study, this is not required to take advantage of the fractionation approach. Our motivation for using a large volume was to ensure that we had sufficient material to document separation at each stage using PFGE. We also anticipated that with sufficient starting volume, we might be able to avoid amplification of the material before cloning. Direct cloning would have been possible for some of the fractions, but the one we chose for analysis did not have sufficient material. The MDA amplification step we employed has been used in other marine viral metagenomes (e.g., [14]), but can result in biases [27], [40] and the formation of chimeras [39]. Such problems may explain some of the odd forward and reverse assemblies noted in the materials and methods and the repetition of genes within a contig. The increased assembly we achieved through fractionation and the long reads from Sanger sequencing make these problems more apparent. The use of improved amplification methods [41] or elimination of the amplification step [38], coupled with increases in sequencing power [20], should further improve our ability to accurately reassemble the genomes of uncultivated viruses isolated by physical fractionation. This is a worthwhile goal, because with accurate genome reassembly, one can move beyond metagenomic gene inventories and conduct comparative genomics of uncultivated viruses.

There are other methods for more efficiently assembling viral genomes from complex assemblages, such as the use of large-insert clone libraries [42], [43] or single-virus amplifications [44]. These methods are also fractionations, but rely on fractionation to the level of single genomes or virions. Bulk fractionation offers significant, complementary advantages. By fractionating populations of intact viruses en masse, it is possible to enrich for even rare populations of interest by screening with specific primers at each stage of the separation. Further, by narrowing the target populations while maintaining sufficient numbers of intact virions, it also becomes possible to more clearly link viral genomes with proteomes and with the physical properties of the virions (buoyant density, surface charge, morphology). Thus, we propose that an effective way to advance our understanding of uncultivated viral populations will be to combine the advantages of bulk fractionation with other methods that allow the assembly of discrete genomes. Initial bulk physical fractionation of a community will allow targeted separation and phenotypic characterization of populations, and subsequent single-virus genomics (whether by amplification, large-insert cloning, or direct sequencing) performed on a portion of the fractionated populations will allow accurate genome assemblies of the phenotypically characterized populations.

Acknowledgments

We thank J. Cesar Ignacio-Espinoza for construction of the phylogenetic tree and Tina Carvalho of the University of Hawaii Biological Electron Microscope Facility for her assistance with TEM.

Author Contributions

Conceived and designed the experiments: JRB AIC GFS. Performed the experiments: JRB. Analyzed the data: JRB GFS. Contributed reagents/materials/analysis tools: AIC GFS. Wrote the paper: JRB GFS.

References

1. Breitbart M (2012) Marine Viruses: Truth or Dare. Ann Rev Mar Sci 4: 425–448.
- View Article
- Google Scholar
2. Wommack KE, Colwell RR (2000) Virioplankton: viruses in aquatic ecosystems. Microbiol Mol Biol Rev 64: 69–114.
- View Article
- Google Scholar
3. Steward GF, Montiel JL, Azam F (2000) Genome size distributions indicate variability and similarities among marine viral assemblages from diverse environments. Limnol Oceanogr 45: 1697–1706.
- View Article
- Google Scholar
4. Sullivan MB, Coleman ML, Weigele P, Rohwer F, Chisholm SW (2005) Three Prochlorococcus cyanophage genomes: signature features and ecological interpretations. PLoS Biol 3: 790–806.
- View Article
- Google Scholar
5. Castberg T, Thyrhaug R, Larsen A, Sandaa R-A, Heldal M, et al. (2002) Isolation and characterization of a virus that infects Emiliania huxleyi (Haptophyta). J Phycol 38: 767–774.
- View Article
- Google Scholar
6. Mann NH, Clokie MRJ, Millard A, Cook A, Wilson WH, et al. (2005) The genome of S-PM2, a “photosynthetic” T4-type bacteriophage that infects marine Synechococcus strains. J Bacteriol 187: 3188–3200.
- View Article
- Google Scholar
7. Hugenholtz P (2002) Exploring prokaryotic diversity in the genomic era. Genome Biol 3: reviews0003.1–0003.8.
- View Article
- Google Scholar
8. Rappe MS, Giovannoni SJ (2003) The uncultured microbial majority. Annu Review Microbiol 57: 369–394.
- View Article
- Google Scholar
9. Edwards RA, Rohwer F (2005) Viral metagenomics. Nat Rev Microbiol 3: 504–510.
- View Article
- Google Scholar
10. Kristensen DM, Mushegian AR, Dolja VV, Koonin EV (2010) New dimensions of the virus world discovered through metagenomics. Trends Microbiol 18: 11–19.
- View Article
- Google Scholar
11. Bench SR, Hanson TE, Williamson KE, Ghosh D, Radosovich M, et al. (2007) Metagenomic characterization of Chesapeake Bay virioplankton. Appl Environ Microbiol 73: 7629–7641.
- View Article
- Google Scholar
12. Breitbart M, Felts B, Kelley S, Mahaffy JM, Nulton J, et al. (2004) Diversity and population structure of a near-shore marine-sediment viral community. Proc Roy Soc B 271: 565–574.
- View Article
- Google Scholar
13. Breitbart M, Salamon P, Andresen B, Mahaffy JM, Segall AM, et al. (2002) Genomic analysis of uncultured marine viral communities. Proc Natl Acad Sci USA 99: 14250–14255.
- View Article
- Google Scholar
14. Angly FE, Felts B, Breitbart M, Salamon P, Edwards RA, et al. (2006) The marine viromes of four oceanic regions. PLoS Biol 4: 2121–2131.
- View Article
- Google Scholar
15. Culley AI, Lang AS, Suttle CA (2006) Metagenomic analysis of coastal RNA virus communities. Science 312: 1795–1798.
- View Article
- Google Scholar
16. Tyson GW, Chapman J, Hugenholtz P, Allen EE, Ram RJ, et al. (2004) Community structure and metabolism through reconstruction of microbial genomes from the environment. Nature 428: 37–43.
- View Article
- Google Scholar
17. Legault BA, Lopez-Lopez A, Alba-Casado JC, Doolittle WF, Bolhuis H, et al. (2006) Environmental genomics of “Haloquadratum walsbyi” in a saltern crystallizer indicates a large pool of accessory genes in an otherwise coherent species. BMC Genomics 7: 171.
- View Article
- Google Scholar
18. Eriksson N, Pachter L, Mitsuya Y, Rhee S-Y, Wang C, et al. (2008) Viral population estimation using pyrosequencing. PLoS Biol 4: 1–13.
- View Article
- Google Scholar
19. Chen K, Pachter L (2005) Bioinformatics for whole-genome shotgun sequencing of microbial communities. PLoS Comput Biol 1: 0106–0112.
- View Article
- Google Scholar
20. Metzker ML (2010) Sequencing technologies - the next generation. Nat Rev Genet 11: 31–46.
- View Article
- Google Scholar
21. Steward GF, Rappé MS (2007) What's the 'meta' with metagenomics? ISME J 1: 100–102.
- View Article
- Google Scholar
22. Bergeron A, Belcaid M, Steward GF, Poisson G (2007) Divide and conquer: enriching environmental sequencing data. PLoS ONE 2: e830.
- View Article
- Google Scholar
23. Brum JR, Steward GF (2011) Physical fractionation of aquatic viral assemblages. Limnol Oceanogr Methods 9: 150–163.
- View Article
- Google Scholar
24. Steward GF (2001) Fingerprinting viral assemblages by pulsed field gel electrophoresis (PFGE). In: Paul JH, editor. Methods in Microbiology, Volume 30. New York: Academic Press. 85–103.
25. Noble RT, Fuhrman JA (1998) Use of SYBR Green I for rapid epifluorescence counts of marine viruses and bacteria. Aquat Microb Ecol 14: 113–118.
- View Article
- Google Scholar
26. Brum JR (2005) Concentration, production, and turnover of viruses and dissolved DNA pools at Station ALOHA, North Pacific Subtropical Gyre. Aquat Microb Ecol 41: 103–113.
- View Article
- Google Scholar
27. Yilmaz S, Allgaier M, Hugenholtz P (2010) Multiple displacement amplification compromises quantitative analysis of metagenomes. Nat Methods 7: 943–944.
- View Article
- Google Scholar
28. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215: 403–410.
- View Article
- Google Scholar
29. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402.
- View Article
- Google Scholar
30. Ignacio-Espinoza JC, Sullivan MB (2012) Phylogenomics of T4 cyanophages: lateral gene transfer in the 'core' and origins of host genes. Environ Microbiol 14: 2113–2126.
- View Article
- Google Scholar
31. Stamatakis A, Ludwig T, Meier H (2005) RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees. Bioinformatics 21: 456–463.
- View Article
- Google Scholar
32. Besemer J, Borodovsky M (1999) Heuristic approach to deriving models for gene finding. Nucleic Acids Res 27: 3911–3920.
- View Article
- Google Scholar
33. Lohr JE, Chen F, Hill RT (2005) Genomic analysis of bacteriophage phi JL001: insights into its interaction with a sponge-associated alpha-proteobacterium. Appl Environ Microbiol 71: 1598–1609.
- View Article
- Google Scholar
34. Marchler-Bauer A, Lu S, Anderson JB, Chitsaz F, Derbyshire MK, et al. (2011) CDD: a Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Res 39: D225–D229.
- View Article
- Google Scholar
35. Sullivan MB, Huang KH, Ignacio-Espinoza JC, Berlin AM, Kelly L, et al. (2010) Genomic analysis of oceanic cyanobacterial myoviruses compared with T4-like myoviruses from diverse hosts and environments. Environ Microbiol 12: 3035–3056.
- View Article
- Google Scholar
36. Lawrence JG, Hatfull GF, Hendrix RW (2002) Imbroglios of viral taxonomy: genetic exchange and failings of phenetic approaches. J Bacteriol 184: 4891–4905.
- View Article
- Google Scholar
37. Fierer N, Breitbart M, Nulton J, Salamon P, Lozupone C, et al. (2007) Metagenomic and small-subunit rRNA analyses reveal the genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol 73: 7059–7066.
- View Article
- Google Scholar
38. Steward GF, Preston CM (2011) Analysis of a viral metagenomic library from 200 m depth in Monterey Bay, California constructed by direct shotgun cloning. Virol J 8: 287.
- View Article
- Google Scholar
39. Lasken R, Stockwell T (2007) Mechanism of chimera formation during the Multiple Displacement Amplification reaction. BMC Biotechnol 7: 19.
- View Article
- Google Scholar
40. Kim K-H, Bae J-W (2011) Amplification methods bias metagenomic libraries of uncultured single-stranded and double-stranded DNA viruses. Appl Environ Microbiol 77: 7663–7668.
- View Article
- Google Scholar
41. Duhaime MBD, Deng L, Poulos BT, Sullivan MB (2012) Towards quantitative metagenomics of wild viruses and other ultra-low concentration DNA samples: a rigorous assessment and optimization of the linker amplification method. Environ Microbiol 14: 2526–2537.
- View Article
- Google Scholar
42. DeLong EF, Preston CM, Mincer T, Rich V, Hallam SJ, et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science 311: 496–503.
- View Article
- Google Scholar
43. Mizuno CM, Rodriguez-Valera F, Garcia-Heredia I, Martin-Cuadrado A-B, Ghai R (2013) Reconstruction of novel cyanobacterial siphovirus genomes from Mediterranean metagenomic fosmids. Appl Environ Microbiol 79: 688–695.
- View Article
- Google Scholar
44. Allen LZ, Ishoey T, Novotny MA, McLean JS, Lasken RS, et al. (2011) Single virus genomics: A new tool for virus discovery. PLoS ONE 6: e17722.
- View Article
- Google Scholar

[ref1] 1. Breitbart M (2012) Marine Viruses: Truth or Dare. Ann Rev Mar Sci 4: 425–448.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Wommack KE, Colwell RR (2000) Virioplankton: viruses in aquatic ecosystems. Microbiol Mol Biol Rev 64: 69–114.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Steward GF, Montiel JL, Azam F (2000) Genome size distributions indicate variability and similarities among marine viral assemblages from diverse environments. Limnol Oceanogr 45: 1697–1706.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Sullivan MB, Coleman ML, Weigele P, Rohwer F, Chisholm SW (2005) Three Prochlorococcus cyanophage genomes: signature features and ecological interpretations. PLoS Biol 3: 790–806.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref5] 5. Castberg T, Thyrhaug R, Larsen A, Sandaa R-A, Heldal M, et al. (2002) Isolation and characterization of a virus that infects Emiliania huxleyi (Haptophyta). J Phycol 38: 767–774.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref6] 6. Mann NH, Clokie MRJ, Millard A, Cook A, Wilson WH, et al. (2005) The genome of S-PM2, a “photosynthetic” T4-type bacteriophage that infects marine Synechococcus strains. J Bacteriol 187: 3188–3200.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. Hugenholtz P (2002) Exploring prokaryotic diversity in the genomic era. Genome Biol 3: reviews0003.1–0003.8.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref8] 8. Rappe MS, Giovannoni SJ (2003) The uncultured microbial majority. Annu Review Microbiol 57: 369–394.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref9] 9. Edwards RA, Rohwer F (2005) Viral metagenomics. Nat Rev Microbiol 3: 504–510.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref10] 10. Kristensen DM, Mushegian AR, Dolja VV, Koonin EV (2010) New dimensions of the virus world discovered through metagenomics. Trends Microbiol 18: 11–19.
View Article
Google Scholar

[29] View Article

[30] Google Scholar

[ref11] 11. Bench SR, Hanson TE, Williamson KE, Ghosh D, Radosovich M, et al. (2007) Metagenomic characterization of Chesapeake Bay virioplankton. Appl Environ Microbiol 73: 7629–7641.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref12] 12. Breitbart M, Felts B, Kelley S, Mahaffy JM, Nulton J, et al. (2004) Diversity and population structure of a near-shore marine-sediment viral community. Proc Roy Soc B 271: 565–574.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref13] 13. Breitbart M, Salamon P, Andresen B, Mahaffy JM, Segall AM, et al. (2002) Genomic analysis of uncultured marine viral communities. Proc Natl Acad Sci USA 99: 14250–14255.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref14] 14. Angly FE, Felts B, Breitbart M, Salamon P, Edwards RA, et al. (2006) The marine viromes of four oceanic regions. PLoS Biol 4: 2121–2131.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref15] 15. Culley AI, Lang AS, Suttle CA (2006) Metagenomic analysis of coastal RNA virus communities. Science 312: 1795–1798.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref16] 16. Tyson GW, Chapman J, Hugenholtz P, Allen EE, Ram RJ, et al. (2004) Community structure and metabolism through reconstruction of microbial genomes from the environment. Nature 428: 37–43.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref17] 17. Legault BA, Lopez-Lopez A, Alba-Casado JC, Doolittle WF, Bolhuis H, et al. (2006) Environmental genomics of “Haloquadratum walsbyi” in a saltern crystallizer indicates a large pool of accessory genes in an otherwise coherent species. BMC Genomics 7: 171.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref18] 18. Eriksson N, Pachter L, Mitsuya Y, Rhee S-Y, Wang C, et al. (2008) Viral population estimation using pyrosequencing. PLoS Biol 4: 1–13.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref19] 19. Chen K, Pachter L (2005) Bioinformatics for whole-genome shotgun sequencing of microbial communities. PLoS Comput Biol 1: 0106–0112.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref20] 20. Metzker ML (2010) Sequencing technologies - the next generation. Nat Rev Genet 11: 31–46.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref21] 21. Steward GF, Rappé MS (2007) What's the 'meta' with metagenomics? ISME J 1: 100–102.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref22] 22. Bergeron A, Belcaid M, Steward GF, Poisson G (2007) Divide and conquer: enriching environmental sequencing data. PLoS ONE 2: e830.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref23] 23. Brum JR, Steward GF (2011) Physical fractionation of aquatic viral assemblages. Limnol Oceanogr Methods 9: 150–163.
View Article
Google Scholar

[68] View Article

[69] Google Scholar

[ref24] 24. Steward GF (2001) Fingerprinting viral assemblages by pulsed field gel electrophoresis (PFGE). In: Paul JH, editor. Methods in Microbiology, Volume 30. New York: Academic Press. 85–103.

[ref25] 25. Noble RT, Fuhrman JA (1998) Use of SYBR Green I for rapid epifluorescence counts of marine viruses and bacteria. Aquat Microb Ecol 14: 113–118.
View Article
Google Scholar

[72] View Article

[73] Google Scholar

[ref26] 26. Brum JR (2005) Concentration, production, and turnover of viruses and dissolved DNA pools at Station ALOHA, North Pacific Subtropical Gyre. Aquat Microb Ecol 41: 103–113.
View Article
Google Scholar

[75] View Article

[76] Google Scholar

[ref27] 27. Yilmaz S, Allgaier M, Hugenholtz P (2010) Multiple displacement amplification compromises quantitative analysis of metagenomes. Nat Methods 7: 943–944.
View Article
Google Scholar

[78] View Article

[79] Google Scholar

[ref28] 28. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215: 403–410.
View Article
Google Scholar

[81] View Article

[82] Google Scholar

[ref29] 29. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402.
View Article
Google Scholar

[84] View Article

[85] Google Scholar

[ref30] 30. Ignacio-Espinoza JC, Sullivan MB (2012) Phylogenomics of T4 cyanophages: lateral gene transfer in the 'core' and origins of host genes. Environ Microbiol 14: 2113–2126.
View Article
Google Scholar

[87] View Article

[88] Google Scholar

[ref31] 31. Stamatakis A, Ludwig T, Meier H (2005) RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees. Bioinformatics 21: 456–463.
View Article
Google Scholar

[90] View Article

[91] Google Scholar

[ref32] 32. Besemer J, Borodovsky M (1999) Heuristic approach to deriving models for gene finding. Nucleic Acids Res 27: 3911–3920.
View Article
Google Scholar

[93] View Article

[94] Google Scholar

[ref33] 33. Lohr JE, Chen F, Hill RT (2005) Genomic analysis of bacteriophage phi JL001: insights into its interaction with a sponge-associated alpha-proteobacterium. Appl Environ Microbiol 71: 1598–1609.
View Article
Google Scholar

[96] View Article

[97] Google Scholar

[ref34] 34. Marchler-Bauer A, Lu S, Anderson JB, Chitsaz F, Derbyshire MK, et al. (2011) CDD: a Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Res 39: D225–D229.
View Article
Google Scholar

[99] View Article

[100] Google Scholar

[ref35] 35. Sullivan MB, Huang KH, Ignacio-Espinoza JC, Berlin AM, Kelly L, et al. (2010) Genomic analysis of oceanic cyanobacterial myoviruses compared with T4-like myoviruses from diverse hosts and environments. Environ Microbiol 12: 3035–3056.
View Article
Google Scholar

[102] View Article

[103] Google Scholar

[ref36] 36. Lawrence JG, Hatfull GF, Hendrix RW (2002) Imbroglios of viral taxonomy: genetic exchange and failings of phenetic approaches. J Bacteriol 184: 4891–4905.
View Article
Google Scholar

[105] View Article

[106] Google Scholar

[ref37] 37. Fierer N, Breitbart M, Nulton J, Salamon P, Lozupone C, et al. (2007) Metagenomic and small-subunit rRNA analyses reveal the genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol 73: 7059–7066.
View Article
Google Scholar

[108] View Article

[109] Google Scholar

[ref38] 38. Steward GF, Preston CM (2011) Analysis of a viral metagenomic library from 200 m depth in Monterey Bay, California constructed by direct shotgun cloning. Virol J 8: 287.
View Article
Google Scholar

[111] View Article

[112] Google Scholar

[ref39] 39. Lasken R, Stockwell T (2007) Mechanism of chimera formation during the Multiple Displacement Amplification reaction. BMC Biotechnol 7: 19.
View Article
Google Scholar

[114] View Article

[115] Google Scholar

[ref40] 40. Kim K-H, Bae J-W (2011) Amplification methods bias metagenomic libraries of uncultured single-stranded and double-stranded DNA viruses. Appl Environ Microbiol 77: 7663–7668.
View Article
Google Scholar

[117] View Article

[118] Google Scholar

[ref41] 41. Duhaime MBD, Deng L, Poulos BT, Sullivan MB (2012) Towards quantitative metagenomics of wild viruses and other ultra-low concentration DNA samples: a rigorous assessment and optimization of the linker amplification method. Environ Microbiol 14: 2526–2537.
View Article
Google Scholar

[120] View Article

[121] Google Scholar

[ref42] 42. DeLong EF, Preston CM, Mincer T, Rich V, Hallam SJ, et al. (2006) Community genomics among stratified microbial assemblages in the ocean's interior. Science 311: 496–503.
View Article
Google Scholar

[123] View Article

[124] Google Scholar

[ref43] 43. Mizuno CM, Rodriguez-Valera F, Garcia-Heredia I, Martin-Cuadrado A-B, Ghai R (2013) Reconstruction of novel cyanobacterial siphovirus genomes from Mediterranean metagenomic fosmids. Appl Environ Microbiol 79: 688–695.
View Article
Google Scholar

[126] View Article

[127] Google Scholar

[ref44] 44. Allen LZ, Ishoey T, Novotny MA, McLean JS, Lasken RS, et al. (2011) Single virus genomics: A new tool for virus discovery. PLoS ONE 6: e17722.
View Article
Google Scholar

[129] View Article

[130] Google Scholar

Figures

Abstract

Introduction

Materials and Methods

Ethics Statement

Sample Collection

Viral Genome Size Distributions

Viral Fractionation

Transmission Electron Microscopy

Library Construction and Sequencing

Analysis of Sequences

Phylogenetic Analysis

Sequence Assembly and Contig Analysis

Results

Viral Fractionation

Transmission Electron Microscopy

Sequence Composition

Phylogenetic Analysis

Sequence Assembly and Contig Annotation

Discussion

Acknowledgments

Author Contributions

References