The Illumina BovineLD BeadChip was designed to support imputation to higher density genotypes in dairy and beef breeds by including single-nucleotide polymorphisms (SNPs) that had a high minor allele frequency as well as uniform spacing across the genome except at the ends of the chromosome where densities were increased. The chip also includes SNPs on the Y chromosome and mitochondrial DNA loci that are useful for determining subspecies classification and certain paternal and maternal breed lineages. The total number of SNPs was 6,909. Accuracy of imputation to Illumina BovineSNP50 genotypes using the BovineLD chip was over 97% for most dairy and beef populations. The BovineLD imputations were about 3 percentage points more accurate than those from the Illumina GoldenGate Bovine3K BeadChip across multiple populations. The improvement was greatest when neither parent was genotyped. The minor allele frequencies were similar across taurine beef and dairy breeds as was the proportion of SNPs that were polymorphic. The new BovineLD chip should facilitate low-cost genomic selection in taurine beef and dairy cattle.
Citation: Boichard D, Chung H, Dassonneville R, David X, Eggen A, et al. (2012) Design of a Bovine Low-Density SNP Array Optimized for Imputation. PLoS ONE 7(3): e34130. doi:10.1371/journal.pone.0034130
Editor: Zhanjiang Liu, Auburn University, United States of America
Received: December 11, 2011; Accepted: February 22, 2012; Published: March 28, 2012
This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Funding: The National Research Agency (ANR) and Apisgene funded the French BovineSNP50 data. The Dairy Futures Cooperative Research Centre, Beef Genetic Technologies Cooperative Research Centre, Department of Primary Industries Victoria, and Dairy Australia funded the Australian dairy and beef genotyping. This project was also supported by Agriculture and Food Research Initiative Competitive Grant no. 2009-65205-05635 from the U.S. Department of Agriculture (USDA) National Institute of Food and Agriculture and by Projects 1265-31000-096-00D and 1265-31000-098-00D from the USDA Agricultural Research Service. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: André Eggen, Kimberly J. Gietzen, Cynthia T. Lawley, and Karine Viaud are employees of Illumina Inc. This study was partly funded by Apisgene. There are no patents, products in development, or marketed products to declare. This does not alter the authors' adherence to all the PLoS ONE policies on sharing data and materials.
Genetic improvement of several key agricultural species is accelerating with the adoption of genomic selection , , . With this method, animals or plants can be selected for breeding on the basis of their genetic merit predicted by markers spanning the entire genome. Particularly in dairy cattle, this method has been shown to be more efficient than conventional progeny testing of bulls (up to double the rate of genetic gain) as well as substantially less expensive . Moreover, genomic selection opens new opportunities for sustainable management of populations by more efficiently selecting for traits that have low heritability, e.g. fitness traits, or traits that are difficult to measure. This method is also useful for managing the accumulation of inbreeding within breeds with a small effective population size. In dairy cattle, genomic selection has been deployed at a rapid pace, and most countries with major dairy breeding programs now rely heavily on this new technology .
A major challenge in implementing genomic selection in most species is the cost of genotyping. The expected value of the information gained by genotyping must exceed the cost of obtaining the genotypes. During the early stages of genomic selection in the dairy industry, the cost of high-density genotyping could be justified. The primary application was to evaluate bulls that were potential candidates for production of commercial semen. Using SNP information for those evaluations resulted in more accurate selection of bulls to acquire and extensively market. Once increased accuracies of genome-enhanced breeding values had been demonstrated, breeders and buyers quickly adopted this technology to improve accuracy of selection . This example of a genomic-selection application has extreme value compared with other animal food production paradigms. In contrast, profit from genomic selection is likely to be much lower for beef bulls and dairy females , . An appealing approach in situations with much lower returns from genotyping is to use a more economical, reduced-density SNP chip with markers optimized for imputation.
Imputation is the process of predicting unknown genotypes for animals from observed genotypes and often uses information from a reference population with dense genotypes to predict missing genotypes for animals with lower density genotypes. It is also applied to merge genotypes of similar densities but different SNPs. Most imputation algorithms use information from relatives and population linkage disequilibrium. A number of software programs for imputation have been developed based originally on human genetics ,  and more recently on animal genetics , , , . The limited effective population sizes and population structures in livestock allow the possibility of imputation of high-density genotypes from quite low-density genotypes , , , .
In 2010, a low-density bovine SNP chip, the Illumina GoldenGate Bovine3K Genotyping Beadchip (http://www.illumina.com/documents/products/datasheets/datasheet_bovine3K.pdf), was developed and made commercially available. That product offered a significant advance toward low-cost genomic selection in cattle; however, imputation accuracy was highly dependent on the relationship of the individual genotyped with the Bovine3K chip to the reference population genotyped at a higher density . In addition, some samples failed to provide genotypes of adequate quality for use in genomic predictions. The SNP call rate performance of the Bovine3K chip was slightly reduced compared with the BovineSNP50 chip  because GoldenGate chemistry relies on two hybridization events for proper SNP detection as opposed to a single event for Infinium chemistry.
In this study, the Illumina Infinium BovineLD Genotyping Beadchip (http://www.illumina.com/documents/products/datasheets/datasheet_bovineLD.pdf) was developed to provide high imputation accuracy for higher density SNP genotypes in taurine dairy and beef populations. The main objective was to provide a tool that would enable genomic estimated breeding values to be calculated from accurately imputed genotype data from an Infinium-based SNP array with very low rates of failed samples. The main features of the new BovineLD chip are presented along with its imputation performance in a range of breeds and reference populations.
Materials and Methods
To provide highly accurate imputation to BovineSNP50 genotypes in global taurine breeds, SNPs were selected from validated assays from existing higher density chips and similar SNP detection technology, i.e. the Illumina BovineSNP50 and BovineHD (http://www.illumina.com/documents/products/datasheets/datasheet_bovineHD.pdf) SNP arrays, with priority given to BovineSNP50 content. From the known and validated SNPs, selection priority was 1) high minor allele frequencies (MAFs) in targeted breeds, 2) uniform spacing at a minimum of 2 SNPs per Mbp, with increased SNP density within 500 kbp of chromosomal ends, 3) inclusion of SNPs for determination of sex, parentage, Y haplotypes, and subspecies and maternal lineages, 4) SNP quality and fidelity criteria for robust reproducibility (>98% call rate and <0.01% Mendelian inconsistency), and 5) a target overlap of 2,000 SNPs with the Bovine3K chip to ensure backward compatibility. The anticipated SNP spacing (2 SNPs per Mbp) obviated the need to check for highly correlated SNPs.
The SNPs were selected to be highly informative with a high MAF over a large range of breeds from around the world (Table 1). The reference MAF estimates were from breeds in 10 countries from North America, Europe, and Oceania. Content selection was optimized using taurine allele frequencies. To achieve regular spacing, the UMD3 bovine genome assembly (http://www.cbcb.umd.edu/research/bos_taurus_assembly.shtml) was used to define 500-kbp segments over the 29 autosomes. A lack of flanking information at the end of each chromosome had resulted in lower imputation efficiency in preliminary tests. To correct that problem, the SNP density was doubled in the first and last segments of each chromosome. Reflecting the diverse membership of the Bovine LD Consortium, initial SNP selection was made by one member and updated by the others. The initial SNP selection was based on two independent criteria. First, SNPs with the highest mean MAF in each 500-kbp segment were selected over a broad range of European breeds including European Holstein, Montbéliarde, Normande, Jersey, Brown Swiss, Norwegian Red, Swedish Red and White, Finnish Ayrshire, Charolais, Limousine, Blonde d'Aquitaine, and Maine Anjou, with Holstein receiving double weight; the top two SNPs were selected in the segment at each end of the chromosome. Second, SNPs with the highest mean minimum MAF for six major European dairy breeds (European Holstein, Montbéliarde, Normande, Jersey, Brown Swiss, and Norwegian Red) were selected for each 500-kbp segment, with again 2 SNPs selected at each end of the chromosome. Selecting those SNPs with the highest mean of the two selection criteria within each 500-kbp segment (with doubling at the chromosome ends) resulted in 8,000 SNPs. Those 8,000 SNPs were subjected to a similar selection process using MAFs from North America and Oceania along with the European populations. For Holstein and Jersey breeds, the MAF used was the mean across the 3 populations; for Brown Swiss, only North America and Europe were included. The mean MAF was computed from Holstein, Jersey, Brown Swiss, Angus, and Brahman. The minimum MAF was from Jersey, Brown Swiss, and Angus. Again, the SNPs with the highest mean of the two selection criteria were selected with doubling at the chromosome ends.
Table 1. Number of DNA samples, minor allele frequencies (MAFs), and estimated frequency of loci that were polymorphic by breed and region.doi:10.1371/journal.pone.0034130.t001
Next, some of the selected SNPs were replaced by Bovine3K SNPs that were in nearby locations to ensure backward compatibility. In addition, SNPs used for breed determination and parentage testing that had not already been selected were included, and some SNPs were added to fill gaps generated by map inconsistencies.
For the X chromosome, Bovine3K SNPs with high MAFs were selected and supplemented with BovineSNP50 SNPs, with consideration given to spacing, MAF, and fidelity. Because large gaps remained after that initial selection, additional X- chromosome SNPs were chosen from the BovineHD assay.
For the Y chromosome and mitochondrial DNA (mtDNA), 9 Y-specific and 13 mtDNA SNP markers were identified from the BovineHD chip based on assay fidelity and performance across 27 breeds, MAF across those breeds, and ability of a SNP to discern subspecies and geographic locations of breed origins.
Imputation efficiency was assessed in 10 populations (North American, French, and Australian Holsteins; North American and Australian Jerseys; North American Brown Swiss; Australian Angus; French Montbéliarde; French Normande; and French Blonde d'Aquitaine). Beagle software (http://faculty.washington.edu/browning/beagle/beagle.html)  was used for the Australian and French populations and findhap.f90 (http://aipl.arsusda.gov/software/findhap/)  for the North American populations. These imputation programs have similar performance in large dairy cattle data sets . Using existing genotypes from the BovineSNP50 chip, imputation efficiency was determined by comparing imputed and obseved genotypes. Part of the population was retained as a “reference,” while target individuals for imputation had their genotypes reduced in silico to either BovineLD or Bovine3K genotypes. Results were assessed as the proportion of genotypes that were correct in the target population. For example, if the imputed genotype was a heterozygote and the BovineSNP50 genotype was a homozygote, that genotype was counted as incorrectly imputed. The count of correct genotypes included both observed and imputed genotypes to measure the overall success of a lower density genotype in approximating a BovineSNP50 genotype.
The SNP assays for 6,914 loci were validated using data from 290 samples that represented 26 global dairy and beef breeds (Table 2) and included Bovine Hapmap samples . The 290 samples (234 males, 56 females) included 286 unrelated samples, 2 trios, and 2 replicates. All markers were assessed for clustering of the genotypes using Illumina GenomeStudio genotyping software (version 2010.3; http://www.illumina.com/documents/products/datasheets/datasheet_genomestudio_software.pdf. A total of 6,909 clearly identifiable and scorable clusters were retained for robust utility of the panel. The cluster positions were defined with priority given first to data from dairy breeds and second to beef breeds. The purpose of the resulting cluster position file is to apply known robust cluster positions to future genotyping data for high throughput genotype calling. For phylogenetic analysis based on Y and mtDNA SNPs, individual sequences for each breed were clustered to construct consensus sequences using SNPs from 9 Y-chromosome loci and 13 mtDNA loci with the DNASTAR SeqMan program (version 6.1; http://www.dnastar.com/t-sub-products-lasergene-seqmanpro.aspx). There were 236 chromosome X SNP on the final Bovine LD chip. Flanking sequences and base calls for the 6,909 SNP are given in Table S1.
SNP call rates and accuracy
The BovineLD chip, consisting of 6,909 final loci, was validated for 290 individuals from 26 major dairy and beef breeds (Table 2). The mean call rate was 99.94% among dairy breeds, 99.90% among beef breeds, and 99.93% among all samples. For taurine breeds, discordant calls compared to BovineSNP50represented <0.01% of all genotyping calls (Table 2). Mendelian consistency was examined using two Holstein trios, which showed a single error on BTB-01149046 out of 13,797 total possible comparisons. Reproducibility was 100% across two Holstein replicated samples. Based on the nearly perfect concordance between the BovineLD and the BovineSNP50 genotypes reported in Table 2 and the similar concordance between BovineSNP50 and BovineHD genotypes, Mendelian consistency and reproducibility were also examined for the overlapping 6,844 SNPs from BovineHD genotypes. Those data included 8 parent-progeny, 24 parent-parent-progeny, and 10 replicate comparisons that represented 11 taurine, 2 indicine, and 1 hybrid breeds (Table 3). Mendelian consistency was 99.95%, and reproducibility was 99.99%.
Table 3. Mendelian consistency and reproducibility comparisons for a set of 6,844 SNPs in common for the BovineHD and BovineLD BeadChips.doi:10.1371/journal.pone.0034130.t003
The concordance rate for 2,088 SNPs in common between BovineLD and Bovine3K assays was 98.78% for 281 females genotyped with both chips. The most likely cause of the differential performance between the BovineLD and Bovine3K chips is the chemistry difference between the Infinium and GoldenGate assays.
Performance for MAF, mean spacing, and paternal and maternal lineages
Data for calculating mean MAF (Table 1) were primarily BovineLD markers extracted from BovineSNP50 data. However, if BovineSNP50 data were not available, BovineLD markers from the validation data were used. That method allowed MAFs to be calculated more accurately. Mean MAF for the 6,909 SNPs was ≥0.29 for all taurine breeds (Table 1). For Brahman (a Bos primigenius indicus breed), mean MAF was lower (0.18). Overall, >89% of the SNPs were polymorphic in Brahman, which suggested that the BovineLD chip may be useful for imputation in this breed.
For the 6,909 SNPs selected for the BovineLD chip, median spacing was 0.348 Mbp, with only 82 (1.1%) of intervals greater than 1 Mbp (Fig. 1). These gaps originate either from the X chromosome, or from regions not covered by the BovineSNP50. The strategy of increasing SNP density at chromosome ends substantially improved imputation accuracy for those regions compared with the Bovine3K array (Fig. 2).
Figure 1. BovineLD single-nucleotide polymorphism (SNP) gap distribution.doi:10.1371/journal.pone.0034130.g001
Figure 2. Imputation accuracy for Bovine3K and BovineLD genotypes.
Imputation was performed for A) Bovine3K and B) BovineLD genotypes using Beagle software (http://faculty.washington.edu/browning/beagle/beagle.html); imputation accuracy is reported by single-nucleotide polymorphism (SNP).doi:10.1371/journal.pone.0034130.g002
The sex-specific and lineage identification SNPs also appeared to perform well. The nine Y-chromosome SNPs had a 100% call rate across 230 males of different breeds and no genotype calls for the 55 females. We investigated the frequency of the haplotypes of the alleles from these 9 SNP both within and across breeds. Four unique haplotypes were observed, which differed dramatically in frequency across breeds, Table 4. One haplotype, CGCCGCAAC (haplotype 1) was observed only in cattle with indicine lineage (eg Brahmans, Beef Master, Santa Getrudis). The second haplotype (TCTCCTCAC) was associated with central European lineage, haplotype 3 (TCTCCTCAT) was 1 base different from haplotype 2 and probably appeared to be associated with breeds that came to the island of Jersey from France or Spain, and haplotype 4 (TCTTGTCGC) was associated with northern European lineage, including islands. Only a few breeds had more than one haplotype, e.g. Santa Gertrudis and Beefmaster, both of which are taurine–indicine hybrids. Common haplotypes across breeds appeared to reflect a common origin. Phylogenetic analysis separated the 26 breeds into four distinctive clades, which agrees with a previous report on the dual origins of dairy cattle breeds in Europe . For mtDNA SNPs (Table 5), seven unique mitochondrial haplotypes were found, however 259 of the animals sampled had the same mitochondrial haplotype. Haplotype 7 (AAGAGCAAAAAAG) was at highest frequency in indicine cattle. Most taurine×indicine cattle were derived from taurine cows. Therefore, the lack of haplotype 7 for taurine breeds in most regions is not unexpected. While more research is required, these preliminary results suggest the BovineLD markers could be useful in determining lineage origin between taurine and indicine breeds or identifying potential admixture within a population of locally adapted animals.
Table 4. Animal counts for Y-chromosome haplotypesa by breed.doi:10.1371/journal.pone.0034130.t004
Table 5. Animal counts for mtDNA-chromosome haplotypesa by breed.doi:10.1371/journal.pone.0034130.t005
Accuracy of imputation
Imputation accuracy was assessed in Australian, French, and North American cattle populations. In all cases, the accuracy of imputation to BovineSNP50 genotypes was ≥95% (Table 6). Most imputation results were >97%, particularly for dairy breeds. The results were lower for some breeds, likely because of the limited reference population size used. For example, the considerably larger size of the North American reference set of Holsteins compared with the Australian set could explain why the North American imputation accuracy was 1.1 percentage points higher than for Australia. The effect of a smaller reference set of genotypes on imputation accuracy was further demonstrated by imputation from BovineLD genotypes for Australian Angus, which had the smallest reference population in the data set. For French populations, imputation efficiency also varied, with the highest accuracy for Holsteins and the lowest for Blondes d'Aquitaine (Table 6); imputation accuracy for Normandes and Montbéliardes was slightly lower than for Holsteins. Again, much of the variation is likely explained by reference population size.
Table 6. Accuracy of imputation from BovineLD genotypes to BovineSNP50 genotypes for Australian, French, and North American breeds.doi:10.1371/journal.pone.0034130.t006
For Australian and North American Holsteins, accuracy of imputation to BovineSNP50 genotypes was better for BovineLD genotypes than for Bovine3K genotypes. For Australian Holsteins, imputation accuracies were up to almost 6 percentage points higher with the BovineLD chip than with the Bovine3K chip using the same data (Table 7). Mean imputation accuracy was 92.8% for Australian Holstein Bovine3K genotypes compared with 97.6% for BovineLD genotypes. For North American Holsteins, accuracies of imputation to BovineSNP50 genotypes from Bovine3K genotypes ranged from 93.0 to 96.7% (depending on number of parents genotyped) for 2,456 animals genotyped with both Bovine3K and BovineSNP50 chips . Corresponding values for BovineLD genotypes (Table 8) are 96.6 to 99.3%.
Table 7. Accuracy of imputationa from BovineLD or Bovine3K genotypes to BovineSNP50 genotypes for Australian Holsteins with and without a sire in the reference populationb.doi:10.1371/journal.pone.0034130.t007
Table 8. Accuracy of imputationa from BovineLD genotypes to BovineSNP50 genotypes for North American Brown Swiss, Holsteins, and Jerseys with and without parents in the reference populationb.doi:10.1371/journal.pone.0034130.t008
The greatest improvement in imputation for BovineLD genotypes compared with Bovine3K genotypes was for individuals with no genotyped parents. For Australian Holsteins, difference in mean imputation accuracy with and without a sire in the reference population was 2.9 percentage points for Bovine3K genotypes but only 1.3 percentage points for BovineLD genotypes. The improvement was smaller for North American Holsteins: a difference of 2.7 percentage points between both parents genotyped and no genotyped parents for Bovine LD genotypes (Table 6) compared with 3.7% for Bovine3K genotypes . Compared with North American Holsteins, BovineLD imputation accuracy for animals without a parent in the reference population was slightly poorer for North American Jersey and Brown Swiss populations (Table 8). However, the more than doubling of markers and the different SNP selection criteria  compared with the Bovine3K chip allowed high imputation accuracies across a wider range of dairy breeds as well as some beef breeds.
The Illumina BovineLD BeadChip includes 6,909 SNPs selected to provide optimized imputation to BovineSNP50 genotypes in dairy breeds. The SNPs have MAFs of >0.3 in most breeds, and nearly uniform spacing across the genome except at the ends of the chromosome where densities were increased. The chip also includes SNPs on the Y chromosome and mtDNA loci that are useful for gender checking, determining subspecies classification and identifying certain paternal and maternal breed lineages. Accuracy of imputation to BovineSNP50 genotypes using the BovineLD chip was >99% when both parents were genotyped in the North American BovineSNP50 reference population. That high accuracy suggests that the design criteria for the BovineLD chip would be useful to consider in other species for which an “imputation chip” could dramatically lower the cost of implementing genomic selection. BovineLD imputation was about 3 percentage points more accurate across multiple populations compared with Bovine3K imputation. The improvement was greatest when neither parent had been genotyped. The gain in imputation accuracy is attributed primarily to the increased overall density of the BovineLD chip compared with the Bovine3K chip and also to the even further increased density at the ends of chromosomes. The high MAFs also contribute to the improved imputation accuracy. The MAFs were similar across taurine beef and dairy breed as was the proportion of SNPs that were polymorphic. Although it would be expected that accuracies of imputation would be highest for those breeds which were included in the design of the chip, which was dominated by dairy breeds, the similar SNP characteristics (particularly the high MAF across many beef and dairy taurine breeds) suggest that the BovineLD chip will perform well in imputation of taurine beef cattle. Our results suggest that the imputation accuracy will also be quite dependent on the size of the population genotyped with a higher density SNP assay. Overall, the new BovineLD BeadChip should facilitate low cost genomic selection in Bos primigenius taurus beef and dairy cattle.
Genomic locations, flanking sequences and base calls for the 6,909 SNP on the bovineLD array.
The authors would like to acknowledge the Beef Cooperative Research Centre for providing data. Cécile Grohs (National Institute for Agricultural Research, Centre de Recherches de Jouy en Josas), Christian Bendixen and Rikke Vingborg (GenoSkan), Reiner Emmerling (Institute of Animal Breeding, Bavarian State Research Center for Agriculture), Ekkehard Schütz (Institute of Veterinary Medicine, Georg August Universität), John Flynn (Weatherby's DNA Laboratory, Irish Equine Centre), Jürg Jürg Moll (Qualitas), and Céline Chantry-Darmon (Labogena) are acknowledged for providing samples for SNP validation. Pfizer Animal Genetics contributed both BovineLD and Bovine3K genotypes for 221 females. The authors thank Suzanne Hubbard (Animal Improvement Programs Laboratory) for her invaluable assistance in manuscript review.
Disclaimers: Mention of trade names or commercial products in this article is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture. The U.S. Department of Agriculture is an equal opportunity provider and employer. TSS, CPV, PMV, and GRW were U.S. Government employees at the time they contributed to this work. The work of those individuals was prepared as part of official government duties. Title 17 U.S.C. §105 provides that “Copyright protection under this title is not available for any work of the United States Government.” Title 17 U.S.C. §101 defines a U.S. Government work as a work prepared by a military service member or employee of the U.S. Government as part of that person's official duties. The views expressed in this article are those of the authors and do not necessarily reflect the official policy or position of the U.S. Department of Agriculture or the U.S. Government.
Conceived and designed the experiments: DB XD BJH TSS CPV. Performed the experiments: XD AE CTL TSS CPV HC SF GRW. Analyzed the data: RD BJH PMV GRW. Contributed reagents/materials/analysis tools: KJG KVM CTL. Wrote the paper: DB RD AE BJH CTL TSS CPV PMV GRW.
- 1. Meuwissen THE, Hayes BJ, Goddard ME (2001) Prediction of total genetic value using genome-wide dense marker maps. Genetics 157: 1819–1829.
- 2. Heffner EL, Jannink JL, Sorrells ME (2011) Genomic selection accuracy using multifamily prediction models in a wheat breeding program. The Plant Genome 4: 65–75.
- 3. Wiggans GR, VanRaden PM, Cooper TA (2011) The genomic evaluation system in the United States: past, present, future. J Dairy Sci 94: 3202–3211.
- 4. Schaeffer LR (2006) Strategy for applying genome-wide selection in dairy cattle. J Anim Breed Genet 123: 218–223.
- 5. Pryce JE, Goddard ME, Raadsma HW, Hayes BJ (2010) Deterministic models of breeding scheme designs that incorporate genomic selection. J Dairy Sci 93: 5455–5466.
- 6. Pryce JE, Daetwyler HD (2011) Designing dairy cattle breeding schemes under genomic selection—a review of international research. Anim Prod Sci 52(3): 107–114.
- 7. Van Eenennaam AL, van der Werf JHJ, Goddard ME (2011) The value of using DNA markers for beef bull selection in the seedstock sector. J Anim Sci 89: 307–320.
- 8. Scheet P, Stephens M (2006) A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. Am J Hum Genet 78: 629–644.
- 9. Browning SR, Browning BL (2011) Haplotype phasing: existing methods and new developments. Nat Rev Genet 16: 703–714.
- 10. Druet T, Georges M (2010) A hidden Markov model combining linkage and linkage disequilibrium information for haplotype reconstruction and quantitative trait locus fine mapping. Genetics 184: 789–798.
- 11. Daetwyler HD, Wiggans GR, Hayes BJ, Woolliams JA, Goddard ME (2011) Imputation of missing genotypes from sparse to high density using long-range phasing. Genetics 189: 317–327.
- 12. Hickey JM, Kinghorn BP, Tier B, Wilson JF, Dunstan N, et al. (2011) A combined long-range phasing and long haplotype imputation method to impute phase for SNP genotypes. Genet Sel Evol 43: 12.
- 13. VanRaden PM, O'Connell JR, Wiggans GR, Weigel KA (2011) Genomic evaluations with many more genotypes. Genet Sel Evol 43: 10.
- 14. Druet T, Schrooten C, de Roos AP (2010) Imputation of genotypes from different single nucleotide polymorphism panels in dairy cattle. J Dairy Sci 93: 5443–5454.
- 15. Weigel KA, Van Tassell CP, O'Connell JR, VanRaden PM, Wiggans GR (2010) Prediction of unobserved single nucleotide polymorphism genotypes of Jersey cattle using reference panels and population-based imputation algorithms. J Dairy Sci 93: 2229–2238.
- 16. Dassonneville R, Brøndum RF, Druet T, Fritz S, Guillaume F, et al. (2011) Effect of imputing markers from a low-density chip on the reliability of genomic breeding values in Holstein populations. J Dairy Sci 94: 3679–3686.
- 17. Wiggans GR, Cooper TA, VanRaden PM, Olson KM, Tooker ME (2012) Use of the Illumina Bovine3K BeadChip in dairy genomic evaluation. J Dairy Sci 95: 1552–1558.
- 18. Matukumalli LK, Lawley CT, Schnabel RD, Taylor JF, Allan MF, et al. (2009) Development and characterization of a high density SNP genotyping assay for cattle. PLoS One 4(4): e5350.
- 19. Johnston J, Kistemaker G, Sullivan PG (2011) Comparison of different imputation methods. Interbull Bulletin 44. Available: http://www.interbull.org/images/stories/Jarmila_copy.pdf. Accessed 2012 Mar 8.
- 20. The Bovine Hap Map Consortium (2009) Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds. Science 324: 528–532.
- 21. Edwards CJ, Ginja C, Kantanen J, Pérez-Pardal L, Tresset A, et al. (2011) Dual origins of dairy cattle farming—evidence from a comprehensive survey of European Y-chromosomal variation. PLoS ONE 6: e15922.
- 22. Dassonneville R, Fritz S, Ducrocq V, Boichard D (2012) Short Communication: Imputation performances of three low density marker panels in beef and dairy cattle. J Dairy Sci. In press.