Estimating the genetic variance available for traits informs us about a population’s ability to evolve in response to novel selective challenges. In selfing species, theory predicts a loss of genetic diversity that could lead to an evolutionary dead-end, but empirical support remains scarce. Genetic variability in a trait is estimated by correlating the phenotypic resemblance with the proportion of the genome that two relatives share identical by descent (‘realized relatedness’). The latter is traditionally predicted from pedigrees (ΦA: expected value) but can also be estimated using molecular markers (average number of alleles shared). Nevertheless, evolutionary biologists, unlike animal breeders, remain cautious about using marker-based relatedness coefficients to study complex phenotypic traits in populations. In this paper, we review published results comparing five different pedigree-free methods and use simulations to test individual-based models (hereafter called animal models) using marker-based relatedness coefficients, with a special focus on the influence of mating systems. Our literature review confirms that Ritland’s regression method is unreliable, but suggests that animal models with marker-based estimates of relatedness and genomic selection are promising and that more testing is required. Our simulations show that using molecular markers instead of pedigrees in animal models seriously worsens the estimation of heritability in outcrossing populations, unless a very large number of loci is available. In selfing populations the results are less biased. More generally, populations with high identity disequilibrium (consanguineous or bottlenecked populations) could be propitious for using marker-based animal models, but are also more likely to deviate from the standard assumptions of quantitative genetics models (non-additive variance).
Citation: Gay L, Siol M, Ronfort J (2013) Pedigree-Free Estimates of Heritability in the Wild: Promising Prospects for Selfing Populations. PLoS ONE 8(6): e66983. doi:10.1371/journal.pone.0066983
Editor: Mark van Kleunen, University of Konstanz, Germany
Received: March 15, 2013; Accepted: May 14, 2013; Published: June 25, 2013
Copyright: © 2013 Gay et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was funded by the French Research Institute INRA. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The genetic variance available for a trait in a population informs us about its potential ability to evolve in response to novel selective challenges . This corresponds to one of Brookfield’s definitions of the evolvability of a population: “a description of its current standing crop of genetic variability, and the consequence of the extent and nature of this variation for the population’s ability to respond to current selective pressures” . Additive genetic variance can be estimated from the resemblance between relatives by relating the phenotypic covariance of a quantitative trait with the proportion of the genome for which two relatives share genes identical by descent , . To achieve this, one would ideally like to know the actual proportion of loci controlling the trait that are identical by descent. This ‘realised relatedness’ is the outcome of a stochastic process (due to Mendelian segregation and linkage) with a variance that depends on genome size –. However, because causal loci are unknown, we traditionally use the expected value of identity by descent given the ancestry , . It can be deduced from a pedigree (hereafter called ΦA), either in an experiment using specific relatedness classes (e.g. full sib-half sib design) or in a population with pedigree data ranging over several generations . However, in wild populations, pedigree information is generally not available except for a few long term studies –. An alternative solution is to estimate the genome-wide average of the realised relatedness between individuals using molecular markers .
With the ongoing rise of next generation sequencing, high density SNP panels become available and the realised proportion of the genome that two individuals share identical by descent can be estimated with increasing accuracy . Several estimators of kinship (sometimes called coancestry) and relatedness (or relationship) coefficients have been proposed – and compared , –. From these reviews, it appears that the relative performance of each method depends on the set of loci used, on allele frequency distributions, in particular minor allele frequency spectrum  and on the average relatedness between individuals in the population. Provided we can estimate it precisely, using the genome-wide average of the realised relatedness rather than resorting to its expected value (ΦA) could improve the estimation of evolutionarily relevant parameters for quantitative traits (genetic variance, genetic correlations or selection gradients) .
Different methods are available to estimate quantitative genetic parameters from molecular marker data (hereafter called pedigree-free methods) . Their respective reliability and suitability depending on the marker used and/or the population genetic structure remain unclear. While such methods based on genome-wide molecular information have received substantial attention by animal breeders and human biologists , , evolutionary biologists remain very cautious about how useful marker-based relatedness coefficients could be for studying complex phenotypic traits in populations lacking pedigree information .
Being able to accurately estimate the genetic variance of a trait is particularly important for inbred or selfing species that have been described as evolutionary dead ends due to the potential loss of genetic diversity , . Self-fertilization is common in angiosperms ,  and also occurs in hermaphrodite animals at a lower frequency . Reduced genetic variation in highly selfing populations is frequently observed using molecular markers –. It can be explained by reduced effective population sizes accompanying increased homozygosity , enhanced genetic hitchhiking with selective sweeps and background selection caused by a reduced effective recombination rate ,  and frequent bottlenecks following the recurrent extinction - recolonisation events . However, the effect of selfing on the genetic variation relevant for most adaptive change, i.e. quantitative genetic variation, is less clear. For a population with constant allelic frequencies, inbreeding is expected to increase the genetic variance of a trait to the point that when inbreeding is complete, the genetic variance in the population as a whole is doubled and appears as the between-lines component . Simultaneously, inbreeding should reduce quantitative variation due to the fixation of alleles and to a reduced efficiency of selection in maintaining variants (see  and  for a review of theoretical arguments). Simulations have shown that if non-additive effects are important (dominance or epistasis), additive genetic variance increases with inbreeding, reaching a maximum for intermediate inbreeding coefficients (F) and then a declining towards zero at F = 1 , . In agreement with these theoretical expectations, some evidence for reduced within population genetic variance in highly inbreeding populations compared to outcrossing populations has been found , . More data are needed to compare the level and components of quantitative genetic variation between selfers and outcrossers and to further understand the consequences of self-fertilization on the adaptive potential of natural populations.
Besides direct effects on genetic variability, selfing generates a correlation in heterozygosity and/or homozygosity across loci, called identity disequilibrium . This will broadly influence both the relatedness between individuals and the variance in relatedness in the population ,  and could improve marker-based estimates of the expected proportion of identity by descent . Thereby, selfing may provide favourable conditions for pedigree-free quantitative genetics.
In this paper we review published results comparing several pedigree-free methods used to estimate quantitative genetics parameters for complex phenotypic traits in wild populations. We then report results from simulations aimed at further comparing the performance of individual-based models (hereafter called animal models) using pairwise relatedness predicted from the pedigree versus molecular markers, with a special focus on how mating systems affects the efficiency of these methods.
Materials and Methods
As a preliminary step, we searched the literature (using keywords in Web of Science) looking for studies comparing methods for estimating quantitative genetic parameters using molecular markers, for example a pedigree-based animal model and another method, or a simulation study. Our review is based on the classification published by Garant & Kruuk , who identified three categories of methods that rely on molecular markers. (1) The ‘Ritland’ method estimates heritability as the covariance between pairwise phenotypic similarity and pairwise relatedness . Alternatively (2), using a maximum likelihood approach, individuals can be classified into known classes of relatedness (for example sibs vs. unrelated) and analyzed in a mixture model . Sibling groups can also be identified within one generation and analyzed in a classic quantitative genetics framework (analysis of variance) or using a more complex model (animal model, ). Finally (3), parentage assignment methods can help reconstruct a complete pedigree spanning several generations ,  and quantitative genetics parameters are then derived from an animal model . We expanded Garant & Kruuk’s classification to include two additional methods that are currently available: (4) an animal model method , ,  directly using the full pairwise relatedness matrix estimated using molecular markers ,  and (5) a multilocus association method derived from genomic selection . This last method was originally aimed at predicting individual breeding values using molecular markers, in order to accelerate and improve the response to artificial selection. Technically, it relies on multiple regressions with shrinkage, where the phenotype is explained by a set of markers (e.g. ).
In order to compare additive genetic variances among traits or taxa, it is common practice to scale it with the total phenotypic variance (heritability) or with the trait mean (coefficient of additive genetic variation). As proposed by Houle  and recently confirmed by Hansen et al.  the additive genetic coefficient of variation is a better predictor of a population’s ability to respond to selection and can be viewed as an accurate measure of evolvability. Nevertheless, heritability remains the most commonly reported measure of evolutionary potential (in particular in all studies using the Ritland method). In our review, we therefore compared heritabilities rather than coefficients of additive genetic variation. This can be misleading in the presence of a positive correlation between the additive variance and other components of phenotypic variance . We also used heritabilities in our simulations in order to remain consistent. We argue that it is not problematic in our simulations as there is no inherent correlation between variance components.
The performance of a statistical inference can be evaluated by the bias, defined as E( – h2), where is the estimator and h2 the parameter and the sampling error, E( –E())2. When reviewing the literature for pedigree- free methods, we would ideally like to compare the bias and sampling error of heritability estimates obtained using pedigrees (ped) or one of the marker-based methods (marker). However, the review of empirical results only provides us with single estimates ped and marker to compare and we cannot say anything about the extent to which either or both are biased and which has smaller sampling error. We therefore only tested whether ped and marker significantly differed for each of the marker-based methods using linear models and also compared their standard errors. Most studies considered several traits that would be wrongly considered as independent. We took this into account by adding a random effect for study. For simulation studies, we report the bias and standard error of the estimators ped or marker. Again, we tested for an influence of the method used (pedigree or marker-based methods) on the bias using linear mixed models with study as a random effect followed by posthoc tests (with Bonferroni correction) to identify significant pairwise differences. Studies using genomic selection methods generally evaluate the accuracy of the model as the correlation between the estimated breeding value and the real breeding value. We report these accuracies.
We tested whether the absolute value of the bias increases with heritability (measured by pedigree methods or simulated) and assessed the relationship between the bias and the number of markers available (or simulated) using a mixed linear model for each method, with study as a random effect.
We simulate a population of constant size N = 500 individuals evolving for a number of generations. Individuals are diploid and unrelated at generation 0 (but they can share alleles identical by state). Generations are non-overlapping. We simulate the genotype of each individual at LM+LQTL unlinked loci. The LQTL loci are causal loci whereas the LM loci are biallelic non-coding marker loci. Initial allele frequencies are drawn from the distribution expected at mutation – drift equilibrium using a Dirichlet distribution . The phenotype of each individual is controlled by 500 loci (LQTL) with five alleles each. Allelic effects are randomly drawn from a normal distribution for each allele. The phenotype of an individual is the sum of the two allelic effects at each of the LQTL loci (αi,j is the allelic effect of allele i at locus j) plus a random factor (εij) drawn from a normal distribution representing the environmental effect:(1)
The variances of the distribution of allelic and environmental effects are adjusted to simulate heritabilities of 0.15, 0.3 and 0.6. The next generation is built either by random mating of pairs of individuals or with a set proportion of selfing (S = 0.9) to simulate inbred populations. The simulation program is written in C++ and runs in batch using a custom python script, with 20 replicates per parameter set. The code is available as supplementary material (Zipfile S1).
Estimation of Genetic Variance Using a Marker-based Animal Model–influence of Mating Regime and Manipulations on Relatedness Matrices
We simulated populations evolving for 10 generations and performed analyses at generation 10, using pedigree and marker-based animal models (method 4 in the review section). Every generation, the pedigree information (mother and father of each individual) was recorded and used to calculate the relatedness coefficient (ΦA) between pairs of individuals. Genotypes at the LM marker loci were used to estimate pairwise relatedness using the coefficient introduced by Loiselle et al.  (thereafter named Ki,j). It does not assume Hardy-Weinberg equilibrium and performs well, even in the presence of rare alleles , . To estimate heritability, we fitted a very simple linear model to the simulated phenotype: y = Za+e where y is the phenotype, Z is a design matrix and a the vector of additive genetic effects; e is the vector of residual effect. The pedigree or the marker loci information were then used to specify a variance–covariance structure for the vector of additive genetic effects a, shaped as 2.ΦA.σA2 when using an animal model including the pedigree or as 2.Ki,j.σA2 when replacing the ΦA matrix by a marker-based relatedness matrix. For selfing populations, we used 2.ΦA.σA2/(1+ F) and 2.Ki,j.σA2/(1+ F) to account for inbreeding , where F was the average inbreeding coefficient. F was either estimated using pedigrees or approximated as S/(2–S)  for the marker-based method, where the pedigree is supposed unknown. We preliminarily ascertained that both values were highly similar (F = 0.82 for S = 0.9). We used restricted maximum likelihood to estimate the additive genetic variance and the standard errors with the program ASReml v3.0 . We examined the effect of the number of marker loci by letting LM vary between 384, 1500, 3000 or 5000 SNPs. We also manipulated the marker-based relatedness matrix to summarize the information and examined how this influenced the bias in genetic variance estimates. It has become common practice in association studies (GWAS) to truncate the marker-based relatedness matrix by replacing any negative value by zero. We transformed our matrix accordingly.
As in the review section for simulation studies, we measured the bias in heritability as E( – h2). We estimated the precision of the estimation using sampling errors, defined as E( – E())2. We compared the bias and sampling errors in heritability estimators (pedigree versus molecular markers) using Wilcoxon signed rank tests. We tested for the effect of the mating regime, trait heritability and number of markers on the bias in heritability estimates using linear models.
The identity disequilibrium created by consanguineous matings could improve marker-based estimates of the expected proportion of identity by descent . We verified this in our simulations by estimating the relatedness at the causal variants (500 QTLs) and examined how accurately it is predicted by the relatedness at the LM marker loci, in selfing or outcrossing populations, using linear regressions. A slope close to one would indicate that the set of observed SNPs accurately predicts the relatedness at causal loci. Any deviation to one could be caused either by sampling error (due to the limited number of observed SNPs) or by rare alleles in the causal variants. All analyses were run in R version 2.15.1.
Review of Published Results
We collected 39 papers comparing heritability estimates based on molecular data or pedigree information. Among those, 24 reported empirical results (Table 1) while 15 were based on simulations only (Table 2). Only seven studies reported heritability estimates for plant species, 16 for animals and one for a protist. Using linear mixed models, we found a significant difference between pedigree-based estimates and marker-based estimates for the Ritland method (χ2 = 22.2; p = 2.5×10−06), the relatedness classes (χ2 = 5.9; p = 0.015) and the reconstructed pedigrees (χ2 = 29.8; p = 4.7×10−08) but not for the animal model (χ2 = 0.04; p = 0.836) or genomic selection (χ2 = 0.001; p = 0.977). As shown by Figure 1.A, the difference between the heritability estimated using pedigree or molecular markers was lowest with method 4 (marker-based animal model, see also Figure S1). When a dataset was analyzed using Ritland’s method and an animal model in parallel, the latter gave results closest to the estimates obtained using the pedigree . In addition, pedigree-free methods seem to improve the precision of the estimation, except for the Ritland method and the genomic selection method (with Ritland: standard error increased by +0.23; p = 0.062 and +0.13 with genomic selection; p = 0.001).
Figure 1. Accuracy of five different marker-based methods to estimate heritability– review of empirical and simulation studies.
The efficiency was assessed in a review of 24 empirical studies (A) or 15 simulation studies (B), comparing heritability estimates using pedigree or one of the following methods: 1 - Ritland; 2 - relatedness classes; 3 - reconstructed pedigrees; 4 - marker-based animal model or 5 - genomic selection. Details of the number of studies for each method are given in Table 1. The bias was measured as marker - pedigree in A and as E( – h2) in B, where h2 is the simulated parameter. The horizontal line shows the median bias for each method. The bottom and top of the box show the 25th and 75th percentiles. The vertical dashed lines show the maximum and minimum biases and the circles are outliers.doi:10.1371/journal.pone.0066983.g001
Table 1. Summary of studies comparing estimates of quantitative genetics parameters using pedigree-free methods.doi:10.1371/journal.pone.0066983.t001
Table 2. Summary of simulation studies comparing estimates of quantitative genetics parameters using pedigree-free methods.doi:10.1371/journal.pone.0066983.t002
Using the 15 simulation studies, we calculated the bias as E(– h2) (Table 2). The mixed model highlighted a significant effect of the method on the bias (p = 0.005). Posthoc tests showed a significantly higher bias for the Ritland method (average 0.30; n = 10) than for the pedigree method (0.04; n = 13; p<0.001), the marker-based animal models (0.03; n = 15; p = 0.049) or the genomic selection (−0.11; n = 12; p = 0.015). There was no significant difference between these three methods (p>0.750). The number of simulation results testing methods 2 and 3 in the dataset we collected was insufficient to compare their biases. The average biases for each method are shown in Figure 1.B. We found no significant difference in standard error between these methods (p = 0.129).
The average value of the bias in heritability tended to increase for traits with low heritabilities (Figure S2.A and S2.B, p<0.0001) and this effect was significant for all methods except the marker-based animal model (p = 0.743). We detected a significant but small negative effect of the number of microsatellites or SNP used on the average value of the bias with the animal model method (p = 0.012 and p = 0.048) (Figure S2.C and S2.D).
Simulation Results - Heritability Estimation Using a Marker-based Animal Model
Simulation results confirmed that using pedigree information in an animal model provides accurate estimates of heritability for both outcrossing and selfing populations, even with low heritabilities (Figure 2.A). We analysed the bias in heritability estimates using Wilcoxon signed rank tests and found that replacing the pedigree-based relatedness matrix by a marker-based relatedness matrix strongly worsens the estimation of heritability for outcrossing populations (average bias = −0.014 with pedigree and 0.174 with markers; VWilcoxon = 9; p = 4.1×10−10; Figure 2.B) but not for selfing populations (average bias = −0.016 with pedigree and −0.015 with markers; VWilcoxon = 738; p = 0.270). The bias in outcrossing populations was reduced when using truncated marker-based relatedness coefficients (0.072; Figure 2.C). Results were the same when testing the effect of the method on the bias using linear models. In addition, if the marker-based method seemed more biased than the pedigree method, the later had higher sampling error (0.15 for the pedigree method and 0.08 for marker-based method) and the difference was significant in outcrossing populations (0.16 versus 0.01; VWilcoxon = 1422; p = 4.1×10−10) but not in selfing populations (0.14 for both methods; VWilcoxon = 1028; p = 0.408).
Figure 2. Simulation results testing the accuracy of pedigree or marker-based methods to estimate heritability.
This figure shows the correlation between the heritability simulated and heritability estimates obtained using pedigree-based animal models (A), marker-based animal models (B) or marker-based relatedness coefficients truncated before the analysis (C). Each dot stands for a simulated population, with 90% selfing (in grey) or complete outcrossing (in black). Circles stand for means across 20 replicates and solid lines show the 95% confidence intervals, as estimated by Asreml (and averaged across replicates). The dashed lines represent y = x.doi:10.1371/journal.pone.0066983.g002
In outcrossing populations, the absolute value of the bias with the marker-based method increased with higher simulated heritabilities (F53,1 = 155.0; p<10−16), but it should be pointed out that with low heritabilities (0.15) the model sometimes failed to converge or estimated an additive variance not significantly different from zero.
Selfing decreased the bias in marker-based heritability estimates. With a heritability of 0.3, for example, the bias was more than seven times larger in outcrossing compared to selfing populations (F36,1 = 57.2; p = 6.10−9). The distribution of pairwise relatedness in selfing populations may explain such a better performance in estimating heritability , . Indeed, as shown on Figure 3, pairwise relatedness coefficients have a higher mean and a larger variance in selfing compared to outcrossing populations. This effect extends beyond the simple influence of population size (Figure S3). The higher performance of the marker based method under selfing can also be related to the fact that the relatedness at causal loci is more closely correlated with the relatedness at a set of marker loci in selfing than in outcrossing populations (Figure 4). This is in agreement with results by Szulkin et al.  on inbreeding coefficients and is caused by high identity disequilibrium in selfing populations.
Figure 3. Higher mean and larger variance in pairwise relatedness coefficients in selfing compared to outcrossing populations.
Regression between pairwise Loiselle coefficients estimated using 1500 SNP and ΦA. The population comprised 500 individuals with 90% selfing (grey crosses) or complete outcrossing (black circles). The legend indicates the slope of the regression of ΦA against Loiselle and the correlation coefficient r. The variance in relatedness was 0.0026 in the outcrossing population and 0.0108 in the selfing population (within the range of variances observed in wild populations, see Table 1 and ).doi:10.1371/journal.pone.0066983.g003
Figure 4. Relatednesses at causal and marker loci are more closely correlated in selfing than in outcrossing populations.
Regression between pairwise Loiselle coefficients estimated using 1500 SNPs and pairwise Loiselle coefficients estimated using the allele frequency at QTLs determining the phenotypic trait. Outcrossing populations are shown in black and selfing populations (selfing rate 90%) in grey. The legend indicates the slope of the regression and the correlation coefficient r. The slope is expected to be close to one if the relatedness at causal loci is accurately predicted by the relatedness at observed SNPs.doi:10.1371/journal.pone.0066983.g004
Figure 5 highlights that when a larger number of loci is used to estimate the matrix of pairwise relatedness, heritability estimates become more accurate, most probably because the prediction of ΦA is improved. The linear model showed that the absolute value of the bias decreases significantly with the number of markers (F78,1 = 11.9; p = 0.001). If marker-based relatedness coefficients are truncated, the effect becomes non-significant (F78,1 = 0.7; p = 0.389) and a lower number of markers is required to get an accurate estimate of heritability. In selfing populations, there is no relationship between the bias and the number of marker used (F78,1 = 0.8; p = 0.371). 384 markers seem already sufficient to estimate heritability reasonably well. Surprisingly, the sampling error of the estimates did not decrease with the number of loci.
Figure 5. Heritability estimates become more accurate with the number of marker loci used to estimate relatedness.
Influence of the number of loci used to estimate pairwise relatedness coefficients (Loiselle coefficients) on the bias in heritability estimates, when using a marker-based animal model. Each dot stands for a simulated population of 500 individuals, with complete outcrossing (panel A, in black) or 90% selfing (panel B, in grey). Panel C shows the results when marker-based relatedness coefficients are truncated before the analysis. Large circles stands for the average heritability over the 20 replicated simulations. The confidence intervals estimated in Asreml for each replicate were averaged over the 20 replicates and are shown as solid lines. The dashed line stands for the simulated heritability.doi:10.1371/journal.pone.0066983.g005
Pedigree-free Methods to Estimate Quantitative Genetics Parameters in the Wild: What have we Learnt in Nearly 20 Years?
Reviewing empirical and simulation studies of quantitative genetics in wild populations using marker-based estimates of relatedness confirms that it is extremely difficult to derive reliable estimates for quantitative genetic parameters in wild populations using Ritland’s pairwise regression model, as suggested by several authors , , . Nevertheless, being a pioneer, the Ritland method played a significant role in stimulating the development of further marker-based methods to estimate quantitative genetics parameters in wild populations. Despite performing slightly better than the Ritland method, the relationship classes method (method 2) requires a known family structure with only two classes of relatedness and is therefore of restricted use . Methods 3 and 4 both use the statistical machinery of the animal model (mixed model) after using molecular markers to reconstruct the pedigree (method 3) or the relatedness matrix (method 4). But empirical and simulation results suggest that method 4 performs best. Finally, genomic selection (method 5) has become extremely popular among breeders, and even though their main purpose is to predict breeding values with the highest accuracy, some of these studies report estimates for additive genetic variance . Nevertheless, the lowest biases are found in studies using samples with family structure ,  and simulation studies seem more encouraging than actual empirical results (Tables 1 and 2). Besides, despite using a colossal number of SNPs for genomic selection (and not only those showing significant association with the phenotype – GWAS ), the SNPs still often explain only a small proportion of heritability (missing heritability) , , . Insufficient linkage disequilibrium between causal variants and genotyped SNPs and low minor allele frequency of causal variants might be involved . Currently, models combining pedigree and molecular markers are being developed to partition the additive genetic variance into genomic and “remaining polygenic” components , , . These results highlight that for traits with highly polygenic determinism, the classic infinitesimal model , , as implemented in the animal model, might still perform best .
Promising Prospects for Marker-based Animal Models?
Our review showed that animal models including marker-based relatedness matrices (method 4) offer promising prospects. Nevertheless, our simulations show that using molecular markers instead of pedigrees seriously worsens the estimation of quantitative genetics parameters in outcrossing populations, even if an increased number of loci and truncated relatedness coefficients improved the result. Conversely, our simulations in selfing populations suggest that pedigree-free methods are successful. We discuss several arguments that could help explain this contrast, in the light of our simulation results and the literature available about inbreeding.
Firstly, we sampled individuals from a single generation in our simulations (non-overlapping generations, i.e. annual populations). Such a sampling design constrains the variance in pairwise relatedness in outcrossing populations (no parent-offspring relatedness and a probability of 1/N to be full-sibs) but not so much in selfing populations (full-sibs from selfing events have ΦA >0.5). More generally, the variance in relatedness in a population is strongly affected by its size and mating system (see Figure 3, Figure S3 and ). Variances in relatedness are generally low in large outcrossing populations (see  and the column “variance in relatedness” in Table 1) but this constrains pedigree methods just as much as marker-based methods, and require collecting data over a large number of generations.
A more serious issue is that uncertainty in relatedness estimates is ignored when estimating heritability using this pedigree-free method and this could be a problem if uncertainty is high. Previous studies have shown that marker-based estimates of inbreeding coefficients are improved by non-random mating because it generates identity disequilibrium . We observe the same thing here for pairwise relatednesses (i.e. the inbreeding coefficient of a hypothetical offspring) (Figure 3 and 4). In outcrossing populations, it remains that a very large set of markers is required to estimate the ‘realised relatedness’ without too much uncertainty ,  and to capture the variance within relatedness classes more accurately than when using the pedigree (ΦA) , , , .
It is worth pointing out that in our simulations, we used the complete pedigree (no missing individual), exempt from errors. Even in domestic animals, pedigree errors have been estimated to range between 1% and 10% . In wild populations, the requirement of large sampling efforts and the occurrence of extra-pair paternity will inevitably scale down the pedigree information available for the analysis and might favour the use of alternative methods for a wide range of populations.
Finally, our simulations do not consider non-genetic sources of resemblance between relatives (common environment, maternal effects…). Such non-genetic effects are expected to overestimate heritability quite markedly . Nevertheless, without transplantation or cross-fostering experiment, these effects are difficult to account for with any of the methods currently available (including pedigree methods, see ), in particular if there are few different classes of relatedness in the dataset .
Promising Prospects for Selfing Populations
Stebbins  suggested that extreme selfing is an “evolutionary dead-end”, because it reduces genetic diversity within populations and may thereby lower their adaptive potential compared to outcrossing populations. Until recently, accurate comparisons of the levels of additive variance between selfers and outcossers were impaired by methodology because most studies on quantitative genetic variation in plants involved analyses of variance between and within families , and the pollinations required to derive paternal families were rarely performed (e.g. in 13 out of 37 studies reported by Charlesworth & Charlesworth, ), particularly in selfing species. Animal models coupled with marker-based relatedness information offer an inclusive, conceptually simple and flexible framework to quantify additive variance in plant and animal populations by taking advantage of the recombined genotypes produced by rare outcrossing events in their wild environment .
Yet, in selfing populations, inbreeding generates additional variance components specifically associated with dominance effects of alleles when autozygous (identical by descent) (detailed in equation (4) in Shaw et al. ). The covariance between the genotypic values of two individuals X and Y becomes , :(2)
where VA and VD are the additive and dominance variances, cov(A,D) is the covariance between the additive effect of alleles and their autozygous dominance deviation, VDI is the total variance due to autozygous dominance effects, H* is the inbreeding depression and VAA the variance due to additive-additive epistatic effects (other epistatic variances are neglected here). Each variance component is preceded by a probability measure (), a function of the identity of alleles by descent (described in further details by Cockerham  and Harris ). This equation highlights the complexity of quantitative genetics in partially inbred populations. In our simulations, we only focused on VA and neglected the effect of directional dominance because we were interested in methods providing comparable estimates of additive variance in selfing and outcrossing populations. These additional variance components could be included in the mixed model (y = Za +…+ e), keeping in mind that their estimation is not straightforward without a complex experimental design . Besides, dominance effects due to heterozygote effects (dij in Harris ) are expected to contribute little to genetic variance in highly selfing populations where homozygosity is very high . Therefore, neglecting VD in selfing populations is not as inaccurate as in outcrossing populations, where it is rarely examined. Theoretical models also predict that inbreeding depression resulting from deleterious recessive alleles should be purged with selfing , . Some empirical data support this prediction , , but the opposite has also been reported , in agreement with the prediction that mildly deleterious mutations could accumulate in selfing populations . Finally, with partial selfing the estimate of VA using the animal model can be inflated by variance due to additive-additive epistasis, directly proportional to ΦA2. Importantly, this is also true for outcrossing species, even if we expect lower effects . We are therefore confident that additive variance estimated using marker-based animal models in selfing populations should be comparable with estimates in outcrossing populations, in spite of inbreeding.
It remains that such estimates of additive variance may not be sufficient to predict the response to selection (R) when there is directional dominance and epistasis. Simulations suggest that evolution in partially selfing populations can strongly differ from the predictions obtained using the breeders’ equation (R = h2 S where S is the selection differential), even if the later accounts for inbreeding depression . Methods have been suggested to extend the breeder’s equation to selfing populations –, but predictions are difficult when all individuals do not share the same level of inbreeding, as expected in most natural populations.
Our literature review highlighted that more testing is required for the most promising marker-based methods: animal models including a marker-based matrix of relatedness and genomic selection. Our simulations of the animal model showed that estimates in selfing populations are as accurate when using molecular markers or pedigrees, thanks to their high identity disequilibrium. It is undeniable that a very large set of molecular markers is required in large random mating populations, but recent advances in next generation sequencing technologies provide encouraging prospects, even for non-model species , . More generally, populations with high identity disequilibrium (consanguineous or bottlenecked populations) could promote the use of marker-based animal models, but at the same time are more likely to deviate from the standard assumptions of quantitative genetics models (e.g. non-additive variance).
Correlation between heritability estimates obtained using pedigree-based animal models or one of the five marker-based method. Each dot stands for an empirical result and the colour indicates the method (Ritland in black; relatedness classes in grey; pedigree reconstruction in pink; animal model in green and genomic selection in blue). The dashed lines represent y = x.
Slight increase in the bias with lower heritabilities or lower number of markers. The panels A and B show the relationship between the bias in heritability estimate and the value of heritability for the empirical data (in A – bias expressed as marker - pedigree) or the simulation data (in B – bias expressed as E( – h2)). The panels C and D show the relationship between the bias in heritability estimate and the number of microsatellites or SNPs used. As explained in the text, the bias did not systematically increase for traits with low heritabilities but was more variable. Surprisingly, we found no overall relationship between the bias and the number of markers used in empirical data.
Influence of population size and mating regime on the variance in pairwise relatedness in a population. Selfing could improve marker-based estimation of heritability because it affects the structure and the variance of pairwise relatedness, as has been shown for the inbreeding coefficient. We assessed the effect of the reduced effective population size in selfing populations (Ne = N/(1+ F)) by simulating populations with varying census size (N = 50, 100, 250, 500) and different mating regimes (outcrossing in black and 90% selfing in grey). Error bars stand for the standard error estimated from 10 replicated simulations. This figure confirms that pairwise relatedness coefficients have a higher mean and a larger variance in selfing compared to outcrossing populations and that this effect extends beyond the simple influence of population size (e.g. large excess of variance in a selfed population of N = 50 compared to an outcrossed population of N = 100). The identity disequilibrium created by selfing might explain such higher variance in relatedness.
Full data literature review.
Simulation program in C++.
The authors thank M. Morrissey for valuable discussions and for sharing his experience about the use of animal models with marker-based relatedness matrices. We are also thankful to S. De Mita for helping solve problems with C++ or python code. M.H. Muller, A. Charmantier and two anonymous reviewers made useful comments on previous versions of this manuscript.
Conceived and designed the experiments: LG MS JR. Performed the experiments: LG MS. Analyzed the data: LG. Contributed reagents/materials/analysis tools: LG MS JR. Wrote the paper: LG MS JR.
- 1. Falconer DS, Mackay TFC (1996) Introduction to Quantitative Genetics. Harlow, Essex, UK: Longmans Green.
- 2. Brookfield JFY (2009) Evolution and evolvability: celebrating Darwin 200. Biology Letters 5: 44–46. doi: 10.1098/rsbl.2008.0639
- 3. Lynch M, Walsh B (1998) Genetics and analysis of quantitative traits. Sunderland, MA: Sinauer.
- 4. Visscher PM (2009) Whole genome approaches to quantitative genetics. Genetica 136: 351–358. doi: 10.1007/s10709-008-9301-7
- 5. Hill WG (1993) Variation in genetic identity within kinships. Heredity 71: 652–653.
- 6. Cockerham CC, Weir BS (1983) Variance of actual inbreeding. Theoretical Population Biology 23: 85–109. doi: 10.1016/0040-5809(83)90006-0
- 7. Visscher PM, Medland SE, Ferreira MAR, Morley KI, Zhu G, et al. (2006) Assumption-free estimation of heritability from genome-wide identity-by-descent sharing between full siblings. PLOS Genetics 2: 316–325.
- 8. Guo S-W (1998) Variation in genetic identity among relatives. Human Heredity 46: 61–70. doi: 10.1159/000154328
- 9. Wright S (1922) Coefficients of inbreeding and relationship. The American Naturalist 56: 330–338. doi: 10.1086/279872
- 10. Nussey DH, Wilson AJ, Brommer JE (2007) The evolutionary ecology of individual phenotypic plasticity in wild populations. Journal of Evolutionary Biology 20: 831–844. doi: 10.1111/j.1420-9101.2007.01300.x
- 11. Kruuk LEB, Hill WG (2008) Introduction. Evolutionary dynamics of wild populations: the use of long-term pedigree data. Proceedings of the Royal Society B: Biological Sciences 275: 593–596. doi: 10.1086/279872
- 12. Kruuk LEB (2004) Estimating genetic parameters in natural populations using the 'Animal Model'. Philosophical Transactions: Biological Sciences 359: 873–890. doi: 10.1098/rstb.2003.1437
- 13. Blouin MS (2003) DNA-based methods for pedigree reconstruction and kinship analysis in natural populations. Trends in Ecology & Evolution 18: 503–511. doi: 10.1016/s0169-5347(03)00225-8
- 14. Powell JE, Visscher PM, Goddard ME (2010) Reconciling the analysis of IBD and IBS in complex trait studies. Nature Review Genetics 11: 800–805. doi: 10.1016/s0169-5347(03)00225-8
- 15. Queller DC, Goodnight KF (1989) Estimating relatedness using molecular markers. Evolution 43: 258–275. doi: 10.1016/s0169-5347(03)00225-8
- 16. Lynch M, Ritland K (1999) Estimation of pairwise relatedness with molecular markers. Genetics 152: 1753–1766. doi: 10.1016/s0169-5347(03)00225-8
- 17. Wang J (2002) An estimator for pairwise relatedness using molecular markers. Genetics 160: 1203–1215. doi: 10.1016/s0169-5347(03)00225-8
- 18. Li CC, Weeks DE, Chakravarti A (1993) Similarity of DNA fingerprints due to chance and relatedness. Human Heredity 43: 45–52. doi: 10.1159/000154113
- 19. Hardy OJ, Vekemans X (1999) Isolation by distance in a continuous population: reconciliation between spatial autocorrelation analysis and population genetics models. Heredity 83: 145–154. doi: 10.1046/j.1365-2540.1999.00558.x
- 20. Loiselle BA, Sork VL, Nason J, Graham C (1995) Spatial genetic structure of a tropical understory shrub, Psychotria officinalis (Rubiaceae). American Journal of Botany 82: 1420–1425. doi: 10.2307/2445869
- 21. Ritland K (1996) Estimators for pairwise relatedness and individual inbreeding coefficients. Genetical Research 67: 175–185. doi: 10.2307/2445869
- 22. Van De Casteele T, Galbusera P, Matthysen E (2001) A comparison of microsatellite-based pairwise relatedness estimators. Molecular Ecology 10: 1539–1549. doi: 10.1046/j.1365-294X.2001.01288.x
- 23. Csillery K, Johnson T, Beraldi D, Clutton-Brock T, Coltman D, et al. (2006) Performance of marker-based relatedness estimators in natural populations of outbred vertebrates. Genetics 173: 2091–2101. doi: 10.1534/genetics.106.057331
- 24. Vekemans X, Hardy OJ (2004) New insights from fine-scale spatial genetic structure analyses in plant populations. Molecular Ecology 13: 921–935. doi: 10.1046/j.1365-294X.2004.02076.x
- 25. Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, et al. (2010) Common SNPs explain a large proportion of the heritability for human height. Nature Genetics 42: 565–569. doi: 10.1038/ng.608
- 26. Garant D, Kruuk LEB (2005) How to use molecular marker data to measure evolutionary parameters in wild populations. Molecular Ecology 14: 1843–1859. doi: 10.1111/j.1365-294X.2005.02561.x
- 27. Sillanpää MJ (2011) On statistical methods for estimating heritability in wild populations. Molecular Ecology 20: 1324–1332. doi: 10.1111/j.1365-294X.2011.05021.x
- 28. Visscher PM, Hill WG, Wray NR (2008) Heritability in the genomics era - concepts and misconceptions. Nature Review Genetics 9: 255–266. doi: 10.1038/nrg2322
- 29. Kruuk LEB, Slate J, Wilson AJ (2008) New answers for old questions: The evolutionary quantitative genetics of wild animal populations. Annual Review of Ecology, Evolution, and Systematics 39: 525–548. doi: 10.1038/nrg2322
- 30. Stebbins GL (1957) Self fertilization and population variability in the higher plants. American Naturalist 91: 337–354. doi: 10.1038/nrg2322
- 31. Takebayashi N, Morrell PL (2001) Is self-fertilization an evolutionary dead end? Revisiting an old hypothesis with genetic theories and a macroevolutionary approach. American Journal of Botany 88: 1143–1150. doi: 10.2307/3558325
- 32. Igic B, Kohn JR (2006) The distribution of plant mating systems: study bias against obligately outcrossing species. Evolution 60: 1098–1103. doi: 10.1038/nrg2322
- 33. Vogler DW, Kalisz S (2001) Sex among the flowers: The distribution of plant mating systems. Evolution 55: 202–204.
- 34. Jarne P, Auld JR (2006) Animals mix it up too: The distribution of self-fertilization among hermaphroditic animals. Evolution 60: 1816–1824.
- 35. Charlesworth D (2003) Effects of inbreeding on the genetic diversity of populations. Philosophical Transactions of the Royal Society of London Series B-Biological Sciences 358: 1051–1070.
- 36. Hamrick JL, Godt MJW (1997) Allozyme diversity in cultivated crops. Crop Science 37: 26–30.
- 37. Glemin S, Bazin E, Charlesworth D (2006) Impact of mating systems on patterns of sequence polymorphism in flowering plants. Proceedings of the Royal Society B-Biological Sciences 273: 3011–3019.
- 38. Pollak E (1987) On the theory of partially inbreeding finite populations.1. Partial selfing. Genetics 117: 353–360.
- 39. Hedrick PW (1980) Hitchhiking - a comparison of linkage and partial selfing. Genetics 94: 791–808.
- 40. Charlesworth B, Morgan MT, Charlesworth D (1993) The effect of deleterious mutations on neutral molecular variation. Genetics 134: 1289–1303.
- 41. Schoen DJ, Brown AHD (1991) Intraspecific variation in population gene diversity and effective population size correlates with the mating system in plants. Proceedings of the National Academy of Sciences of the United States of America 88: 4494–4497. doi: 10.1073/pnas.88.10.4494
- 42. Crow J, Kimura M (1970) Introduction to Theoretical Population Genetics. New York: Harper and Row.
- 43. Charlesworth D, Charlesworth B (1995) Quantitative genetics in plants - the effect of the breeding system on genetic variability. Evolution 49: 911–920. doi: 10.2307/2410413
- 44. Van Buskirk J, Willi Y (2006) The change in quantitative genetic variation with inbreeding. Evolution 60: 2428–2434. doi: 10.1554/06-485.1
- 45. Barton NH, Turelli M (2004) Effects of genetic drift on variance components under a general model of epistasis. Evolution 58: 2111–2132. doi: 10.2307/2410413
- 46. Geber MA, Griffen LR (2003) Inheritance and natural selection on functional traits. International Journal of Plant Sciences 164: S21–S42. doi: 10.2307/2410413
- 47. Weir BS, Cockerham CC (1973) Mixed self and random mating at two loci. Genetical Research 21: 247–262. doi: 10.1017/S0016672300013446
- 48. Szulkin M, Bierne N, David P (2010) Heterozygosity-fitness correlations: a time for reappraisal. Evolution 64: 1202–1217. doi: 10.1111/j.1558-5646.2010.00966.x
- 49. Reid JM, Keller LF (2010) Correlated inbreeding among relatives: occurence, magniture and implications. Evolution 64: 973–985. doi: 10.1111/j.1558-5646.2009.00865.x
- 50. Ritland K (2000) Detecting Inheritance with Inferred Relatedness in Nature. Adaptive genetic variation in the wild. USA: Oxford University Press.
- 51. Mousseau TA, Ritland K, Heath DD (1998) A novel method for estimating heritability using molecular markers. Heredity 80: 218–224. doi: 10.1046/j.1365-2540.1998.00269.x
- 52. Thomas SC, Hill WG (2000) Estimating quantitative genetic parameters using sibships reconstructed from marker data. Genetics 155: 1961–1972. doi: 10.1046/j.1365-2540.1998.00269.x
- 53. Jones AG, Small CM, Paczolt KA, Ratterman NL (2010) A practical guide to methods of parentage analysis. Molecular Ecology Resources 10: 6–30. doi: 10.1111/j.1755-0998.2009.02778.x
- 54. Milner JM, Pemberton JM, Brotherstone S, Albon SD (2000) Estimating variance components and heritabilities in the wild: a case study using the ‘animal model’ approach. Journal of Evolutionary Biology 13: 804–813. doi: 10.1046/j.1420-9101.2000.00222.x
- 55. Henderson CR (1984) Applications of Linear Models in Animal Breeding. Guelph, Canada: University of Guelph.
- 56. Goddard M (2009) Genomic selection: prediction of accuracy and maximisation of long term response. Genetica 136: 245–257. doi: 10.1007/s10709-008-9308-0
- 57. Strandén I, Garrick DJ (2009) Technical note: Derivation of equivalent computing algorithms for genomic predictions and reliabilities of animal merit. Journal of Dairy Science 92: 2971–2975. doi: 10.3168/jds.2008-1929
- 58. Crossa J, Campos Gdl, Perez P, Gianola D, Burgueno J, et al. (2010) Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers. Genetics 186: 713–724. doi: 10.1534/genetics.110.118521
- 59. Houle D (1992) Comparing evolvability and variability of quantitative traits. Genetics 130: 195–204.
- 60. Hansen T, Pélabon C, Houle D (2011) Heritability is not evolvability. Evolutionary Biology 38: 258–277.
- 61. Ewens WJ (2004) Mathematical Population Genetics. New York: Springer-Verlag.
- 62. Hardy OJ (2003) Estimation of pairwise relatedness between individuals and characterization of isolation-by-distance processes using dominant genetic markers. Molecular Ecology 12: 1577–1588. doi: 10.1046/j.1365-294X.2003.01835.x
- 63. Nordborg M, Donnelly P (1997) The coalescent process with selfing. Genetics 146: 1185–1195.
- 64. Gilmore AR, Gogel BJ, Cullis BR, Thompson R (2006) Asreml User Guide Release 2.0. Hemel Hempsted: VSN International Ltd.
- 65. Frentiu FD, Clegg SM, Chittock J, Burke T, Blows MW, et al. (2008) Pedigree-free animal models: the relatedness matrix reloaded. Proceedings of the Royal Society B: Biological Sciences 275: 639–647.
- 66. Pemberton JM (2008) Wild pedigrees: the way forward. Proceedings of the Royal Society B: Biological Sciences 275: 613–621.
- 67. Lee S, Goddard M, Visscher P, van der Werf J (2010) Using the realized relationship matrix to disentangle confounding factors for the estimation of genetic variance components of complex traits. Genetics Selection Evolution 42: 22.
- 68. Jensen J, Su G, Madsen P (2012) Partitioning additive genetic variance into genomic and remaining polygenic components for complex traits in dairy cattle. BMC Genetics 13: 44. doi: 10.1186/1471-2156-13-44
- 69. Hill WG (2012 ) Quantitative genetics in the genomics era. Current genomics 13: 196–206. doi: 10.2174/138920212800543110
- 70. Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, et al. (2009) Finding the missing heritability of complex diseases. Nature 461: 747–753. doi: 10.1038/nature08494
- 71. Maher B (2008) The case of the missing heritability. Nature 456: 18–21. doi: 10.1038/456018a
- 72. Christensen O, Lund M (2010) Genomic prediction when some animals are not genotyped. Genetics Selection Evolution 42: 2. doi: 10.1186/1297-9686-42-2
- 73. Legarra A, Robert-Granié C, Manfredi E, Elsen J-M (2008) Performance of genomic selection in mice. Genetics 180: 611–618. doi: 10.1534/genetics.108.088575
- 74. Bulmer MG (1980) The mathematical theory of quantitative genetics. Oxford, UK: Oxford University Press.
- 75. Fisher RA (1918) The correlation between relatives on the supposition of mendelian inheritance. Transactions of the Royal Society of Edinburgh: Earth Sciences 52: 399–433. doi: 10.1017/s0080456800012163
- 76. Leroy G, Danchin-Burge C, Palhiere I, Baumung R, Fritz S, et al. (2012) An ABC estimate of pedigree error rate: application in dog, sheep and cattle breeds. Animal Genetics 43: 309–314. doi: 10.1111/j.1365-2052.2011.02253.x
- 77. Stopher KV, Walling CA, Morris A, Guinness FE, Clutton-Brock TH, et al. (2012) Shared spatial effects on quantitative genetic parameters: accounting for spatial autocorrelation and home range overlap reduces estimates of heritability in red deer. Evolution 66: 2411–2426. doi: 10.1111/j.1558-5646.2012.01620.x
- 78. Kruuk LEB, Hadfield JD (2007) How to separate genetic and environmental causes of similarity between relatives. Journal of Evolutionary Biology 20: 1890–1903. doi: 10.1111/j.1420-9101.2007.01377.x
- 79. Thomas SC, Coltman DW, Pemberton JM (2002) The use of marker-based relationship information to estimate the heritability of body weight in a natural population: a cautionary tale. Journal of Evolutionary Biology 15: 92–99. doi: 10.1046/j.1420-9101.2002.00372.x
- 80. Holland JB, Nyquist WE, Cervantes-Martínez CT (2010) Estimating and Interpreting Heritability for Plant Breeding: An Update. Plant Breeding Reviews: John Wiley & Sons, Inc. pp. 9–112.
- 81. Shaw RG, Byers DL, Shaw FH (1998) Genetic components of variation in Nemophila menziesii undergoing inbreeding: Morphology and flowering time. Genetics 150: 1649–1661. doi: 10.1046/j.1420-9101.2002.00372.x
- 82. Harris DL (1964) Genotypic covariances between inbred relatives. Genetics 50: 1319–1348. doi: 10.1046/j.1420-9101.2002.00372.x
- 83. Cockerham CC (1971) Higher order probability functions of identity of alleles by descent. Genetics 69: 235–246.
- 84. Goldringer I, Brabant P, Gallais A (1997) Estimation of additive and epistatic genetic variances for agronomic traits in a population of doubled-haploid lines of wheat. Heredity 79: 60–71.
- 85. Lande R, Schemske DW (1985) The evolution of self- fertilization and inbreeding depression in plants. 1. Genetics models. Evolution 39: 24–40.
- 86. Wright S (1969) Evolution and the Genetics of Populations. Vol. II. The Theory of Gene Frequencies. Chicago: University of Chicago Press.
- 87. Charlesworth D, Willis JH (2009) The genetics of inbreeding depression. Nature Reviews Genetics 10: 783–796.
- 88. Husband BC, Schemske DW (1996) Evolution of the magnitude and timing of inbreeding depression in plants. Evolution 50: 54–70. doi: 10.2307/2410780
- 89. Byers DL, Waller DM (1999) Do plant populations purge their genetic load? Effects of population size and mating history on inbreeding depression. Annual Review of Ecology and Systematics 30: 479–513. doi: 10.2307/2410780
- 90. Glémin S, Ronfort J, Bataillon T (2003) Patterns of inbreeding depression and architecture of the load in subdivided populations. Genetics 165: 2193–2212. doi: 10.2307/2410780
- 91. Holland JB (2010) Epistasis and Plant Breeding. Plant Breeding Reviews: John Wiley & Sons, Inc. pp. 27–92.
- 92. Kelly JK, Williamson S (2000) Predicting response to selection on a quantitative trait: A comparison between models for mixed-mating populations. Journal of Theoretical Biology 207: 37–56. doi: 10.1006/jtbi.2000.2154
- 93. Pederson DG (1969) The prediction of selection response in a self-fertilizing species. Australian Journal of Biological Sciences 22: 117–130.
- 94. Cockerham CC, Matzinger DF (1985) Selection response based on selfed progenies. Crop Science 25: 483–488.
- 95. Wright, A J, Cockerham, C C (1985) Selection with partial selfing. I: Mass selection. Bethesda, MD, USA: Genetics Society of America.
- 96. Kelly JK (1999) Response to selection in partially self-fertilizing populations. I. Selection on a single trait. Evolution 53: 336–349.
- 97. Peterson BK, Weber JN, Kay EH, Fisher HS, Hoekstra HE (2012) Double digest RADseq: An inexpensive method for de novo SNP discovery and genotyping in model and non-model species. Plos One 7.
- 98. Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, et al.. (2008) Rapid SNP discovery and genetic mapping using sequenced RAD markers. Plos One 3.
- 99. Coltman DW (2005) Testing marker-based estimates of heritability in the wild. Molecular Ecology 14: 2593–2599. doi: 10.1111/j.1365-294X.2005.02600.x
- 100. Ritland K, Ritland C (1996) Inferences about quantitative inheritance based on natural population structure in the yellow monkeyflower, Mimulus guttatus. Evolution 50: 1074–1082. doi: 10.2307/2410648
- 101. van Kleunen M, Ritland K (2005) Estimating heritabilities and genetic correlations with marker-based methods: An experimental test in Mimulus guttatus. Journal of Heredity 96: 368–375. doi: 10.1093/jhered/esi039
- 102. Klaper R, Ritland K, Mousseau TA, Hunter MD (2001) Heritability of phenolics in Quercus laevis inferred using molecular markers. Journal of Heredity 92: 421–426. doi: 10.1093/jhered/92.5.421
- 103. Bouvet JM, Kelly B, Sanou H, Allal F (2008) Comparison of marker- and pedigree-based methods for estimating heritability in an agroforestry population of Vitellaria paradoxa CF Gaertn. (shea tree). Genetic Resources and Crop Evolution 55: 1291–1301.
- 104. Doran JC, Matheson AC (1994) Genetic parameters and expected gains from selection for monoterpene yields in Petford Eucalyptus camaldulensis. New Forests 8: 155–167.
- 105. Andrew RL, Peakall R, Wallis IR, Wood JT, Knight EJ, et al. (2005) Marker-based quantitative genetics in the wild?: The heritability and genetic correlation of chemical defenses in eucalyptus. Genetics 171: 1989–1998. doi: 10.1534/genetics.105.042952
- 106. Bessega C, Saidman BO, Darquier MR, Ewens M, Sanchez L, et al. (2009) Consistency between marker- and genealogy-based heritability estimates in an experimental stand of Prosopis alba (Leguminosae). American Journal of Botany 96: 458–465. doi: 10.3732/ajb.0800074
- 107. Kumar S, Richardson TE (2005) Inferring relatedness and heritability using molecular markers in radiata pine. Molecular Breeding 15: 55–64.
- 108. Shikano T (2008) Estimation of quantitative genetic parameters using marker-inferred relatedness in Japanese flounder: A case study of upward bias. Journal of Heredity 99: 94–104. doi: 10.1093/jhered/esm105
- 109. Wilson AJ, McDonald G, Moghadam HK, Herbinger CM, Ferguson MM (2003) Marker-assisted estimation of quantitative genetic parameters in rainbow trout, Oncorhynchus mykiss. Genetics Research 81: 145–156. doi: 10.1017/s0016672302006055
- 110. Heath DD, Devlin RH, Heath JW, Iwama GK (1994) Genetic, environmental and interaction effects on the incidence of jacking in Oncorhynchus tshawytscha (chinook salmon). Heredity 72: 146–154. doi: 10.1017/s0016672302006055
- 111. Blonk RJW, Komen H, Kamstra A, van Arendonk JAM (2010) Estimating breeding values with molecular relatedness and reconstructed pedigrees in natural mating populations of common sole, Solea solea. Genetics 184: 213–219. doi: 10.1534/genetics.109.110536
- 112. Hayes BJ, Goddard ME (2008) Technical note: Prediction of breeding values using marker-derived relationship matrices. Journal of Animal Sciences 86: 2089–2092. doi: 10.1017/s0016672302006055
- 113. Anderson TJC, Williams JT, Nair S, Sudimack D, Barends M, et al. (2010) Inferred relatedness and heritability in malaria parasites. Proceedings of the Royal Society B: Biological Sciences 277: 2531–2540. doi: 10.1098/rspb.2010.0196
- 114. Veerkamp RF, Mulder HA, Thompson R, Calus MPL (2011) Genomic and pedigree-based genetic parameters for scarcely recorded traits when some animals are genotyped. Journal of Dairy Science 94: 4189–4197. doi: 10.3168/jds.2011-4223
- 115. Rolf M, Taylor J, Schnabel R, McKay S, McClure M, et al. (2010) Impact of reduced marker set estimation of genomic relationship matrices on genomic selection for feed efficiency in Angus cattle. BMC Genetics 11: 24. doi: 10.1186/1471-2156-11-24
- 116. Usai MG, Goddard ME, Hayes BJ (2009) LASSO with cross-validation for genomic selection. Genetics Research 91: 427–436. doi: 10.1017/S0016672309990334
- 117. Goddard ME, Hayes BJ, Meuwissen THE (2011) Using the genomic relationship matrix to predict the accuracy of genomic selection. Journal of Animal Breeding and Genetics 128: 409–421. doi: 10.1111/j.1439-0388.2011.00964.x
- 118. Toro M, Barragán C, Óvilo C, Rodrigañez J, Rodriguez C, et al. (2002) Estimation of coancestry in Iberian pigs using molecular markers. Conservation Genetics 3: 309–320. doi: 10.1023/a:1019921131171
- 119. VanRaden PM (2008) Efficient methods to compute genomic predictions. Journal of Dairy Science 91: 4414–4423. doi: 10.3168/jds.2007-0980
- 120. Oliehoek PA, Windig JJ, van Arendonk JAM, Bijma P (2006) Estimating relatedness between individuals in general populations with a focus on their use in conservation programs. Genetics 173: 483–496. doi: 10.1534/genetics.105.049940
- 121. Smith BR, Herbinger CM, Merry HR (2001) Accurate partition of individuals into full-sib families from genetic data without parental information. Genetics 158: 1329–1338. doi: 10.1023/a:1019921131171
- 122. Marshall TC, Slate J, Kruuk LEB, Pemberton JM (1998) Statistical confidence for likelihood-based paternity inference in natural populations. Molecular Ecology 7: 639–655. doi: 10.1046/j.1365-294x.1998.00374.x
- 123. Wang J (2004) Sibship reconstruction from genetic data with typing errors. Genetics 166: 1963–1979. doi: 10.1534/genetics.166.4.1963
- 124. Duchesne P, Godbout M-H, Bernatchez L (2002) PAPA (package for the analysis of parental allocation): a computer program for simulated and real parental allocation. Molecular Ecology Notes 2: 191–193. doi: 10.1046/j.1471-8286.2002.00164.x
- 125. Ritland K (1996) A marker-based method for inferences about quantitative inheritance in natural populations. Evolution 50: 1062–1073. doi: 10.1046/j.1471-8286.2002.00164.x
- 126. Thomas SC, Pemberton JM, Hill WG (2000) Estimating variance components in natural populations using inferred relationships. Heredity 84: 427–436. doi: 10.1046/j.1365-2540.2000.00681.x
- 127. Rodriguez-Ramilo ST, Toro MA, Caballero A, Fernandez J (2007) The accuracy of a heritability estimator using molecular information. Conservation Genetics 8: 1189–1198. doi: 10.1046/j.1471-8286.2002.00164.x
- 128. DiBattista JD, Feldheim KA, Garant D, Gruber SH, Hendry AP (2009) Evolutionary potential of a large marine vertebrate: quantitative genetic parameters in a wild population. Evolution 63: 1051–1067. doi: 10.1111/j.1558-5646.2008.00605.x
- 129. Yu J, Zhang Z, Zhu C, Tabanao DA, Pressoir G, et al. (2009) Simulation appraisal of the adequacy of number of background markers for relationship estimation in association mapping. The Plant Genome 2: 63–77. doi: 10.3835/plantgenome2008.09.0009
- 130. Nejati-Javaremi A, Smith C, Gibson JP (1997) Effect of total allelic relationship on accuracy of evaluation and response to selection. Journal of Animal Science 75: 1738–1745. doi: 10.3835/plantgenome2008.09.0009
- 131. Villanueva B, Pong-Wong R, Fernández J, Toro MA (2005) Benefits from marker-assisted selection under an additive polygenic genetic model. Journal of Animal Science 83: 1747–1752. doi: 10.3835/plantgenome2008.09.0009
- 132. Meuwissen THE, Hayes BJ, Goddard ME (2001) Prediction of total genetic value using genome-wide dense marker maps. Genetics 157: 1819–1829. doi: 10.3835/plantgenome2008.09.0009
- 133. Calus MPL, Veerkamp RF (2007) Accuracy of breeding values when using and ignoring the polygenic effect in genomic breeding value estimation with a marker density of one SNP per cM. Journal of Animal Breeding and Genetics 124: 362–368. doi: 10.1111/j.1439-0388.2007.00691.x
- 134. Zhang Z, Liu J, Ding X, Bijma P, de Koning D-J, et al. (2010) Best linear unbiased prediction of genomic breeding values using a trait-specific marker-derived relationship matrix. PLoS ONE 5: e12648. doi: 10.1371/journal.pone.0012648
- 135. Sørensen A, Pong-Wong R, Windig J, Woolliams J (2002) Precision of methods for calculating identity-by-descent matrices using multiple markers. Genetics Selection Evolution 34: 1–23. doi: 10.1186/1297-9686-34-5-557