Independent genome-wide association studies (GWAS) showed an obesogenic effect of two single nucleotide polymorphisms (SNP; rs12970134 and rs17782313) more than 150 kb downstream of the melanocortin 4 receptor gene (MC4R). It is unclear if the SNPs directly influence MC4R function or expression, or if the SNPs are on a haplotype that predisposes to obesity or includes functionally relevant genetic variation (synthetic association). As both exist, functionally relevant mutations and polymorphisms in the MC4R coding region and a robust association downstream of the gene, MC4R is an ideal model to explore synthetic association.
We analyzed a genomic region (364.9 kb) encompassing the MC4R in GWAS data of 424 obesity trios (extremely obese child/adolescent and both parents). SNP rs12970134 showed the lowest p-value (p = 0.004; relative risk for the obesity effect allele: 1.37); conditional analyses on this SNP revealed that 7 of 78 analyzed SNPs provided independent signals (p≤0.05). These 8 SNPs were used to derive two-marker haplotypes. The three best (according to p-value) haplotype combinations were chosen for confirmation in 363 independent obesity trios. The confirmed obesity effect haplotype includes SNPs 3′ and 5′ of the MC4R. Including MC4R coding variants in a joint model had almost no impact on the effect size estimators expected under synthetic association.
A haplotype reaching from a region 5′ of the MC4R to a region at least 150 kb from the 3′ end of the gene showed a stronger association to obesity than single SNPs. Synthetic association analyses revealed that MC4R coding variants had almost no impact on the association signal. Carriers of the haplotype should be enriched for relevant mutations outside the MC4R coding region and could thus be used for re-sequencing approaches. Our data also underscore the problems underlying the identification of relevant mutations depicted by GWAS derived SNPs.
Citation: Scherag A, Jarick I, Grothe J, Biebermann H, Scherag S, et al. (2010) Investigation of a Genome Wide Association Signal for Obesity: Synthetic Association and Haplotype Analyses at the Melanocortin 4 Receptor Gene Locus. PLoS ONE 5(11): e13967. doi:10.1371/journal.pone.0013967
Editor: Thorkild I. A. Sorensen, Institute of Preventive Medicine, Denmark
Received: July 16, 2010; Accepted: October 24, 2010; Published: November 15, 2010
Copyright: © 2010 Scherag et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by grants from the Bundesministerium fuer Bildung und Forschung (01GS0903; 01KU0903; NGFNplus 01GS0820, 01GS0830, 01GS0825), the Deutsche Forschungsgemeinschaft (HE 1446/4-1) and the European Union (FP6 LSHMCT-2003-503041). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Alleles of common single nucleotide polymorphisms (SNPs) rs12970134 and rs17782313 located downstream of the MC4R (154 kb and 188 kb, respectively) were shown to be associated with obesity and related traits , .
SNP rs17782313 had been identified in a large-scale international cooperation encompassing more than 90,000 individuals . Initially, GWAS data obtained in seven studies (16,876 Europeans) had been meta-analyzed. The second strongest association signal mapped 188 kb downstream of the MC4R. This location of rs17782313 indicates that its relevance for weight regulation might be mediated via effects on MC4R expression. The association result was confirmed in independent samples of a total of 60,352 adults and 5,988 children. Amongst the adults, each copy of the rs17782313 effect (C) allele was associated with a change in BMI of ~0.22 kg/m2. A copy of the allele resulted in an odds ratio of 1.08 and 1.12 for overweight and obesity, respectively .
This small effect size was confirmed by three subsequent GWAS meta-analyses for human obesity genes –. All of these GWAS picked up rs17782313 as a polygenic variant for obesity; a total of more than 150,000 individuals was analyzed , –.
Additionally, a GWAS for insulin resistance and related phenotypes in 2,684 Indian Asians (including subsequent analyses in 11,955 individuals of Indian Asian or European ancestry) detected an association of rs12970134 located 154 kb downstream of MC4R with waist circumference (p = 1.7×10−9). The effect was also present in normal weight individuals. Hence, it was suggested that the association of rs12970134 with insulin resistance is partially independent of obesity . Association of rs12970134 with BMI was also found in the whole study group (N = 11,955; p = 6.4×10−5; 0.25 kg/m2, 95% CI 0.13–0.38 kg/m2); in the 4,561 Europeans alone the effect size was similar (0.28 kg/m2, 95% CI 0.05–0.51 kg/m2).
Currently, it is unclear if and how the association signals of the two common non-coding GWAS-derived SNPs (rs17782313 and rs12970134, with pairwise linkage disequilibrium of D′ = 0.92 and r2 = 0.75 according to Ensembl version 57; CEU; Genome Assembly GRCh37) 154 and 188 kb of the 3′ end of the MC4R relate to the receptor gene.
On the other hand, approximately 130 mutations have been detected in the human MC4R gene; those leading to a reduced function result in a dominantly inherited form of obesity . 2–6% of extremely obese children and adolescents harbour such mutations. Most of these lead to a total or partial loss of function as shown by in vitro assays . Quantitative trait analyses in pedigrees revealed that individuals with these mutations had a significantly higher current BMI (4.5 and 9.5 kg/m2 in males and females, respectively) than their relatives without the mutations . Phenotypical effects of MC4R mutations other than obesity have been shown to encompass hyperinsulinemia, elevated growth rates and higher bone density .
In addition to these rare mutations, the minor alleles of two MC4R non-synonymous polymorphisms (Val103Ile, Ile251Leu) have convincingly been shown to be protective against obesity –. Heterozygotes for the 103Ile variant of Val103Ile were found in 2–9% of subjects in different populations. An effect size estimate of −0.48 kg/m2 for Ile103 carriers was calculated  and subsequently confirmed in an extended single large epidemiological study of approximately 8,000 individuals . Additional meta-analyses encompassing a total of 53,343 individuals again confirmed the initial finding , . In a GWAS with a focus on early-onset extreme obesity, candidate genes for obesity had specifically been tested. The negative association with obesity for the 103Ile variant of Val103Ile was again supported (p = 4.2×10−4 ). In the GWAS detecting the association of obesity with the 3′ MC4R-SNP , the respective finding could not be explained by the two non-synonymous polymorphisms (Val103Ile, Ile251Leu). This conclusion was based on both imputed data with a high confidence score for Val103Ile (posterior probability >0.99 using IMPUTE) and on directly genotyped data for both polymorphisms on a subsample of the study group (Val103Ile, n = 5,516 and Ile251Leu: n = 5,039) . Functional studies showed that the MC4R-103Ile reveals a modest, but significant decrease in antagonist potency . The effect of potent endogenous pro-opiomelanocortin-derived agonists at the MC4R seemed to be increased for MC4R-103Ile . Hence, both the lower antagonistic and the increased agonistic potencies are compatible with an elevated MC4R function, which could explain the weight-reducing effect of the variant.
Finally, a consistent negative association of the Leu251 variant with body weight (minor allele frequency approx. 1 percent) of the second non-synonymous coding polymorphism (Ile251Leu; rs52820871) was detected in about 17,000 individuals of European origin. The effect was shown for extremely obese children and adults (odds ratios ranging from 0.25 to 0.76) and within population-based samples. A meta-analysis supported the evidence of the obesity-protective effect of MC4R Leu251 (odds ratio = 0.52) . An increased basal activity was described for the Leu251 variant .
We deem the MC4R coding region and the distant SNPs as an excellent example for the recently proposed synthetic association , . Accordingly, GWAS signals of common non-functional SNPs outside of coding regions may be the result of a combination of rare coding/functional variants with stronger effects given that these rare variants arose on a haplotype which is tagged by the common SNP. Here we investigate synthetic association by analyzing if common genetic variation (SNPs and haplotypes thereof) genotyped in the non-coding genomic region of the MC4R locus is related to coding (functional) variants of the MC4R. In the first step, we analyzed the genomic region covering both 3′ SNPs (rs17782313 and rs12970134), the MC4R and SNPs in its 5′ flanking region to extract all evidence for common variation related to obesity at the MC4R locus. Transmission disequilibrium was analyzed in GWAS data (78 SNPs) of 424 obesity trios. Conditioning on the strongest single-marker signal in our sample, we screened the remaining 77 SNPs for independent (secondary signals with a p-value≤0.05 in the conditional test) signals. Subsequently, we determined the transmission patterns of all possible two-marker haplotypes of the primary and the independent signals. We searched for over-transmission of ‘obesity haplotypes’ to determine if the distant non-coding 3′ SNPs (rs17782313 and rs12970134) belong to haplotypes which extend into the MC4R locus. In the second step, we incorporated information on mutations and polymorphisms within the MC4R coding region to analyze if synthetic association explains the GWAS signals of the distant SNPs.
Extended family-based haplotype analyses of common variants
We analyzed 78 SNPs in the genomic region comprising the MC4R gene and reaching from the recombination hotspot at 55,879.013 bps (recombination rate: 43.3 cM/Mb), 310.5 kb 3′ of the MC4R, to the recombination hotspot at 56,243,921 bps (recombination rate: 89.5 cM/Mb) 52.9 kb 5′ of the MC4R coding region. Analyses were performed in 424 extremely obese German children and adolescents and both of their parents (Affymetrix Genome-Wide Human SNP Array 6.0). The strongest single-marker association signal (according to nominal p-values in the genomic region) was observed for rs12970134 (two-sided exact p = 0.004, relative risk (RR) = 1.37, 95% confidence interval (CI): 1.11–1.69) which had previously been reported by Chambers et al. . SNP rs12970134 was in moderate to strong linkage disequilibrium (LD) to the other previously reported GWAS SNP rs17782313  with D′ = 0.931 and r2 = 0.770 (thus, our data were similar to the Ensembl 57 data base entry). A search for secondary signals when conditioning on the strongest signal (rs12970134) revealed that 7 additional SNPs provided independent signals (p≤0.05). Expectedly, rs17782313 did not provide an independent contribution compared to rs12970134 due to the aforementioned LD. The total of 8 SNPs with independent contributions was used to derive the three best two-marker haplotype combinations according to their p-value in the haplotype transmission disequilibrium test (Figure 1).
Figure 1. Genomic region encompassing the melanocortin 4 receptor gene (MC4R; location indicated by the vertical bar).
Displayed are results of transmission disequilibrium tests of single markers and selected two-marker haplotypes in the detection sample of 424 obesity trios (two-sided exact p-values). The black dots indicate single marker TDT results for all 78 SNPs whereas the dots surrounded by a circle highlight the 8 SNPs used for two-marker haplotype construction (selected based on conditional analyses). The region covered by each two-marker haplotype combination is displayed as a line (only those haplotypes with a p-value≤0.1 and whose p-value was below the p-value of each single-marker test are displayed). Surrounding double circles indicate those SNPs (single-marker TDT) of the best supported haplotype result; Haplo 3 spans MC4R. In addition, grey lines indicate recombination rates according to HAPMAP CEU.doi:10.1371/journal.pone.0013967.g001
For replication, all 4 SNPs (rs17700028, rs12970134, rs1943226 and rs1943229) involved in the three best haplotype combinations (p≤0.05 after Bonferroni correction for the 28 haplotypic tests performed) were genotyped in 363 independent obesity trios (Table 1). For one haplotype combination, Haplo 1 (rs17700028 and rs12970134), the haplotype TDT was nominally significant (p≤0.05) in the detection as well as in the replication sample. However, the corresponding risk haplotype [G; A] of Haplo 1 is rare (estimated frequency in the joint sample of 787 parents≈0.2%) and does not provide evidence for an association in the joint sample (haplotype RR (hRR) = 3.24 with 95% CI: 0.36–29.12, p-value = 0.29). Analysis of the full set of 787 obesity trios showed a higher hRR for the risk haplotype combination Haplo 3 than for any other single marker or haplotype combination (Tables 1 and 2). Interestingly, Haplo 3 covers the MC4R (Figure 1). In more detail, the combination of the risk alleles of the SNPs included in Haplo 3 resulted in a hRR for [A; G] = 1.61, (95% CI: 1.30–2.00) for obesity, which was descriptively stronger than the relative risks for each of the single markers effect alleles (rs12970134; A is obesity effect allele: RR = 1.33, 95% CI: 1.14–1.55; rs1943229; G is obesity effect allele: RR = 1.03, 95% CI: 0.86–1.23). Note that the single-marker association signal for rs1943229 resulted in p-value = 0.86 in the joint sample (Table 2).
Table 1. Global transmission disequilibrium tests (HAP-TDT) of all three best two-marker haplotype combinations (according to p-values) in the genomic region covering the melanocortin 4 receptor gene (MC4R) in 424 obesity trios of the detection sample, the confirmation sample of 363 obesity trios and in the joint sample of 787 obesity trios.doi:10.1371/journal.pone.0013967.t001
Table 2. Single-marker transmission disequilibrium tests in the detection sample (424 obesity trios), the confirmation sample (363 obesity trios) and in the joint sample (787 obesity trios).doi:10.1371/journal.pone.0013967.t002
Synthetic association – assessment of the relationship of MC4R coding variants to common obesity-associated SNPs outside of the MC4R coding region
As both, common distant SNPs and functionally relevant low frequency genetic variation within MC4R, are associated with obesity, the MC4R might be an example for ‘synthetic association’. Accordingly, we evaluated the impact of MC4R coding variants on the association signals of the distant SNPs or their respective two-marker haplotype. First, we explored the impact of the non-synonymous polymorphisms Val103Ile and Ile251Leu. Secondly, we assessed the impact of mutations in the MC4R coding region that lead to a reduced receptor function.
The obesity effect alleles of the non-synonymous polymorphisms Val103Ile and Ile251Leu (the wild-type alleles G and T) were frequently detected on the common obesity haplotype comprising the obesity effect alleles at rs12970134 and rs1943229 (Table 3; left panel). As expected, pairwise LD measures between the two non-synonymous MC4R polymorphisms and the two SNPs of the obesity haplotype are larger for D′ (range 0.475–1), whereas the respective r2 values were low (range 0.001–0.007) due to the low allele frequencies of these polymorphisms.
Table 3. MC4R coding variants and their relationship to haplotype frequencies and transmission ratios of the two-marker haplotype comprising the SNPs rs12970134 (3′end) and rs1943229 (5′end) in the MC4R region.doi:10.1371/journal.pone.0013967.t003
We observed a slightly stronger transmission disequilibrium of the obesity haplotype [A; G] of Haplo 3 (transmission-ratio = 1.56; Table 3) if it included the wild-type, ‘obesogenic’ alleles of the two non-synonymous MC4R polymorphisms as compared to the situation of an un-stratified assessment of omitting the two polymorphisms (transmission-ratio = 1.55; Table 4). Accordingly, the obesity alleles of the common obesity-associated SNPs outside of the MC4R coding region (rs12970134 and rs1943229) were less frequently transmitted to obese offspring in the presence of the weight-lowering variants of the two MC4R polymorphisms (Val103Ile and Ile251Leu; Table 3; left panel) as indicated by transmission-ratios below 1. Subsequently, we removed the heterozygous carriers (total n = 108) at Val103Ile (n = 85) and Ile251Leu (n = 23) from the full sample to explore their impact on the TDT findings for the two distant SNPs (rs12970134, rs1943229). While both the obesity effects for the one distant SNP rs12970134 and the risk haplotype of Haplo 3 associated to obesity (TDT p-value = 0.02 after Bonferroni correction for the 78 tests performed) increased slightly (A allele: RR 1.33 to 1.35; [A; G] allele: hRR 1.61 to 1.64) the point estimate for the other distant SNP rs1943229 did not change (uncorrected TDT p-value = 0.86; G allele: RR 1.03). Similarly, joint modelling of the effects of the two distant SNPs and the non-synonymous MC4R polymorphisms collapsed into one covariate  resulted in hardly any change of the obesity effects of the two distant SNPs (Table 5; left panel; rs12970134, A allele: RR 1.45 to 1.43; rs1943229, G allele: RR 1.23 to 1.24), while the effect for the collapsed wild-type alleles of the two non-synonymous MC4R polymorphisms was slightly reduced (RR 1.66 to 1.54 ).
Table 4. Analyses for the two-marker haplotype comprising the SNPs rs12970134 (3′end) and rs1943229 (5′end) in the MC4R region.doi:10.1371/journal.pone.0013967.t004
Table 5. Conditional logistic regression analysis including effects of the two-marker haplotype SNPs (rs12970134, rs1943229), the two coding polymorphisms (weight lowering effect: Val103Ile, Ile251Leu) and any other functionally relevant obesity MC4R mutations.doi:10.1371/journal.pone.0013967.t005
In a subgroup (n = 525, see , ) of the 787 trios we also analyzed if mutations in the MC4R coding region that lead to a reduced receptor function can in part explain the effect observed for the common obesity-associated SNPs outside of the MC4R coding region. We had previously screened the MC4R coding region for mutations (dHPLC ). We observed heterozygous carriers of the following MC4R mutations that lead to a reduced receptor function as shown by at least one functional assay: Ser30Phe , [Tyr35Stop; 110A>T] , Pro78Leu , Ser94Arg , , Thr112Met , Ile121Thr , , Ser127Leu , , Arg165Trp , , Ala175Thr , Gly181Asp , , , Ala244Glu , deletion of 4 base pairs at codon 211 (L211fsX216 , ; Ile317Thr . For one mutation (Met200Val), two functional assays (cAMP response and cell surface expression) revealed a function similar to the wild-type MC4R , . However, for some of the other mutations, there was only one amongst a variety of assays that showed a deviation from the wild-type receptor (e.g. Thr112Met; wild-type function in , , ; only Nijenhuis et al.  described a reduced receptor function). Hence, we decided to also rate Met200Val as a functionally relevant mutation; rating as a wild-type receptor did not alter the results (data not shown).
Again, removal of the mutation carriers from the analysis did not alter the results for the distant SNPs or the two-marker haplotype strongly (rs12970134-A: RR 1.38 to 1.34; rs1943229-G: RR 1.05 to 1.03; hRR for [A; G] of Haplo 3: 1.82 to 1.70). For the more distant marker rs12970134 and for the obesity effect haplotype [A; G] of Haplo 3, the effect was slightly reduced in comparison to the model of not removing the mutation carriers. For rs1943229 located closer to the coding region of the MC4R the association signal remained nearly the same upon removal of the mutation carriers. Similar observations were made for the joint modelling of the distant SNPs, the non-synonymous MC4R polymorphisms and the mutations (Table 5; right panel). For the joint modelling, we collapsed the wild-type alleles of the non-synonymous MC4R polymorphisms into one covariate and all mutations into another  as each single MC4R mutation is almost a private event. Here, in contrast to the observation for the non-synonymous MC4R polymorphisms Val103Ile and Ile251Leu, the combination of all functional mutations (n = 15) resulted in an additional independent obesity-association signal for the mutations (p-values≤0.05 in model 2 analyzing the mutations only and model 3 joint modelling of distant SNPs, the non-synonymous MC4R polymorphisms and the mutations; Table 5 right panel) which was even strengthened (RR 2.57 to 2.91) upon inclusion of the polymorphisms.
At the MC4R locus, common SNPs are associated with polygenic forms of obesity and variants leading to a reduced MC4R function entail a major gene effect for obesity. Thus this gene locus is ideally suited for the analysis of synthetic association. We performed conditional as well as haplotype analyses for a genomic region approximately 310 kb downstream of the MC4R to 53 kb upstream of the MC4R coding region (between two recombination hotspots). In contrast to prior analyses , we had the advantage of being able to use directly genotyped data from nuclear families. Thus, we did not have to rely on imputed genotypes; estimation of haplotypes is simplified and transmission patterns can be explored.
Haplotype analyses are known to be more powerful than single-marker analyses for the detection of a genomic region that is enriched for phenotype-relevant mutation(s)/causal variant(s). Recently, the analysis of WTCCC (Wellcome Trust Case Control Consortium) data sets of seven complex disorders suggested that haplotype analyses in current GWAS can guide future re-sequencing approaches to identify underlying rare functional variants . Therefore, we analyzed haplotypes flanking the MC4R. We observed a stronger association signal upon combining the information of SNPs from the 3′ and 5′ regions than for each single SNP in two independent TDT-based association studies. Whether the constructed obesity haplotype points to an ancestral obesity haplotype or if it reflects the impact of two relatively independent loci in the vicinity of the MC4R is beyond the scope of our analyses. However, as we have limited our analysis to the region between the recombination hotspots derived from 30 CEU Hapmap trios  3′ and 5′ to the MC4R we attempted to increase the chance to detect an ancestral haplotype.
As we identified a relatively common risk haplotype (frequency of [A; G] of Haplo 3 equals to 15.7% in the 787 parents) covering the MC4R coding region, we also analyzed if the associations of obesity to the distal 3′ common SNPs represent an example of synthetic association , . Synthetic association implies that GWAS signals of common non-functional SNPs outside of coding regions may be the result of a combination of rare coding/functional variants with stronger effects. This is in contrast to the more widespread idea that association signals of common non-functional SNPs outside of coding regions point to a common causal variant.
To further explore this idea, we analyzed the influence of coding variants (two coding polymorphisms and several functionally relevant rare mutations) in MC4R on the transmission of the obesity effect haplotype as well as on the obesity effect alleles of the two SNPs at the 3′ and 5′ ends forming the obesity effect haplotype. Both the removal of individuals with MC4R coding variants and the inclusion of the information on MC4R coding variant status in a regression model had basically no impact on the estimators of the obesity effect alleles of two SNPs at the 3′ and 5′ ends of the MC4R, as well as for the obesity effect haplotype. This observation did not support the model of synthetic association  for the MC4R coding region. Thus, we conclude that the investigated genetic variation within the coding region cannot explain the obesity association effect of the 3′ SNP rs12970134 or that of its haplotype combinations with rs12970134 (Haplo 3) observed in our samples. However, we cannot exclude that other functional variants – outside of the coding region of the MC4R contribute to the GWAS obesity association signal. Moreover, we cannot exclude that other less frequent non-coding variants not properly tagged by the current GWAS technology may be correlated to the MC4R coding variants. Finally, our analyses are based on a relatively small sample size, which results in low power to detect potentially small synthetic association effects.
In sum, we detected a haplotype covering the MC4R coding region; or at least a secondary independent signal 5′ of the MC4R which is associated with extreme obesity. Recently, a meta-analysis comprising GWAS data of 123,865 individuals of European ancestry followed by confirmatory analyses in up to 125,931 independent individuals also described a secondary independent signal at the MC4R locus at a position similar to the one we detected .
We observed a stronger association signal for an obesity effect haplotype as compared to single-marker signals in both our detection and our confirmatory samples. Accounting for genetic variation in the MC4R coding region following the idea of synthetic association had hardly any impact on the association signal in our sample. Consequently, the genomic region of Haplo 3 (201,708 base pairs) could be a focus of a deep sequencing approach aiming at the detection of additional obesity mutations/polymorphisms outside of the MC4R coding region. We aim at re-sequencing this region in the extremely obese individuals harbouring the risk haplotype ([A; G] of Haplo 3); subsequently we will analyze  newly identified potentially causal rare variants in independent well-powered case-control samples.
Materials and Methods
A total of 787 obesity trios comprising an extremely obese child or adolescent (index patients, see ) and both of their biological parents (for details see Table 6) was analyzed. Written informed consent was given by all participants and in case of minors by their parents. The study was approved by the Ethics Committees of the Universities of Marburg and Essen and conducted in accordance with the guidelines of The Declaration of Helsinki.
Table 6. Description of the investigated two independent German family-based data sets (detection and confirmation sample).doi:10.1371/journal.pone.0013967.t006
Genotyping in 424 obesity trios was performed on the Genome-Wide Human SNP Array 6.0 (http://www.affymetrix.com) by ATLAS Biolabs GmbH (Berlin, Germany). Birdseed V-2 algorithm was applied for calling. For genotyping of rs12970134 and rs1943229 we used available TaqMan assays (C___3058722_10 and C__11962333_10, Applied Biosystems, Germany). The variants rs17700028 and rs1943226 were genotyped using ARMS-PCR , primers are available upon request. The non-synonymous MC4R polymorphisms Val103Ile (rs2229616 – also available on the Genome-Wide Human SNP Array 6.0) and Ile251Leu (rs52820871) were genotyped as described previously . To validate the genotypes, allele determination was made independently by at least two experienced individuals. Discrepancies were solved unambiguously either by reaching consensus or by repeating.
For the functionally relevant obesity MC4R mutations, we used the previously published mutation screen data (525 trios) .
For the GWAS data on 424 obesity trios, SNPs with a call rate <95%, departure from Hardy-Weinberg equilibrium in the parents (two-sided exact test <0.001), or with minor allele frequency below 1 percent in the parents were excluded from the analysis (20 SNPs in the ±200 kb region as described below). In addition, all SNPs were set to “missing” in case of Mendelian inconsistent calls within a family. From the genome-wide data set, 78 SNPs were selected that met the quality control criteria and which were within a genomic region defined as 200 kb 3′ of the most distant previously reported GWAS SNP (rs17782313)  to 200 kb 5′ of the MC4R coding region. On this marker set, we performed single-marker transmission disequilibrium tests (TDTs)  using PLINK v1.07. Relative Risks were estimated by use of conditional logistic regression. Subsequently, we aimed to screen for a set of SNPs which most likely contributed secondary independent association signals. We performed conditional analyses on the SNP (rs12970134) with the strongest primary signal using FAMHAP (version 18) . This test is based on the idea that transmission of haplotypes of closely linked SNPs involving rs12970134 would not depend on allelic status at another SNP, if rs12970134 is the only causal variant. For screening purposes, we selected those SNPs with a permutation-based p-value below 0.05 in the conditional test. Afterwards, we performed family-based association tests of all two-marker haplotypes for the set of selected SNPs (primary and secondary signals) using the weighted haplotype test as provided in FAMHAP (HAP-TDT) . Again, permutation-based p-values were derived. In addition, to overcome the problem of haplotype phasing in the possible presence of some recombinations, we estimated effect sizes such as relative risks with conditional logistic regression under a log-additive mode of inheritance applying the R-package ‘DGCgenetics’ provided by David Clayton. To address the inherent multiplicity and over-fitting problem in the GWAS detection sample, we limited the analyses in the independent confirmation sample (363 obesity case-parents trios) to those models including the two SNPs of Haplo 3 (rs12970134, rs1943229). Using the observed data from the detection data set and simplified adapting it to the bi-allelic case to derive a power estimate for the confirmatory sample using Quanto 1.2.4, we estimated that 363 trios would have a power of 97% to detect an RR effect of RR = 1.7 under a log-additive mode of inheritance (minor allele frequency 15%, αtwo-sided = 0.05).
Finally, we jointly analyzed all 787 trios. For the exploration of the genomic region in order to support the idea of ‘synthetic association’, we phased our data sets using FAMHAP (version 18) . Additionally, we applied conditional logistic regression analyses as implemented in R v.2.9.0 incorporating the SNPs rs12970134, and rs1943229 as well as the variants in the MC4R coding region as independent covariants hierarchically. To enable estimation, we adopted the idea of Li and Leal  and collapsed effects of similar variant classes (non-synonymous MC4R polymorphisms or functionally relevant obesity MC4R mutations). Linkage disequilibrium measurements for the polymorphisms rs12970134, rs1943229, Val103Ile (rs2229616) and Ile251Leu (rs52820871) were based on haplotype phasing incorporating all four variants.
We thank Inga Drachenberg, M.A. Anglistics for proof-reading our manuscript. We thank the probands for participation in our study.
Conceived and designed the experiments: AS IJ JH AH. Performed the experiments: IJ ALV CIGV JH. Analyzed the data: AS IJ JG HB ALV CIGV BG AH. Contributed reagents/materials/analysis tools: JG HB SS BG JH AH. Wrote the paper: AS IJ JH AH.
- 1. Loos RJF, Lindgren CM, Li S, Wheeler E, Zhao JH, et al. (2008) Common variants near MC4R are associated with fat mass, weight and risk of obesity. Nat Genet 40: 768–775.
- 2. Chambers JC, Elliott P, Zabaneh D, Zhang W, Li Y, et al. (2008) Common genetic variation near MC4R is associated with waist circumference and insulin resistance. Nat Genet 40: 716–718.
- 3. Willer CJ, Speliotes EK, Loos RJ, Li S, Lindgren CM, et al. (2009) Six new loci associated with body mass index highlight a neuronal influence on body weight regulation. Nat Genet 41: 25–34.
- 4. Thorleifsson G, Walters GB, Gudbjartsson DF, Steinthorsdottir V, Sulem P, et al. (2009) Genome-wide association yields new sequence variants at seven loci that associate with measures of obesity. Nat Genet 41: 18–24.
- 5. Meyre D, Delplanque J, Chèvre JC, Lecoeur C, Lobbens S, et al. (2009) Genome-wide association study for early-onset and morbid adult obesity identifies three new risk loci in European populations. Nat Genet 41: 157–159.
- 6. Hinney A, Hebebrand J (2009) Three at One Swoop! Obes Facts 2: 3–8.
- 7. Hinney A, Vogel CI, Hebebrand J (2010) From monogenic to polygenic obesity: recent advances. Eur Child Adolesc Psychiatry 19: 297–310.
- 8. Fan ZC, Tao YX (2009) Functional characterization and pharmacological rescue of melanocortin-4 receptor mutations identified from obese patients. J Cell Mol Med 13: 3268–3282.
- 9. Hinney A, Bettecken T, Tarnow P, Brumm H, Reichwald K, et al. (2006) Prevalence, spectrum, and functional characterization of melanocortin-4 receptor gene mutations in a representative populationbased sample and obese adults from Germany. J Clin Endocrin Metab 91: 1761–1769.
- 10. Dempfle A, Hinney A, Heinzel-Gutenbrunner M, Raab M, Geller F, et al. (2004) Large quantitative effect of melanocortin-4 receptor gene mutations on body mass index. J Med Genet 41: 795–800.
- 11. Farooqi IS, Keogh JM, Yeo GS, Lank EJ, Cheetham T, et al. (2003) Clinical spectrum of obesity and mutations in the melanocortin 4 receptor gene. N Engl J Med 348: 1085–1095.
- 12. Geller F, Reichwald K, Dempfle A, Illig T, Vollmert C, et al. (2004) Melanocortin-4 receptor gene variant I103 is negatively associated with obesity. American Journal of Human Genetics 74: 572–581.
- 13. Heid IM, Vollmert C, Hinney A, Döring A, Geller F, et al. (2005) Association of the 103I MC4R allele with decreased body mass in 7937 participants of two population based surveys. Journal of Medical Genetics 42: e21.
- 14. Stutzmann F, Vatin V, Cauchi S, Morandi A, Jouret B, et al. (2007) Non-synonymous polymorphisms in melanocortin-4 receptor protect against obesity: the two facets of a Janus obesity gene. Hum MolGenet 16: 1837–1844.
- 15. Young EH, Wareham NJ, Farooqi S, Hinney A, Hebebrand J, et al. (2007) The V103I polymorphism of the MC4R gene and obesity: population based studies and meta-analysis of 29 563 individuals. Int J Obesity (London) 31: 1437–1441.
- 16. Wang D, Ma J, Zhang S, Hinney A, Hebebrand J, et al. (2010) Association of the MC4R V103I polymorphism with obesity: a Chinese Case-control study and meta-analysis in 55,195 individuals. Obesity (Silver Spring) 18: 573–579.
- 17. Xiang Z, Litherland SA, Sorensen NB, Proneth B, Wood MS, et al. (2006) Pharmacological characterization of 40 human melanocortin-4 receptor polymorphisms with the endogenous proopiomelanocortin-derived agonists and the agouti-related protein (AGRP) antagonist. Biochem 45: 7277–7288.
- 18. Dickson SP, Wang K, Krantz I, Hakonarson H, Goldstein DB (2010) Rare variants create synthetic genome-wide associations. PLoS Biol 8: e1000294.
- 19. Cirulli ET, Goldstein DB (2010) Uncovering the roles of rare variants in common disease through whole-genome sequencing. Nat Genet Rev 11: 415–425.
- 20. Li B, Leal SM (2008) Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. Am J Hum Genet 83: 311–321.
- 21. Hinney A, Hohmann S, Geller F, Vogel C, Hess C, et al. (2003) Melanocortin-4 receptor gene: case-control study and transmission disequilibrium test confirm that functionally relevant mutations are compatible with a major gene effect for extreme obesity. J Clin Endocrin Metab 88: 4258–4267.
- 22. Hinney A, Schmidt A, Nottebom K, Heibült O, Becker I, et al. (1999) Several mutations in the melanocortin-4 receptor gene including a nonsense and a frameshift mutation associated with dominantly inherited obesity in humans. J Clin Endocrinol Metab 84: 1483–1486.
- 23. Xiang Z, Proneth B, Dirain ML, Litherland SA, Haskell-Luevano C (2010) Pharmacological Characterization of 30 Human Melanocortin-4 Receptor Polymorphisms with the Endogenous Proopiomelanocortin-Derived Agonists, Synthetic Agonists, and the Endogenous Agouti-Related Protein Antagonist. Biochemistry 49: 4583–4600.
- 24. Nijenhuis WA, Garner KM, van Rozen RJ, Adan RA (2003) Poor cell surface expression of human melanocortin-4 receptor mutations associated with obesity. J Biol Chem 278: 22939–22945.
- 25. Vaisse C, Clement K, Durand E, Hercberg S, Guy-Grand B, et al. (2000) Melanocortin-4 receptor mutations are a frequent and heterogeneous cause of morbid obesity. J Clin Invest 106: 253–262.
- 26. Yeo GS, Lank EJ, Farooqi IS, Keogh J, Challis BG, et al. (2003) Mutations in the human melanocortin-4 receptor gene associated with severe familial obesity disrupts receptor function through multiple molecular mechanisms. Hum Mol Genet 12: 561–574.
- 27. Larsen LH, Echwald SM, Sørensen TI, Andersen T, Wulff BS, et al. (2004) Prevalence of mutations and functional analyses of melanocortin 4 receptor variants identified among 750 men with juvenile-onset obesity. J Clin Endocrinol Metab 90: 219–224.
- 28. Tao YX, Segaloff DL (2003) Functional characterization of melanocortin-4 receptor mutations associated with childhood obesity. Endocrinol 144: 4544–4551.
- 29. Feng T, Zhu X (2010) Genome-wide searching of rare genetic variants in WTCCC data. Hum Genet. Jun 13. [Epub ahead of print].
- 30. Chen JM, Férec C, Cooper DN (2006) A systematic analysis of disease-associated variants in the 3′ regulatory regions of human protein-coding genes I: general principles and overview. Hum Genet 120: 1–21.
- 31. Chen JM, Férec C, Cooper DN (2006) A systematic analysis of disease-associated variants in the 3′ regulatory regions of human protein-coding genes II: the importance of mRNA secondary structure in assessing the functionality of 3′ UTR variants. Hum Genet 120: 301–333.
- 32. Kleinjan DA, van Heyningen V (2005) Long-range control of gene expression: emerging mechanisms and disruption in disease. Am J Hum Genet 76: 8–32.
- 33. Speliotes EK, Willer CJ, Berndt SI, Monda KL, et al. (2010) Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nat Genet. Oct 10. In press.
- 34. Wang K, Dickson SP, Stolle CA, Krantz ID, Goldstein DB, Hakonarson H (2010) Interpretation of association signals and identification of causal variants from genome-wide association studies. Am J Hum Genet 86: 730–742.
- 35. Hebebrand J, Heseker H, Himmelmann GW, Schäfer H, Remschmidt H (1994) Altersperzentilen für den Body Mass Index aus Daten der Nationalen Verzehrstudie einschließlich einer Übersicht zu relevanten Einflußfaktoren. Aktuelle Ernährungsmedizin 19: 259–265.
- 36. Ye S, Dhillon S, Ke X, Collins AR, Day IN (2001) An efficient procedure for genotyping single nucleotide polymorphisms. Nucleic Acids Res 29: E88–8.
- 37. Spielman RS, McGinnis RE, Ewens WJ (1993) Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). Am J Hum Genet 52: 506–516.
- 38. Herold C, Becker T (2009) Genetic association analysis with FAMHAP: a major program update. Bioinformatics 25: 134–136.