Asbestos exposure is the main risk factor for malignant pleural mesothelioma (MPM), a rare aggressive tumor. Nevertheless, only 5–17% of those exposed to asbestos develop MPM, suggesting the involvement of other environmental and genetic risk factors.
To identify the genetic risk factors that may contribute to the development of MPM, we conducted a genome-wide association study (GWAS; 370,000 genotyped SNPs, 5 million imputed SNPs) in Italy, among 407 MPM cases and 389 controls with a complete history of asbestos exposure. A replication study was also undertaken and included 428 MPM cases and 1269 controls from Australia.
Although no single marker reached the genome-wide significance threshold, several associations were supported by haplotype-, chromosomal region-, gene- and gene-ontology process-based analyses. Most of these SNPs were located in regions reported to harbor aberrant alterations in mesothelioma (SLC7A14, THRB, CEBP350, ADAMTS2, ETV1, PVT1 and MMP14 genes), causing at most a 2–3-fold increase in MPM risk. The Australian replication study showed significant associations in five of these chromosomal regions (3q26.2, 4q32.1, 7p22.2, 14q11.2, 15q14).
Multivariate analysis suggested an independent contribution of 10 genetic variants, with an Area Under the ROC Curve (AUC) of 0.76 when only exposure and covariates were included in the model, and of 0.86 when the genetic component was also included, with a substantial increase of asbestos exposure risk estimation (odds ratio, OR: 45.28, 95% confidence interval, CI: 21.52–95.28).
These results showed that genetic risk factors may play an additional role in the development of MPM, and that these should be taken into account to better estimate individual MPM risk in individuals who have been exposed to asbestos.
Citation: Matullo G, Guarrera S, Betti M, Fiorito G, Ferrante D, et al. (2013) Genetic Variants Associated with Increased Risk of Malignant Pleural Mesothelioma: A Genome-Wide Association Study. PLoS ONE 8(4): e61253. doi:10.1371/journal.pone.0061253
Editor: Xiao-Ping Miao, MOE Key Laboratory of Environment and Health, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, China
Received: January 7, 2013; Accepted: March 6, 2013; Published: April 23, 2013
Copyright: © 2013 Matullo et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was partially supported by the Regione Piemonte Ricerca Sanitaria Finalizzata 2007, 2008, 2009 (to I.D.), Fondazione Buzzi Unicem Onlus 2007 (to I.D., S.B), CIPE (to I.D.), AIRC (to I.D., D.U., S.B) and Human Genetics Foundation - HuGeF (to G.M.). The Turin case-control study was supported by a grant from Regione Piemonte, Ricerca Scientifica Applicata 2003 (to D.M.). The Casale case-control study was supported by a grant from Regione Piemonte, Ricerca Sanitaria Finalizzata 2004 (to C.M.). The Australian studies have been supported by the Australian National Health and Medical Research Council, the Sir Charles Gairdner Hospital and PathWest laboratory Medicine of WA. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors declare no competing financial interest. In fact, the “PathWest Laboratory Medicine WA” is not a commercial funder of this research. The authors Jennie Hui and John Beilby are employed by PathWest and do not have any additional consultancy, patents, products in development or marketed products with competing interests relating to this research. Thus, PathWest affiliation does not alter the authors' adherence to all the PLOS ONE policies on sharing data and materials.
Malignant pleural mesothelioma (MPM) is a rare, aggressive tumor that generally causes death within 2 years. The only clearly established risk factors for MPM are asbestos exposure, and exposure to erionite, other mineral fibers and x-ray for medical purposes . Asbestos fibers retained in the lung and pleura may be carcinogenic, either through direct mechanical or biochemical effects, or through the activation of inflammatory cells. Persistent inflammation can induce chronic oxidative stress, genotoxic lesions, chromosomal aberrations and epigenetic alterations , . Asbestos fibers may also interfere with chromosome segregation and mitosis .
Although asbestos has been banned in many Western countries, it is still used in several parts of the world, and some developing countries are actually increasing the industrial use of asbestos, as well as its production and importation , , . In Western Europe, over 5,000 people with MPM die each year , , , . Considering the long median latency period between initial asbestos exposure and MPM diagnosis , , MPM incidence is expected to peak around 2020 in Western countries , , .
Only 5%–17% of individuals heavily exposed to asbestos develop MPM , suggesting a genetic component in the etiology of the disease, which is also supported by reports of familial clustering , , ,  and candidate-gene association studies , . Dominant mutations in the BAP1 (BRCA1-associated protein 1) gene were recently reported to cause a new, rare cancer-prone syndrome that renders the individual susceptible to mesothelioma and melanoma, among others .
The aim of this study was to identify genetic risk factors that might contribute to the development of MPM. To this end, we performed a GWAS in an Italian study sample of 407 MPM cases and 389 healthy controls, and a replication study in an Australian study sample of 428 MPM cases and 1269 controls.
The general characteristics of the Italian study sample, after quality controls (QC), are reported in Table 1 (392 MPM cases and 367 controls; 540 males, 219 females). A total of 330,879 SNPs were included in the analyses. The principal component analysis (PCA) (Figure S1) showed population stratification with two distinct clusters, which was further confirmed by K-mean analysis (data not shown). After correction of the regression analyses by PCA-cluster, the λ inflation factor was <1.03 for both the overall and the exposed-only samples (Quantile- Quantile, QQ plots, Figure S2). Manhattan plots of the two-sided logistic regression analyses (per allele additive model) are also reported (Figure 1).
Figure 1. Manhattan plot of genotyped SNPs from logistic additive model.
A) all samples, B) exposed samples.doi:10.1371/journal.pone.0061253.g001
Table 1. Summary statistics of all the subjects included in the Italian GWAS.doi:10.1371/journal.pone.0061253.t001
The genotyped SNPs with the highest significance levels are listed in Table 2. The imputed SNPs with the highest significance levels are listed in Table S1. Nine intragenic SNPs (7 genotyped and 2 imputed) were located in genes. When analyzing these nine genes in a Gene Set Enrichment Analysis (GSEA, File S1), significant enrichment involving MMP14 and ADAMTS2 was shown for gene-ontology (GO, File S1) biological processes including lung development (P = 0.0087), respiratory tube development (P = 0.0087), respiratory system development (P = 0.0087), metalloendopeptidase activity (P = 0.0140), and metallopeptidase activity (P = 0.0210) (Table S2).
Table 2. Italian top 12 genotyped SNP list (2-tailed logistic regression, n = 759 overall, n = 593 exposed only).doi:10.1371/journal.pone.0061253.t002
When the GSEA (File S1) was extended to SNPs with a significance level of P≤10−4 in the regression analysis (additive model, 201 genes), another metallopeptidase, namely MMP8, was included in the gene list, further reinforcing the putative role of the metalloendopeptidase pathway in MPM.
Haplotype association was investigated in the Italian study sample for the 20 genes/chromosomal regions with the highest significance levels. The most significant haplotype associations were found in the chromosomal region 3p24.2, where the THRB gene is located (P = 2.04×10−7), and in 19q13.42 (P = 7.02×1 0−7) (Table S3), strengthening the importance of these chromosomal regions.
Seven chromosomal regions were significantly associated with MPM in the region-based analysis (P<0.0025, Table 3, Figure 2, Figure S3) . The gene-based analysis confirmed the significance of the THRB gene (P = 2.29×10−5) and showed a borderline significance for the PVT1 gene (P = 0.02) (Table 3). Finally, the regional GO (File S1) process-based analysis supported the involvement of the metalloendopeptidase and metallopeptidase GO (File S1) processes (Table 3, P = 0.0005 and 0.0039, respectively).
Figure 2. Regional association plots for 4 of the most consistent chromosome regions.
a. 3p24.2, b. 8q24.21, c. 14q11.2, d. 7p22.2. Consistency was based on haplotype, gene-, region- and pathway analysis. Each SNP is plotted with respect to its chromosomal location (x axis) and its log10 transformed P value (y axis on the left) for associations with MPM. The tall blue spikes indicate the recombination rate (y axis on the right) at that region of the chromosome. The red-outlined diamond indicate the index SNP and other diamond indicate the genotyped SNPs, the squares indicate imputed SNPs using as reference 1000 Genomes Pilot 1 CEU population. LD values were calculated only on our control population.doi:10.1371/journal.pone.0061253.g002
Table 3. Region-, Gene- and GO process-based analysis on top SNPs (1-tailed binomial test, n = 759, alpha 0.0025, alpha = 0.01, alpha = 0.025, respectively).doi:10.1371/journal.pone.0061253.t003
We detected a substantial improvement in accuracy comparing the first multivariate model, which used asbestos exposure as a predictor and adjusted for demographic covariates, with the second one, which also included 10 selected SNPs with independent effects (Table 4). The average Akaike Information Criterion (AIC) and area under ROC curve (AUC) across 10,000 random splits of the entire Italian study sample were 871.34 and 0.76 for the first model, and 730.27 and 0.86 for the second model, respectively (Figure 3, Table 4). The analysis stratified by center (Casale Monferrato versus Turin-Genoa) confirmed the stability of the risk estimates and 95% CIs (data not shown).
Figure 3. Receiver Operating Curves (ROC) for the two multivariate models including asbestos exposure 1) without and 2) with the 10 most robust and significant genetic variants.doi:10.1371/journal.pone.0061253.g003
Table 4. Nested multivariate logistic regression models: 1) model 1, without genetic component; 2) model 2, with genetic component.doi:10.1371/journal.pone.0061253.t004
The first multivariate model confirmed asbestos exposure as the main risk factor for MPM (high exposure: OR 17.33, 95% CI 9.28–32.37, P<2×10−16; low exposure: OR 8.01, 95% CI 4.41–14.54, P = 8.52×10−12) (Table 4). The second model, which included the genetic component, showed that the 10 selected SNPs had an independent contribution to MPM risk (Table 4), and also increased the estimate for the effect of asbestos exposure (high exposure: OR 45.28, 95% CI 21.52–95.28, P<2×10−16; low exposure: OR 15.31, 95% CI 7.78–30.14, P = 2×10−15).
SNP validation and replication
The Italian and Australian study samples showed a marked degree of heterogeneity (I2 statistics, range 0.62–0.97)  (Table S5). None of the 12 genotyped SNPs with the highest significance levels in the Italian study were found in the Australian replication study (Table S4), and nor of these were confirmed by the meta-analysis (Table S5). Nevertheless, when a regional analysis was performed in the Australian study sample, we found significant associations in five chromosomal regions (3q26.2, 4q32.1, 7p22.2, 14q11.2, 15q14) that have reported to be altered in mesothelioma (Table 5) .
Table 5. Regional replication of Italian top signals in the Australian study for 5 out of the 20 regions.doi:10.1371/journal.pone.0061253.t005
Gene expression analysis in blood and in normal pleural tissue
Gene expression analysis on lymphocytes from Italian healthy subjects (Text S1) showed a possible expression Quantitative Trait Locus (eQTL) for the PVT1 (rs7841347) gene (non-parametric Kruskal-Wallis test P<0.001) (Figure 4). However, expression analysis from Italian healthy subjects pleural tissue stratified by PVT1 rs7841347 genotypes did not show any gradient, although a statistically significant difference (P = 0.01) was found (Figure S4). Published expression data  (Text S1) confirmed the dysregulation of MMP14, THRB and MYC genes in MPM, supporting our results.
Figure 4. eQTL: PVT1 and MYC gene-expression levels in blood cells across rs78941347 genotypes.doi:10.1371/journal.pone.0061253.g004
SNP predictive functional analysis
Using the GenomePipe tool, none of the SNPs with the highest significance levels included in the present analysis might predict damage, nor were they located in a regulatory or splicing site. Even when SNPs in Linkage Disequilibrium (LD) with our top SNPs (LD≥0.8 as measured by pairwise r2) were included in the analysis no evidence of functional properties of the proxy SNPs was found. LD refers to two different populations, i.e. HapMap TSI from Tuscany (Italy) and CEU (HapMap3, File S1), for a total of 33 and 72 SNPs respectively.
In order to identify genetic risk factors that might contribute to the development of MPM, we performed a GWAS on 407 Italian MPM cases and 389 controls.
We performed an independent replication study in an Australian sample, which included 428 MPM cases (Genetic Understanding of Asbestos-Related Disease, GUARD, study) and 1,269 controls (Busselton Health Study, BHS).
Among the top SNPs identified in our Italian study sample, there were several genes previously reported to be involved in MPM or other cancer types, as well as chromosomal regions reported to be altered in MPM .
Although no single SNP replicated in the Australian sample, probably due to the high genetic heterogeneity between the two studies, regional analyses showed significant signals in 5 of the chromosomal regions where the Italian top SNPs are located. The chromosomal region 7p22.2 found in the replication study includes the SDK1  and FOXK1  genes. Interestingly, FOXK1 has been reported to interact with BAP1 , which was recently found to be mutated in mesothelioma . Chromosomal region 7p22 is located in a fragile sequence (FRA7B) containing two miRNA genes (mir589 and mir339) and three large genes (SDK1, THSD7A, MAD1L1), and is highly prone to gaps and breaks in several cancers .
Another Italian genotyped top-signal (rs7632718) is located in the SLC7A14 (solute carrier family 7 member 14) gene, which lies on 3q26.2, which was one of the replicating regions in the Australian study. Although no link with MPM has been previously reported for SLC7A14, a chromosomal gain has been described in this region , suggesting a possible involvement of other genes in MPM.
The PVT1 (Pvt1 oncogene (non-protein coding)) gene is involved in several types of cancer , , , , . It is located in a large (>300 kb) locus downstream of MYC (53 Kb apart) on chromosomal region 8q24. The PVT1 locus produces a wide variety of spliced non-coding RNAs as well as a cluster of six annotated miRNAs: miR-1204, miR-1205, miR-1206, miR-1207-5p, miR-1207-3p, and miR-1208 , . PVT1 was proposed to regulate c-Myc gene transcription over a long distance . A functional variant (rs378854) in chromosomal region 8q24 that modulates PVT1 expression has been associated with prostate cancer . In vitro, the rs378854-G allele has been associated with reduced binding of the transcription factor YY1, a putative tumor suppressor, and with repressed global transcription in prostate cancer . The regulation of this chromosomal region is very complex, as is suggested by the association of several SNPs with different cancer types , and involves miRNA, lincRNA and other epigenetic regulations .
The gene-expression analysis on lymphocytes from Italian healthy subjects showed a possible eQTL for PVT1. Functional studies are needed to clarify the link between PVT1-associated SNPs, gene expression regulation and cancer risk taking into account that in our study PVT1 seems to act only at an early stage of carcinogenesis as its deregulation has not been observed at later stages in tumor tissue .
Two other genes that have been reported to be dysregulated in MPM, are THRB and MMP14 , . THRB encodes for thyroid hormone receptor beta (TRβ), which could function as a tumor suppressor. Cell-based studies and xenograft models have demonstrated that TRβ is a suppressor of ras-mediated cell proliferation, transformation, and tumorigenesis . Moreover, TRβ disrupts mitogenic growth factors by suppressing the activation of extracellular signal-regulated kinases and phosphatidylinositol 3-kinase signaling pathways to suppress tumor cell invasiveness and metastasis , . THRB is located about 28 Mb telomeric to the BAP1 gene, which is mutated in MPM . A down-regulation of THRB has been documented in MPM versus parietal pleura  and it is frequently methylated/deleted in non-squamous-cell lung cancer .
MMP14 (matrix metallopeptidase 14) has been reported to influence overall survival in MPM cases , and was significantly highlighted in our enrichment analysis, together with ADAMTS2, because of their metalloendopeptidase and metallopeptidase activities. The matrix metalloproteinases are a family of zinc-containing enzymes with proteolytic activity against a wide range of extracellular proteins. Extracellular matrix proteases are involved in several steps of cancer development and progression, including angiogenesis and metastasis.
Some of the SNPs with highest significance levels were located in the genes: CEP350, ETV1 and SHC4. Although they have not been directly associated with MPM, their involvement in several cancer types has been described , , , suggesting the necessity to further investigate their possible role in MPM pathogenesis. Considering the closest flanking genes of intergenic SNPs, the following are noteworthy and could contribute to the carcinogenic process, as has been reported for other cancer types: PRDM1 , ATG5 , MYC , EID , RLN1 , CD274 .
Although our sample size is clearly a limitation for a GWAS, the Italian and the Australian study samples are, to the best of our knowledge, the largest MPM series with available DNA, as mesothelioma is a very rare cancer. A further limitation of GWAS is that they do not take into account rare variants. The availability of methods for complete genome sequencing (and the decrease of the sequencing costs) will allow to circumvent the problem linked to the identification of rare variants, whose involvement should be better investigated in future studies.
The negative replication of the Italian top SNPs in the Australian study should be revised on the basis of the following considerations: i) the two studies had a marked degree of heterogeneity as shown by the I2 statistics; ii) no exposure assessment was available for the Australian control group. Notwithstanding these discrepancies, we observed an intriguing significant regional replication in the Australian study for 5 out of 20 Italian top signals.
Most of the top-signals we identified were located in chromosomal regions reported to harbor aberrant alterations in mesothelioma, and cause an at most 2–3 fold increase in MPM risk.
Moreover, asbestos exposure in our study was associated with a remarkable increase in MPM risk, which became even more evident when the contribution of genetic factors was taken into account, with a significant improvement of asbestos exposure risk estimation.
In conclusion, our results support the complementary role of genetic background in asbestos-related carcinogenesis of the pleura, indicating that genetic risk factors should be taken into account to understand MPM physiopathology, and to better define the MPM risk profile of people with a high exposure to asbestos.
All MPM cases reported on in the present report gave written informed consent. This study was performed according to the principles of the Declaration of Helsinki and in agreement with ethical requirements. Approval was obtained from the Istituto Nazionale per la Ricerca sul Cancro Ethics Committee for the studies in Genoa and La Spezia, and from the Human Genetics Foundation (HuGeF) Ethics Committee for the studies in Casale Monferrato and Turin. The Australian replication study was specifically approved by the Human Research Ethics Committee of the University of Western Australia.
Italian study sample
The Italian study sample is composed of MPM cases and controls from cities located in Northern Italy: Casale Monferrato and Turin in the Piedmont Region, and Genoa and La Spezia in the Liguria Region (Table 1; details in Text S1). The study in Casale Monferrato was a population-based MPM case-control study , and included 241 MPM patients and 252 population controls of Italian nationality and Caucasian ethnicity. The study in Turin was a hospital-based MPM case-control study , and consisted of 91 MPM patients and 56 controls of Italian nationality and Caucasian ethnicity. The hospital-based study in Genoa and La Spezia included 75 incident MPM cases . Controls are 81 healthy subjects or patients hospitalized for non-neoplastic/non-respiratory conditions.
All the three of the above-mentioned Italian studies were registry-based and therefore no selection criteria were applied to MPM cases; they needed only to be residing in the study area at the time of diagnosis. Only cases with a pathological diagnosis (based on histology or cytology with confirmatory immunohistochemical staining) were eligible for inclusion in the present analysis. Study periods in the Italian studies were different (Casale Monferrato: January 2001 to December 2006; Turin: January 2004 to October 2008; Genoa and La Spezia: April 1996 to February 2006 for cases and February 1997 and November 2006 for controls). For practical reasons, the study in Turin was limited to cases admitted to the main metropolitan hospitals.
Asbestos exposure was carefully assessed in all the Italian cases and controls. After reviewing individual occupational histories, asbestos exposure was reclassified for the overall sample by the same expert (D.M.) as “no/unlikely” (no acknowledged occupational or environmental exposure), “low” (low exposure probability, or definite low exposure), and “high” (definite and high exposure; asbestos-cement and asbestos-textile workers, insulators, shipyard workers and dockers).
Australian replication study
Australian MPM cases were part of the GUARD study, which consisted of individuals who had been exposed to asbestos and diagnosed with MPM (n = 428) and who attended a hospital clinic in Perth, Western Australia between 1988 and 2010 . DNA samples and clinical data from these individuals were obtained and MPM diagnosis was confirmed after pathological, radiological and clinical review with confirmation from respective cancer registries in Western Australia (Western Australia Mesothelioma Registry) and Queensland.
The GUARD study subjects are primarily male (88.8%) with an average age of 67±10.3 years. Most BHS study subjects are female (57.4%) and the average age is 54±17.2 years. Control samples (n = 1,269), with no information on asbestos exposure, were obtained from the population-based BHS . MPM cases were excluded after genotyping if they were: related to another individual, had a low call GWAS rate (<97%), were not Caucasian/European based on principal component analysis, had ambiguous sex, or had low heterozygosity compared to the rest of the sample.
Whole-genome genotyping was done on a HumanCNV370-Quad BeadChip (Illumina Inc., San Diego, CA, USA) for 716 samples. The remaining 80 samples were tested on a Human610-Quad (which includes 100% of the HumanCNV370 BeadChip SNPs) as the HumanCNV370-Quad had been discontinued. Genotypes were assessed by GenomeStudio V2011.1(Illumina Inc., San Diego, CA). The 12 most significant SNPs from the Italian studyS were individually genotyped in the Australian replication study with a 5′-nuclease assay (AppliedBiosystems, CA, USA).
Genotyping quality controls.
A cut-off a genotyping call rate of 0.98 was set, leading to the exclusion of 18 study subjects. SIdentity By Descent (IBD) estimation using the Identity By State (IBS) distance was used to check genotypic identity or relatedness among subjects (PLINK software , File S1). Subjects with IBD≥0.05 (n = 16) were considered consanguineous and excluded from further analyses. We additionally excluded three samples with an X chromosome inbreeding homozygosity estimate of about 0.5. Thirty-seven subjects (4.64%) were removed from the analysis, leaving 759 subjects (392 cases and 367 controls).
SNPs with minor allele frequency <1% (n = 15,252), those having >0.05 missing genotypes (n = 11,535) and those deviating from Hardy-Weinberg equilibrium (HWE) in the control population (P<0.001, n = 1,157) were excluded from the analysis, for a final study data-set of 330,879 SNPs, which were analyzed for their potential association with mesothelioma.
Population structure and association analysis.
The population structure was investigated by PCA (PLINK Software, File S1, Covariance Method ). A new discrete covariate was defined by the two principal components (Figure S1), and was included in the following logistic regression analysis. PCA results were further confirmed by the K-means clustering analysis  (data not shown). The effective removal of any population structure bias was checked by the λ-inflation factor parameter  (Figure S2).
We tested for 330,879 SNPs for their association with mesothelioma by 2-sided logistic regression analysis on a per-allele additive model after adjusting for age, gender, PCA cluster, center of recruitment and exposure level, both in the overall Italian sample (n = 759) and among exposed-only Italian subjects (n = 593) (high and low exposure). After Bonferroni correction, we considered alpha = 1.51×10−7 (0.05/330879) as a threshold of significance. The analyses were performed with PLINKv1.07 (File S1)  and Rv2.10.1  software. The software Impute.v2 ,  was used to impute 5,333,982 SNPs, using the 1000 genomes (http://www.1000genomes.org/) and HapMap3 (File S1) genotype panels as reference datasets.
Haplotypes (Table S3) within the chromosomal regions where the most significant SNPs were located (considering sliding windows from 2 to 10 SNPs; PLINK Software, File S1) were also tested for any association with MPM in the overall Italian sample.
Meta-analysis and replication.
A meta-analysis of the Italian-study top 12 genotyped SNPs was done on data from the whole genome genotyping (Human610-Quad BeadChip, Illumina) of 428 cases and 1269 Australian controls of European descent (GWAMA software, File S1 ). A random-effects model was used due to the presence of genetic heterogeneity (I2 statistic  >50%; Table S5).
The cumulative effect of the SNPs with highest significance levels was investigated by two-sided multivariate logistic regression analysis, comparing the prediction accuracy of two models: the first considering asbestos exposure as a predictor and adjusting for demographic covariates (recruitment center, gender, age, geographical cluster), and the second identical to the first, but also including the genetic component (genotypes). SNPs included in the second multivariate model were selected among the top 20 markers (12 genotyped and 8 imputed), excluding 4 SNPs (rs4290865, rs1354252, rs1072577, rs10519201) because of negative internal replication between Casale Monferrato and pooled Turin-Genoa studies, and 6 SNPs (rs742109, rs1508805, rs9536579, rs5756444, rs6897549, rs71365421) because they did not replicate in the Australian study on the regional analysis and were not intragenic.
An internal validation of the two models was done by randomly splitting the overall Italian sample in two groups 10,000 times, each time performing a two-sided logistic regression in the first group and verifying the accuracy of estimation in the second group. The average AIC under 10,000 permutations and AUC were used as measures of the fit and the prediction power of the two models.
Gene-region enrichment and SNP functional prediction analyses.
A GSEA (File S1)  was performed on the genes in which the top SNPs are located (9 genes out of 20 signals): PVT1 (gene ID 5820), CEP350 (ID 9857), THRB (ID 7068), ETV1 (ID 2115), C9orf46 (also known as PLGRKT; ID 55848), MMP14 (ID 4323), ADAMTS2 (ID 9509), SLC7A14 (ID 57709), SHC4 (ID 399694). The list was tested for over-representation using the curated Molecular Signatures Database (MSigDB) 7, specifically i) KEGG 8 (File S1), REACTOME and BioCarta pathway databases, ii) the GO (File S1) gene set 9. Gene set enrichment significance was tested by a hyper-geometric test that evaluates the distribution of overlapping genes over all genes in the gene set (Table S2).
Region-, gene- and GO (File S1) process-based analyses were also performed . We investigated the occurrence of multiple signals in those genes and chromosomal regions, where the significant SNPs from the single SNP analysis are located, as well as those from genes belonging to the pathways identified by the GO (File S1) process-based analysis (Table 3).
We tested 20 candidate chromosomal regions, and five genes (CEP350, THRB, SLC7A14, SDK1 and PVT1) for which there were enough representative SNPs genotyped, and two GSEA significant GO processes (File S1) (metalloendopeptidase activity and metallopeptidase activity). After Bonferroni correction, we adopted the following significance thresholds: alpha = 0.0025, alpha = 0.01, alpha = 0.025, for region-based, gene-based and GO (File S1) process-based analysis respectively.
Prediction of functional SNPs has been carried out with several softwares, including GenomePipe software, which is freely available at website of the National Institute of Environmental Health Sciences (http://snpinfo.niehs.nih.gov/seleGWAs.htm) and the Pupasuite3.1 software (http://pupasuite.bioinfo.cipf.es/).
The expression levels of the nine genes corresponding to the most common intragenic SNPs (Table 2) and of MYC, which is neighbor to PVT1, were examined using data from the HapMap (File S1) CEU gene-expression database, and the GenoPheno database , an internal database which includes genotypic, phenotypic, and gene-expression data from the peripheral blood of 120 healthy Italian volunteers (Text S1). We considered the average expression levels of probes and, when feasible, tested for differential expression among the three genotypes (Kruskal-Wallis test).
In addition, the mRNA levels of the PVT1, MYC and THRB genes were measured by quantitative real-time PCR in 79 normal pleural tissues from donors that underwent thoracoscopy for conditions other than MPM, who signed an informed consent form (Text S1).
Principal Component Analysis (PCA) plots: first vs second PC. A) Cases and controls are plotted for the overall study and for each of the three study samples (Turin, Casale Monferrato and Genoa); B) birth places (Northern, Central, Southern Italy, Sardinians and Other Caucasians) are plotted for the overall study and for each of the three study samples.
Supplementary figure 1:Q-Q plots for GWAS of mesothelioma in the Italian population. This Q-Q plots are based on logistic regression allelic P after standard quality control. The estimated λ inflation factor was <1.03. Plot A shows the Q-Q plot for the overall Italian population, whereas Plot B refers to the exposed-only population.
Regional association plots for additional 4 regions (a. 3q26.2, b. 4q32.1, c. 7p21.2, d. 15q14) replicating in the Australian study. Each SNP is plotted with respect to its chromosomal location (x axis) and its log10 transformed P value (y axis on the left) for associations with MPM. The tall blue spikes indicate the recombination rate (y axis on the right) at that region of the chromosome. The red-outlined diamond indicate the index SNP and other diamond indicate the genotyped SNPs, the squares indicate imputed SNPs using as reference 1000 Genomes Pilot 1 CEU population. LD values were calculated only on our control population
RT-PCR of PVT1 and MYC genes-expression levels in 79 normal pleural tissues expression levels across rs78941347 genotypes.
Italian top 8 imputed SNP list.
Gene Set Enrichment Analysis.
Significant Haplotype Results for 3p24 and 19q13.42 regions.
Replication of the 12 genotyped Italian top SNPs on GUARD-BHS Study.
Meta-analysis of Italian and Australian studies for the top 12 genotyped Italian SNPs.
We wish to thank all the patients and healthy controls that generously participated in the study, the attending MDs that supported it, Dr Silvia Polidoro for her contribution to our the first technical issues with the Illumina platform, and Progetto RoPHS, Regione Piemonte - Bando Scienze umane e sociali (I.D.).
Contributed to writing the manuscript: SG CDG AR GF FR. Evaluated asbestos exposure: DM. Obtained and/or supervised clinical information: RG EP AH FA ER CC SM. Obtained funding for sample collection and genotyping: ID GM CM LJP ALJ SB DU DM PGB. Participated in critical review of the manuscript for intellectual content: GM SG MB GF DF FV GC CDG FR AR AH EC ST MP MG AA CC FA ER PGB RL RG EP MN AWBM NHdK JH JB ALJ JC BWR SM LJP DM DU SB CM ID. Conceived and designed the experiments: ID GM SB CM. Performed the experiments: SG AR MB AA EC FR ID GM JH. Analyzed the data: GF CM DF ST FV GC CDG FR MP. Contributed reagents/materials/analysis tools: MB MN DU RL MG PGB SB AH JC BWR AWBM LJP ALJ NHdK JB CC FA. Wrote the paper: GM ID CM.
- 1. IARC Working Group on the Evaluation of Carcinogenic Risks to Humans (2011) A review of Human carcinogens: Metals, arsenic, dusts, and fibres. IARC Monographs on the Evaluation of Carcinogenic Risks to Humans Twelfth Edition 2011 2012 ed. Lyon: WHO, IARC.
- 2. Mossman BT, Lippmann M, Hesterberg TW, Kelsey KT, Barchowsky A, et al. (2011) Pulmonary endpoints (lung carcinomas and asbestosis) following inhalation exposure to asbestos. Journal of toxicology and environmental health Part B, Critical reviews 14: 76–121. doi: 10.1080/10937404.2011.556047
- 3. Achilli A, Olivieri A, Pala M, Metspalu E, Fornarino S, et al. (2007) Mitochondrial DNA variation of modern Tuscans supports the Near Eastern origin of Etruscans. American Journal of Human Genetics 80: 759–768. doi: 10.1086/512822
- 4. Robinson BWS, Lake RA (2005) Advances in malignant mesothelioma. New England Journal of Medicine 353: 1591–1603. doi: 10.1056/nejmra050152
- 5. Azari MR, Nasermoaddeli A, Movahadi M, Mehrabi Y, Hatami H, et al. (2010) Risk assessment of lung cancer and asbestosis in workers exposed to asbestos fibers in brake shoe factory in Iran. Ind Health 48: 38–42. doi: 10.2486/indhealth.48.38
- 6. Below JE, Cox NJ, Fukagawa NK, Hirvonen A, Testa JR (2011) Factors That Impact Susceptibility to Fiber-Induced Health Effects. Journal of Toxicology and Environmental Health-Part B-Critical Reviews 14: 246–266. doi: 10.1080/10937404.2011.556052
- 7. Brims FJ (2009) Asbestos–a legacy and a persistent problem. J R Nav Med Serv 95: 4–11.
- 8. Neri M, Ugolini D, Dianzani I, Gemignani F, Landi S, et al. (2008) Genetic susceptibility to malignant pleural mesothelioma and other asbestos-associated diseases. Mutation Research - Reviews in Mutation Research 659: 126–136. doi: 10.1016/j.mrrev.2008.02.002
- 9. Peto J, Decarli A, La Vecchia C, Levi F, Negri E (1999) The European mesothelioma epidemic. British Journal of Cancer 79: 666–672.
- 10. Peto J, Hodgson JT, Matthews FE, Jones JR (1995) Continuing increase in mesothelioma mortality in Britain. Lancet 345: 535–539. doi: 10.1016/s0140-6736(95)90462-x
- 11. Betti M, Ferrante D, Padoan M, Guarrera S, Giordano M, et al. (2011) XRCC1 and ERCC1 variants modify malignant mesothelioma risk: A case-control study. Mutation Research - Fundamental and Molecular Mechanisms of Mutagenesis 708: 11–20. doi: 10.1016/j.mrfmmm.2011.01.001
- 12. Montanaro F, Rosato R, Gangemi M, Roberti S, Ricceri F, et al. (2009) Survival of pleural malignant mesothelioma in Italy: A population-based study. International Journal of Cancer 124: 201–207. doi: 10.1002/ijc.23874
- 13. Marinaccio A, Binazzi A, Cauzillo G, Cavone D, Zotti RD, et al. (2007) Analysis of latency time and its determinants in asbestos related malignant mesothelioma cases of the Italian register. European Journal of Cancer 43: 2722–2728. doi: 10.1016/j.ejca.2007.09.018
- 14. Ismail-Khan R, Robinson LA, Williams CC Jr, Garrett CR, Bepler G, et al. (2006) Malignant pleural mesothelioma: a comprehensive review. Cancer Control 13: 255–263.
- 15. Pelucchi C, Malvezzi M, La Vecchia C, Levi F, Decarli A, et al. (2004) The Mesothelioma epidemic in Western Europe: An update. British Journal of Cancer 90: 1022–1024.
- 16. Ascoli V, Cavone D, Merler E, Barbieri PG, Romeo L, et al. (2007) Mesothelioma in blood related subjects: Report of 11 clusters among 1954 Italy cases and review of the literature. American Journal of Industrial Medicine 50: 357–369. doi: 10.1002/ajim.20451
- 17. Ugolini D, Neri M, Ceppi M, Cesario A, Dianzani I, et al. (2008) Genetic susceptibility to malignant mesothelioma and exposure to asbestos: The influence of the familial factor. Mutation Research - Reviews in Mutation Research 658: 162–171. doi: 10.1016/j.mrrev.2007.08.001
- 18. de Klerk N, Alfonso H, Olsen N, Reid A, Sleith J, et al. (2012) Familial aggregation of malignant mesothelioma in former workers and residents of Wittenoom, Western Australia. International Journal of Cancer (In press). doi: 10.1002/ijc.27758
- 19. Testa JR, Cheung M, Pei J, Below JE, Tan Y, et al. (2011) Germline BAP1 mutations predispose to malignant mesothelioma. Nat Genet 43: 1022–1025. doi: 10.1038/ng.912
- 20. Gray SG, Fennell DA, Mutti L, O'Byrne KJ (2009) In arrayed ranks: array technology in the study of mesothelioma. Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer 4: 411–425. doi: 10.1097/jto.0b013e3181951ce8
- 21. Higgins JP, Thompson SG, Deeks JJ, Altman DG (2003) Measuring inconsistency in meta-analyses. BMJ 327: 557–560. doi: 10.1136/bmj.327.7414.557
- 22. Melaiu O, Cristaudo A, Melissari E, Di Russo M, Bonotti A, et al. (2011) A review of transcriptome studies combined with data mining reveals novel potential markers of malignant pleural mesothelioma. Mutation research doi: 10.1016/j.mrrev.2011.12.003
- 23. Bosco N, Pelliccia F, Rocchi A (2010) Characterization of FRA7B, a human common fragile site mapped at the 7p chromosome terminal region. Cancer genetics and cytogenetics 202: 47–52. doi: 10.1016/j.cancergencyto.2010.06.008
- 24. Komorek J, Kuppuswamy M, Subramanian T, Vijayalingam S, Lomonosova E, et al. (2010) Adenovirus type 5 E1A and E6 proteins of low-risk cutaneous beta-human papillomaviruses suppress cell transformation through interaction with FOXK1/K2 transcription factors. Journal of virology 84: 2719–2731. doi: 10.1128/jvi.02119-09
- 25. Yu H, Mashtalir N, Daou S, Hammond-Martel I, Ross J, et al. (2010) The ubiquitin carboxyl hydrolase BAP1 forms a ternary complex with YY1 and HCF-1 and is a critical regulator of gene expression. Molecular and cellular biology 30: 5071–5085. doi: 10.1128/mcb.00396-10
- 26. Zeidler R, Joos S, Delecluse HJ, Klobeck G, Vuillaume M, et al. (1994) Breakpoints of Burkitt's lymphoma t(8;22) translocations map within a distance of 300 kb downstream of MYC. Genes, chromosomes & cancer 9: 282–287. doi: 10.1002/gcc.2870090408
- 27. Lennon PA, Abruzzo LV, Medeiros LJ, Cromwell C, Zhang X, et al. (2007) Aberrant EVI1 expression in acute myeloid leukemias associated with the t(3;8)(q26;q24). Cancer genetics and cytogenetics 177: 37–42. doi: 10.1016/j.cancergencyto.2007.05.007
- 28. Storlazzi CT, Fioretos T, Paulsson K, Strombeck B, Lassen C, et al. (2004) Identification of a commonly amplified 4.3 Mb region with overexpression of C8FW, but not MYC in MYC-containing double minutes in myeloid malignancies. Human molecular genetics 13: 1479–1485. doi: 10.1093/hmg/ddh164
- 29. Kamath A, Tara H, Xiang B, Bajaj R, He W, et al. (2008) Double-minute MYC amplification and deletion of MTAP, CDKN2A, CDKN2B, and ELAVL2 in an acute myeloid leukemia characterized by oligonucleotide-array comparative genomic hybridization. Cancer genetics and cytogenetics 183: 117–120. doi: 10.1016/j.cancergencyto.2008.02.011
- 30. Guan Y, Kuo WL, Stilwell JL, Takano H, Lapuk AV, et al. (2007) Amplification of PVT1 contributes to the pathophysiology of ovarian and breast cancer. Clinical cancer research : an official journal of the American Association for Cancer Research 13: 5745–5755. doi: 10.1158/1078-0432.ccr-06-2882
- 31. Beck-Engeser GB, Lum AM, Huppi K, Caplen NJ, Wang BB, et al. (2008) Pvt1-encoded microRNAs in oncogenesis. Retrovirology 5: 4. doi: 10.1186/1742-4690-5-4
- 32. Huppi K, Volfovsky N, Runfola T, Jones TL, Mackiewicz M, et al. (2008) The identification of microRNAs in a genomically unstable region of human chromosome 8q24. Mol Cancer Res 6: 212–221. doi: 10.1158/1541-7786.mcr-07-0105
- 33. Carramusa L, Contino F, Ferro A, Minafra L, Perconti G, et al. (2007) The PVT-1 oncogene is a Myc protein target that is overexpressed in transformed cells. J Cell Physiol 213: 511–518. doi: 10.1002/jcp.21133
- 34. Meyer KB, Maia AT, O'Reilly M, Ghoussaini M, Prathalingam R, et al. (2011) A functional variant at a prostate cancer predisposition locus at 8q24 is associated with PVT1 expression. PLoS genetics 7: e1002165. doi: 10.1371/journal.pgen.1002165
- 35. Guarrera S, Ricceri F, Polidoro S, Sacerdote C, Allione A, et al. (2012) Association between total number of deaths, diabetes mellitus, incident cancers, and haplotypes in chromosomal region 8q24 in a prospective study. American journal of epidemiology 175: 479–487. doi: 10.1093/aje/kwr430
- 36. Ahmadiyeh N, Pomerantz MM, Grisanzio C, Herman P, Jia L, et al. (2010) 8q24 prostate, breast, and colon cancer risk loci show tissue-specific long-range interaction with MYC. Proceedings of the National Academy of Sciences of the United States of America 107: 9742–9746. doi: 10.1073/pnas.0910668107
- 37. Crispi S, Calogero RA, Santini M, Mellone P, Vincenzi B, et al. (2009) Global gene expression profiling of human pleural mesotheliomas: identification of matrix metalloproteinase 14 (MMP-14) as potential tumour target. PLoS One 4: e7016. doi: 10.1371/journal.pone.0007016
- 38. Garcia-Silva S, Aranda A (2004) The thyroid hormone receptor is a suppressor of ras-mediated transcription, proliferation, and transformation. Mol Cell Biol 24: 7514–7523. doi: 10.1128/mcb.24.17.7514-7523.2004
- 39. Martinez-Iglesias O, Garcia-Silva S, Tenbaum SP, Regadera J, Larcher F, et al. (2009) Thyroid hormone receptor beta1 acts as a potent suppressor of tumor invasiveness and metastasis. Cancer Res 69: 501–509. doi: 10.1158/0008-5472.can-08-2198
- 40. Lu C, Mishra A, Zhu YJ, Meltzer P, Cheng SY (2011) Genomic profiling of genes contributing to metastasis in a mouse model of thyroid follicular carcinoma. American journal of cancer research 1: 1–13.
- 41. Roe OD, Anderssen E, Helge E, Pettersen CH, Olsen KS, et al. (2009) Genome-wide profile of pleural mesothelioma versus parietal and visceral pleura: the emerging gene portrait of the mesothelioma phenotype. PloS one 4: e6554. doi: 10.1371/journal.pone.0006554
- 42. Dmitriev AA, Kashuba VI, Haraldson K, Senchenko VN, Pavlova TV, et al. (2012) Genetic and epigenetic analysis of non-small cell lung cancer with NotI-microarrays. Epigenetics 7: 502–513. doi: 10.4161/epi.19801
- 43. Oh S, Shin S, Janknecht R (2012) ETV1, 4 and 5: An oncogenic subfamily of ETS transcription factors. Biochimica et biophysica acta 1826: 1–12. doi: 10.1016/j.bbcan.2012.02.002
- 44. Fagiani E, Giardina G, Luzi L, Cesaroni M, Quarto M, et al. (2007) RaLP, a new member of the Src homology and collagen family, regulates cell migration and tumor growth of metastatic melanomas. Cancer research 67: 3064–3073. doi: 10.1158/0008-5472.can-06-2301
- 45. Korzeniewski N, Cuevas R, Duensing A, Duensing S (2010) Daughter centriole elongation is controlled by proteolysis. Molecular biology of the cell 21: 3942–3951. doi: 10.1091/mbc.e09-12-1049
- 46. Kucuk C, Iqbal J, Hu X, Gaulard P, De Leval L, et al. (2011) PRDM1 is a tumor suppressor gene in natural killer cell malignancies. Proceedings of the National Academy of Sciences of the United States of America 108: 20119–20124. doi: 10.1073/pnas.1115128108
- 47. Wojtkowiak J, Rothberg JM, Kumar V, Schramm KJ, Haller E, et al. (2012) Chronic autophagy is a cellular adaptation to tumor acidic pH microenvironments. Cancer research doi: 10.1158/0008-5472.can-11-3881
- 48. Dang CV (2012) MYC on the path to cancer. Cell 149: 22–35. doi: 10.1016/j.cell.2012.03.003
- 49. Kamio Y, Maeda K, Moriya T, Takasu N, Takeshita A, et al. (2010) Clinicopathological significance of cell cycle regulatory factors and differentiation-related factors in pancreatic neoplasms. Pancreas 39: 345–352. doi: 10.1097/mpa.0b013e3181bb9204
- 50. Feng S, Agoulnik IU, Bogatcheva NV, Kamat AA, Kwabi-Addo B, et al. (2007) Relaxin promotes prostate cancer progression. Clinical cancer research : an official journal of the American Association for Cancer Research 13: 1695–1702. doi: 10.1158/1078-0432.ccr-06-2492
- 51. Topalian SL, Drake CG, Pardoll DM (2012) Targeting the PD-1/B7-H1(PD-L1) pathway to activate anti-tumor immunity. Current opinion in immunology 24: 207–212. doi: 10.1016/j.coi.2011.12.009
- 52. Dianzani I, Gibello L, Biava A, Giordano M, Bertolotti M, et al. (2006) Polymorphisms in DNA repair genes as risk factors for asbestos-related malignant mesothelioma in a general population study. Mutation Research - Fundamental and Molecular Mechanisms of Mutagenesis 599: 124–134. doi: 10.1016/j.mrfmmm.2006.02.005
- 53. Ugolini D, Neri M, Canessa PA, Casilli C, Catrambone G, et al. (2008) The CREST biorepository: a tool for molecular epidemiology and translational studies on malignant mesothelioma, lung cancer, and other respiratory tract diseases. Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology 17: 3013–3019. doi: 10.1158/1055-9965.epi-08-0524
- 54. de Klerk NH, Armstrong BK, Musk AW, Hobbs MS (1989) Cancer mortality in relation to measures of occupational exposure to crocidolite at Wittenoom Gorge in Western Australia. British journal of industrial medicine 46: 529–536. doi: 10.1136/oem.46.8.529
- 55. Creaney J, Olsen NJ, Brims F, Dick IM, Musk AW, et al. (2010) Serum mesothelin for early detection of asbestos-induced cancer malignant mesothelioma. Cancer Epidemiol Biomarkers Prev 19: 2238–2246. doi: 10.1158/1055-9965.epi-10-0346
- 56. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, et al. (2007) PLINK: A tool set for whole-genome association and population-based linkage analyses. American Journal of Human Genetics 81: 559–575. doi: 10.1086/519795
- 57. Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, et al. (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nature Genetics 38: 904–909. doi: 10.1038/ng1847
- 58. Hartigan JA, Wong MA (1979) Algorithm AS 136: A K-Means Clustering Algorithm. Journal of the Royal Statistical Society Series C (Applied Statistics) 28: 100–108. doi: 10.2307/2346830
- 59. Devlin B, Roeder K (1999) Genomic control for association studies. Biometrics 55: 997–1004. doi: 10.1111/j.0006-341x.1999.00997.x
- 60. R Development Core Team R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing,Vienna, Austria. Available: http://www.R-project.org, 2009
- 61. Marchini J, Howie B, Myers S, McVean G, Donnelly P (2007) A new multipoint method for genome-wide association studies by imputation of genotypes. Nat Genet 39: 906–913. doi: 10.1038/ng2088
- 62. Howie BN, Donnelly P, Marchini J (2009) A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet 5: e1000529. doi: 10.1371/journal.pgen.1000529
- 63. Magi R, Morris AP (2010) GWAMA: software for genome-wide association meta-analysis. BMC Bioinformatics 11: 288. doi: 10.1186/1471-2105-11-288
- 64. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, et al. (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 102: 15545–15550. doi: 10.1073/pnas.0506580102
- 65. Wang X, Liu X, Sim X, Xu H, Khor CC, et al. (2012) A statistical method for region-based meta-analysis of genome-wide association studies in genetically diverse populations. Eur J Hum Genet 20: 469–475. doi: 10.1038/ejhg.2011.219
- 66. Ricceri F, Porcedda P, Allione A, Turinetto V, Polidoro S, et al. (2011) Involvement of MRE11A and XPA gene polymorphisms in the modulation of DNA double-strand break repair activity: a genotype-phenotype correlation study. DNA repair 10: 1044–1050. doi: 10.1016/j.dnarep.2011.08.003