Common genetic variation contributes significantly to the risk of developing chronic lymphocytic leukemia

Maria Chiara Di Bernardo; Peter Broderick; Daniel Catovsky; Richard S. Houlston

doi:10.3324/haematol.2012.072140

Letters to the Editor

Common genetic variation contributes significantly to the risk of developing chronic lymphocytic leukemia

Maria Chiara Di Bernardo
Peter Broderick
Daniel Catovsky
Richard S. Houlston

Molecular and Population Genetics, Division of Genetics and Epidemiology, Sutton, Surrey, UK

Section of Haemato-Oncology, Division of Pathology, Institute of Cancer Research, Sutton, Surrey, UK

Molecular and Population Genetics, Division of Genetics and Epidemiology, Sutton, Surrey, UK

Vol. 98 No. 3 (2013): March, 2013 https://doi.org/10.3324/haematol.2012.072140

Recent genome-wide association studies (GWAS) have identified common genetic risk variants for chronic lymphocytic leukemia (CLL).3 1 Testing SNPs individually for an association in GWAS necessitates the imposition of a very stringent P value to address multiple testing. While this reduces false positives, it may result in true associations being missed. Thus, any overall estimate of the total heritability that is, the proportion of the CLL risk ascribable to genetic variation, will be negatively biased. An alternative approach is to fit all the SNPs simultaneously providing an unbiased estimate of the heritability explained by all SNPs.4

We have applied this methodology to a GWAS of CLL. Briefly, 517 CLL cases were genotyped using HumanCNV370-Duo BeadChips (Illumina).2 1 For controls, we made use of Hap1.2M-Duo Custom array data generated on 2,930 individuals from Wellcome Trust Case-Control Consortium 2 (WTCCC2).5 We excluded samples with call rates below 90%, non-European background and cryptic relatedness assessed by estimation of identity by descent, along with SNPs having call rate below 95%, minor allele frequency (MAF) less than 1% in cases and controls, and evidence of departure from Hardy-Weinberg equilibrium (P<10 cases; P<0.05 controls). Performing a differential missingness test between cases and controls we excluded those SNPs with P<0.05. In addition, using PLINK6 we excluded individuals having a relatedness score over 0.05. This filtering resulted in 238,870 SNPs used for the analysis. A total of 63 samples were removed during quality control.

We estimated heritability using the methodology of Yang et al.⁷ and Lee et al.⁴ Briefly, the method fits a linear mixed model of the form: y=μ+g+e where y is the vector of disease status, μ is the mean vector, g is a vector of random additive genetic effects obtained from SNP data, and e is a vector of residual effects. The covariance structure fitted in the data is the individual relationship estimated from the SNPs, defined by: cov(yj,yk)=Ajkσg2+σe2 where A_jk is the genetic relationship between individuals, j and k derived from the SNPs, σ2_g is the additive genetic variance and σ2_e is the residual variance. Under this model, disease heritability, h2₀ is defined by: σg2/(σg2+σe2). The estimate of variance explained by the SNPs on the observed 0–1 scale is linearly transformed to that on the unobserved continuous liability scale such that where K is the prevalence of the disease and z is the value of the standard normal probability density function at the threshold t. Using data from the SEER registry we set the prevalence of CLL to be 1 in 2,700. Estimation of the additive genetic variance was performed using restricted maximum likelihood via genome-wide complex trait analysis (GCTA) software.8 We followed the procedure of Yang et al.⁷ to adjust the crude heritability estimate,₁, to account for missing LD between the genotyped SNPs and unknown causal variants. SNPs were randomly assigned into two groups with one of the groups being treated as representing ‘true’ causal variants. As advocated, we calibrated the prediction error using data on SNPs representing causal variants having MAF below 0.1.7

After transforming the data to account for disease prevalence, incomplete LD and ascertainment on the liability scale, the variance explained by all SNPs was 0.59 (95% CI: 0.35–0.83) (Table 1). The familial risk associated with CLL is amongst the highest of any cancer9 and our findings are compatible with polygenic susceptibility to CLL mediated through common SNPs in strong LD, with functional variants making a significant contribution to the heritable risk.

The heritability we estimated is simply the additive variance as a proportion of the phenotypic variance and does not include non-additive genetic variance or gene-environment interactions. Although it is entirely possible that highly penetrant mutations for CLL may exist, linkage analysis of CLL families and mutational analysis of selected genes has so far not provided robust evidence for their existence. Similarly, part of the genetic variance could be mediated by a large number of rare disease-causing risk variants, although to date there is no reason to believe that the majority of the apparent missing genetic risk is solely explained by a restricted number of high-risk variants.

The receiver operator characteristic curve associated with the known common risk variants at 2q13, 2q37.1, 2q37.3, 6p25.3, 8q24.21, 11q24.1, 15q21.3, 15q23, 15q25.2, 16q24.1 and 19q13.32 is 0.67, thereby accounting for only approximately 5% of the total genetic variance.10 Predicated on the assumption of a polygenic basis to CLL, our heritability estimate suggests most of the genetic risk remains unexplained. While the existing SNPs have little diagnostic value given the probable polygenic basis to the familial risk of CLL, the harvesting of additional risk variants theoretically offers prospects for risk prediction based on profiling. The power of existing GWASs to identify common alleles conferring relative risks of 1.3 or greater (such as the 6p25.3 variant) is high. Hence, there may not be many additional SNPs with similar effects for alleles with frequencies greater than 0.3 in populations of European ancestry. In contrast, studies have had low power to detect alleles with smaller effects and/or MAF below 0.1. Evidence for the existence of additional risk variants for CLL is provided by Quantile-Quantile plots of observed and expected association test statistics from case-control analysis of our dataset (Figure 1). This shows that there is inflation of the test statistics at the upper tail of the distribution (P<10), even after exclusion of SNPs mapping to known loci (Figure 1). It is, therefore, likely that additional common low risk variants remain to be discovered and should be eminently harvestable in new larger GWAS or through further pooling of additional existing datasets. How much of the unaccounted heritable risk is truly embodied in a long tail of association is currently unknown but will impact on the ability to fully understand the genetic, and ultimately biological basis of CLL predisposition.

In conclusion, our findings provide evidence for a polygenic basis to susceptibility to CLL and a strong rationale for continuing to search for new risk variants through GWAS-based strategies.

References

Di Bernardo MC, Crowther-Swanepoel D, Broderick P, Webb E, Sellick G, Wild R. A genome-wide association study identifies six susceptibility loci for chronic lymphocytic leukemia. Nat Genet. 2008; 40((10)):1204-10. PubMed https://doi.org/10.1038/ng.219 Google Scholar
Crowther-Swanepoel D, Broderick P, Di Bernardo MC, Dobbins SE, Torres M, Mansouri M. Common variants at 2q37.3, 8q24.21, 15q21.3 and 16q24.1 influence chronic lymphocytic leukemia risk. Nat Genet. 2010; 42((2)):132-6. PubMed https://doi.org/10.1038/ng.510 Google Scholar
Crowther-Swanepoel D, Di Bernardo MC, Jamroziak K, Karabon L, Frydecka I, Deaglio S. Common genetic variation at 15q25.2 impacts on chronic lymphocytic leukaemia risk. Br J Haematol. 2011; 154((2)):229-33. PubMed https://doi.org/10.1111/j.1365-2141.2011.08706.x Google Scholar
Lee SH, Wray NR, Goddard ME, Visscher PM. Estimating missing heritability for disease from genome-wide association studies. Am J Hum Genet. 2011; 88((3)):294-305. PubMed https://doi.org/10.1016/j.ajhg.2011.02.002 Google Scholar
Power C, Elliott J. Cohort profile: 1958 British birth cohort (National Child Development Study). Int J Epidemiol. 2006; 35((1)):34-41. PubMed https://doi.org/10.1093/ije/dyi183 Google Scholar
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007; 81((3)):559-75. PubMed https://doi.org/10.1086/519795 Google Scholar
Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, Nyholt DR. Common SNPs explain a large proportion of the heritability for human height. Nat Genet. 2010; 42((7)):565-9. PubMed https://doi.org/10.1038/ng.608 Google Scholar
Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet. 2011; 88((1)):76-82. PubMed https://doi.org/10.1016/j.ajhg.2010.11.011 Google Scholar
Sellick GS, Catovsky D, Houlston RS. Familial chronic lymphocytic leukemia. Semin Oncol. 2006; 33((2)):195-201. PubMed https://doi.org/10.1053/j.seminoncol.2006.01.013 Google Scholar
Wray NR, Yang J, Goddard ME, Visscher PM. The genetic interpretation of area under the ROC curve in genomic profiling. PLoS Genetics. 2010; 6((2)):e1000864. https://doi.org/10.1371/journal.pgen.1000864 Google Scholar

Data Supplements

Figures & Tables

Article Information

Vol. 98 No. 3 (2013): March, 2013 : Letters to the Editor

DOI

https://doi.org/10.3324/haematol.2012.072140

Pubmed

22899579

Pubmed Central

PMC3659921

Published

2013-03-01

Published By

Ferrata Storti Foundation, Pavia, Italy

Print ISSN

0390-6078

Online ISSN

1592-8721

Article Usage

Online Views

1099

PDF Downloads

291

No Data

PlumX

[bib1] Di Bernardo MC, Crowther-Swanepoel D, Broderick P, Webb E, Sellick G, Wild R. A genome-wide association study identifies six susceptibility loci for chronic lymphocytic leukemia. Nat Genet. 2008; 40((10)):1204-10. PubMed https://doi.org/10.1038/ng.219 Google Scholar

[bib2] Crowther-Swanepoel D, Broderick P, Di Bernardo MC, Dobbins SE, Torres M, Mansouri M. Common variants at 2q37.3, 8q24.21, 15q21.3 and 16q24.1 influence chronic lymphocytic leukemia risk. Nat Genet. 2010; 42((2)):132-6. PubMed https://doi.org/10.1038/ng.510 Google Scholar

[bib3] Crowther-Swanepoel D, Di Bernardo MC, Jamroziak K, Karabon L, Frydecka I, Deaglio S. Common genetic variation at 15q25.2 impacts on chronic lymphocytic leukaemia risk. Br J Haematol. 2011; 154((2)):229-33. PubMed https://doi.org/10.1111/j.1365-2141.2011.08706.x Google Scholar

[bib4] Lee SH, Wray NR, Goddard ME, Visscher PM. Estimating missing heritability for disease from genome-wide association studies. Am J Hum Genet. 2011; 88((3)):294-305. PubMed https://doi.org/10.1016/j.ajhg.2011.02.002 Google Scholar

[bib5] Power C, Elliott J. Cohort profile: 1958 British birth cohort (National Child Development Study). Int J Epidemiol. 2006; 35((1)):34-41. PubMed https://doi.org/10.1093/ije/dyi183 Google Scholar

[bib6] Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007; 81((3)):559-75. PubMed https://doi.org/10.1086/519795 Google Scholar

[bib7] Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, Nyholt DR. Common SNPs explain a large proportion of the heritability for human height. Nat Genet. 2010; 42((7)):565-9. PubMed https://doi.org/10.1038/ng.608 Google Scholar

[bib8] Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet. 2011; 88((1)):76-82. PubMed https://doi.org/10.1016/j.ajhg.2010.11.011 Google Scholar

[bib9] Sellick GS, Catovsky D, Houlston RS. Familial chronic lymphocytic leukemia. Semin Oncol. 2006; 33((2)):195-201. PubMed https://doi.org/10.1053/j.seminoncol.2006.01.013 Google Scholar

[bib10] Wray NR, Yang J, Goddard ME, Visscher PM. The genetic interpretation of area under the ROC curve in genomic profiling. PLoS Genetics. 2010; 6((2)):e1000864. https://doi.org/10.1371/journal.pgen.1000864 Google Scholar

Common genetic variation contributes significantly to the risk of developing chronic lymphocytic leukemia

References

Data Supplements

Figures & Tables

Article Information

Article Usage

Download Citation

Navigate

For Authors

For Reviewers

For Advertisers

Education

Privacy

More