Investigation of twins has enabled estimations of the germline and somatic genetic contribution to many pediatric cancers.1 Twin concordance for childhood acute leukemias is estimated at approximately 10%2 for lymphoblastic leukemia (ALL) and myeloid leukemia (AML) phenotypes. While purpose-built twin cohorts allow for large enrollments of twin cases with leukemia, these studies are at risk of inaccuracies resulting from ascertainment bias.3 In contrast, records-based population-based studies allow accrual in an unbiased fashion. Here, we identified substantially lower than expected pediatric and adolescent/ young adult (AYA) twin leukemia concordance rates through a population-based registry assessment in California from 1982-2022. We further demonstrated a unique association showing first-born twins are more likely to develop ALL than second-born. We show the difference in risk of ALL development by birth order may be reflected in epigenome-wide alterations in DNA methylation, in which first-born twins show significant hypomethylation compared to second-born siblings. Furthermore, we show significant alterations in DNA methylation at sites with known association to birth order and cortisol responsiveness, implying a potential biological explanation for this finding.
This research was performed under review by a California State Institutional Review Board which permits health research on de-identified Biobank specimens. We merged registry data from the California Birth Master Statistical File (BMSF) from 1982 to 2017 and the California Cancer Registry (CCR) from 1988 to 2022 to identify a total of 1,213 twins in which at least one sibling was diagnosed with leukemia from 0-39 years of age. Of these, there were 255 twin pairs in which at least one sibling in the pair had a diagnosis of leukemia, along with 265 distinct cancer diagnoses and 258 leukemia diagnoses within these 255 twin pairs (Table 1). Concordant cases were defined as both siblings in a twin pair having diagnoses in the same subgroup, and cross-cancer concordance as both having diagnoses in separate subgroups. Two concordant twin pairs were identified for both lymphoid leukemia and AML (Table 2). Concordance rates were defined as the total number of concordant twin pairs divided by the sum of concordant and discordant twin pairs. The twin concordance rate was 0.8% (2 of 255 twin pairs) for all leukemias, 0.5% for lymphoid leukemias (1 of 199), and 2.9% for AML (1 of 34). The percentage of monozygotic (MZ) twins was estimated assuming dizygotic twins are equally likely to be represented as same sex or opposite sex twins.3 The estimated MZ concordance rate was 2.6% for all leukemias, 1.4% for lymphoid leukemias, and 16.7% for AML. By diagnosis age, the rate of lymphoid leukemia concordance for 1-10 years was 0.7% (MZ: 2.1%). The rate of AML development for ages <1 year was 25% (MZ: 100%). Standardized incidence ratios (SIR) were calculated based on previously published methods.4 The SIR for developing a leukemia of the same subgroup as the proband in twin siblings was 15.6 (95% Confidence Interval [CI]: 1.9-56.4), and 2.9 for development of any pediatric/AYA cancer (0.4-10.5) (Online Supplementary Table S1).
We next assessed associations between twin plurality order (being born first or second within the twin pair) and development of leukemia. Across the full twin cohort, there was a significant increased risk of leukemia development associated with being the first-born twin (Fisher’s exact test, Odds Ratio [OR]: 1.80; P=0.001) (Figure 1A). Lymphoblastic leukemias were the most highly associated with being first-born (OR: 2.26; P<0.001) and this did not vary by age at cancer diagnosis or mode of birth (data not shown), while AML showed a non-significant decreased risk (OR: 0.55; P=0.127). To evaluate how plurality order may be reflected by DNA methylation at birth, we investigated 41 pediatric ALL-discordant twin pairs who had previously undergone genome-wide DNA methylation profiling at birth5 for differences in DNA methylation based on twin plurality order. Deconvolution analysis showed no significant differences in nucleated cell proportions by plurality order. We used linear regression controling for ALL cases’ status, birthweight, sex, twin pair number, nucleated cell proportions, and array batch effect to identify 1,394 significant differentially methylated probes (False Discovery Rate [FDR]: <0.05: 710,010 total CpG assessed), with a moderately elevated genomic inflation value of 2.21 (Figure 1B, C and Online Supplementary Table S2). There was a total of 534 significant differentially methylated regions assessed through Comb-P (Šidák, P<0.05) (Figure 1D and Online Supplementary Table S3).
We then evaluated CpG with known association to birth order6 across subsequent gestations. Of 341 birth-order CpG, 305 overlapped with the 710,010 probes from the EPIC array ALL-discordant twin cohort. Mean delta-beta values (the first minus second-born absolute DNA methylation value) were significantly lower (hypomethylated) across the 24 twin pairs in which the ALL case was first-born compared to the 17 in which the case was second-born (Wilcoxon rank-sum, P=0.037) (Figure 1E). We further investigated 22,100 CpG sites nominally associated with cortisol levels,7 of which 19,193 CpG overlapped with the EPIC array ALL twin cohort. Mean delta-beta values are significantly different in instances in which the ALL case sibling was born first compared to the case sibling being born second (P=0.011). We identified an overall concordance rate of 0.8% for twins with leukemia, which is notably lower than prior studies which range from 5-25%.2 Our estimation of concordance in MZ twins for lymphoid leukemias remains low at 1.4%, while that of AML is within the previously reported range at 16.7%. Accounting for approximately 70% of MZ being monochorionic,8 concordance increases to just 2.4% for ALL twins, and 3.4% specifically for the 1-10 year old age group. Remarkably, none of the 6 instances of lymphoid leukemia in twins <1 year of age demonstrated sibling concordance, despite prior estimated rates nearing 100%.2 Twin studies previously helped to identify that the initiating somatic genetic translocation event in childhood leukemia development can occur in the intrauterine period and subsequently pass between siblings through shared placental circulation9 ,10 in instances of MZ, monochorionic twins. The low rate of twin concordance in this study indicates that acquired genetic, epigenetic and discordant responses to environmental exposure events may contribute more heavily to pediatric and AYA cancer etiology than previously appreciated in pediatric acute leukemias,11 or at least among those in recent decades in California. We identified a twin pair concordant for lymphoid leukemias with specific diagnoses of B-cell lymphoblastic leukemia/ lymphoma and Burkitt’s cell leukemia which do not share known driver mutations. This combination suggests an alternative disease mechanism of concordance than that of shared fetal circulation and may point toward shared genetic predisposition to hematologic malignancies, which we were not able to evaluate.
These results show that concordant cases of twin acute leukemia are rarer than previously reported, which may result from biases in active twin recruitment leading to oversampling of rare concordant twins relative to the true population background rate.2 It is also notable that this study examined twins in a multiethnic birth population in which Latino births represent the majority population; most prior population-based studies had been performed in non-Latino Whites. While these results do not directly alter clinical recommendations,12 additional nuance should be considered in how concordance risk is discussed with families after an initial diagnosis of ALL in a twin sibling. This study was limited by the lack of zygosity information available in the evaluated registry data; however, our estimation of MZ twin pairs allows for broad comparison to previously identified concordance rates. In a subset of twins with ALL from this cohort investigated with a DNA single nucleotide polymorphism array, approximately 50% of same sex pairs were MZ,5 which is consistent with the estimated results displayed here. Furthermore, we are limited by the right censoring of registry data (to the year 2022), raising potential for lead-time bias in twin siblings born in the latest birth years assessed, though the inclusion of over three decades of cancer and birth registry data limits this impact.
Table 1.Twin leukemia cohort characteristics.
These results indicate first-born twins are more likely to develop ALL compared to second-born twins, a finding which had not previously been described. We conducted linear regression modeling in twins discordant for ALL to investigate how this relationship might be reflected in DNA methylation at birth in which we found evidence of genomic inflation indicative of higher-than-expected P values across most CpG probes on the EPIC array. This finding suggests the underlying biological mechanism responsible for the difference in leukemia risk between first- and second-born twins is reflected in a broad alteration (generally in the hypo-direction) of DNA methylation across the epigenome. Interestingly, first-born singleton births (compared to subsequent gestations) are at increased risk of ALL,13 while the inverse is seen for pediatric AML,14 which mirrors our results here with twins. The relationship of birth order with DNA methylation6 also mirrors our results, suggesting the biological mechanism leading to an increase in ALL risk in first-born singletons may have similarities to that of first-born twins. We further found a significant association between the development of ALL and DNA methylation at cortisol-associated sites, indicating lower exposure to cortisol during birth may contribute to the significantly increased risk of ALL in first-born twins. Increased, early exposure to cortisol is hypothesized to reduce risk of ALL.15 Further investigation in a broader cohort of twins with acute leukemia is warranted.
Table 2.Twin pairs with concordant and cross-cancer concordant leukemia diagnoses.
Figure 1.Birth plurality order is significantly associated with development of leukemia in leukemia-discordant twin pairs. (A) Forest plot demonstrating Odds Ratio (OR) from Fisher’s exact test for the association between twin plurality order (first- or second-born of a set of twin siblings) and pediatric leukemia in leukemia-discordant twin pairs. A significantly higher odds of developing leukemia of any type is seen in first-born twins. By disease group, twins discordant for the development of lymphoid leukemia had significantly higher odds of being first-born. Twins discordant for development of acute myeloid leukemia, however, had lower odds of being first-born, but this difference was not statistically significant. (B) Quantile-quantile plot of linear regression model for the relationship between DNA methylation status and twin plurality order. Lambda value represents genomic inflation. (C) Volcano plot from the linear regression model showing significance (-log10P) by regression coefficient. The 1,394 significant differentially methylated probes (False Discovery Rate <0.05) are shown in red. (D) Volcano plot showing distribution of -log10P values by genomic position. Probes found within the 534 significant differentially methylated regions are highlighted in red. (E) Distribution of mean delta-beta (first-born sibling DNA methylation beta value minus second-born within a twin pair) values across 305 CpG with known association to birth order found within the discordant acute lymphoblastic leukemia (ALL) twin DNA methylation array dataset. Mean delta-beta values are significantly different in instances in which the ALL case sibling was born first (N=24) compared to the case sibling being born second (N=17; Wilcoxon rank-sum, P=0.037). (F) Distribution of mean delta-beta values across 19,193 CpG with known association to post-partum cortisol levels found within the discordant ALL twin DNA methylation array dataset. Mean delta-beta values are significantly different in instances in which the ALL case sibling was born first (N=24) compared to the case sibling being born second (P=0.011). CI: Confidence interval; N: number.
Footnotes
- Received August 6, 2024
- Accepted December 31, 2024
Correspondence
Disclosures
No conflicts of interest to disclose.
Contributions
Funding
Funding was provided by the V Foundation (FP067172), the USC Norris Comprehensive Cancer Center SPORE development award (to JLW), and the Shirley McKernan Courage Foundation (CHLA) (to JLW).
Acknowledgments
The collection of cancer incidence data used in this study was supported by the California Department of Public Health as part of the statewide cancer reporting program mandated by California Health and Safety Code Section 103885; the National Cancer Institute’s Surveillance, Epidemiology, and End Results Program under contract HHSN261201000140C awarded to the Cancer Prevention Institute of California, contract HHSN261201000035C awarded to the University of Southern California, and contract HHSN261201000034C awarded to the Public Health Institute; and the Centers for Disease Control and Prevention’s National Program of Cancer Registries, under agreement U58DP003862-01 awarded to the California Department of Public Health. The ideas and opinions expressed herein are those of the author(s), and no endorsement by the California Department of Public Health, the National Cancer Institute, or the Centers for Disease Control and Prevention or their contractors and subcontractors is intended or should be inferred.
References
- Kadan-Lottick NS, Kawashima T, Tomlinson G. The risk of cancer in twins: a report from the childhood cancer survivor study. Pediatr Blood Cancer. 2006; 46(4):476-481. Google Scholar
- Greaves MF, Maia AT, Wiemels JL, Ford AM. Leukemia in twins: lessons in natural history. Blood. 2003; 102(7):2321-2333. Google Scholar
- Buckley JD, Buckley CM, Breslow NE, Draper GJ, Roberson PK, Mack TM. Concordance for childhood cancer in twins. Med Pediatr Oncol. 1996; 26(4):223-229. Google Scholar
- Feng Q, Nickels E, Muskens IS. Increased burden of familial-associated early-onset cancer risk among minority Americans compared to non-Latino Whites. Elife. 2021; 10:e64793. Google Scholar
- Nickels EM, Li S, Myint SS. DNA methylation at birth in monozygotic twins discordant for pediatric acute lymphoblastic leukemia. Nat Commun. 2022; 13(1):6077. Google Scholar
- Li S, Spitz N, Ghantous A. A Pregnancy and Childhood Epigenetics Consortium (PACE) meta-analysis highlights potential relationships between birth order and neonatal blood DNA methylation. Commun Biol. 2024; 7(1):66. Google Scholar
- Houtepen LC, Vinkers CH, Carrillo-Roa T. Genome-wide DNA methylation levels and altered cortisol stress reactivity following childhood trauma in humans. Nat Commun. 2016; 7:10967. Google Scholar
- Hall JG. Twinning. Lancet. 2003; 362(9385):735-743. Google Scholar
- Greaves M. In utero origins of childhood leukaemia. Early Hum Dev. 2005; 81(1):123-129. Google Scholar
- Wiemels JL, Xiao Z, Buffler PA. In utero origin of t(8;21) AML1-ETO translocations in childhood acute myeloid leukemia. Blood. 2002; 99(10):3801-3805. Google Scholar
- Hong D, Gupta R, Ancliff P. Initiating and cancer-propagating cells in TEL-AML1-associated childhood leukemia. Science. 2008; 319(5861):336-339. Google Scholar
- Ford AM, Colman S, Greaves M. Covert pre-leukaemic clones in healthy co-twins of patients with childhood acute lymphoblastic leukaemia. Leukemia. 2023; 37(1):47-52. Google Scholar
- Paltiel O, Lemeshow S, Phillips GS. The association between birth order and childhood leukemia may be modified by paternal age and birth weight. Pooled results from the International Childhood Cancer Cohort Consortium (I4C). Int J Cancer. 2019; 144(1):26-33. Google Scholar
- Von Behren J, Spector LG, Mueller BA. Birth order and risk of childhood cancer: a pooled analysis from five US States. Int J Cancer. 2011; 128(11):2709-2716. Google Scholar
- Schmiegelow K, Vestergaard T, Nielsen SM, Hjalgrim H. Etiology of common childhood acute lymphoblastic leukemia: the adrenal hypothesis. Leukemia. 2008; 22(12):2137-2141. Google Scholar
Data Supplements
Figures & Tables
Article Information

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.