Acute myeloid leukemia (AML) is a heterogeneous disease characterized by multiple genetic mutations and cytogenetic abnormalities.31 Our understanding of AML pathobiology has improved significantly in recent decades, including the ability to predict which patients have a high probability of relapse after chemotherapy. The European LeukemiaNet (ELN) developed guidelines,4 which were revised in 2017,5 to guide physicians’ decisions as to how to treat and manage AML. However, these guidelines require molecular tests that are not routinely performed at all medical centers,76 limiting their broad applicability, and the complex pattern of co-occurrence among prognostic abnormalities is poorly understood. We created an automated decision tree to stratify patients into prognostic risk categories according to the ELN guidelines. We evaluated 1682 patient records from three datasets, conducting a detailed analysis of the co-occurrence of mutations and karyotypic abnormalities and determining the prognostic impact of each test result on accurately categorizing patients. Our results have implications not only in understanding the real-world distribution of ELN-defined prognostic abnormalities, but also in categorizing patients based on incomplete data.
The ELN guidelines stratify patients into three prognostic risk categories (Favorable, Intermediate, and Adverse) by combining the presence of karyotypic abnormalities with genetic mutations (in FLT3, NPM1, CEBPA, TP53, RUNX1, and ASXL1). To understand the impact of genetic abnormality co-occurrence on prognostic risk, we combined data from three large AML patient datasets (n=1682 patients): the German-Austrian AML Study Group (AMLSG) cohort (n=1384),83 The Cancer Genome Atlas (n=193),1 and the Beat AML Wave One dataset (n=105) (Figure 1). A complete description of the methods of data analysis, including a comprehensive R Markdown file, is available on GitHub (https://github.com/WatanabeSmith/AML_ELN_PrognosticRiskClassification) and in Online Supplementary Methods. We filtered out entries without complete karyotype and mutation data, including karyotypes that did not follow standardized nomenclature.9 The presence or absence of mutations were extracted from these datasets. We extracted prognostically significant cytogenetic abnormalities, along with complex and monosomal karyotypes, using regular expressions and an automated abnormality calculator. The complete dataset is available in Online Supplementary Table S1.
To evaluate the pattern of mutational co-occurrence in these patients, we divided the prognostic risk factors by those originating either from cytogenetic testing or gene mutation testing. We subdivided the patient karyotypes into four groups: Favorable, Intermediate, and Adverse (all based on the presence of prognostic cytogenetic abnormalities alone), and Normal (which contain no abnormalities). We sorted patients within each karyotype subdivision by the presence of adverse TP53/RUNX1/ASXL1 mutations, and the combined presence or absence of NPM1, FLT3-ITD, and CEBPA mutations (Figure 2A). We confirmed that the co-occurrence patterns from each dataset closely matched one another (Online Supplementary Figure S1A-C).
Overall, we observed that NPM1, FLT3-ITD, and CEBPA mutations are enriched in a Normal karyotype background, especially NPM1 and CEBPA, highlighting their value in providing prognostic information in cytogenetically normal AML. In addition, TP53 mutations, in contrast to mutations in RUNX1 and ASXL1, more frequently co-occur with complex karyotypes and other adverse-risk markers (Figure 2B). Therefore, within the context of broadly assigning prognostic risk to AML patients, TP53 mutation status offers little additional prognostic information besides that already provided by cytogenetic testing. Notably, there are very few patients with adverse/intermediate risk mutations and favorable karyotypic abnormalities, who, according to the ELN conventional care regimens for AML, would normally go forward for transplant.5
Next, to categorize the patients into prognostic risk categories, we created an automated decision tree based on the newly published ELN guidelines (Figure 3A). It sequentially processes mutation and cytogenetic data, and returns the corresponding Favorable, Intermediate, and Adverse prognostic risk category. (Cases of acute promyelocytic leukemia were classified as Favorable risk based on separate ELN guidelines.10) Notably, the decision tree does not consider FLT3-ITD allele ratio (FLT3-ITD vs. FLT3-ITD) or biallelic CEBPA mutations because those data were not available for many of the patient records (see Online Supplementary Methods).
To understand the significance of each test (NPM1, FLT3-ITD, CEBPA, TP53, RUNX1, ASXL1, and cytogenetics) in assigning prognostic risk category, we analyzed our dataset using the decision tree while selectively censoring the presence of individual mutations or all karyotypic abnormalities. Censored test results were assumed to be wild-type or cytogenetically normal, simulating an unknown test result. Patients were assigned ELN risk categories based on incomplete data from all possible combinations of genetic test results. These risk categories were compared to the actual risk categories based on complete information to determine the accuracy of categorizing with partial information (Figure 3B). We similarly censored test results in groups that might have more clinical relevance (e.g. all adverse mutations considered at once, simulating results from a mutation panel) (Online Supplementary Figure S2).
Karyotype is the most informative single test, as expected,11 although 72% of patients can still be correctly categorized without cytogenetic information, using only the mutation tests. In most cases, a combination of NPM1 mutational and FLT3-ITD status are the next most impactful tests. Omitting NPM1 or FLT3-ITD results has a similar prognostic impact, but the type of misclassification shifts from incorrectly categorizing favorable-risk patients to incorrectly categorizing intermediate-risk patients, respectively. In addition, CEBPA and RUNX1 mutations each provide significant improvements in classifying favorable- and adverse-risk patients, respectively, while ASXL1 and TP53 mutations rarely impact prognostic categorization. Although most genetic mutation tests in these guidelines can be combined in a single mutational panel (greatly decreasing the cost of detecting mutations in additional genes), NPM1 mutations and FLT3-ITDs are identified by PCR amplification and electropherogram, respectively, requiring separate tests and equipment.
Finally, we determined the optimal sequence of the seven tests required for ELN risk classification, in which the next test in the sequence results in the greatest increase in classification accuracy. Each genetic test was evaluated on its ability to classify patients relative to a single risk category (e.g. call patients correctly as Favorable or simply Not Favorable) or in all categories using balanced accuracy (to account for differences in population size) or overall accuracy, respectively. We displayed all 5040 possible sequences for the overall classification and for each risk category, highlighting the optimum sequence (Figure 3C–F and Online Supplementary Table S2).
While the test with the greatest impact on prognosis is consistently karyotype, the test with the second biggest impact varies widely depending on risk categories. Favorable-risk patients can be identified with 95% accuracy with the addition of NPM1 and FLT3-ITD, and 99.9% accuracy with the further addition of CEBPA, suggesting that significant prognostic impact can be achieved from relatively little information or despite missing information. For intermediate prognostic risk, there were two optimized sequences: one (Intermediate_R) with RUNX1 as the second test, resulting in more correctly classified patients only after that second test, and another (Intermediate) with FLT3-ITD and NPM1 as the second and third tests, resulting in more correctly classified patients after that third test (since NPM1 mutations only confer Favorable risk in patients without FLT3-ITD). Adverse risk is 95% determined by karyotype and RUNX1 mutation status, demonstrating the significance and prognostic impact of RUNX1 over other mutational tests. The optimal overall sequence closely resembles that of intermediate risk and highlights the minimal impact of ASXL1 and TP53 on overall ELN risk classification. Because the incidence of TP53 and ASXL1 mutations increases with age,12 we separated our dataset into younger (<60 years old; n=1311) and older (≥60; n=371) patients and evaluated the difference in the prognostic significance of those mutations between both subgroups. We found that the difference in optimized sequences between subgroups was only minimal, and the proportion of patients incorrectly classified as a result of missing TP53 and ASXL1 mutation data was only nominally increased in older patients (Online Supplementary Figure S3 and Online Supplementary Methods).
Overall, our automated decision tree has potential applications outside of this investigation. While there are a multitude of clinical or genetic factors outside of the ELN guidelines that have prognostic significance, we focused only on the genetic abnormalities described in the guidelines. The decision tree, however, does allow other researchers to interrogate how additional factors are distributed across the prognostic risk categories.
Moreover, while novel AML treatment strategies will probably include comprehensive mutation testing upon diagnosis, current treatment guidelines remain hampered by financial and technical challenges. This is especially true in low- and middle-income countries (LMIC), where prognostic classification is often made in the presence of incomplete data. A recent Brazilian study assessing the relevance of the 2008 ELN guidelines in LMIC found that 26% of AML patients in their cohort were randomly missing cytogenetic or complete mutational data.13 This approach could also enable clinicians and healthcare providers in LMIC to evaluate and customize their diagnostic testing depending on the availability of resources and the individual circumstances of each patient. Our analysis enables data-driven decision-making by providing a framework for understanding which genetic features hold the most weight in prognostication given their relative prevalence and co-occurrence in real-world AML datasets. Accordingly, we have made all of our programming scripts and datasets publicly available at https://github.com/WatanabeSmith/AML_ELN_PrognosticRiskClassification to encourage further research.
Ultimately, AML prognostic risk involves a constellation of abnormalities, each one carrying more or less weight depending on the abnormalities with which it co-occurs. This analysis provides an important first step in applying this understanding to the newly revised ELN guidelines for AML.
References
- Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia. N Engl J Med. 2013; 368(22):2059-2074. PubMedhttps://doi.org/10.1056/NEJMoa1301689Google Scholar
- Kihara R, Nagata Y, Kiyoi H. Comprehensive analysis of genetic alterations and their prognostic impacts in adult acute myeloid leukemia patients. Leukemia. 2014; 28(8):1586-1595. PubMedhttps://doi.org/10.1038/leu.2014.55Google Scholar
- Papaemmanuil E, Gerstung M, Bullinger L. Genomic Classification and Prognosis in Acute Myeloid Leukemia. N Engl J Med. 2016; 374(23):2209-2221. PubMedhttps://doi.org/10.1056/NEJMoa1516192Google Scholar
- Döhner H, Estey EH, Amadori S. Diagnosis and management of acute myeloid leukemia in adults: recommendations from an international expert panel, on behalf of the European LeukemiaNet. Blood. 2010; 115(3):453-474. PubMedhttps://doi.org/10.1182/blood-2009-07-235358Google Scholar
- Döhner H, Estey E, Grimwade D. Diagnosis and management of AML in adults: 2017 ELN recommendations from an international expert panel. Blood. 2017; 129(4):424-447. PubMedhttps://doi.org/10.1182/blood-2016-08-733196Google Scholar
- Lin TL, Williams T, He J. Rates of complete diagnostic testing for patients with acute myeloid leukemia. Cancer Med. 2015; 4(4):519-522. Google Scholar
- Pollyea DA, George T, Foucar KM. Molecular genetic testing patterns for patients with newly diagnosed acute myeloid leukemia (AML) enrolled in the CONNECT MDS/AML disease registry. J Clin Oncol. 2017; 35(15_suppl):7022-7022. Google Scholar
- Gerstung M, Papaemmanuil E, Martincorena I. Precision oncology for acute myeloid leukemia using a knowledge bank approach. Nat Genet. 2017; 49(3):332-340. Google Scholar
- McGowan-Jordan J, Simons A, Schmid M. ISCN 2016: An International System for Human Cytogenomic Nomenclature. Karger: Basel, New York; 2016. Google Scholar
- Sanz MA, Grimwade D, Tallman MS. Management of acute promyelocytic leukemia: recommendations from an expert panel on behalf of the European LeukemiaNet. Blood. 2009; 113(9):1875-1891. PubMedhttps://doi.org/10.1182/blood-2008-04-150250Google Scholar
- O’Donnell MR, Abboud CN, Altman J. Acute myeloid leukemia. J Natl Compr Canc Netw. 2011; 9(3):280-317. PubMedhttps://doi.org/10.6004/jnccn.2011.0027Google Scholar
- Tsai C-H, Hou HA, Tang JL. Genetic alterations and their clinical implications in older patients with acute myeloid leukemia. Leukemia. 2016; 30(7):1485-1492. Google Scholar
- Benicio MT de L, Ribeiro AFT, Américo AD. Evaluation of the European LeukemiaNet recommendations for predicting outcomes of patients with acute myeloid leukemia treated in low- and middle-income countries (LMIC): A Brazilian experience. Leuk Res. 2017; 60:109-114. Google Scholar