Abstract
Introduction. Acute myeloid leukemia (AML) shows remarkable biological heterogeneity, with diverse genomic lesions driving variable clinical trajectories. Although the Sanger AML multistage model by Gerstung et al. (Nature Genetics, 2017) predicts disease transitions through multiple clinical states, its real-world performance has not been externally validated. We aimed to assess its predictive accuracy in a contemporary AML cohort and to explore the clinical-molecular determinants of treatment response.
Methods. We retrospectively analyzed 73 AML patients (median age 61 years, range 19–81) diagnosed and treated between 2019 and 2024. Genomic profiling was performed by NGS panels covering recurrently mutated AML genes. Clinical variables, cytogenetic risk, and mutational profiles were used to compute individualized predictions through the Sanger AML multistage online tool. Observed clinical outcomes were classified at 12 and 36 months into three major groups: patients in continuous complete remission (CR), relapsed, and refractory. Agreement between predicted and observed outcomes was measured by weighted Cohen’s kappa.
Results. Among 73 patients, 23 (32%) achieved durable CR, 21 (29%) relapsed, and 29 (39%) were refractory or died without remission. CR patients were significantly younger (median 51 years) and mostly treated with intensive chemotherapy (78%), while refractory cases were older (median 63 years) and frequently harbored TP53 mutations (24%) or complex karyotypes (43%). Relapsed patients showed intermediate profiles, with partial enrichment of FLT3-ITD (19%) and adverse-risk cytogenetics (40%), as reported in Table 1.
TP53 mutation emerged as the strongest independent adverse factor (HR 8.07, 95% CI 2.23–29.13, p=0.0014), conferring more than eightfold increased risk of death and a median OS of 1 month compared with 14 months in TP53 wild-type. FLT3-ITD mutations correlated with higher CR achievement (43% vs 14%, p=0.037).
At 12 months, 65 evaluable patients yielded moderate agreement between model predictions and actual outcomes (weighted κ=0.57, SE=0.10, 95% CI 0.38–0.77), with best accuracy among refractory and continuously remitted patients. Misclassifications mainly involved elderly or non-intensively treated cases. At 36 months, concordance declined to κ=0.34 (SE=0.06, 95% CI 0.23–0.46), as relapsed and long-term survivors were less reliably predicted. Alive-vs-dead concordance confirmed this trend (κ=0.55 at 12 months; κ=0.39 at 36 months). Overall, the model captured early mortality well but underestimated relapse probability, reflecting evolving treatment paradigms.
Conclusions. TP53 mutation remains the dominant adverse determinant in AML. The Sanger multistage model retains good short-term but limited long-term accuracy, emphasizing the need for retraining with contemporary therapeutic datasets and integration of dynamic molecular variables for individualized prognostication.

Figures & Tables
Article Information

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.