Acute myeloid leukemia (AML) encompasses a biologically and clinically heterogeneous group of disorders. Among these, therapy-related AML (t-AML) and AML arising from antecedent myelodysplastic syndromes (MDS) or myeloproliferative neoplasms (MPN) are consistently associated with poorer prognosis. Contemporary classifications increasingly emphasize genetic and morphological criteria. The International Consensus Classification (ICC) prioritizes morphologic and molecular characteristics and subsequently applies the “therapy-related” qualifier (a complex clinical entity characterized by its association with prior exposure to cytotoxic agents or radiotherapy), whereas the 5th edition of the World Health Organization (WHO) classification replaces the former category of “AML with myelodysplasia-related changes (MRC)” with “AML, myelodysplasia-related (AML-MR).” AML-MR is defined by ≥20% myeloid blasts in the presence of specific myelodysplasia-associated cytogenetic or molecular abnormalities and may arise de novo or following MDS/MDS-MPN.13
CPX-351 is a liposomal co-formulation of cytarabine and daunorubicin at a fixed 5:1 molar ratio, engineered to deliver synergistic drug exposure directly to leukemic blasts.4 In a pivotal randomized phase III trial involving older adults with newly diagnosed high-risk or secondary AML, CPX-351 significantly improved overall survival (OS) compared to the conventional “7+3” regimen. Longer-term follow-up demonstrated a median OS of 9.56 months versus 5.95 months (P=0.003) favoring CPX-351, with sustained benefit observed in subsets eligible for allogeneic hematopoietic stem cell transplantation (allo-HSCT).5,6 These results have led to the broad integration of CPX-351 in clinical practice and spurred numerous real-world evidence (RWE) studies across diverse patient populations and healthcare settings. Nevertheless, a comprehensive synthesis comparing RWE outcomes with those from randomized controlled trials (RCT) - encompassing both efficacy and safety endpoints - is currently lacking in the literature.
To address the gap, we conducted a meta-analysis with two primary aims: (i) to synthesize outcomes for patients treated with CPX-351 across RCT and RWE cohorts, and (ii) to compare effect sizes between RCT and RWE studies for key endpoints - complete remission (CR), 1-year OS, rate of allo-HSCT post-response, and early mortality.
Following a prespecified protocol and adhering to PRISMA 2020 guidelines,7 we included RCT and observational RWE studies enrolling adults (≥18 years) with newly diagnosed AML receiving CPX-351 as single-agent induction therapy, with or without consolidation according to study-specific protocols or routine care (Online Supplementary Figure S1). Studies assessing CPX-351 in combination regimens (e.g., with venetoclax or gemtuzumab ozogamicin),8,9 and those involving relapsed/refractory AML, were excluded. The study respects the ethical rules of the country in which it has been performed.
A systematic search of MEDLINE (PubMed), Cochrane Central Register of Controlled Trials (CENTRAL), and Clinical-Trials.gov was conducted through December 2024, with no language or publication-status restrictions. Search terms included “acute myeloid leukemia,” “AML,” “CPX-351,” and “real-world.” Two independent reviewers performed study screening, data extraction, and quality assessment (MM and DG). Primary endpoints were CR, including CR with incomplete hematologic recovery (CRi) where consistently reported; 1-year OS; proportion of patients proceeding to allo-HSCT among responders or all treated patients, as specified; and 30- and 60-day mortality. Given anticipated clinical and methodological heterogeneity, random-effects models were applied for meta-analysis, with heterogeneity quantified using Q statistics and the I² index. Comparisons between RCT and RWE results were descriptive, based on subgroup-specific pooled estimates with corresponding confidence intervals.
We identified 16 eligible studies involving 3,280 patients treated with CPX-351. This included two RCT comparing CPX-351 versus standard cytarabine-daunorubicin (7+3) and versus FLAG-Ida, comprising 258 patients in CPX-351 arms.5,10 Fourteen were observational RWE cohorts from Europe and North America, representing 3,022 patients (list of RWE studies is available in the Online Supplementary Appendix). Median patient ages ranged from late 50s to late 60s, consistent with the target population of secondary/ high-risk AML. Across all studies, consolidation treatment was delivered with CPX-351 at doses and schedules consistent with the approved product labeling. Diagnostic criteria varied across studies, encompassing t-AML, AML-MR/MRC, and “secondary AML,” with variation especially notable among RWE cohorts, which also included some younger adults (Table 1). Most of the studies were conducted prior to 2022; consequently, the diagnostic definitions of s-AML relied on earlier classification systems, preceding the current ICC and WHO 2022 classifications. However, the aforementioned limitation did not significantly impact the meta-analytic results.
The RCT population comprised the US/Canada pivotal program (median age 67.8 years) and the UK NCRI AML19 high-risk cohort (median age 57 years). The RWE datasets included national registries (e.g., England’s NCRAS, N=353; England CAS, N=602), multicenter consortia (e.g., MARROW, N=267; CREST-UK, N=147), and single- or multicenter institutional series from Germany, Italy (2 cohorts), France, Spain (2 cohorts), and US centers (N ranges, 59-513). Median age in RWE cohorts ranged between 60 and 67 years, broadly aligning with RCT demographics.
The pooled CR rate across all studies was 56% (95% confidence interval [CI]: 50-61%) with significant heterogeneity (Q=82.46; P<0.001; I²=85%). Notably, pooled CR rates were nearly identical between RCT (56%; 95% CI: 40-71%) and RWE (56%; 95% CI: 50-62%) (P=0.23) (Figure 1A). The overall pooled 1-year OS was 54% (95% CI: 51-57%), also with significant heterogeneity (Q=51.19; P<0.001; I²=71%). Notably, 1-year OS was lower in RCT (47%; 95% CI: 35-60%) compared to RWE cohorts (55%; 95% CI: 5258%) (P=0.13) (Figure 1B). Among responders, 38% (95% CI: 33-43%) proceeded to allo-HSCT (the mean percentage of patients undergoing allo-HSCT in CR1 was 83% [95% CI: 75-100%]), with high variability (Q=126.57; P<0.001; I²=88%). RCT reported 41% (95% CI: 24-59%), and RWE cohorts 38% (95% CI: 32-43%) (P=0.193) (Figure 1C). Thirty-day mortality was uniformly 6% across all studies with moderate heterogeneity (Q=12.44; P=0.13; I²=36%) and was consistent between RCT and RWE (6%; 95% CI for RCT: 3-9%; RWE 5-8%; P=0.587) (Figure 1D). Sixty-day mortality, reported in nine studies, was 10% overall with higher heterogeneity (Q=50.76; P<0.001; I²=80%), but slightly higher in RCT (13%; 95% CI: 9-18%) versus RWE (9%; 95% CI: 6-13%; P=0.03) (Online Supplementary Figure S2; Table 2).
There are several aspects qualifying the results of this meta-analysis. Firstly, RWE did not underperform compared with RCT, contrary to common expectations. The pivotal phase III trial selectively enrolled older patients with uniformly high-risk disease and applied strict definition criteria for therapy-related and myelodysplasia-related AML, potentially enriching for adverse biology.5 In addition, RWE cohorts are heterogeneous, including younger adults, evolving diagnostic classifications (t-AML, AML-MR/MRC, secondary AML), and potentially benefiting from advances in supportive care and transplant practices over time. Also, the rate of patients accessing to an allo-HSCT procedure varied between the two RCT (approximately 33-50%), underscoring the impact of institutional policies and donor availability on survival beyond induction therapy.11-13 Finally, differences in treatment delivery, including outpatient administration and earlier referral to transplant, may mitigate toxicity and improve bridging to curative approaches.
Although substantial heterogeneity across studies is expected given geographic, temporal, and patient-selection diversity, the concordance of CR rates between RCT and RWE populations is reassuring, supporting the generalizability of CPX-351’s disease-reducing efficacy. In addition, the stability of 30-day mortality at approximately 6% supports acceptable early safety in routine clinical practice. The observed lower 60-day mortality in RWE relative to RCT (8.5-9% vs. 12-13%, depending on the cohort) may reflect contemporary care pathways and an evolving learning curve in managing CPX-351-related toxicities. These patterns suggest that real-world settings may benefit from improved familiarity with CPX-351 and that early safety signals can be favorable outside trial environments.
Table 1.Randomized controlled trials (RCTs) and real-world evidence (RWE) studies included in the metanalysis.
Figure 1.Forest plot of the composite complete remission, the rate of allogeneic transplantation, the 1-year overall survival and the 30-days mortality from the studies included in this meta-analysis. (A) Forest plot of the composite complete response (CR) from the studies included in this meta-analysis. In a forest-plot view, study-level CR estimates vary widely (reflecting differences in age, baseline risk, and transplant policies), yet the diamond-shaped pooled estimate for randomized controlled trials (RCT) overlaps closely with that for real-world evidence (RWE), visually underscoring the consistency of CR with CPX-351 across controlled and routine-practice settings. (B) Forest plot of the rate of allogeneic transplantation performed in patients from the included studies in this meta-analysis. The forest plot reveals substantial inter-study variation in transplant rates - driven by differences in fitness, donor availability, and national policies - yet the pooled estimates for RCT and RWE remain closely aligned, supporting the concept of CPX-351 as an effective “bridge-to-transplant” platform in both trial and practice. (C) Forest plot of the 1-year overall survival from the included studies in this meta-analysis. The forest plot indicates a rightward shift (higher survival proportions) in several large RWE datasets relative to the RCT. While confidence intervals partly overlap, the visual impression is that everyday practice yielded at least comparable - and in aggregate slightly higher - 1-year survival. (D) Forest plot of the 30-day mortality in patients from the included studies in this meta-analysis. The pooled estimate sits near 6% with narrow 95% confidence intervals (CI) across subgroups, visually conveying early safety consistency. RT: controlled trials.
Our findings complement prior meta-analytic work by Sandhu et al.,14 which reported an indirect comparison between cohorts treated exclusively with CPX-351 and those treated with the 3+7 regimen. Collectively, both analyses support CPX-351 as a consolidated benchmark for future studies testing novel agents in the context of secondary/ high-risk AML. This is particularly relevant for patients with AML-MR and t-AML, in whom CPX-351 demonstrates robust remission induction that facilitates allo-HSCT.
Early mortality rates with CPX-351 are comparable to, and survival outcomes are at least equivalent to, those observed with intensive chemotherapy, and may be superior in real-world settings. These findings endorse the sustained preferential use of CPX-351 in eligible patients and underscore the importance of optimizing transplant referral and timing after induction. Emerging data on dosing strategies and peri-transplant integration in real-world practice (for example, Italian RWE) are expected to further refine management.13
Limitations include heterogeneity in endpoint definitions and case-mix definitions across RWE cohorts, which can hinder direct comparability. The absence of individual patient data restricted our ability to perform adjusted analyses. Moreover, temporal improvements in supportive care may bias RWE in a favorable direction relative to older randomized CT. Additionally, the analysis is susceptible to publication and selection biases inherent to study availability and reporting.
Table 2.Summary of the meta-analytic statistical data according to the examined criteria.
Prospective registries with standardized phenotyping aligned to WHO/ICC criteria, harmonized response definitions, and comprehensive molecular risk profiling (including TP53 status, complex karyotype, and NPM1/FLT3 mutations), together with systematic documentation of transplant intent, are essential to enable more granular benchmarking and direct comparisons with alternative induction regimens such as venetoclax plus hypomethylating agents in elderly or unfit patients. Incorporating quality-of-life assessments and health-economic evaluations alongside clinical effectiveness would provide a more holistic basis for informed therapeutic decision-making.
In summary, this synthesis of 16 studies involving over 3,200 patients confirms that CPX-351 achieves consistent remission rates (~56%), low early mortality (~6% at 30 days), and meaningful bridge-to-transplant rates (~38-41%) across both RCT and real-world contexts. The 1-year OS in pooled RWE (~55%) is not inferior - and may modestly exceed - that of RCT populations, likely reflecting differences in patient selection and care delivery. These findings reinforce the external validity of CPX-351 as the induction therapy of choice for secondary/high-risk AML, including AML-MR and t-AML under current classification systems.
Footnotes
- Received October 21, 2025
- Accepted December 4, 2025
Correspondence
Disclosures
No conflicts of interest to disclose.
Contributions
References
- Arber DA, Orazi A, Hasserjian RP. International Consensus Classification of Myeloid Neoplasms and Acute Leukemias: integrating morphologic, clinical, and genomic data. Blood. 2022; 140(11):1200-1228. Google Scholar
- Khoury JD, Solary E, Abla O. The 5th edition of the World Health Organization Classification of haematolymphoid tumours: myeloid and histiocytic/dendritic neoplasms. Leukemia. 2022; 36(7):1703-1719. Google Scholar
- Döhner H, Weisdorf DJ, Bloomfield CD. Acute myeloid leukemia. N Engl J Med. 2015; 373(12):1136-1152. Google Scholar
- Molica M, Perrone S, Mazzone C, Cesini L, Canichella M, de Fabritiis P. CPX-351: An old scheme with a new formulation in the treatment of high-risk AML. Cancers (Basels). 2022; 14(12):2843. Google Scholar
- Lancet JE, Uy GL, Newell LF. CPX-351 versus 7+3 cytarabine and daunorubicin chemotherapy in older adults with newly diagnosed high-risk or secondary acute myeloid leukaemia: 5-year results of a randomised, open-label, multicentre, phase 3 trial. Lancet Haematol. 2021; 8(7):e481-e491. Google Scholar
- Lancet JE, Uy GL, Cortes JE. CPX-351 (cytarabine and daunorubicin) Liposome for injection versus conventional cytarabine plus daunorubicin in older patients with newly diagnosed secondary acute myeloid leukemia. J Clin Oncol. 2018; 36(26):2684-2692. Google Scholar
- Page MJ, McKenzie JE, Bossuyt PM. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. 2021; 372:n71. Google Scholar
- Uy GL, Pullarkat V, Baratam P. Lower-intensity CPX-351 plus venetoclax induction for adults with newly diagnosed AML unfit for intensive chemotherapy. Blood Adv. 2024; 8(24):6248-6256. Google Scholar
- Ganzel C, Frisch A, Wolach O. CPX-351 +/- gemtuzumab ozogamicin as induction therapy for adult patients with newly diagnosed, favourable-intermediate risk, FLT3-ITD negative, AML: A pilot study. Br J Haematol. 2025; 206(2):551-555. Google Scholar
- Othman J, Wilhelm-Benartzi C, Dillon R. A randomized comparison of CPX-351 and FLAG-Ida in adverse karyotype AML and high-risk MDS: the UK NCRI AML19 trial. Blood Adv. 2023; 7(16):4539-4549. Google Scholar
- Mehta P, Campbell V, Maddox J. CREST-UK: Real-world effectiveness, safety and outpatient delivery of CPX-351 for first-line treatment of newly diagnosed therapy-related AML and AML with myelodysplasia-related changes in the UK. Br J Haematol. 2024; 205(4):1326-1336. Google Scholar
- Legg A, Lambova A, Broe A, Levy J, Medalla G. Real-world experience with CPX-351 treatment for acute myeloid leukemia in England: an analysis from the National Cancer Registration and Analysis Service. Clin Lymphoma Myeloma Leuk. 2023; 23(10):e323-e330. Google Scholar
- Guolo F, Fianchi L, Martelli MP. Real world study on the best CPX-351 treatment duration and timing for allogeneic stem cell transplantation. Am J Hematol. 2025; 100(12):2293-2304. Google Scholar
- Sandhu M, Ashok Kumar P, Shi CW. Evaluation of treatment outcomes with CPX-351 in adults with AML: A meta-analysis. J Clin Oncol. 2024; 42:6540-6540. Google Scholar
Data Supplements
Figures & Tables
Article Information

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.