In this issue of Blood, Ghione et al1 have compared outcomes for patients with multiply relapsed follicular lymphoma treated with axicabtagene ciloleucel (axi-cel) in the ZUMA-5 study with those in the retrospective SCHOLAR-5 registry. Axi-cel was highly active in the ZUMA-5 population, with an impressive complete response rate of 79% and the median progression free survival not yet reached after 17.5 months of follow-up.2 However, as a single-arm study, it is not possible to directly compare these outcomes with the plethora of currently available therapies, including various combinations of chemotherapy, anti-CD20 and bispecific antibodies, lenalidomide, phosphoinositide 3-kinase inhibitors, or enhancer of zeste homolog (EZH) 2 inhibitors. In the comparative effectiveness study, Ghione et al used propensity-scoring analysis on prespecified prognostic factors using standardized mortality weighting to match patients in the SCHOLAR-5 registry to those in the study. The authors charted marked improvements in response rate with axi-cel, as well as a notably improved time to next treatment and improved progression-free and overall survival, with a substantial 58% reduction in the risk of death (hazard ratio [HR], 0.42; 95% confidence interval [CI], 0.21-0.83).
The use of axi-cel for follicular lymphoma is accompanied by significant toxicities, with cytokine release syndrome in 78% of patients, which was grade 3 or worse in 6% of those treated. In the total trial population, 50% required treatment with tocilizumab and 5% needed vasopressors. Grade 3/4 neurological events were seen in 15% of patients, and grade 3 or worse infections were seen in 18%. As the authors note, it is not possible to directly compare these events to the historical controls.
The correlation between observational studies and formal randomized controlled trials (RCTs) has been debated for decades. In one study of 19 therapies examined across 53 observational studies and 83 RCTs, there was a high degree of correlation between the observational and randomized trials, with only 2 of the analyses showing an effect in the observational studies outside the 95% CI for the effect in the RCTs.3 However, in a larger analysis restricted to oncology treatments, which compared 350 observational studies of 2 treatment regimens compiled from various databases versus 121 RCTs, there was no significant correlation between the HRs.4
Propensity-scoring analysis has been proposed as a method to simulate randomized controlled trials without the need for a contemporaneous control arm.5 However, there is considerable debate as to how effective they are in this regard. Propensity scoring can adjust for a whole range of measured variables to decrease confounding but is unable to adjust for unmeasured variables. How can we be sure that the selected patients from a registry are truly comparable to those fit enough for inclusion in clinical trials?
Kumar et al compared the outcomes for patients in 141 RCTs across 8 cancer types versus matched patients in the National Cancer Database (NCDB).6 When analyzed using propensity-scoring analysis, the correlation between HRs in the clinical trials and the database was low (r = 0.25; 95% CI, 0.09-0.40; P = .003). In 92 RCTs, there was no significant difference (P ≥ .05) between the treatment arms, but the retrospective propensity-matched NCDB analysis in 59 of these studies (64%) showed significant P values. Of the 6 lymphoma studies analyzed, the HR was concordant in only 3 studies.
Despite these caveats, there are clearly patients with multiply relapsed follicular lymphoma for whom axi-cel will offer a significant benefit over existing therapies. Similarly, bispecific antibodies have shown comparable response rates in this population, with mosunetuzumab showing a complete response rate of 58% and overall response rate of 79% in a cohort of 90 patients with follicular lymphoma who had received ≥2 prior therapies.7 The optimal sequencing of chimeric antigen receptor (CAR) T-cell therapy and the patients who benefit the most remain to be determined.
Ghione et al cite feasibility constraints of an RCT in multiply relapsed follicular lymphoma, but, while acknowledging the challenges, an RCT is essential to inform optimal use of axi-cel, as recently achieved in the setting of relapsed/refractory diffuse large B-cell lymphoma.8 With improvements in the safe delivery of CAR T-cell therapy across populations and the impressive time to next treatment and overall survival advantages demonstrated in this comparative analysis, clinician and patient enthusiasm would be high, substantially improving the feasibility of conducting a phase 3 RCT, especially with a crossover design. Ideally, this would include a more racially diverse range of patients than the ZUMA-5 and SCHOLAR-1 cohorts, which were both >90% White. Based on these comparative effectiveness data, the number of patients needed to demonstrate superiority would not be large, and, given the multiple therapeutic options in the third-line setting, a comparator arm with physicians’ choice for the standard of care would facilitate recruitment. RCTs are needed to formally compare the activity and toxicity of axi-cel with other approaches as we prepare to roll out this efficacious immunotherapy globally.
Conflict-of-interest disclosure: R.G. has received honoraria from Abbvie, Astellas, Janssen, MSD, Novartis, Otsuka Pharmaceutical, and Takeda. J.T. reports research funding to institution from BeiGene, Celgene, Cellectar, Roche, PCYC, Takeda, and Janssen.