In this issue of Blood, Kwok and colleagues have provided a wealth of retrospective clinical data that greatly expands our appreciation for the prognostic importance of flow-cytometric residual disease measurement across a spectrum of clinical presentations in chronic lymphocytic leukemia (CLL).1 

Measuring minimal residual disease (MRD) to detect 1 or fewer CLL cells in 10 000 normal leukocytes has consistently shown correlation with long-term clinical outcomes when examined in the context of prospective clinical trials.2-4  Substantial international effort has provided a sound scientific foundation establishing valid and reproducible methods in flow cytometry and polymerase chain reaction–based molecular assays.5  The excitement for CLL MRD is justified both by the impressive methods science and the remarkable therapeutic advances in CLL unfolding with each contemporary major clinical trial. Each new achievement is revolutionizing the principles and practice of CLL medicine.6  In the current study, Kwok and colleagues have shown that the long-term prognostic value of MRD is valid with up to 18 years of follow-up across a spectrum of therapeutic approaches both in the up-front setting and for previously treated patients.1  Doubts about the prognostic validity of MRD in various settings should now be put to rest. The clinical utility of MRD must now be understood so that it can be properly applied.

MRD as a clinical tool rests on its powerful prognostic capacity. There is interest in using MRD to guide therapy and for the design of clinical trials, including as end points. Improving clinical outcomes for those living with CLL entails improving survival and quality of life while minimizing toxicity, including financial burden. MRD assessment will play an important role in achieving these goals.

In order to learn how to apply MRD toward these goals, randomized controlled clinical trials (RCTs) asking specific questions concerning the use of MRD must be conducted. For example, should MRD status support treatment decisions for shortening or intensifying therapy? Can MRD assessments be used to define treatment duration? Only through RCTs can answers to these types of critically important questions be ascertained. The essential clinical trial end points required for this work, progression-free survival (PFS) and overall survival (OS), require long-term follow-up. The work is daunting yet essential. There is interest in shorter-term clinical trial end points, such as MRD status, in hopes of more rapidly achieving these goals to improve the lives of those living with CLL. However, to confidently use a shorter-term end point in place of an accepted longer-term clinical benefit end point demands that key evidentiary requirements be met.

The strong association of MRD status with prognosis has raised the prospect that MRD may be suitable for use as a surrogate for the true clinical benefit end points PFS and OS.7  Indeed, if the treatment effect on MRD were able to reliably predict the treatment effect on PFS and OS—this is termed the trial-level effect—the ability to complete clinical trials could potentially be substantially shortened. One might ask if the current data already support the use of MRD as a surrogate for longer-term clinical benefit end points. Unfortunately, the answer to this is no, not yet. The analyses so far have evaluated MRD by pooling patients who have undetectable MRD levels and comparing them to those pooled on the basis of positive MRD detection—such an analysis reflects an amalgamation of varied patient-level effects. In this situation the improved outcomes in the undetectable MRD group are pooled over specific therapies at the patient level, and therefore, it is impossible to assign a causal effect or magnitude of effect on MRD to any given therapy. In other words, trial-level effects of treatment are not demonstrated. One way to think of such an analysis is that MRD serves as a tumor sensitivity assay for each patient, and so MRD results forecast on an individual patient level whether clinical outcome is likely to be favorable or not. We can only conclude that an individual patient with undetectable MRD status has a more favorable prognosis compared with a patient with detectable MRD.

For MRD to serve as a surrogate end point in RCTs, we need to establish that the trial-level effect of a specific treatment causes the MRD effect and that the MRD effect reliably predicts the true clinical benefit effect. To establish the desired trial-level surrogacy, a collection of RCTs is required in order to show how the experimental treatment affects MRD compared with how the control treatment effects MRD and whether the magnitude of effect on MRD enables one to reliably predict the magnitude of effect on PFS or OS.8,9  This is a formidable task. Many factors can confound the assessment. If patient clinical care is changed based on MRD knowledge outside of the protocol-directed approach, then the potential surrogacy relationship is uninterpretable. Further complicating the use of MRD as a surrogate is the fact that MRD dynamics vary incredibly by type of treatment. For example, compared with chemoimmunotherapy, novel biologic therapies such as ibrutinib have a much slower response time and often do not result in undetectable MRD status, yet they may improve PFS.5,10  This introduces uncertainty for timing of MRD assessment for various therapies in the quest toward establishing surrogacy. Establishment of surrogacy for one type of therapy does not necessarily support a claim of surrogacy for another therapeutic agent. Surrogacy has to be constantly verified with new therapeutic agents.

Importantly, surrogacy is not required for immediate incorporation of MRD assessment into clinical trials designed to improve clinical outcome for the individual patient and to address important individual and public health concerns. As novel therapies for CLL involve chronic administration of expensive medicines, trials focused on duration, intensification, or deintensification of therapy can provide precision in determining who will benefit from different treatment strategies. Provided such studies measure the true clinical benefit end points, they will also provide the necessary data to evaluate claims for potential surrogacy of MRD.

Conflict-of-interest disclosure: The authors declare no competing financial interests.

1.
Kwok
M
,
Rawstron
AC
,
Varghese
A
, et al
.
Minimal residual disease is an independent predictor for 10-year survival in CLL
.
Blood
.
2016
;
128
(
24
):
2770
-
2773
.
2.
Moreno
C
,
Villamor
N
,
Colomer
D
, et al
.
Clinical significance of minimal residual disease, as assessed by different techniques, after stem cell transplantation for chronic lymphocytic leukemia
.
Blood
.
2006
;
107
(
11
):
4563
-
4569
.
3.
Böttcher
S
,
Ritgen
M
,
Fischer
K
, et al
.
Minimal residual disease quantification is an independent predictor of progression-free and overall survival in chronic lymphocytic leukemia: a multivariate analysis from the randomized GCLLSG CLL8 trial
.
J Clin Oncol
.
2012
;
30
(
9
):
980
-
988
.
4.
Kovacs
G
,
Robrecht
S
,
Fink
AM
, et al
.
Minimal residual disease assessment improves prediction of outcome in patients with chronic lymphocytic leukemia (CLL) who achieve partial response: comprehensive analysis of two phase III studies of the German CLL Study Group (published online ahead of print 29 August 2016)
.
J Clin Oncol
.
2016
.
5.
Thompson
PA
,
Wierda
WG
.
Eliminating minimal residual disease as a therapeutic end point: working toward cure for patients with CLL
.
Blood
.
2016
;
127
(
3
):
279
-
286
.
6.
Koffman
B
,
Schorr
A
.
The 21st century revolution in CLL: why this matters to patients
.
Best Pract Res Clin Haematol
.
2016
;
29
(
1
):
122
-
132
.
7.
Böttcher
S
,
Hallek
M
,
Ritgen
M
,
Kneba
M
.
The role of minimal residual disease measurements in the therapy for CLL: is it ready for prime time?
Hematol Oncol Clin North Am
.
2013
;
27
(
2
):
267
-
288
.
8.
Buyse
M
,
Sargent
DJ
,
Grothey
A
,
Matheson
A
,
de Gramont
A
.
Biomarkers and surrogate end points--the challenge of statistical validation
.
Nat Rev Clin Oncol
.
2010
;
7
(
6
):
309
-
317
.
9.
Korn
EL
,
Sachs
MC
,
McShane
LM
.
Statistical controversies in clinical research: assessing pathologic complete response as a trial-level surrogate end point for early-stage breast cancer
.
Ann Oncol
.
2016
;
27
(
1
):
10
-
15
.
10.
Woyach
JA
,
Smucker
K
,
Smith
LL
, et al
.
Prolonged lymphocytosis during ibrutinib therapy is associated with distinct molecular characteristics and does not indicate a suboptimal response to therapy
.
Blood
.
2014
;
123
(
12
):
1810
-
1817
.
Sign in via your Institution