• There is a high level of heterogeneity in cGVHD plasma biomarkers in a large cGVHD cohort, with CXCL10 being the most reproducible marker.

  • CXCR3+CD56bright natural killer regulatory cells have a strong inverse relationship with plasma CXCL10 in patients with or without cGVHD.

Chronic graft-versus-host disease (cGVHD) remains one of the most significant long-term complications after allogeneic blood and marrow transplantation. Diagnostic biomarkers for cGVHD are needed for early diagnosis and may guide identification of prognostic markers. No cGVHD biomarker has yet been validated for use in clinical practice. We evaluated both previously known markers and performed discovery-based analysis for cGVHD biomarkers in a 2 independent test sets (total of 36 cases ≤1 month from diagnosis and 31 time-matched controls with no cGVHD). On the basis of these results, 11 markers were selected and evaluated in 2 independent replication cohorts (total of 134 cGVHD cases and 154 controls). cGVHD cases and controls were evaluated for several clinical covariates, and their impact on biomarkers was identified by univariate analysis. The 2 replications sets were relatively disparate in the biomarkers they replicated. Only sBAFF and, most consistently, CXCL10 were identified as significant in both replication sets. Other markers identified as significant in only 1 replication set included intercellular adhesion molecule 1 (ICAM-1), anti-LG3, aminopeptidase N, CXCL9, endothelin-1, and gelsolin. Multivariate analysis found that all covariates evaluated affected interpretation of the biomarkers. CXCL10 had an increased significance in combination with anti-LG3 and CXCL9, or inversely with CXCR3+CD56bright natural killer (NK) cells. There was significant heterogeneity of cGVHD biomarkers in a large comprehensive evaluation of cGVHD biomarkers impacted by several covariates. Only CXCL10 strongly correlated in both replication sets. Future analyses for plasma cGVHD biomarkers will need to be performed on very large patient groups with consideration of multiple covariates.

Chronic graft-versus-host disease (cGVHD) remains one of the most significant long-term complications of allogeneic blood and marrow transplantation (BMT). Recent studies have shown that the quality of life of patients with cGVHD is much poorer, with a higher frequency of cardiovascular disease, metabolic syndrome, diabetes mellitus, obesity, cognitive issues, fatigue, sexual dysfunction, and endocrine abnormalities.1-4 

Biomarkers that can act as diagnostic, prognostic, and predictive markers for cGVHD are needed to improve the ability to treat cGVHD effectively. Criteria for such biomarkers are well defined in the recent National Institutes of Health (NIH) Chronic GVHD consensus report.5  Many candidate plasma or serum cGVHD diagnostic biomarkers have been identified but none have been validated for use in clinical practice.6  Many recurring markers include soluble B-cell activating factor (sBAFF), aminopeptidase N (sCD13), interleukin-2 receptor alpha (IL-2Rα), and CXCL9.6-8 

It has been 10 years since the first NIH consensus report was published, and the most recent report still concludes that there is a great need for biomarkers that can be rigorously replicated by several research groups. The conclusions of the most recent consensus report were that (1) larger replication studies are needed, (2) more work needs to focus on prognostic and predictive cGVHD biomarkers in addition to diagnostic markers, and (3) there is little consistency between the biomarkers currently identified to allow for clinical application. The report noted that collection methodology is highly variable, with little consideration of factors that may have an impact on the importance of the markers, and many potential covariates were identified.5  These covariates included method of sample collection, clinical data collected, clinical presentation, total body irradiation (TBI), previous acute GVHD (aGVHD), infection, type of preparative regimen, underlying disease, donor source, and level of HLA matching.

In this study, we performed a replication study of cGVHD diagnostic markers in adult cohorts after initial discovery analyses on independent test sets. We attempted to control our analyses for several covariates, including time of onset, presence of previous aGVHD, and TBI. We found that there was a high heterogeneity among plasma markers, but that CXCL10 was consistently elevated at diagnosis of cGVHD. We found that a high plasma CXCL10 level correlated with low numbers of peripheral blood CXCR3+CD56bright natural killer (NK) cells.

Sample collection and processing protocol

Samples in all test and replication sets were collected from patients age 18 years or older. Two test sets and 2 independent replication sets (Figure 1) were obtained, and laboratory analysis was performed after informed consent was provided in accordance with the Declaration of Helsinki and approval was given by the local ethics committee. Plasma samples for test set 1 were collected at the Fred Hutchinson Cancer Research Center from participants in a prospective longitudinal cohort study.9  Samples from late onset cases,10  diagnosed 9 or more months after transplantation, were collected within 1 month of the onset of cGVHD (n = 17). Control samples were obtained from patients with no cGVHD at the time of collection or in the subsequent 3 months. These samples were matched to cGVHD cases on the basis of time after BMT (n = 21). Any cases that had a previous diagnosis of cGVHD or a flare-up of old cGVHD were excluded from both groups.

Figure 1

Biomarker selection algorithm. Biomarkers for testing in the replication sets were selected from 2 test sets with discovery-based evaluations using proteomic analysis followed by MRM-MS replication, Luminex, and enzymatic assays. Other markers were selected from communication with collaborators (M.-J.H. and S.G.H.) as well as from recent biomarker studies in the literature (Kitko et al8 ). Using the selection criteria as outlined, only 11 biomarkers were evaluated in the replication sets using ELISA, MRM-MS, Luminex, and enzymatic assays.

Figure 1

Biomarker selection algorithm. Biomarkers for testing in the replication sets were selected from 2 test sets with discovery-based evaluations using proteomic analysis followed by MRM-MS replication, Luminex, and enzymatic assays. Other markers were selected from communication with collaborators (M.-J.H. and S.G.H.) as well as from recent biomarker studies in the literature (Kitko et al8 ). Using the selection criteria as outlined, only 11 biomarkers were evaluated in the replication sets using ELISA, MRM-MS, Luminex, and enzymatic assays.

Close modal

Twenty-three samples for test set 2 were collected from patients undergoing allogeneic BMT at the University Hospital Regensburg (19 cGVHD cases and 4 no GVHD controls). Serum samples were retrieved at diagnosis of cGVHD in the outpatient clinic and were prepared and stored at −80°C in aliquots within 4 hours. Clinical data were obtained within the prospective trial, including transplant characteristics, history of aGVHD, and NIH-based diagnosis, and organ grading of cGVHD, as well as intensity of immunosuppression at time of sampling.

Samples in replication set 1 (n = 198) were collected as part of the Chronic GVHD Consortium Protocol 6501 cohort study. In replication set 1, heparinized plasma was isolated from freshly processed blood without platelet depletion. Samples from cases were collected within 1 month of cGVHD diagnosis. Control samples were matched within 90 days of time since transplantation.

Samples in replication set 2 were collected as a part of a Canadian Institutes of Health Research (CIHR)–funded biomarker study based at the British Columbia Children’s Hospital Child and Family Research Institute from 9 Canadian BMT Group (CBMTG) centers, an additional 2 centers in the United States, and 1 BMT center in Saudi Arabia (n = 30 cGVHD cases; n = 53 no cGVHD controls). Onset samples were collected between 21 days before and 26 days after cGVHD diagnosis. In replication set 2, platelet-depleted plasmas were isolated and frozen within 24 hours of collection. Supplemental Table 1 (available on the Blood Web site) summarizes the comparison of the sample collection and processing differences between test sets and replication sets.

Proteomics for discovery of markers

Briefly, samples from cases and controls were sent to the University of Victoria Genome BC Proteomics Centre, where they were depleted of the most abundant proteins; albumin and immunoglobulins were precipitated, digested to form peptides, differentially labeled with iTRAQ, and separated by ion chromatography on a VISION workstation (Applied Biosystems). Resultant peptides were processed by using the 4800 MALDI TOF/TOF analyzer (Applied Biosystems). Spectra were searched against IPI Human (v3.2195) and MSDB (build 20063108) sequence databases.7 

Validation with multiple reaction monitoring mass spectrometry

All experiments were performed on simple tryptic digests of human plasma without prior affinity depletion or enrichment. Stable isotope-labeled standard peptides were added immediately after tryptic digestion. Proteotypic tryptic peptides containing isotopically coded amino acids were synthesized for all 45 proteins. Peptide purity was assessed by capillary zone electrophoresis, and the peptide quantity was determined by amino acid analysis. For maximum sensitivity and specificity, instrumental parameters were empirically determined to generate the most abundant precursor ions and ion fragments. Concentrations of individual peptide standards in the mixture were optimized to approximate endogenous concentrations of analytes and to ensure the maximum linear dynamic range of the multiple reaction monitoring mass spectrometry (MRM-MS) assays.11 

Anti-perlecan auto-antibodies (anti-LG-3)

Auto-antibody immunoglobulin G (IgG) titers against the bioactive C-terminal fragment of perlecan released by apoptotic endothelial cells is shown to be a biomarker of immune-mediated vascular injuries.12  Anti-LG3 titers were measured with previously described laboratory-made enzyme-linked immunosorbent assay (ELISA).13  Briefly, recombinant mouse LG-3 (10 μg/μL) was first coated on 96-well Nunc MaxiSorp plates (Thermo Scientific, Rochester, NY) for a total of 1 μg per well. The plasma was diluted (1:250), and 100 μL was added per well. After 1 hour of incubation at room temperature, the plates were washed, and bound IgG was detected by using horseradish peroxidase coupled with anti-human IgG antibody (Amersham, Little Chalfont, United Kingdom). Reactions were revealed with 100 μL of tetramethylbenzidine substrate (BD Biosciences) for 10 minutes and stopped with 50 μL of H2SO4. Spectrophotometric analysis was undertaken at 450 nm, and the results were expressed as optical density (OD) × 1000.

Cytokine measurements

ELISA and multiplex kits used to detect cytokines in replication sets were purchased from R&D Systems (Minneapolis, MN), except for CXCL9, which was purchased from RayBiotech (Norcross, GA). Manufacturer’s instructions were followed for every assay and the source of each assay is summarized in supplemental Table 2. The lowest standard concentration showed that OD ≥ 2.0 × OD of the blank was set as the detection limit. Samples evaluated for vascular markers at Oregon Health & Science University had levels of epidermal growth factor (EGF), fibroblast growth factor 1 (FGF-1,) FGF-2, heparin binding EGF-like growth factor, vascular endothelial growth factor A, (VEGFA), VEGFC, and VEGFD, angiopoietin-2, endothelin-1, endoglin, follistatin, leptin, and placental growth factor determined by Milliplex magnetic bead array (Millipore, Billerica, MA). For laboratory measurements that were detectable but below the lower limit of quantification, the values were imputed as the lower limit of quantification.

Immune cell phenotyping

Peripheral blood was collected, usually from a vein in the antecubital fossa, into blood collection tubes containing heparin (BD Vacutainer). Peripheral blood was processed, and immunophenotyping was performed on cryopreserved peripheral blood mononuclear cells with a panel consisting of FITC-CD3 (UCHT1; BioLegend), Brilliant Violet 510-CD8 (RPA-T8; BioLegend), PE-CD56 (MEM-188; BioLegend), Pacific Blue-CD4 (SK3; BioLegend), Brilliant Violet 785-CD45RA (HI100; BioLegend), APC/Cy7-CD183 (CXCR3) (G025H7; BioLegend), and 7-AAD viability staining solution (BioLegend). Cells were analyzed by using a Fortessa cell analyzer equipped with four lasers (yellow-green, blue, violet, and red) (BD Biosciences), and data were analyzed by using FlowJo, v9.8 software (Tree Star, Ashland, OR).

Criteria for selection of biomarkers

We used two separate criteria for biomarker selection (summarized in Figure 1). Markers identified in the test sets were selected for further evaluation in the replication sets if they were significantly different from the no cGVHD controls (P ≤ .05). They were considered of high interest if the difference was significant and the receiver operating characteristic area under the curve (ROC AUC) ≥0.75.

Statistical methods

Biomarker values were log-transformed before analysis; mean differences between groups were re-expressed as fold differences (case:control). Those recorded as below the lower limit of detection were assigned a value equal to one-half the lower limit. In test sets 1 and 2, differences between cGVHD case and control groups were assessed by Wilcoxon rank-sum test. In the replication sets, linear regression analysis was used to evaluate biomarker levels according to cGVHD case-control status, with or without inclusion of other covariates. In the replication data sets, logistic regression was used to compute ROC AUC by using single biomarkers or combinations of biomarkers to distinguish cGVHD cases and controls. Forward and backward selection was used with logistic regression to define optimal combinations of biomarkers.

Discovery evaluations performed on test sets 1 and 2

Two separate test sets were evaluated for a broad number of plasma-based biomarkers to determine which panel would be tested in the larger replication sets. Test set 1 was obtained from adult patients at the Fred Hutchinson Cancer Research Center (Table 1) upon diagnosis of cGVHD (17 cGVHD cases, all collected within 1 month of diagnosis (supplemental Table 1) and from 21 patients who did not develop cGVHD (controls). Proteomic analysis on test set 1 plasma was performed with matrix-assisted laser desorption ionization time of flight/time of flight methodology by using iTRAQ labels. MRM-MS was used to validate 71 of the most promising candidates identified by the initial proteomic studies. Proteomic analysis followed by MRM-MS validation identified 2 proteins of highest statistical significance: aminopeptidase N (sCD13) (P = .004) and protease inhibitor 16 (PI-16) (P = .005). In addition, gelsolin and kallikrein, although less significant, were determined to be of sufficient interest for further evaluation (Table 2). Luminex assays identified 2 cytokine or chemokine markers. One of the markers was soluble IL-2Rα (sIL-2Rα [sCD25]), previously identified (by our group) in children7  (P = .005). The other was ICAM-1 (P = .04). Thus, test set 1 identified 6 candidate markers for replication, including gelsolin, kallikrein, aminopeptidase N, IL-2Rα, ICAM-1, and PI-16.

Test set 2 (Table 2) was made up of 19 cGVHD onset samples and 4 time-matched controls obtained in collaboration with University Hospital Regensburg (D.W.). CXCL10 (P = .005) was identified by using a screening Luminex assay, ELISA, and enzymatic assays for markers previously identified in a pediatric cohort and evaluated in test set 1.7  Two collaborators (S.G.H. and M.-J.H.) identified 2 additional vascular inflammation markers. One marker was identified in collaboration with the Canadian National Transplantation Research Program (M.-J.H.)12,13  as anti-LG3, a potential validated kidney vascular rejection marker that may be elevated in GVHD.13  The second was identified in collaboration with Oregon Health & Science University (S.G.H.) by using a customized Luminex assay for vascular markers. Endothelin-114,15  was identified in 2 patients with samples drawn at the onset of cGVHD and 28 days later. Both had a significantly increased endothelin-1 concentration at diagnosis, with a 4.8-fold decrease in endothelin-1 after cGVHD treatment was initiated (19.7 to 4.1 pg/mL). We added two markers, CXCL98  and sBAFF,6,7  shown in previous publications as being significant for evaluation in the replication sets, even though they were not identified by the 2 test set evaluations or by our collaborators.

Replication of candidate diagnostic cGVHD biomarkers identified in test sets 1 and 2

On the basis of their significance in test sets 1 and 2 and in the literature, 11 biomarkers were selected for testing by using the algorithm in Figure 1. These included soluble aminopeptidase N (sCD13), gelsolin, kallikrein, PI-16, CXCL10, ICAM-1, endothelin-1, sBAFF, IL-2Rα (sCD25), anti-LG3, and CXCL9 (supplemental Table 2). Patient characteristics for replication sets 1 and 2 are summarized in Table 1. Replication set 1 included 198 patients (n = 104 [cGVHD]; n = 94 [controls after BMT]), and replication set 2 included 83 patients (n = 30 [cGVHD]; n = 53 [controls after BMT]).

In replication set 1 (Table 3), we found that in 4 significant biomarkers—sBAFF (P = .05), CXCL10 (P = .02), ICAM-1 (P = .01), and anti-LG3 (P = .04)—ROC AUCs were between 0.59 and 0.61. The highest ROC AUC was attained with CXCL10 in combination with anti-LG3 (0.65; Figure 2A).

Figure 2

Receiver operator characteristic curves. (A) For replication set 1 with CXCL10 and anti-LG3 each alone and in combination; the combination gave the highest AUC and (B) for replication set 2 with CXCL10 and CXCL9 each alone and in combination; the combination gave the highest AUC.

Figure 2

Receiver operator characteristic curves. (A) For replication set 1 with CXCL10 and anti-LG3 each alone and in combination; the combination gave the highest AUC and (B) for replication set 2 with CXCL10 and CXCL9 each alone and in combination; the combination gave the highest AUC.

Close modal

In replication set 2 (Table 3), an identical analysis was performed. Findings were different from those in replication set 1. Three biomarkers were identified as significant in replication set 2 that had not been identified in replication set 1, including aminopeptidase N (P = .02), endothelin-1 (P = .02), and gelsolin (P = .0007). Markers identified as significant in replication set 1, but not as significant in replication set 2, included ICAM-1 and anti-LG3. There were 2 markers identified as statistically significant in replication set 1 that had been previously identified in replication set 2. These were sBAFF (P = .01) and, most significantly, CXCL10 (P < .0001). Overall, the ROC AUCs were much higher in replication set 2 compared with replication set 1, with 4 markers—aminopeptidase N, sBAFF, CXCL9, and endothelin-1—having ROC AUCs between 0.60 and 0.69. Two markers, gelsolin (0.71) and CXCL10 (0.78), had ROC AUCs ≥0.70. The best model by forward or backward model selection was CXCL10 combined with CXCL9, with an AUC of 0.83 (Figure 2B). Analysis of the 4 markers, CXCL10, CXCL9, anti-LG3, and sBAFF, in combination in both replication sets 1 and 2 did not increase the AUC compared with the 2 combinations of CXCL10 plus anti-LG3 or CXCL10 and CXCL9 in the 2 replication sets (Table 3). The best cutoffs for CXCL10 (for highest sensitivity with high specificity) were 291 pg/mL (29% sensitivity and 90% specificity) in replication set 1 and 313 pg/mL (50% sensitivity and 91% specificity) in replication set 2.

Analysis of covariates that have an impact on cGVHD biomarkers

We performed a multivariate analysis evaluating major factors that may have an impact on the interpretation of biomarkers (Table 4): early vs late cGVHD, prior aGVHD, conditioning (myeloablative, myeloablative with TBI, or nonmyeloablative), female donor to male recipient, donor source (peripheral blood stem cells, bone marrow [BM], umbilical cord blood), donor matching (matched related, matched unrelated, mismatched), and age 50 years or older. For evaluation of time dependence, early cGVHD markers (cGVHD onset before 9 months) were matched with no GVHD controls (majority at 6 months after BMT) and late cGVHD (≥ 9 months cGVHD onset) with no GVHD controls (majority at 12 months after BMT). None of the control patients developed cGVHD within the first year after transplantation. Multiple factors were evaluated, including donor matching, donor source, TBI, previous aGVHD, time of onset of cGVHD (early vs late), and non-myeloablative preparatory regimen; all had an impact on the interpretation of at least 1 biomarker (Table 4). We found no concordance between the 2 replication sets for the impact of any of the factors on the marker results. Only 1 marker, PI-16, was not impacted by a covariate in either replication set. Some factors such as TBI, non-myeloablative transplantation, and donor mismatch were almost always associated with an increased effect size of the biomarker (CXCL10, endothelin-1, IL-2Rα). By contrast, the donor source of either BM or umbilical cord blood and late onset of cGVHD (≥ 9 months after BMT), were almost always (except for kallikrein) associated with a decreased biomarker effect size (sBAFF, endothelin-1, CXCL10) (Table 4). Evaluation for skin vs other organ involvement did not show any impact on the interpretation of the biomarker results (data not shown).

Evaluation for a correlation between CXCL10 and CXCR3+ immune cell populations and cGVHD severity

It has previously been shown that CXCL9, CXCL10, and CXCL11 all influence trafficking of CXCR3+ T cells toward the peripheral tissues.16  In addition, a decrease in CD4+CXCR3+ T cells was seen in the blood of a small number of patients with active skin cGVHD and elevated serum CXCL10.17  In that study, CD4+CXCR3+ T cells were increased in skin biopsies from the same patients, suggesting a CXCL10-mediated recruitment of CXCR3+ cells to tissues. In addition to CD4+ and CD8+ T cells, another population, CD56bright NK cells, also express CXCR3.18 

We evaluated whether there were any differences in the percentage of CXCR3+-expressing CD4+ T cells, CD8+ T cells, or CD56bright NK cells in the total lymphocyte count between patients with and those without cGVHD. Second, we evaluated whether there was a difference in the ratio of the plasma CXCL10 level to percentage of the 3 cell subsets in patients with or without cGVHD. These analyses were performed on a cohort of patients from replication sets 1 and 2 who had paired plasma and cell samples available. We found that the subset of replication sets 1 and 2 that was evaluated was representative of the larger replications sets with not more than a 20% difference between any of the clinical variables in the subset and the larger replication sets (supplemental Table 3). Supporting this conclusion, the CXCL10 ROC AUCs in the subsets of 0.61 and 0.73, respectively (Table 5), were similar to those in the larger replications sets. Evaluation of the 3 CXCR3+-expressing populations revealed that CXCR3+CD56bright NK cells were the only CXCR3+ population significantly associated with lower cell numbers in cGVHD for both replication set 1 (ROC AUC, 0.62; P = .03) and replication set 2 (ROC AUC, 0.70; P = .02). When a ratio of the 3 CXCR3+ cell populations was performed with CXCL10, all 3 populations had significant ROC AUCs in at least 1 replication set. CD56brightCXCR3+ NK cells remained the most significant in both replication sets 1 and 2 (Table 5). Correlation with CXCL9 and CXCR3+ populations was in fact slightly higher than that for CXCL10 in replication set 1 and the inverse in replication set 2 (Table 5). The most significant association of CXCL9 was with CXCR3+CD56bright NK cells, identical to that in CXCL10.

In replication set 1, cGVHD severity (overall NIH score) is positively correlated with the combined biomarker score derived from logistic regression of CXCL10 and anti-LG3 and cGVHD (Ptrend = .07 from linear regression). Among cases in replication set 2, there is no such evidence (Ptrend = .67), although the number of cases is smaller.

The results summarized in this article show that, in 2 separate replication sets, CXCL10 was the most significant diagnostic biomarker that distinguished patients with cGVHD from those without. This represents one of the largest such replication studies (total of 350 patients, including 170 cGVHD cases and 180 controls) performed to date in cGVHD. Interestingly, CXCL10 was not identified in the initial test set 1 of patients with late onset (≥9 months after BMT). There was a high variability in its significance with an ROC of only 0.61 in replication set 1 and 0.77 in replication set 2. In addition, its highest predictive value was increased in combination with other biomarkers, but they were not identical in the 2 replication sets. Its association was highest in replication set 1 with anti-LG3, a vascular rejection biomarker in kidney transplantation,12,13  and in replication set 2, with CXCL9 and CXCR3+ regulatory NK (NKreg) cells.

CXCL10 is an inflammatory chemokine binding to CXCR3, which is involved in the activation and recruitment of T cells, eosinophils, monocytes, and NK cells.16  Previously, in a smaller group of patients (n = 8), all 3 CXCR3-binding chemokines (CXCL9, CXCL10, and CXCL11) were increased in patients with cGVHD of the skin17  and conjunctiva.19  Moreover, a significant decrease in CD4+CXCR3+ T cells was seen in the blood with concomitant increase in the number of central arterioles in the dermis. Another recent study found that both CXCL10 and CXCL11, along with sBAFF, were reproducible diagnostic biomarkers for cGVHD.20  CXCL10 is well associated with aGVHD in mice21  and humans22-24  and in solid organ rejection.25 

Elevation of the CXCR3+ agonists—CXCL9, CXCL10, and CXCL11—will cause migration of CXCR3+ effector populations in the skin. We were not able to evaluate skin cell infiltration, but we did evaluate for an inverse relationship in peripheral blood. We evaluated 3 populations—CD4+ T cells, CD8+ T cells, and CD56bright NK cells—associated with NK regulatory function. Interestingly, we found the closest correlation of CXCL10 elevation with a decrease in CD56bright NK cells. There is now very solid evidence that the CD56bright NK cell population is a classic NKreg population. We found that the CXCR3+CD56bright populations demonstrated a significantly higher expression of CD335 (NKp46; not shown) compared with CD337 in both subpopulations, consistent with the NKreg phenotype. 1,2,26-28  CD56brightCD16 NK cells exert low cytotoxicity but produce high levels of cytokines upon stimulation, they primarily exhibit an immunoregulatory role, and they are usually called NKreg cells. There have been a number of NKreg populations with this phenotype associated with immune regulatory functions and immune tolerance.3,29 

There was high heterogeneity in our results. Markers were markedly elevated in either test set 1 or 2, as in previous studies, including aminopeptidase N and IL-2Rα,7,8  sBAFF,6,7  and CXCL9.8  Only 1 of the markers, sBAFF, was significant in both replication sets (excluding CXCL10). Six other markers were significant in either replication set 1 (ICAM-1, anti-LG3, and IL-2Rα) or replication set 2 (gelsolin, endothelin-1, CXCL9, and aminopeptidase N). Interestingly, 3 of those markers—endothelin-1, gelsolin, and anti-LG3—are considered vascular inflammation markers, supporting that this mechanism is important in cGVHD. We consider that an ROC AUC of ≥0.75 represents a marker of highest interest, whereas an AUC between 0.59 and 0.74 represents a marker that is of interest and that warrants further investigation. CXCL10 was the only biomarker to meet our criteria for future validation with an AUC of ≥0.75. In our opinion, aminopeptidase N, CXCL9, sBAFF, gelsolin, endothlin-1, ICAM-1, anti-LG3, and IL-2Rα (with AUCs between 0.59 and 0.74) all warrant further investigation in future replication studies. CXCL10 has now been identified by 2 separate groups in addition to our own, 1 group in a small study17  and another group in a recent larger study with 2 centers.20  On the basis of the criteria set out by the NIH consensus report for biomarkers, CXCL10 has been validated, but we would argue that before it is used in the clinic as a validated diagnostic biomarker for cGVHD, it should be further validated by at least 1 additional separate large cohort of patients.5  Validation should focus on a CXCL10 cutoff for maximum sensitivity on plasma or serum concentrations between 291 and 313 pg/mL (approximately 90% specificity).

The high heterogeneity of biomarkers in this study points out a number of factors that may influence interpretation of plasma and serum biomarkers being evaluated and replicated in cGVHD. As summarized in supplemental Table 1, the fact that the time from collection to processing and whether the sample was plasma or serum were different for each of the sample sets, which may have influenced the heterogeneity of the results. The NIH cGVHD biomarker consensus report5  identified a number of covariates that may influence the interpretation of results. In light of these concerns, we evaluated multiple covariates. All covariates evaluated (Table 4) had an impact on the interpretation of at least 1 marker. These findings support the conclusion that each biomarker used to diagnose cGVHD may require correction for different covariates. Our results also support the lack of consistency between previous larger and smaller cGVHD biomarker trials (summarized in Paczesny et al5 ). Large trials that consider multiple covariates will be needed to evaluate these cGVHD markers before they can be used in clinical applications.

The complexity and heterogeneity of cGVHD continues to make replication and validation of candidate biomarkers difficult. In this large discovery and replication study, we found that only 1 marker—CXCL10—met strict criteria for replication as a clinical biomarker for the diagnosis of cGVHD. The high heterogeneity of our results within the study emphasizes the critical need for more clinically well-documented samples that have been collected in a standardized way. We hypothesize that the heterogeneity of results can be minimized by considering the impact of covariates on each individual marker in its interpretation to create a biomarker profile that can be used for clinical decisions.

The online version of this article contains a data supplement.

The publication costs of this article were defrayed in part by page charge payment. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.

The authors thank centers that contributed to the Chronic GVHD Consortium, including Vanderbilt (Madan Jagasia), Roswell Park (George Chen), Stanford University (Sally Arai), Cleveland Clinic Taussig Cancer Institute (Betty Hamilton), Washington University (Iskra Pusic), and Cornell University (Sebastian Ma).

This work was supported by a Canadian Institutes of Health Operating Grant; by the CIHR/Wyeth Clinical Research Chair in Transplantation (K.R.S.); by the Canadian National Transplant Research Program (funded by the CIHR); by the Office of Research in Women’s Health and the National Institute of Child Health and Human Development, Oregon Building Interdisciplinary Research Careers in Women’s Health Award No. 2K12HD043488 (S.G.H.); by the Medical Foundation of Oregon (S.G.H.); and by the Deutsche José Carreras Leukämie-Stiftung (D.W.). The Chronic GVHD Consortium is funded via collaboration between the Center for Advancing Translational Sciences and the National Cancer Institute (NCI) (U54CA163438). Support for the discovery samples was provided by grant CA118953 from NCI. The Canadian Bone Marrow Transplant Group Clinical Trials Network provided samples from CBMTG 0601 (funded by National Institutes of Health/NCI 1R01CA108752-01A2) and samples from CBMTG 0801 (funded by CIHR).

Contribution: A.K. designed and performed research, wrote the manuscript, and analyzed and interpreted data; S.G.H. performed research, wrote the manuscript, and analyzed and interpreted data; S.I. designed and performed research, performed statistical analysis, and analyzed and interpreted data; J.R. performed research, wrote the manuscript, performed statistical analysis, and analyzed and interpreted data; M.-J.H. contributed vital new reagents or analytical tools; P.J.M., S.J.L., D.W., and P.S. collected data and wrote the manuscript; S.A., S.S., J.S., M. Aljurf, M. Arora, C.C., G.G., J.K., J.L., T.J.N., L.F.N., J.P., G.P., D.S., J.T., C.L.T., I.W., and S.C. collected data; M.L. advised on experimental design and interpretation; T.P. analyzed data from the CBMTG 0801 and 0601 studies (Vancouver cohort, replication set 2); B.E.S. wrote the manuscript and performed statistical analysis; and K.R.S. designed the research, wrote the manuscript, and analyzed and interpreted data.

Conflict-of-interest disclosure: P.J.M. participated in a single scientific advisory board meeting for Pharmacyclics. The remaining authors declare no competing financial interests.

Correspondence: Kirk R. Schultz, British Columbia Children’s Hospital, University of British Columbia, 4480 Oak St, Room A119, Vancouver, BC V6H 3V4, Canada; e-mail: kschultz@mail.ubc.ca.

1
Sun
 
CL
Kersey
 
JH
Francisco
 
L
et al. 
Burden of morbidity in 10+ year survivors of hematopoietic cell transplantation: report from the bone marrow transplantation survivor study.
Biol Blood Marrow Transplant
2013
, vol. 
19
 
7
(pg. 
1073
-
1080
)
2
Wilhelmsson
 
M
Vatanen
 
A
Borgström
 
B
et al. 
Adverse health events and late mortality after pediatric allogeneic hematopoietic SCT-two decades of longitudinal follow-up.
Bone Marrow Transplant
2015
, vol. 
50
 
6
(pg. 
850
-
857
)
3
Boyiadzis
 
M
Arora
 
M
Klein
 
JP
et al. 
Impact of Chronic Graft-versus-Host Disease on Late Relapse and Survival on 7,489 Patients after Myeloablative Allogeneic Hematopoietic Cell Transplantation for Leukemia.
Clin Cancer Res
2015
, vol. 
21
 
9
(pg. 
2020
-
2028
)
4
El-Jawahri
 
A
Pidala
 
J
Inamoto
 
Y
et al. 
Impact of age on quality of life, functional status, and survival in patients with chronic graft-versus-host disease.
Biol Blood Marrow Transplant
2014
, vol. 
20
 
9
(pg. 
1341
-
1348
)
5
Paczesny
 
S
Hakim
 
FT
Pidala
 
J
et al. 
National Institutes of Health Consensus Development Project on Criteria for Clinical Trials in Chronic Graft-versus-Host Disease: III. The 2014 Biomarker Working Group Report.
Biol Blood Marrow Transplant
2015
, vol. 
21
 
5
(pg. 
780
-
792
)
6
Sarantopoulos
 
S
Stevenson
 
KE
Kim
 
HT
et al. 
High levels of B-cell activating factor in patients with active chronic graft-versus-host disease.
Clin Cancer Res
2007
, vol. 
13
 
20
(pg. 
6107
-
6114
)
7
Fujii
 
H
Cuvelier
 
G
She
 
K
et al. 
Biomarkers in newly diagnosed pediatric-extensive chronic graft-versus-host disease: a report from the Children’s Oncology Group.
Blood
2008
, vol. 
111
 
6
(pg. 
3276
-
3285
)
8
Kitko
 
CL
Levine
 
JE
Storer
 
BE
et al. 
Plasma CXCL9 elevations correlate with chronicGVHD diagnosis.
Blood
2014
, vol. 
123
 
5
(pg. 
786
-
793
)
9
ChronicGVHD Consortium
Rationale and design of the chronicGVHD cohort study: improving outcomes assessment in chronicGVHD.
Biol Blood Marrow Transplant
2011
, vol. 
17
 
8
(pg. 
1114
-
1120
)
10
Filipovich
 
AH
Weisdorf
 
D
Pavletic
 
S
et al. 
National Institutes of Health consensus development project on criteria for clinical trials in chronic graft-versus-host disease: I. Diagnosis and staging working group report.
Biol Blood Marrow Transplant
2005
, vol. 
11
 
12
(pg. 
945
-
956
)
11
Kuzyk
 
MA
Smith
 
D
Yang
 
J
et al. 
Multiple reaction monitoring-based, multiplexed, absolute quantitation of 45 proteins in human plasma.
Mol Cell Proteomics
2009
, vol. 
8
 
8
(pg. 
1860
-
1877
)
12
Cardinal
 
H
Dieudé
 
M
Brassard
 
N
et al. 
Antiperlecan antibodies are novel accelerators of immune-mediated vascular injury.
Am J Transplant
2013
, vol. 
13
 
4
(pg. 
861
-
874
)
13
Soulez
 
M
Pilon
 
EA
Dieudé
 
M
et al. 
The perlecan fragment LG3 is a novel regulator of obliterative remodeling associated with allograft vascular rejection.
Circ Res
2012
, vol. 
110
 
1
(pg. 
94
-
104
)
14
Yamane
 
K
Miyauchi
 
T
Suzuki
 
N
et al. 
Significance of plasma endothelin-1 levels in patients with systemic sclerosis.
J Rheumatol
1992
, vol. 
19
 
10
(pg. 
1566
-
1571
)
15
Tomis
 
JF
Sanz-Rodriguez
 
C
de Soria
 
VG
et al. 
Plasma endothelin-1 levels after stem cell transplantation.
Bone Marrow Transplant
2000
, vol. 
26
 
11
(pg. 
1199
-
1204
)
16
Groom
 
JR
Luster
 
AD
CXCR3 in T cell function.
Exp Cell Res
2011
, vol. 
317
 
5
(pg. 
620
-
631
)
17
Croudace
 
JE
Inman
 
CF
Abbotts
 
BE
et al. 
Chemokine-mediated tissue recruitment of CXCR3+ CD4+ T cells plays a major role in the pathogenesis of chronicGVHD.
Blood
2012
, vol. 
120
 
20
(pg. 
4246
-
4255
)
18
Marquardt
 
N
Wilk
 
E
Pokoyski
 
C
Schmidt
 
RE
Jacobs
 
R
Murine CXCR3+CD27bright NK cells resemble the human CD56bright NK-cell population.
Eur J Immunol
2010
, vol. 
40
 
5
(pg. 
1428
-
1439
)
19
Westekemper
 
H
Meller
 
S
Citak
 
S
et al. 
Differential chemokine expression in chronicGVHD of the conjunctiva.
Bone Marrow Transplant
2010
, vol. 
45
 
8
(pg. 
1340
-
1346
)
20
Ahmed
 
SS
Wang
 
XN
Norden
 
J
et al. 
Identification and validation of biomarkers associated with acute and chronic graft versus host disease.
Bone Marrow Transplant
2015
, vol. 
50
 
12
(pg. 
1563
-
1571
)
21
Villarroel
 
VA
Okiyama
 
N
Tsuji
 
G
Linton
 
JT
Katz
 
SI
CXCR3-mediated skin homing of autoreactive CD8 T cells is a key determinant in murine graft-versus-host disease.
J Invest Dermatol
2014
, vol. 
134
 
6
(pg. 
1552
-
1560
)
22
Piper
 
KP
Horlock
 
C
Curnow
 
SJ
et al. 
CXCL10-CXCR3 interactions play an important role in the pathogenesis of acute graft-versus-host disease in the skin following allogeneic stem-cell transplantation.
Blood
2007
, vol. 
110
 
12
(pg. 
3827
-
3832
)
23
Mavin
 
E
Ahmed
 
SS
O’Boyle
 
G
et al. 
 
Regulatory T cells inhibit CD8(+) T-cell tissue invasion in human skin graft-versus-host reactions. Transplantation. 2012;94(5):456-464
24
Lamarthée
 
B
Malard
 
F
Gamonet
 
C
et al. 
Donor interleukin-22 and host type I interferon signaling pathway participate in intestinal graft-versus-host disease via STAT1 activation and CXCL10.
Mucosal Immunol
2016
, vol. 
9
 
2
(pg. 
309
-
321
)
25
Martinu
 
T
Kinnier
 
CV
Gowdy
 
KM
et al. 
Innate immune activation potentiates alloimmune lung disease independent of chemokine (C-X-C motif) receptor 3.
J Heart Lung Transplant
2011
, vol. 
30
 
6
(pg. 
717
-
725
)
26
Kheav
 
VD
Busson
 
M
Scieux
 
C
et al. 
Favorable impact of natural killer cell reconstitution on chronic graft-versus-host disease and cytomegalovirus reactivation after allogeneic hematopoietic stem cell transplantation.
Haematologica
2014
, vol. 
99
 
12
(pg. 
1860
-
1867
)
27
Crome
 
SQ
Lang
 
PA
Lang
 
KS
Ohashi
 
PS
Natural killer cells regulate diverse T cell responses.
Trends Immunol
2013
, vol. 
34
 
7
(pg. 
342
-
349
)
28
Larghero
 
J
Rocha
 
V
Porcher
 
R
et al. 
Association of bone marrow natural killer cell dose with neutrophil recovery and chronic graft-versus-host disease after HLA identical sibling bone marrow transplants.
Br J Haematol
2007
, vol. 
138
 
1
(pg. 
101
-
109
)
29
Tian
 
Z
Gershwin
 
ME
Zhang
 
C
Regulatory NK cells in autoimmune disease.
J Autoimmun
2012
, vol. 
39
 
3
(pg. 
206
-
215
)
Sign in via your Institution