Introduction

Infant acute lymphoblastic leukemia (ALL) with KMT2A rearrangement (KMT2A-r) is associated with a very poor prognosis. Disease free survival from the date of diagnosis is approximately 20% to 40%, depending on age, white blood cell count, and response to induction therapy. Refractory and relapsed infant ALL is often resistant to attempts at re-induction, and second remission is difficult to both achieve and maintain. Genomic sequencing studies of infant KMT2A-r ALL clinical samples have demonstrated an average of fewer than 3 additional non-silent somatic mutations per case at diagnosis, most commonly sub-clonal variants in RAS pathway genes. We previously reported relapse-associated gains in somatic variants associated with signaling, adhesion, and B-cell development pathways (Blood 2016 128:1735). We hypothesized that relapsed infant ALL is characterized by recurrent, altered patterns of gene expression. In this analysis, we utilized single cell RNA sequencing (scRNAseq) to identify candidate genes with differential expression in diagnostic vs. relapse leukemia specimens from 3 infants with KMT2A-r ALL.

Methods

Cryopreserved blood or bone marrow specimens from 3 infants enrolled in the Children's Oncology Group AALL0631 trial were selected for analysis. Samples from both diagnosis (DX) and relapse (RL) time points were thawed and checked for viability (>90% of cells viable) using trypan blue staining. Samples were multiplexed and processed for single cell RNA sequencing using the Chromium Single Cell 3' Library Kit (v2) and 10x Genomics Chromium controller per manufacturer's instructions (10x Genomics, Pleasanton, CA). Single cell libraries were converted to cDNA, amplified, and sequenced on an Illumina NovaSeq instrument. Two technical replicates were performed. Samples were de-multiplexed using genotype information acquired from previous whole exome sequencing (WES) and demuxlet software. Transcript alignment and counting were performed using the Cell Ranger pipeline (10x Genomics, default settings, Version 2.2.0, GRCh37 reference). Quality control, normalization, gene expression analysis, and unsupervised clustering were performed using the Seurat R package (Version 3.0). Dimensionality reduction and visualization were performed with the UMAP algorithm. Analyses were restricted to leukemia blasts with CD19 expression by scRNAseq.

Results

The clinical features for each case are shown in Table 1. Cells from the 3 infant ALL samples clustered together, distinct from cells of non-infant B-ALL, T-ALL, and mixed lineage acute leukemia biospecimens in the Children's Mercy scRNAseq database, but largely did not overlap with one another. For each of the 3 infant cases, cells from DX and RL time points could be distinguished by differential patterns of gene expression (Figure 1). Individual genes with statistically significant (p<0.05) log-fold change values were examined. Figure 2 summarizes the number of genes with up-regulation of expression by scRNAseq at RL compared to DX. Only 6 genes, DYNLL1, HMGB2, HMGN2, JUN, STMN1, and TUBA1B, were significantly increased at RL across all 3 cases. We repeated this analysis, restricting to leukemia blasts with CD79A expression, and identified these same 6 genes, and 4 additional genes: H2AFZ, NUCKS1, PRDX1, and TUBB, as consistently up-regulated in RL clusters. We examined the expression of candidate genes of interest, including clinically targetable genes, to compare the distribution of expression at DX and RL (Table 2).

Conclusion

Genomic factors underlying the aggressive, refractory clinical phenotype of relapsed infant ALL have yet to be defined. Each of these 3 cases demonstrates unique expression patterns at relapse, readily distinguishable from both the paired diagnostic sample and the other 2 relapse samples. Thus, scRNAseq is a powerful tool to identify heterogeneity in gene expression, with the potential to discover recurrent genomic drivers within resistant disease sub-clones. Ongoing analyses include scRNAseq in additional infant ALL samples, relative quantification of transcript expression in single cells, and comparison with bulk RNAseq data.

Disclosures

No relevant conflicts of interest to declare.

Author notes

*

Asterisk with author names denotes non-ASH members.

Sign in via your Institution