Abstract
Introduction: Burkitt lymphoma (BL) is an aggressive B-lineage non-Hodgkin lymphoma that has traditionally been classified into 3 subtypes: "endemic", "sporadic", and HIV-associated. The highest incidence of BL - prompting the "endemic" classification - is observed in children in sub-Saharan Africa, where the distribution of the disease is closely associated with that of P. falciparum malaria. The tumor cells in most cases of BL from sub-Saharan Africa are infected with Epstein Barr virus (EBV). In contrast, a minority of BL tumors from Europe and North America - which are commonly described as "sporadic" - are EBV positive. Although excellent outcomes have been observed for patients with BL treated with multi-agent chemotherapy in Europe and North America, the outcomes of BL patients in sub-Saharan Africa are suboptimal. We hypothesized that improved understanding of the molecular heterogeneity that exists within BL, particularly the molecular features that distinguish cases from sub-Saharan Africa from those from Europe/North America, will enable more effective treatment strategies that can readily be implemented in low-resources settings.
Methods: Tumor biopsies were collected from 19 pediatric patients who presented to the Uganda Cancer Institute with maxillofacial tumors histologically confirmed to be BL. Total RNA was extracted and polyA-selected sequencing libraries were prepared. Paired-end, 50-base pair sequencing was performed on the Illumina HiSeq 2500 platform at a depth of 100 million reads per sample. A publicly available RNA sequencing dataset from 28 BL cases from Europe and North America that were previously analyzed by both microarray (NEJM 2006; 354[23]: 2431-2442) and RNA sequencing (Nature 2012; 490[7418]: 116-120) was analyzed in parallel for comparison. RNA sequencing on the 28 sporadic BL tumors was performed on polyA-selected sequencing libraries with paired-end, 108-base pair sequence reads generated on the Illumina HiSeq 2000 platform. The reads from all 47 BL tumors were aligned to the human and EBV (GenBank ID KC207813.1) genomes using the STAR aligner. Tumors were deemed EBV positive if the ratio of mapped viral reads to human reads exceeded 0.001%, since the EBV genome is 0.005% the size of the human genome. Normalization and differential expression analysis were performed on aligned reads using the DESeq2 R package. Somatic variants were identified by the Genome Analysis Tool Kit. Analysis of alterative splicing was performed with the SGSeq R package.
Results: All of the Uganda BL cases but only 70% of the previously published European/North American cases were obtained from children less than 18 years; 61% and 87% were from male patients, respectively. One half of patients in both groups presented with Ann Arbor stage I or II disease, while the remainder presented with higher stage disease. The median survival in the 19 Ugandan patients was less than one year. Sequence reads from 15 (79%) of the Ugandan tumors and 4 (15%) of the European/North American tumors aligned to the EBV genome. Although BL is reported to express only latency type I EBV genes, expression of latency type I, II, and III genes was detected. Among the 4 tumors from Uganda that did not contain EBV-derived reads, 1 was from a child who was HIV-seropositive, while the others were from HIV-seronegative patients. Unsupervised hierarchical clustering and principal component analysis of all annotated genes failed to separate EBV positive and EBV negative tumors. Cluster analysis of the Ugandan tumors revealed no association with clinical variables. Ongoing analyses are being performed to identify associations between differentially expressed genes and somatic mutations and alternative splicing that may distinguish the tumor subtypes.
Conclusions: While the terms "endemic" and "sporadic" have traditionally been used to classify BL, this nomenclature is archaic and does not accurately capture the underlying biology of the tumor subtypes. We performed a comparative whole transcriptome analysis of BL tumors from patients in Uganda to a dataset of BL tumors from patients in Europe/North America to identify features that distinguish the two cohorts. While cluster analysis failed to clearly separate tumors by their EBV or clinical status, ongoing analyses are being performed to identify alterations that may contribute to their distinct gene expression profiles.
No relevant conflicts of interest to declare.
Author notes
Asterisk with author names denotes non-ASH members.
This feature is available to Subscribers Only
Sign In or Create an Account Close Modal