Diamond Blackfan Anemia Syndrome (DBA) is a rare, congenital bone marrow failure syndrome characterized by severe macrocytic anemia, most often diagnosed during infancy. Congenital anomalies and predisposition to cancer are also important features of DBA. Establishment of a molecular diagnosis in a patient with DBA is critical to determine treatment strategies (i.e. the identification of compatible related transplant donors), as well as developing reproductive strategies for genetically at risk families. The overwhelming majority (>98.75%) of DBA patients with a molecular diagnosis have mutations in a Ribosomal Protein (RP) gene. Targeted and exome sequencing (WES) strategies can identify RP mutations in >70% of DBA patients (Ulrisch et al. Am J Hum Genet. 2018). Single Nucleotide Polymorphism Comparative Genome Hybridization (SNP array) detects >30 kb deletions of RP genes (which cannot be identified by sequencing) in ~10% of DBA patients (Farrar et al. Blood. 2011), leaving ~20% of DBA patients without a molecular diagnosis.
We hypothesized that smaller copy number variants (CNVs - either insertions or deletions) in RP genes that are below the limit of detection of SNP array are responsible for the remaining 20%. To test this hypothesis we collected DNA with informed consent for whole genome sequencing (WGS) analysis from 6 patients who had no mutations detected by WES or SNP array.
On average, we aligned ~1x1010paired end reads of 250 base pairs for each patient (~83X coverage of the genome). The aligned sequences were analyzed for CNVs using two independent software packages. Delly analyzes the two ends of each sequence read and maps them to the current human reference genome. Read ends that map further apart than expected are flagged as potential CNVs. CNVkit estimates regions of copy loss by changes in average sequencing depth.
Using relatively relaxed thresholds in Delly and CNVkit we identified ~100 candidate CNVs in each patient. We filtered out CNVs present in public databases and focused on those CNVs in the region of the RP genes. This analysis identified 2-5 potential RP gene associated CNVs in each patient. We designed PCR primers that flanked each putative CNV and confirmed at least one RP CNV in all 6 patient DNAs.
At this time, the CNVs in two patients are in the process of evaluation. We have validated causative RP CNVs in the other 4 patients, representing one known and three novel DBA genes. One patient had a 464 bp deletion in 3rdintron of the RPL27 gene, which is mutated in other DBA patients. We hypothesized that the deletion caused a splicing defect. Using a mini gene in which the second intron of the gamma globin gene was replaced with the 3rdintron of either the wild type or mutant RPL27 gene, we showed that the mutant exon was not spliced. An alternative hypothesis, that the deletion removed an enhancer element, was also tested, but no enhancer activity was detected. We conclude that the RPL27exon deletion causes aberrant splicing leading to an unstable RPL27 mRNA and haploinsufficiency of RPL27.
A second patient had a 3.5 kb deletion at the 3' end of the RPS5 gene, including the stop codon and poly A addition site. We hypothesized that the lack of the 3' processing signals would lead to an unstable mRNA. To test this hypothesis we generated MYC and FLAG tagged wildtype and 3' deleted RPS5 genes and co-transfected them into 293T cells. Regardless of the tag used, RT-PCR analysis showed a severe reduction in the mutant mRNA levels. Western Blot analysis demonstrated that only the wild type protein was expressed, leading to the conclusion that the RPS5 truncation led to an unstable RPS5 mRNA and haploinsufficiency of RPS5.
A third patient had a 28 kb deletion that removes the RPS9 gene. shRNA knockdown of RPS9 mRNA in normal CD34+ cells inhibited erythroid differentiation, leading to the conclusion that RPS9 deficiency causes DBA. Finally, we observed a 3 bp insertion in exon 6 of the RPL14 gene. The deletion adds an alanine residue to a string of 10 alanines in the wild type allele. We confirmed the insertion by targeted sequence analysis of patient DNA.
Our data show that WGS can identify small CNVs that cause DBA in at least 2/3 of patients who do not have a mutation detectable by other methods. We believe that WGS analysis following targeted sequencing, SNP array and WES can identify virtually all DBA mutations. With declining WGS costs, we recommend adding WGS to the molecular diagnostic pipeline for genetic testing of DBA.
Farrar:Novartis: Research Funding. Vlachos:Novartis: Other: Steering committee member.
Author notes
Asterisk with author names denotes non-ASH members.
This icon denotes a clinically relevant abstract
This feature is available to Subscribers Only
Sign In or Create an Account Close Modal