A global estimation of sickle cell disease severity has been difficult to establish making the integration of clinical and laboratory abnormalities of into a predictive model of disease complications and death, an unrealized goal. A useful model might help us to understand the interactions among common clinical and laboratory abnormalities and allow patients at risk of certain complications and early death to be identified early so that targeted treatment, if available, could be provided. To approach this problem we first integrated clinical and laboratory data from nearly 3500 individuals from the Cooperative Study of Sickle Cell Disease and applied to this data advanced statistical machine learning techniques to identify the significant associations between selected complications of disease and laboratory variables. We looked for predictors of the risk for early death in three separate age groups. The resulting model revealed that complex networks of interactions between clinical and laboratory variables underlie common disease complications and ultimately death. Predictors of risk in this model were the HBA1, HBA2 genotype, stroke, sepsis and acute chest syndrome. While this model can predict the risk for early death, given the presence of other disease complications and variations among common laboratory variables, it did not provide an understanding of the genetic basis for our observations. Accordingly, in over 1000 patients with sickle cell disease we genotyped SNPs in genes chosen because of their possible link to the pathophysiology of disease. We then used our estimate of global disease severity to find associations of genotypes with severity. In the initial screening studies we identified several genes in the TGF-ß/BMP pathway that were associated with selected disease subphenotypes (

Sebastiani et al Nature Genet 37: 435, 2005
;
Baldwin et al. Blood 106: 372, 2005
). The TGF-b/BMP pathway regulates a wide range of biological functions including cell proliferation, cellular differentiation, extracellular matrix production, cell death, tissue repair and immune regulation. To expand our initial findings, we typed haplotype tagging SNPS in a more comprehensive fashion in 21 genes that are key members of this pathway. The genes included BMP6, BMP receptors, TGF-b receptors, SMADs, MAP kinases and their associated co-factors such as SARA, CDH1 and SMURF1. We used a Bayesian significance test to model the associations between the score of disease severity and individual SNPs, as well as a traditional association test with p-values computed using permutation tests. To reduce the problem of multiple comparisons, we selected as significantly associated with disease severity, only those SNPs that passed both tests with strong evidence of association. The most striking associations were found for BMP6, its receptors BMPR1 and BMPR2, several SMAD proteins and SARA1. Our results confirm the importance of the TGF-ß signaling pathway in sickle cell disease pathophysiology and suggest that subtle variation in this cell signaling network can be associated with the severity of disease.

Author notes

Corresponding author

Sign in via your Institution