de Haan et al have developed a risk score algorithm that is able to improve the individual prediction of venous thrombosis, taking into account information from multiple risk SNPs in addition to clinical data.1 

Who is going to suffer a venous thrombosis or pulmonary embolism? Predicting the individual risk of any multifaceted disease is a very difficult task. The complexity arises from the occurrence of multiple risk factors (with some likely yet to be identified) and their interplay. For venous thromboembolic disease, a substantial portion of these risk factors are genetic.2  There have been a few attempts to mathematically predict the individual risk of venous thromboembolism. One of them, by Eichinger et al, proposed a very simple risk score for recurrence in patients with previous spontaneous thrombosis,3  and 2 others tried to predict the risk of a first thromboembolic event in asymptomatic subjects.4,5  The risk score proposed by Hippisley-Cox and Coupland uses 14 variables, exclusively clinical and included comorbidities and concurrent medications.4  In contrast, Heinemann et al also suggested a simple algorithm that included 2 genetic risk factors, that is, the Factor V Leiden and the Prothrombin G20210A mutation.5 

In this issue of Blood, de Haan et al report an algorithm that incorporates a wider set of genetic risk factors discovered recently by genome-wide studies.1  The predictive power of the previous models, mainly based on nongenetic risk factors, is significantly improved by including genomic information. To our knowledge, these results are the first that demonstrate that sophisticated probabilistic models, which combine clinical, environmental, and genetic variables, can be efficient and useful predictors of the thromboembolic disease of an individual. One of the more striking results in the article by de Haan et al is the apparent ineffectiveness of a risk score composed of 31 SNPs compared with another composed of only 5 SNPs with the highest individual odds ratio for venous thrombosis (Factor V Leiden, PT G20210A, A1 blood group, 1 SNP in the Fibrinogen γ gene, 1 in the Factor XI gene).1  Remarkably, these 5 genetic variants have a direct or indirect functional effect on hemostatic proteins, highlighting the role of coagulation on thrombosis risk. Considering that all 31 SNPs are independently associated with thrombotic risk, although sometimes small, a better predictive power should be expected when increasing the number of SNPs in the model. We hypothesize 2 explanations for this paradox. Perhaps some of the proposed SNPs are false-positive. More likely, new and more powerful mathematical tools to integrate all of this multiplicity of genetic information are needed. To date, only classical methods of logistic regression have been applied. These methods are unable to cope with the interaction among risk factors. It is also possible that other variables play an important role in thrombotic risk. In this sense, a limitation to the study by de Haan et al is the exclusive use of common genetic variants in the analysis of the risk. Using these variants, only a very small portion of the disease susceptibility is explained and this is one of the more frustrating findings of the giant genomic studies published to date.6  One germane question is where the “missing heritability” underlying the complex and common diseases lies. Very probably, other rarer genetic variants, of the utmost importance at the individual level, must be included in the risk score of venous thromboembolic disease.

The clinical implications derived from the article by de Haan et al are clearly relevant because predictive medicine should be a high priority in prevalent diseases, such as cardiovascular disease and venous thromboembolic disease. The ability to objectively classify high-risk individuals will allow the best preventive strategies, both in asymptomatic individuals and in already affected patients (secondary prophylaxis). de Haan et al have placed one of the cornerstones of the edifice. To continue the construction of this building, given the enormous difficulties in addressing the complexity of the underlying disease, it is essential to use more and better items, such as the following:

  • New, not yet known, genetic risk factors. It is necessary to continue searching.

  • Incorporation of the rare genetic variants into the analysis of risk, with stronger effects in the carriers. Some of these rare variants are also affecting hemostatic factors, such as the A384S in the SERPINC1 gene,7  the R67X in SERPIN10 gene,8  or Ser219Gly in the PROCR gene.9 

  • In addition, all the protective factors should be incorporated to better estimate the actual individual risk of suffering the disease.

  • In terms of clinical practice, new laboratory methods for DNA genotyping and performing simultaneous and fast multiple SNP testing (diagnostic chips) would be welcome.

  • Use of other available biologic information, beyond DNA data, such as potential new markers based on plasma intermediate phenotypes such as thrombin generation, thromboelastometry, dynamics of the activated partial thromboplastin time, platelet volume, plasma levels of Factor VIII, and so on.

  • More specific mathematical models able to integrate this multidimensionality of variables (clinical, genetic, genomic, and biologic) will be necessary to accurately estimate the individual risk.

Conflict-of-interest disclosure: The authors declare no competing financial interests. ■

1
de Haan
 
HG
Bezemer
 
ID
Doggen
 
CJM
et al. 
Multiple SNP testing improves risk prediction of first venous thrombosis.
Blood
2012
, vol. 
120
 
3
(pg. 
656
-
663
)
2
Souto
 
JC
Almasy
 
L
Borrell
 
M
et al. 
Genetic susceptibility to thrombosis and its relationship with physiological risk factors: the GAIT Study.
Am J Hum Genet
2000
, vol. 
67
 
6
(pg. 
1452
-
1459
)
3
Eichinger
 
S
Heinze
 
G
Jandeck
 
LM
Kyrle
 
PA
Risk assessment of recurrence in patients with unprovoked deep vein thrombosis or pulmonary embolism. The Vienna prediction model.
Circulation
2010
, vol. 
121
 
14
(pg. 
1630
-
1636
)
4
Hippisley-Cox
 
J
Coupland
 
C
Development and validation of risk prediction algorithm (Qthrombosis) to estimate the future risk of venous thromboembolism: prospective cohort study.
BMJ
2011
, vol. 
343
 pg. 
d4656
 
5
Heinemann
 
LAJ
DoMinh
 
T
Assmann
 
A
et al. 
VTE risk assessment – a prognostic model: BATER cohort study of young women.
Thrombosis J
2005
, vol. 
3
 
1
pg. 
5
 
6
Manolio
 
TA
Collins
 
FS
Cox
 
NJ
et al. 
Finding the missing heritability of complex diseases.
Nature
2009
, vol. 
461
 
7265
(pg. 
747
-
753
)
7
Corral
 
J
Hernández-Espinosa
 
D
Soria
 
JM
et al. 
Antithrombin Cambridge II (A384S): an underestimated genetic risk factor for venous thrombosis.
Blood
2007
, vol. 
109
 
10
(pg. 
4258
-
4263
)
8
Corral
 
J
Gonzalez-Conejero
 
R
Soria
 
JM
et al. 
A nonsense polymorphism in the protein Z-dependent protease inhibitor increases the risk for venous thrombosis.
Blood
2006
, vol. 
108
 
1
(pg. 
177
-
183
)
9
Dennis
 
J
Johnson
 
CY
Adediran
 
AS
et al. 
The endothelial protein C receptor (PROCR) Ser219Gly variant and risk of common thrombotic disorders: a HuGE review and meta-analysis of evidence from observational studies.
Blood
2012
, vol. 
119
 
10
(pg. 
2392
-
2400
)
Sign in via your Institution