• We identified 38 significant loci associated with VTE through GWAS meta-analysis and prioritized causal genes through an integrative method.

  • Functional confirmatory studies of GWAS hits via genome editing in zebrafish showed a novel role for genes TC2N and RASIP1 in thrombosis.

Abstract

Venous thromboembolisms (VTEs) are a leading cause of morbidity and mortality. Although many genetic risk factors have been identified, a substantial portion of the heritability remains unexplained. In this study, we employed a genome-wide association study (GWAS) for VTE across 9 international cohorts of the Global Biobank Meta-Analysis Initiative to address this question, along with in vivo functional validation. In this multipopulation GWAS (VTE cases, 27 987; controls, 1 035 290), 38 genome-wide significant loci were identified, 4 of which were potentially novel. For each autosomal locus, we performed gene prioritization using 7 independent, yet converging, lines of evidence. Through prioritization, we identified genes associated with VTE through GWAS and/or functional studies (eg, F5, F11, VWF, STAB2, PLCG2, TC2N), functionally validated those that did not have evidence other than GWAS (TC2N, TSPAN15), and discovered 1 not previously associated with coagulation (RASIP1). We evaluated the function of 6 prioritized genes with strong genetic evidence, including F7 as a positive control, using laser-mediated endothelial injury to induce thrombosis in zebrafish after CRISPR/Cas9 knockdown. From this assay, we have supportive evidence for the role of RASIP1 and TC2N in the modification of human VTE and suggestive evidence for STAB2 and TSPAN15. This study expands on the currently identified genomic architecture of VTE through biobank-based, multipopulation GWASs, in silico candidate gene predictions, and in vivo functional follow-up of candidate genes.

Deep vein thrombosis and pulmonary embolism, collectively referred to as venous thromboembolism (VTE), are disorders characterized by the pathologic formation of thrombi in deep veins that risk embolization to the pulmonary circulation. VTE is a common cause of morbidity and mortality and affects >900 000 individuals per year in the United States.1-4 As a complex trait, VTE risk is influenced by an array of well-described environmental factors and genetics. Heritability studies have suggested that between 30% and 40% of VTE risk is a consequence of genetic factors.5 Before genotyping technology allowed a genome-wide assessment of common variants, a few polymorphisms were implicated in VTE risk, but only 2 were validated by initial European (EUR) population GWASs. Those were the ABO blood group6 and a common variant in the F5 gene (factor V Leiden).7,8 Previous large GWAS8-24 have identified common and rare variants of dozens of loci that are associated with the risk for VTE (supplemental Table 1). The interaction between common variation and rare pathogenic variants in genes such as TUBB1, PROC, and PROS1 has also been described.25 

The strongest signals observed in GWASs for VTE are those associated with loci with known roles in the hemostatic system, such as gain of function variants in procoagulant genes (F2, F5, F11, FGG20) or missense variants p.Ser219Gly and p.Arg113Cys in the anticoagulant gene PROCR.26,27 As the list of VTE risk variants continues to grow with the inclusion of loci with no previously described genes involved in thrombosis, so does the need for functional analyses of these variants. Zebrafish serve as a vertebrate model with genes that are highly conserved in the human genome,28 including the coagulation cascade.29 Their high fecundity, optical transparency, and external development make them amenable to the functional analysis of top selected GWAS signals in an in vivo model.30-33 We have previously shown that we can evaluate normal and pathologic hemostasis, as well as thrombosis, in zebrafish embryos and larvae using genome editing.34-36 

In this study, we expanded the list of loci associated with VTE through a multipopulation meta-analysis of GWASs from biobanks of the Global Biobank Meta-Analysis Initiative (GBMI). Using genetic associations that emerged from this meta-analysis at known and potentially novel loci, we performed integrative bioinformatics-driven gene prioritization and subsequent functional analyses of 7 candidate genes in zebrafish assessed for thrombosis.

GWAS meta-analysis

VTE is one of the pilot phenotypes of the GBMI. Each biobank conducted genotyping, imputation, quality controls, and completed the GWAS in accordance with the GBMI analysis plan.37 In brief, a logistic mixed model in Scalable and Accurate Implementation of a Generalized Mixed Model or REGENIE was used, and covariates included age, age2 sex, age × sex, first 20 principal components from genetic data, and any biobank-specific covariates. VTE was defined as in previous GWASs17 and refined to include cases with the International Classification of Diseases, 10th Revision codes I80.1, I80.2, I82.2, I26.0, and I26.9; the International Classification of Diseases, 9th Revision codes 451.11, 453.40, 453.2, 453.77, 453.87, and 415.1; and the Office of Population Censuses and Surveys-4 Procedure Codes L791 or L90.2 (supplemental Table 2). After phenotype harmonization, GWAS summary statistics across 9 international cohorts (BioMe, Vanderbilt University Biobank [BioVU], China Kadoorie Biobank, Estonian Biobank, FinnGen, Genes and Health Study, Michigan Genomics Initiative, University of California, Los Angeles, and United Kingdom Biobank) with representation across 5 super populations (American, African [AFR], East Asian, EUR, [including Finnish and non-Finnish EUR ancestries], and South Asian) were combined using an inverse variance–weighted meta-analysis with 27 987 cases, 1 035 290 controls, and an effective sample size of 107 409 (supplemental Tables 3-5; supplemental Figure 1). We defined genome-wide significant loci as described in Zhou et al37 by iteratively spanning the ±500 kb region around the most significant variant and merging overlapping regions until no genome-wide significant variants (P < 5 × 10–8) were detected within ±500 kb. The most significant variant in each locus was selected as the lead single nucleotide polymorphism (SNP). The genomic control factor (lambda) used to measure the inflation of test statistics was 1.042 (supplemental Figure 2), and therefore P values were not adjusted for inflation.

We looked up the novel loci in the 2 most recently published studies, namely the 2022 International Venous Thrombosis Network (INVENT) meta-analysis within the Million Veterans Program38 (supplemental Table 6) and a 2023 meta-analysis of individuals of EUR genetic ancestry.39 We do not consider this a formal replication because there is some sample overlap between these 2 studies and our meta-analysis.

Statistical analysis

Unless otherwise noted, the analysis was performed using R statistical software. Packages for data visualization and statistical tests included isoplotR,40 ggplot2,41 MASS,42 data.table,43 ggthemes,44 and dplyr.45 The code is available at https://github.com/bnwolford/gbmi_vte.

Credible sets

To fine-map the loci identified in the VTE meta-analysis, we generated credible sets of causal variants using the sum of single effects,46 an iterative, Bayesian, step-wise selection using sparse multiple regression–determined credible sets with a 95% posterior probability of containing potential causal variants. A linkage disequilibrium (LD) reference panel from 2504 unique individuals from all ancestral cohorts of 1000 Genomes was used. To create credible sets, we considered regions ±500 kb from the index variant.

Gene prioritization

For each autosomal locus, we performed gene prioritization using 7 independent, yet converging, lines of evidence. We used Data-driven Expression Prioritized Integration for Complex Traits (DEPICT) and polygenic priority score (PoPS) for gene prioritization for all 14 endpoints in the GBMI pilot study.37 Using the variants with a P value of <1 × 10–5 in the multipopulation meta-analysis, any gene with a false discovery rate of <0.05 with DEPICT47,48 was considered prioritized. Similarly, a gene in the top 10% of genes as ranked by PoPS49 was considered as the prioritized genes. For both analyses, individuals of EUR ancestry from the 1000 Genomes Project phase 3 were used as the LD reference panel,50 because 86% of the individuals included in the GWAS were of primarily EUR ancestry.

We compared the performance of DEPICT and PoPS using a gold standard set of coagulation and platelet genes (N = 41; supplemental Table 7), determined before the GWAS by a medical and molecular genetics expert in VTE and coagulation (J.A.S.) and based on a high-throughput sequencing panel containing a gold standard list of coagulation and platelet genes by the ThromboGenomics group.51 We used a significant false discovery rate threshold of <0.05 to define prioritized genes from the DEPICT result. Of the 54 genes prioritized by DEPICT, 11 of those were in the VTE gold standard gene list (area under the curve, 0.75; supplemental Figure 3). For the PoPS gene prioritization result, we selected the top 10% genes (N = 1839) with the highest PoP score as the prioritized genes, and 32 of the PoPS-defined prioritized genes were reported in the gold standard gene list (area under the curve, 0.84; supplemental Figure 3). We further evaluated the performance of DEPICT and PoPS in predicting functional genes using the DeLong test, which showed no significant difference between the accuracy of the 2 methods (P = .30). Therefore, we concluded that both methods can be used in the integrative prioritization.

For the proteome-wide Mendelian randomization52 and colocalization analysis, candidate SNPs from the non-Finnish EUR meta-analysis with a P value of <1 × 10–5 were selected for the genetic association with VTE. For the approximate colocalization, we aimed to test whether the leading protein quantitative trait locus was in LD (r2 ≥ 0.8) with a candidate SNP. Genes with proteome-wide Mendelian randomization and colocalization evidence were used for prioritization. We also looked up the lead SNPs in relevant Genotype-Tissue Expression tissues,53 including whole blood, atrial appendage, left ventricle, aortic artery, coronary artery, tibial artery, and Epstein-Barr virus-transformed lymphocytes, and reported significant expression quantitative trait loci (eQTLs; q value of < 0.05). We also considered deleterious mutations for gene prioritization. We identified genes with a pathogenic variant in ClinVar54 as of 7 May 2021 or genes with a nonsynonymous variant in the 95% credible set (calculated using the sum of single effects). For both the eQTL and ClinVar annotations, we also considered genome-wide significant variants within 50 kb upstream and downstream of the lead SNP, but this expanded variant set did not significantly impact the main prioritized gene (supplemental Table 8). Finally, we considered the nearest gene, as annotated with ANNOtate VARiation. For each lead SNP, a simple sum across these 7 lines of evidence was used to identify a potentially causal gene with the most evidence. We used Enrichr55-57 for the 38 prioritized candidates to identify Gene Ontology (GO) molecular function and biologic processes and Kyoto Encyclopedia of Genes and Genomes human pathways.

Targeted knockdown in zebrafish using genome engineering

We used the ChopChop server to identify highly efficient single-guide RNAs (sgRNAs) for CRISPR/Cas9–mediated genome editing.58 A total of 2 to 4 guides were selected for each gene of interest (supplemental Table 9). sgRNAs and Cas9 nuclease were ordered from Synthego. sgRNAs for each gene were pooled, mixed with Cas9, and injected into single-cell embryos produced from ABxTL hybrids. To prove that editing occurred, we validated each sgRNA by lysing injected embryos, performing a polymerase chain reaction assay across the target site, and running it on a sensitive electrophoresis system (Qiaxcel, Qiagen). Successful editing was indicated by a smear rather than a single band. Injection and subsequent laser-induced endothelial injury were performed at least twice for each knockdown, and the results were pooled.

Laser-induced endothelial injury in zebrafish

Zebrafish were maintained according to protocols approved by the University of Michigan Animal Care and Use Committee. Three days after fertilization, larvae were anesthetized using tricaine and mounted in 1.6% low melting agarose. Laser injury was performed using an Andor Micropoint pulsed-dye laser focusing system to target the posterior cardinal vein (PCV), 5 somites caudal to the anal pore,34,35 by an observer blinded to sample identify. Following injury, clotting was initiated in the PCV, and larvae were observed until complete occlusion of the vessel by this developing thrombus to block blood flow. The time to occlusion (TTO) of the PCV was recorded up to 120 seconds. The larvae were lysed and subjected to polymerase chain reaction assays using primers that flanked each target site to confirm successful editing.

Statistical analysis was performed using the ggpubr package in R, v4.1.1. Pairwise Wilcoxon signed rank tests were used for each gene to compare the TTO of uninjected controls and sgRNA-injected embryos. Injection with no sgRNA was used as a negative control. A sensitivity analysis was performed by pooling uninjected control measurements and comparing this distribution with that of each sgRNA-injected gene using a Wilcoxon signed rank test. A Bonferroni threshold of 0.0083 was used to account for 6 independent genes tested.

All animal studies were approved by the University of Michigan Institutional Animal Care and Use Committee with an approval date of 10 March 2022. The GWAS performed for each biobank was approved by each institution’s institutional review board, and only the summary statistics were used here.

Multipopulation meta-analysis yielded 4 potentially novel loci

We performed single variant association analyses across 9 biobanks with diverse ancestries (supplemental Figure 1), followed by a meta-analysis (27 987 cases, 1 035 290 controls), to look for VTE-associated loci. We identified 38 genome-wide significant loci (Figure 1; supplemental Table 10). Of these, 34 lead SNPs were within 500 kb of a variant previously reported in GWAS or sequencing studies, and there were 4 potentially novel associations in DHRS3, HOXB2, ARHGAP4, and LINC02411 (Table 1). We identified a potentially novel locus (rs112106699 near DHRS3) that is rare in EUR ancestries (gnomAD v4.1.0 non-Finnish EUR allele frequency, 0.08%) but that is observed at higher frequency in AFR ancestry cohorts (gnomAD AFR/AFR American allele frequency, 9.0%), highlighting the importance of a multipopulation meta-analysis (supplemental Figure 5). The variant’s imputation quality score ranged from 0.70 to 0.94 (median, 0.84) in 8 contributing GWASs. We also identified a common 166 bp structural variant, rs1459062246, near HOXB2 and a rare variant, rs115924439, with a large effect size (odds ratio, 1.9; 1.5-2.4) near LINC02411, highlighting the importance of the inclusion of common and rare indels, structural variants, and single variants in GWASs. On the X chromosome, we found 2 independent loci, namely common variant p.Thr194Ala, known as F9 Malmö (rs6048),59 and a variant ∼200 kb upstream of BCOR (rs3002417). An intronic variant of FUNDC2 (rs17328181) that was previously nominally associated with VTE (P = 2 × 10–7)19 was also associated.

Figure 1.

Schematic view of genes. Of the 38 genome-wide significant loci, 4 are potentially novel, and 34 are known from previous GWASs. Of the potentially novel genes, 2 have supportive evidence from recent GWAS meta-analyses.38,39 Six of the previously known genes were functionally validated in this study using a zebrafish model of blood clotting with 3 genes showing supportive evidence (significant validation) as the causal gene for modification of VTE in humans.

Figure 1.

Schematic view of genes. Of the 38 genome-wide significant loci, 4 are potentially novel, and 34 are known from previous GWASs. Of the potentially novel genes, 2 have supportive evidence from recent GWAS meta-analyses.38,39 Six of the previously known genes were functionally validated in this study using a zebrafish model of blood clotting with 3 genes showing supportive evidence (significant validation) as the causal gene for modification of VTE in humans.

Close modal
Table 1.

Potentially novel genome-wide significant VTE loci

Variant (risk-increasing allele/non–risk-increasing allele)Risk allele frequency in GBMIRisk allele frequency by ancestry in gnomAD 4.1.0Odds ratioP valuePrioritized gene
chr1:12563482
rs112106699 a/g 
0.004 AMR, 0.0082; AFR, 0.088; FIN, 0.00; NFE, 0.00078 1.57 2 × 10–8 DHRS3
Dehydrogenase/reductase 3 
chr11:100637868
rs11224340 g/a 
0.90 AMR, 0.93; AFR, 0.98; FIN, 0.90; NFE, 0.91 1.09 3 × 10–8 ARHGAP4
Rho GTPase activating protein 4 
chr12:127502539
rs115924439 a/g 
0.002 AMR, 0.0031; AFR, 0.038;
FIN, 0.00; NFE, 0.000088 
1.92 2 × 10–8 LINC02411
Long intergenic nonprotein coding RNA 2411 
chr17:48539858
rs1459062246
166-bp insertion 
0.80 AMR, 0.80; AFR, 0.54;
FIN, 0.77; NFE, 0.78 
1.12 3 × 10–8 HOXB2
Homeobox B2 
Variant (risk-increasing allele/non–risk-increasing allele)Risk allele frequency in GBMIRisk allele frequency by ancestry in gnomAD 4.1.0Odds ratioP valuePrioritized gene
chr1:12563482
rs112106699 a/g 
0.004 AMR, 0.0082; AFR, 0.088; FIN, 0.00; NFE, 0.00078 1.57 2 × 10–8 DHRS3
Dehydrogenase/reductase 3 
chr11:100637868
rs11224340 g/a 
0.90 AMR, 0.93; AFR, 0.98; FIN, 0.90; NFE, 0.91 1.09 3 × 10–8 ARHGAP4
Rho GTPase activating protein 4 
chr12:127502539
rs115924439 a/g 
0.002 AMR, 0.0031; AFR, 0.038;
FIN, 0.00; NFE, 0.000088 
1.92 2 × 10–8 LINC02411
Long intergenic nonprotein coding RNA 2411 
chr17:48539858
rs1459062246
166-bp insertion 
0.80 AMR, 0.80; AFR, 0.54;
FIN, 0.77; NFE, 0.78 
1.12 3 × 10–8 HOXB2
Homeobox B2 

Meta-analysis identified 38 genome-wide significant loci, 4 are potentially novel.

AFR, African ancestry as defined by gnomAD; AMR, Admixed American ancestry; FIN, Finnish ancestry; NFE, non-Finnish EUR.

We performed lookups of novel variants in 2 large previously published GWAS summary statistics (ncases = 42 032; ncases = 81 19038). Two of the 4 novel lead SNPs from the discovery meta-analysis were nominally significant in at least 1 of the studies with 1 being significant at a Bonferroni threshold of 0.013. All 4 loci, except LINC02411, showed a consistent direction of effect between the discovery and lookup cohorts (supplemental Table 11). Because of the potential overlap in individuals between the Electronic Medical Records and Genomics Network samples in the 2019 INVENT meta-analysis and the BioVU samples in GBMI, we also compared the 2022 INVENT meta-analysis with the GBMI without the BioVU samples as a sensitivity analysis (supplemental Figure 6). In addition, in the meta-analysis of the INVENT summary statistics and the GBMI summary statistics without the BioVU samples, all of the known loci had a combined P value of <5 × 10–8, and of the novel loci, only the ARGHAP4 locus met this threshold (supplemental Table 12). Although these novel loci require subsequent replication in genetic studies, the meta-analysis results are robust enough for integrative bioinformatic gene prioritization.

Integrative gene prioritization nominates likely causal genes

For each autosomal locus (n = 35), we performed bioinformatic gene prioritization using 7 independent lines of evidence from genetic, biologic, and clinical databases. More specifically, for each of the associated genetic regions, we recorded a gene that (1) had a missense variant within the 95% credible variant set (8 loci), (2) was closest to the lead associated variant within 1 kb (25 loci), (3) was the only gene prioritized by DEPICT (8 loci), (4) was within the top 10% according to the PoPS scores (17 loci), (5) was an expression quantitative trait loci (eQTL; 4 loci), (6) was found in ClinVar for related phenotypes (5 loci), or (7) was prioritized via colocalization and proteome-wide Mendelian randomization (15 genes at 23 loci; supplemental Table 10). Notably, for the eQTL and ClinVar variants, they must have been the lead SNP at that locus. By summing the number of lines of evidence that support a given gene, we prioritized at least 1 gene with ≥2 lines of supporting evidence at 30 of the 35 loci (supplemental Table 10). The genes with the most evidence (5/7) to be likely causal were F5, PLEK, and PROS1 (Figure 2). Similarly, genes that had 4 lines of evidence that supported their role as a likely causal gene were PROC, FGG, F11, F2, VWF, and PLCG2.

Figure 2.

Integrative gene prioritization. Autosomal genome-wide significant loci labeled by prioritized gene (x-axis) with shading for each line of evidence used in the bioinformatics-driven prioritization scheme (y-axis). Genes in bold were on the gold standard list (supplemental Table 7). The 7 lines of evidence evaluated were chosen to cover different mechanisms through which genetic variants contribute to disease risk, for example, regulatory changes vs protein perturbations. For VPS13D;DHRS3, F7;F10, and LINC02375;LINC02411, the genes had equal numbers of supporting lines of evidence. LINC00656 is also known as RP4-737E23.2. rs536995174 had 1 line of evidence each for SERPING1, SLC43A3, SLC43A1, F2, OR5AK4P, and LRRC55 and was excluded from this visualization. Genes with asterisks were selected for follow-up in a functional assay in zebrafish (Figure 3).

Figure 2.

Integrative gene prioritization. Autosomal genome-wide significant loci labeled by prioritized gene (x-axis) with shading for each line of evidence used in the bioinformatics-driven prioritization scheme (y-axis). Genes in bold were on the gold standard list (supplemental Table 7). The 7 lines of evidence evaluated were chosen to cover different mechanisms through which genetic variants contribute to disease risk, for example, regulatory changes vs protein perturbations. For VPS13D;DHRS3, F7;F10, and LINC02375;LINC02411, the genes had equal numbers of supporting lines of evidence. LINC00656 is also known as RP4-737E23.2. rs536995174 had 1 line of evidence each for SERPING1, SLC43A3, SLC43A1, F2, OR5AK4P, and LRRC55 and was excluded from this visualization. Genes with asterisks were selected for follow-up in a functional assay in zebrafish (Figure 3).

Close modal

Using our integrative gene prioritization approach, we identified 11 genes known to be involved in blood clotting (F2, F5, F9, F7, F10, F11, FGG, PROC, PROCR, PROS1, and VWF), including genes that are known to regulate blood clotting factors from functional analyses but that have not been identified previously in GWASs (eg, PROS1, STAB2, SERPINE2) and genes without known mechanisms (eg, TC2N, PLEK). Using Enrichr for the 38 prioritized candidate genes, we identified significantly enriched gene sets, including the Kyoto Encyclopedia of Genes and Genomes pathway of complement and coagulation cascades (supplemental Table 13; enrichment P = 4 × 10–14), the GO biologic process term of negative regulation of blood coagulation (supplemental Table 14; enrichment P = 6 × 10–8), and the GO molecular function term of serine-type endopeptidase activity (supplemental Table 15; enrichment P = 1 × 10–4). For the 4 novel loci from our GWAS, we prioritized 1 or 2 likely causal genes at each locus based on 2 lines of evidence (HOXB2, ARHGAP4) or only 1 line of evidence (DHRS3, LINC02411). The rare variant in DHRS3 and the large indel in HOXB2 did not have significant associations in the 2 recent GWASs and warrant further validation.

In vivo functional analyses in zebrafish provide evidence for the causal role of RASIP1 and TC2N

The bioinformatic gene prioritization showed evidence that a number of known coagulation factors contribute to VTE. Our previous studies validated several known coagulation factors using the genome-edited zebrafish models of hemostasis and thrombosis, including F2,60,F5,61,F10,62,PROS1,63 and PROC.63 We have also shown previously that the knockout of SERPINC1,34,PROS1, and PROC in zebrafish increased the TTO owing to a consumptive coagulopathy that was caused by excess thrombin activity and the consumption of fibrinogen. This can also be seen in severe human thrombosis, and therefore an increased TTO is consistent with VTE. An increased TTO was also observed with loss of function mutations in procoagulant genes (F2,60,F5,61 F1062). Because GWAS associations with VTE can either be protective or indicate increased risk, the TTO is a simple assay to confirm an individual gene’s causality and whether it is antithrombotic or prothrombotic.

Using this model, we evaluated 6 prioritized genes (F7, RASIP1, TC2N, STAB2, TSPAN15, and PLCG2) from regions that demonstrated conservation of synteny in the zebrafish genome. We chose genes across the spectrum of prioritized genes to see if those with more lines of evidence were more likely to be functional than genes with less lines of evidence. Because our aim was also to test the validity of the in silico gene prioritization, we focused on genes with evidence from multiple genetic studies rather than on our novel gene findings from the genetic discovery. PLCG2 had the most lines of supporting evidence of the genes that were novel when considering functional studies. From genes with 3 lines of evidence, F7 and STAB2 were selected—F7 was used as a positive control in this assay, and STAB2 has a missense variant in the 95% credible set. For the others, we selected those with the most significant P values among the genes with 2 lines of evidence that had either been associated with thrombosis through an unknown mechanism or that had not been previous implicated in coagulation.

CRISPR/Cas9 was used to create mosaic knockdown larvae64 for genotype-blinded evaluation of the TTO after induced endothelial injury. After accounting for multiple testing using the Bonferroni multiple testing correction, we have supportive evidence for RASIP1 (Wilcoxon signed-rank test P = 2.8 × 10–15) and TC2N (Wilcoxon signed-rank test P = 8.1 × 10–4) in the modification of human VTE (Figure 3). STAB2 and PLCG2 were nominally significant (P < .05). To increase power, we also pooled the noninjected controls from multiple experiments and compared the median TTO to the median for each knockdown. This secondary comparison provided additional evidence for RASIP1 and provided suggestive evidence for STAB2 and TSPAN15 (P < Bonferroni-adjusted threshold 0.0083) (supplemental Figure 7).

Figure 3.

Functional evidence for causal genes in genetically modified zebrafish.P values from Wilcoxon rank sum tests are listed at the top. The y-axis represents the experimental TTO for control and sgRNA-injected zebrafish embryos with the x-axis showing the genes targeted through CRISPR. Injections made without sgRNA served as a negative control. Factor 7 (F7) served as a positive control.

Figure 3.

Functional evidence for causal genes in genetically modified zebrafish.P values from Wilcoxon rank sum tests are listed at the top. The y-axis represents the experimental TTO for control and sgRNA-injected zebrafish embryos with the x-axis showing the genes targeted through CRISPR. Injections made without sgRNA served as a negative control. Factor 7 (F7) served as a positive control.

Close modal

In this study, we performed a multipopulation meta-analysis of GWASs of VTE, compared the findings with those of similar studies,38,39 and identify 4 potentially novel loci. Using a bioinformatics-driven gene prioritization heuristic, we identify prioritized genes at each locus and validated these through knockdown of the 6 putative causal genes in zebrafish. The integrative prioritization method is similar to previous studies, but no gold standard method exists for defining the most probable causal gene. One limitation of the credible sets used for prioritization is the unreliability of the fine-mapping results from a multipopulation meta-analysis.65 A purely bioinformatics-driven approach has its limitations, for example, SCARA5 only had 2 lines of evidence despite previous functional work suggesting its role in von Willebrand factor clearance.66 However, it can be useful for prioritizing genes for functional follow-up in situations in which the number of candidate genes in the region is too high to take forward into biologic models.

We can assign a candidate causal gene to 2 of the novel loci after supplementing bioinformatics-driven integrative gene prioritization with literature review. On chromosome 11, the intergenic variant rs11224340 both is an eQTL in tibial arterial tissue for ARHGAP4 and is 46 kb away. ARHGAP4 previously has been associated with blood pressure,67 whereas CNTN5, 278 kb away, has been associated with platelet count,68 white blood cell count,68 and red blood cell distribution width.69 On chromosome 17, the insertion falls into a gene-enhancer region between HOXB1 and HOXB2 and near HOXB2-AS1. Although there are blood trait associations in the GWAS Catalog, it is unclear which HOX gene in this region may be causal, although the associated lead variant is 2.4 kb upstream and also an eQTL for HOXB2. The potentially novel variants on chromosome 1 (DHRS3) and chromosome 12 (LINC02411) do not have clear candidate causal genes and are assigned based on proximity.

Other known genes had conflicting evidence in gene prioritization. On chromosome 6, the intronic variant rs10559566 is in CARMIL1, a gene previously associated with platelet counts,70 and was the highest prioritized gene based on PoPS, however, the strongest eQTLs for this variant point to SCGN. The location of the lead SNP and eQTLs indicated that JAZF1-AS was the likely causal gene of the intronic, noncoding RNA variant rs1513275 on chromosome 7, and JAZF1 is known to be associated with type 2 diabetes.71 This locus has been associated with F7 activity in a previous study72 in which no bioinformatic gene-prioritization was performed. However, de Vries et al did find that silencing JAZF1 in liver cells lowered the expression of F7 messenger RNA and protein. Because the original association was with F7 activity, which was only partially attenuated by the silencing, the authors concluded that the genetic variants in this locus might play independent roles in antigen and activity levels.

Although imperfect, the integrative gene prioritization provided a starting point for functional studies. Although the TTO was not significantly different after accounting for multiple testing, STAB2, TSPAN15, and PLCG2 remain candidates as causal genes. For STAB2, sequencing of 393 VTE cases and 6114 controls identified rare, damaging variants in the gene with strong evidence for a role in modifying thrombosis risk.73 Furthermore, mouse knockout of Stab2 was prothrombotic and led to the formation of large venous thrombi.74 The lack of functional confirmation in the zebrafish model could be because of limited statistical power, insufficient knockdown, or species-specific differences. In addition, this assay primarily tests the ability to produce fibrin-rich thrombi. It is possible that these genes affect thrombosis through modification of other pathways involved in clotting, including platelets and vasculature. For example, although RASIP1 knockdown did significantly alter the TTO, this gene is also a regulator of vascular integrity75 and therefore might mediate VTE through other mechanisms. The additional prioritized genes from the genome-wide significant loci remain intriguing for functional follow-up in future studies.

This work contributes to the understanding of genetic variation associated with VTE by layering information from genetic, medical, and biologic studies and models to link the genetic findings to biologic function. By combining evidence from a large-scale GWAS, a multitude of existing data sources, and bioinformatic tools for in silico follow-up and functional follow-up in in vivo model organism, we identified 38 genome-wide significant loci (4 potentially novel) with plausible underlying causal genes, 2 (RASIP1 and TC2N) of which had biologic support in the functional follow-up. These genes may be further studied to identify diagnostic or therapeutic targets that may aid in the management of VTE. Finally, further studies that integrate multiple layers of information will add to the understanding of the human genetic background of VTE and lead to new insights into VTE pathophysiology.

The authors acknowledge the biobank participants, recruitment teams, and project managers of the Global Biobank Meta-Analysis Initiative for providing their data for biomedical research and providing data aggregation, management, and distribution services in support of the research reported in this publication (especially Sinéad Chapman and Bethany Klunder). The authors acknowledge BioBank Japan (Yukinori Okada, Koichi Matsua, and Masahiro Kanai), BioMe (Ruth Loos, Judy Cho, Eimear Kenny, Michael Preuss, and Simon Lee), BioVU (Nancy Cox and Jibril Hirbo), Canadian Partnership for Tomorrow (Philip Awadalla and Marie-Julie Fave), China Kadoorie (Robin Walters, Kuang Lin, and Iona Millwood), Colorado Center for Personalized Medicine (Kathleen Barnes, Michelle Daya, and Chris Gignoux), deCODE Genetics (Kári Stefánsson and Unnur þorsteinsdóttir), East London Genes & Health (David A. van Heel, Sarah Finer, and Richard Trembath), Estonian Biobank (Andres Metspalu, Reedik Mägi, Tõnu Esko, and Priit Palta), FinnGen (Aarno Palotie, Mark Daly, Samuli Ripatti, Mitja Kurki, and Juha Karjalainen), Generation Scotland (Caroline Hayward and Riccardo Marioni), the Trøondelag Health Study (HUNT) (Kristian Hveem, Cristen Willer, Sarah Graham, Ben Brumpton, and Brooke Wolford), Lifelines (Serena Sanna and Esteban Lopera), Michigan Genomics Initiative (Sebastian Zoellner, Michael Boehnke, Lars Fritsche, and Anita Pandit), Million Veteran Program (Christopher J. O’Donnell), Netherlands Twin Register (D. I. Boomsma and M. G. Nivard), Partners Biobank (Jordan Smoller and Yen-Chen Feng), QIMR Berghofer (Sarah Medland, Stuart McGregor, and Nathan Ingold), Taiwan Biobank (Yen-Feng Lin, Yen-Chen Feng, and Hailiang Huang), University of California, Los Angeles Precision Health Biobank (Ruth Johnson, Yi Ding, Alec Chiu, Bogdan Pasaniuc, and Daniel Geschwind), and UK Biobank (Konrad Karczewski and Alicia Martin).

J.A.S. was supported by R35 HL150784 and the Henry and Mala Dorfman Family Professorship in Pediatric Hematology/Oncology. K.C.D. was supported by R01 HL172780. D.-A.T. was supported by the Multi-omics Approach to Trackle the Epidemiology of Venous Thromboembolism (EPIDEMIOM-VT) Senior Chair from the University of Bordeaux initiative of excellence and the Laboratory of Excellence on Medical Genomics (GENMED LabEx, ANR-10-LABX-0013), a research program managed by the National Research Agency (ANR) as part of the French Investment for the Future. C.J.W., I.S., K.-H.H.W., and B.N.W. were supported by R35-HL135824 (Willer, PI). S.M.D. was supported by IK2-CX001780. This research is based on data from the Million Veteran Program, Office of Research and Development, Veterans Health Administration, and was supported by award number BX003362. This work was supported by funding from the Department of Veterans Affairs Office of Research and Development, Million Veteran Program Grant MVP000; Department of Veterans. L.B. and B.M.B. work in a research unit funded by the Liaison Committee for education, research and innovation in Central Norway and the joint research committee of St. Olavs Hospital and the Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology. The Genotype-Tissue Expression (GTEx) Project was supported by the Common Fund of the Office of the Director of the National Institutes of Health, and by the National Cancer Institute, National Human Genome Research Institute, National Heart, Lung, and Blood Institute, National Institute on Drug Abuse, National Institute of Mental Health, and National Institute of Neurological Disorders and Stroke. The data used for the analyses described in this manuscript were obtained from GTEx Analysis v8 on the GTEx Portal on 1 May 2021.

This publication does not represent the views of the Department of Veteran Affairs or the United States Government. The views expressed in this manuscript are those of the authors and do not necessarily represent the views of the National Heart, Lung, and Blood Institute; the National Institutes of Health; or the United States Department of Health and Human Services.

Contribution: W.Z., B.N.W., I.S., and K.-H.H.W. performed the bioinformatic analyses; B.M.B. and L.B. performed the Mendelian randomization; F.T., A.D.J., N.L.S., S.M.D., D.K., and D.-A.T. performed the lookup; Q.Y.Z., X.Y., C.E.R., and J.A.S. performed the functional analyses; B.N.W., I.S., K.-H.H.W., K.C.D., C.E.R., and J.A.S. designed the research and wrote the manuscript; V.L.F. and K.T. provided critical feedback and revision of the manuscript; and C.J.W., M.J.D., and B.M.N. provided critical reviews of the manuscript.

Conflict-of-interest disclosure: C.J.W. and K.-H.H.W. report being employed at Regeneron Pharmaceuticals, although they were not at the time of this study. D.K. reports being employed at Bitterroot Bio, although he was not at the time of this study. S.M.D. reports research support from Novo Nordisk and Amgen, outside the scope of the current research; and is named as a coinventor on a government-owned US Patent application related to the use of genetic risk prediction for venous thromboembolic disease filed by the US Department of Veterans Affairs in accordance with Federal regulatory requirements. J.A.S. reports serving as a consultant for Sanofi, Novo Nordisk, Biomarin, Takeda, Pfizer, Genentech, CSL Behring, and Medexus. The remaining authors declare no competing financial interests.

A complete list of the members of the Global Biobank Meta-analysis Initiative (GBMI) study group and INVENT, MVP consortium appears in the supplemental Appendix.

Correspondence: Ida Surakka, Division of Cardiovascular Medicine, Department of Internal Medicine, University of Michigan, NCRC Building 26, Room 361S, 2800 Plymouth Rd, Ann Arbor, MI 48109-2800; email: isurakka@umich.edu.

1.
Henke
PK
,
Kahn
SR
,
Pannucci
CJ
, et al;
American Heart Association Advocacy Coordinating Committee
.
Call to action to prevent venous thromboembolism in hospitalized patients: a policy statement from the American Heart Association
.
Circulation
.
2020
;
141
(
24
):
e914
-
e931
.
2.
Heit
JA
.
Epidemiology of venous thromboembolism
.
Nat Rev Cardiol
.
2015
;
12
(
8
):
464
-
474
.
3.
Centers for Disease Control and Prevention
.
Data and statistics on venous thromboembolism
. Accessed 1 August 2021. https://www.cdc.gov/blood-clots/data-research/facts-stats/index.html.
4.
Centers for Disease Control and Prevention
.
Impact of blood clots on the United States
. Accessed 1 August 2021. https://www.cdc.gov/blood-clots/toolkit/impact-of-blood-clots.html?CDC_AAref_Val=https://www.cdc.gov/ncbddd/dvt/infographic-impact.html.
5.
Zöller
B
,
Pirouzifard
M
,
Svensson
PJ
, et al
.
Familial segregation of venous thromboembolism in Sweden: a nationwide family study of heritability and complex segregation analysis
.
J Am Heart Assoc
.
2021
;
10
(
24
):
e020323
.
6.
Souto
JC
,
Almasy
L
,
Muñiz-Diaz
E
, et al
.
Functional effects of the ABO locus polymorphism on plasma levels of von Willebrand factor, factor VIII, and activated partial thromboplastin time
.
Arterioscler Thromb Vasc Biol
.
2000
;
20
(
8
):
2024
-
2028
.
7.
Bertina
RM
,
Koeleman
BP
,
Koster
T
, et al
.
Mutation in blood coagulation factor V associated with resistance to activated protein C
.
Nature
.
1994
;
369
(
6475
):
64
-
67
.
8.
Trégouët
DA
,
Heath
S
,
Saut
N
, et al
.
Common susceptibility alleles are unlikely to contribute as strongly as the FV and ABO loci to VTE risk: results from a GWAS approach
.
Blood
.
2009
;
113
(
21
):
5298
-
5303
.
9.
Germain
M
,
Saut
N
,
Greliche
N
, et al
.
Genetics of venous thrombosis: insights from a new genome wide association study
.
PLoS One
.
2011
;
6
(
9
):
e25581
.
10.
Heit
JA
,
Armasu
SM
,
Asmann
YW
, et al
.
A genome-wide association study of venous thromboembolism identifies risk variants in chromosomes 1q24.2 and 9q
.
J Thromb Haemost
.
2012
;
10
(
8
):
1521
-
1531
.
11.
Tang
W
,
Teichert
M
,
Chasman
DI
, et al
.
A genome-wide association study for venous thromboembolism: the extended Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium
.
Genet Epidemiol
.
2013
;
37
(
5
):
512
-
521
.
12.
Germain
M
,
Chasman
DI
,
de Haan
H
, et al;
Cardiogenics Consortium
.
Meta-analysis of 65,734 individuals identifies TSPAN15 and SLC44A2 as two susceptibility loci for venous thromboembolism
.
Am J Hum Genet
.
2015
;
96
(
4
):
532
-
542
.
13.
Hernandez
W
,
Gamazon
ER
,
Smithberger
E
, et al
.
Novel genetic predictors of venous thromboembolism risk in African Americans
.
Blood
.
2016
;
127
(
15
):
1923
-
1929
.
14.
Hinds
DA
,
Buil
A
,
Ziemek
D
, et al;
METASTROKE Consortium, INVENT Consortium
.
Genome-wide association analysis of self-reported events in 6135 individuals and 252 827 controls identifies 8 loci associated with thrombosis
.
Hum Mol Genet
.
2016
;
25
(
9
):
1867
-
1874
.
15.
Rühle
F
,
Witten
A
,
Barysenka
A
, et al
.
Rare genetic variants in SMAP1, B3GAT2, and RIMS1 contribute to pediatric venous thromboembolism
.
Blood
.
2017
;
129
(
6
):
783
-
790
.
16.
Heit
JA
,
Armasu
SM
,
McCauley
BM
, et al
.
Identification of unique venous thromboembolism-susceptibility variants in African-Americans
.
Thromb Haemost
.
2017
;
117
(
4
):
758
-
768
.
17.
Klarin
D
,
Emdin
CA
,
Natarajan
P
,
Conrad
MF
,
Kathiresan
S
;
INVENT Consortium
.
Genetic analysis of venous thromboembolism in UK Biobank identifies the ZFPM2 locus and implicates obesity as a causal risk factor
.
Circ Cardiovasc Genet
.
2017
;
10
(
2
):
e001643
.
18.
Thibord
F
,
Hardy
L
,
Ibrahim-Kosta
M
, et al
.
A genome wide association study on plasma FV levels identified PLXDC2 as a new modifier of the coagulation process
.
J Thromb Haemost
.
2019
;
17
(
11
):
1808
-
1814
.
19.
Lindström
S
,
Wang
L
,
Smith
EN
, et al;
Million Veteran Program
CHARGE Hemostasis Working Group
.
Genomic and transcriptomic association studies identify 16 novel susceptibility loci for venous thromboembolism
.
Blood
.
2019
;
134
(
19
):
1645
-
1657
.
20.
Klarin
D
,
Busenkell
E
,
Judy
R
, et al;
INVENT Consortium
Veterans Affairs’ Million Veteran Program
.
Genome-wide association analysis of venous thromboembolism identifies new risk loci and genetic overlap with arterial vascular disease
.
Nat Genet
.
2019
;
51
(
11
):
1574
-
1579
.
21.
Deguchi
H
,
Shukla
M
,
Hayat
M
,
Torkamani
A
,
Elias
DJ
,
Griffin
JH
.
Novel exomic rare variants associated with venous thrombosis
.
Br J Haematol
.
2020
;
190
(
5
):
783
-
786
.
22.
Rodriguez
BAT
,
Bhan
A
,
Beswick
A
, et al;
lFinnGen Study
.
A platelet function modulator of thrombin activation is causally linked to cardiovascular disease and affects PAR4 receptor signaling
.
Am J Hum Genet
.
2020
;
107
(
2
):
211
-
221
.
23.
Mateos
MK
,
Tulstrup
M
,
Quinn
MC
, et al
.
Genome-wide association meta-analysis of single-nucleotide polymorphisms and symptomatic venous thromboembolism during therapy for acute lymphoblastic leukemia and lymphoma in Caucasian children
.
Cancers
.
2020
;
12
(
5
):
1285
.
24.
Herrera-Rivero
M
,
Stoll
M
,
Hegenbarth
J-C
, et al
.
Single- and multimarker genome-wide scans evidence novel genetic risk modifiers for venous thromboembolism
.
Thromb Haemost
.
2021
;
121
(
9
):
1169
-
1180
.
25.
Stefanucci
L
,
Collins
JH
,
Sims
MC
, et al
.
The effects of pathogenic and likely pathogenic variants for inherited hemostasis disorders in 140 214 UK Biobank participants
.
Blood
.
2023
;
142
(
24
):
2055
-
2068
.
26.
Manderstedt
E
,
Halldén
C
,
Lind-Halldén
C
, et al;
Regeneron Genetics Center
.
Thrombotic risk determined by protein C receptor (PROCR) variants among middle-aged and older adults: a population-based cohort study
.
Thromb Haemost
.
2022
;
122
(
8
):
1326
-
1332
.
27.
Dennis
J
,
Johnson
CY
,
Adediran
AS
, et al
.
The endothelial protein C receptor (PROCR) Ser219Gly variant and risk of common thrombotic disorders: a HuGE review and meta-analysis of evidence from observational studies
.
Blood
.
2012
;
119
(
10
):
2392
-
2400
.
28.
Howe
K
,
Clark
MD
,
Torroja
CF
, et al
.
The zebrafish reference genome sequence and its relationship to the human genome
.
Nature
.
2013
;
496
(
7446
):
498
-
503
.
29.
Kretz
CA
,
Weyand
AC
,
Shavit
JA
.
Modeling disorders of blood coagulation in the zebrafish
.
Curr Pathobiol Rep
.
2015
;
3
(
2
):
155
-
161
.
30.
Liu
LY
,
Fox
CS
,
North
TE
,
Goessling
W
.
Functional validation of GWAS gene candidates for abnormal liver function during zebrafish liver development
.
Dis Model Mech
.
2013
;
6
(
5
):
1271
-
1278
.
31.
Škorić-Milosavljević
D
,
Tadros
R
,
Bosada
FM
, et al;
KORA-Study Group
.
Common genetic variants contribute to risk of transposition of the great arteries
.
Circ Res
.
2022
;
130
(
2
):
166
-
180
.
32.
Gehlen
J
,
Stundl
A
,
Debiec
R
, et al
.
Elucidation of the genetic causes of bicuspid aortic valve disease
.
Cardiovasc Res
.
2023
;
119
(
3
):
857
-
866
.
33.
Adeyemo
AA
,
Zaghloul
NA
,
Chen
G
, et al;
South Africa Zulu Type 2 Diabetes Case-Control Study
.
ZRANB3 is an African-specific type 2 diabetes locus associated with beta-cell mass and insulin response
.
Nat Commun
.
2019
;
10
(
1
):
3195
.
34.
Liu
Y
,
Kretz
CA
,
Maeder
ML
, et al
.
Targeted mutagenesis of zebrafish antithrombin III triggers disseminated intravascular coagulation and thrombosis, revealing insight into function
.
Blood
.
2014
;
124
(
1
):
142
-
150
.
35.
Rost
MS
,
Grzegorski
SJ
,
Shavit
JA
.
Quantitative methods for studying hemostasis in zebrafish larvae
.
Methods Cell Biol
.
2016
;
134
:
377
-
389
.
36.
Raghunath
A
,
Ferguson
AC
,
Shavit
JA
.
Fishing for answers to hemostatic and thrombotic disease: genome editing in zebrafish
.
Res Pract Thromb Haemost
.
2022
;
6
(
5
):
e12759
.
37.
Zhou
W
,
Kanai
M
,
Wu
K-HH
, et al;
Biobank of the Americas
Biobank Japan Project
BioMe
BioVU
CanPath - Ontario Health Study
China Kadoorie Biobank Collaborative Group
Colorado Center for Personalized Medicine
deCODE Genetics
Estonian Biobank
FinnGen
Generation Scotland
Genes & Health Research Team
LifeLines
Mass General Brigham Biobank
Michigan Genomics Initiative
National Biobank of Korea
Penn Medicine BioBank
Qatar Biobank
QSkin Sun and Health Study
Taiwan Biobank
HUNT Study
UCLA ATLAS Community Health Initiative
Uganda Genome Resource
UK Biobank
.
Global Biobank meta-analysis Initiative: powering genetic discovery across human disease
.
Cell Genom
.
2022
;
2
(
10
):
100192
.
38.
Thibord
F
,
Klarin
D
,
Brody
JA
, et al;
Global Biobank Meta-Analysis Initiative; Estonian Biobank Research Team; 23andMe Research Team; Biobank Japan; CHARGE Hemostasis Working Group
.
Cross-ancestry investigation of venous thromboembolism genomic predictors
.
Circulation
.
2022
;
146
(
16
):
1225
-
1242
.
39.
Ghouse
J
,
Tragante
V
,
Ahlberg
G
, et al
.
Genome-wide meta-analysis identifies 93 risk loci and enables risk prediction equivalent to monogenic forms of venous thromboembolism
.
Nat Genet
.
2023
;
55
(
3
):
399
-
409
.
40.
Vermeesch
P
.
IsoplotR: a free and open toolbox for geochronology
.
Geosci Front
.
2018
;
9
(
5
):
1479
-
1493
.
41.
Wickham
H
. Ggplot2: Elegant Graphics for Data Analysis.
Springer
;
2016
.
42.
Venables
WN
,
Ripley
BD
. Modern Applied Statistics With S. Statistics and Computing.
Springer
;
2002
.
43.
Barrett
T
,
Dowle
M
,
Srinivasan
A
, et al
.
data.table: extension of ‘data.frame’. R package version 1.17.99
. 2025. Accessed 16 July 2025. https://r-datatable.com.
44.
Arnold
J
.
ggthemes: extra themes, scales and geoms for ‘ggplot2’. R package version 5.1.0.9000
. https://jrnold.github.io/ggthemes/.
45.
Wickham
H
,
Francois
R
,
Henry
L
,
Müller
K
,
Vaughan
D
.
dplyr: a grammar of data manipulation. R package version 1.1.4
. 2025. Accessed 16 July 2025. https://dplyr.tidyverse.org.
46.
Wang
G
,
Sarkar
A
,
Carbonetto
P
,
Stephens
M
.
A simple new approach to variable selection in regression, with application to genetic fine mapping
.
J R Stat Soc Ser B Stat Methodol
.
2020
;
82
(
5
):
1273
-
1300
.
47.
Pers
TH
,
Karjalainen
JM
,
Chan
Y
, et al;
Genetic Investigation of ANthropometric Traits GIANT Consortium
.
Biological interpretation of genome-wide association studies using predicted gene functions
.
Nat Commun
.
2015
;
6
:
5890
.
48.
Abecasis
GR
,
Altshuler
D
,
Auton
A
, et al;
1000 Genomes Project Consortium
.
A map of human genome variation from population-scale sequencing
.
Nature
.
2010
;
467
(
7319
):
1061
-
1073
.
49.
Weeks
EM
,
Ulirsch
JC
,
Cheng
NY
, et al
.
Leveraging polygenic enrichments of gene features to predict genes underlying complex traits and diseases
.
bioRxiv
.
Preprint posted online 10 September 2020
.
50.
Auton
A
,
Brooks
LD
,
Durbin
RM
, et al;
1000 Genomes Project Consortium
.
A global reference for human genetic variation
.
Nature
.
2015
;
526
(
7571
):
68
-
74
.
51.
Downes
K
,
Megy
K
,
Duarte
D
, et al;
NIHR BioResource
.
Diagnostic high-throughput sequencing of 2396 patients with bleeding, thrombotic, and platelet disorders
.
Blood
.
2019
;
134
(
23
):
2082
-
2091
.
52.
Zhao
H
,
Rasheed
H
,
Nøst
TH
, et al
.
Proteome-wide Mendelian randomization in global biobank meta-analysis reveals multi-ancestry drug targets for common diseases
.
Cell Genom
.
2022
;
2
(
11
):
100195
.
53.
GTEx Consortium
.
The GTEx Consortium atlas of genetic regulatory effects across human tissues
.
Science
.
2020
;
369
(
6509
):
1318
-
1330
.
54.
Landrum
MJ
,
Lee
JM
,
Benson
M
, et al
.
ClinVar: improving access to variant interpretations and supporting evidence
.
Nucleic Acids Res
.
2018
;
46
(
D1
):
D1062
-
D1067
.
55.
Chen
EY
,
Tan
CM
,
Kou
Y
, et al
.
Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool
.
BMC Bioinformatics
.
2013
;
14
:
128
.
56.
Kuleshov
MV
,
Jones
MR
,
Rouillard
AD
, et al
.
Enrichr: a comprehensive gene set enrichment analysis web server 2016 update
.
Nucleic Acids Res
.
2016
;
44
(
W1
):
W90
-
W97
.
57.
Xie
Z
,
Bailey
A
,
Kuleshov
MV
, et al
.
Gene set knowledge discovery with Enrichr
.
Curr Protoc
.
2021
;
1
(
3
):
e90
.
58.
Labun
K
,
Montague
TG
,
Krause
M
,
Torres Cleuren
YN
,
Tjeldnes
H
,
Valen
E
.
CHOPCHOP v3: expanding the CRISPR web toolbox beyond genome editing
.
Nucleic Acids Res
.
2019
;
47
(
W1
):
W171
-
W174
.
59.
Bezemer
ID
,
Arellano
AR
,
Tong
CH
, et al
.
F9 Malmö, factor IX and deep vein thrombosis
.
Haematologica
.
2009
;
94
(
5
):
693
-
699
.
60.
Grzegorski
SJ
,
Hu
Z
,
Liu
Y
, et al
.
Disruption of the kringle 1 domain of prothrombin leads to late onset mortality in zebrafish
.
Sci Rep
.
2020
;
10
:
4049
.
61.
Weyand
AC
,
Grzegorski
SJ
,
Rost
MS
, et al
.
Analysis of factor V in zebrafish demonstrates minimal levels needed for early hemostasis
.
Blood Adv
.
2019
;
3
(
11
):
1670
-
1680
.
62.
Hu
Z
,
Liu
Y
,
Huarng
MC
, et al
.
Genome editing of factor X in zebrafish reveals unexpected tolerance of severe defects in the common pathway
.
Blood
.
2017
;
130
(
5
):
666
-
676
.
63.
Ku
C-J
,
Yu
X
,
Zhao
QY
, et al
.
Loss of protein C vs protein S results in discrepant thrombotic phenotypes
.
Blood Adv
.
2025
;
9
(
3
):
545
-
557
.
64.
Burger
A
,
Lindsay
H
,
Felker
A
, et al
.
Maximizing mutagenesis with solubilized CRISPR-Cas9 ribonucleoprotein complexes
.
Development
.
2016
;
143
(
11
):
2025
-
2037
.
65.
Kanai
M
,
Ulirsch
JC
,
Karjalainen
J
, et al
.
Insights from complex trait fine-mapping across diverse populations
.
bioRxiv
.
Preprint posted online 5 September 2021
.
66.
Swystun
LL
,
Ogiwara
K
,
Lai
JD
, et al
.
The scavenger receptor SCARA5 is an endocytic receptor for von Willebrand factor expressed by littoral cells in the human spleen
.
J Thromb Haemost
.
2019
;
17
(
8
):
1384
-
1396
.
67.
Kato
N
,
Loh
M
,
Takeuchi
F
, et al;
BIOS-consortium
CARDIo GRAMplusCD
LifeLines Cohort Study
InterAct Consortium
.
Trans-ancestry genome-wide association study identifies 12 genetic loci influencing blood pressure and implicates a role for DNA methylation
.
Nat Genet
.
2015
;
47
(
11
):
1282
-
1293
.
68.
Sakaue
S
,
Kanai
M
,
Tanigawa
Y
, et al;
FinnGen
.
A cross-population atlas of genetic associations for 220 human phenotypes
.
Nat Genet
.
2021
;
53
(
10
):
1415
-
1424
.
69.
Astle
WJ
,
Elding
H
,
Jiang
T
, et al
.
The allelic landscape of human blood cell trait variation and links to common complex disease
.
Cell
.
2016
;
167
(
5
):
1415
-
1429.e19
.
70.
Wei
Y
,
Tejera
P
,
Wang
Z
, et al
.
A missense genetic variant in LRRC16A/CARMIL1 improves acute respiratory distress syndrome survival by attenuating platelet count decline
.
Am J Respir Crit Care Med
.
2017
;
195
(
10
):
1353
-
1361
.
71.
Zeggini
E
,
Scott
LJ
,
Saxena
R
, et al;
Wellcome Trust Case Control Consortium
.
Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes
.
Nat Genet
.
2008
;
40
(
5
):
638
-
645
.
72.
de Vries
PS
,
Sabater-Lleal
M
,
Huffman
JE
, et al;
INVENT Consortium
MEGASTROKE Consortium of the International Stroke Genetics Consortium
.
A genome-wide association study identifies new loci for factor VII and implicates factor VII in ischemic stroke etiology
.
Blood
.
2019
;
133
(
9
):
967
-
977
.
73.
Desch
KC
,
Ozel
AB
,
Halvorsen
M
, et al
.
Whole-exome sequencing identifies rare variants in STAB2 associated with venous thromboembolic disease
.
Blood
.
2020
;
136
(
5
):
533
-
541
.
74.
Michels
A
,
Swystun
LL
,
Dwyer
CN
, et al
.
Stabilin-2 deficiency increases thrombotic burden and alters the composition of venous thrombi in a mouse model
.
J Thromb Haemost
.
2021
;
19
(
10
):
2440
-
2453
.
75.
Wilson
CW
,
Parker
LH
,
Hall
CJ
, et al
.
Rasip1 regulates vertebrate vascular endothelial junction stability through Epac1-Rap1 signaling
.
Blood
.
2013
;
122
(
22
):
3678
-
3690
.

Author notes

B.N.W. and Q.Y.Z. contributed equally to this study.

J.A.S and I.S. are joint senior authors.

Genome-wide association study summary statistics are available for download at https://www.globalbiobankmeta.org/resources and for browsing at http://results.globalbiobankmeta.org. The integrative gene prioritization data may be found in a data supplement available with the online version of this article.

The full-text version of this article contains a data supplement.