BACKGROUND. Most oncological and non-oncological hematological conditions fall under the category of rare diseases. Rare diseases present unique challenges due to the limited availability of data, which impacts diagnostic rates and the generation of clinical evidence. Overall, they constitute a public health concern, highlighting the urgent need to develop new methods for improving data accessibility. In this context, Federated Learning (FL) is a Machine Learning approach that allows multiple centers to collaborate on complex research questions without the need to centralize or share data.
This project was conducted by the Genomed4all and Synthema consortia with the goal of developing an innovative FL platform for rare hematological diseases. This platform enables the development of novel Artificial Intelligence (AI) models for personalized medicine without data sharing, to be implemented in the referral centers of EuroBloodNET, the European Reference Network for rare hematological diseases. The aims of the project were: 1) to develop robust federated models for personalized prediction using multicentric, real-world datasets; 2) to protect patients' privacy; and 3) to enhance collaboration between institutions while avoiding the creation of centralized data repositories.
METHODS. The FL platform includes a manager node (MN) and multiple worker nodes (WN). Users upload their model to the MN, which distributes it to WNs for local training. Trained weights are returned to the MN for aggregation, repeating until training is complete. The MN is hosted at the Humanitas Research Hospital, and the 3 WNs are located at Humanitas, Universität Leipzig, and the University of Bologna.
The testing approach is based on MOSAIC, an AI-based framework for multimodal analysis in rare cancers (PMID 38875514). We focused on myelodysplastic syndromes as a use case, since they are a rare hematological disease with high clinical and genomic heterogeneity (which represents a challenging scenario). The dataset includes comprehensive clinical and genomic data, with information on treatments and clinical outcomes. Each of the 3 WNs has a different sample size of patient data, with 2656, 1328, and 443 samples. Our goal was to build an AI-based model for prediction of survival probability, combining molecular information with clinical data. We planned 3 scenarios for testing: a best-case scenario using all available features, a worst-case scenario excluding up to 50% of genomic information, and a random scenario where an incremental percentage of the genomic variables were randomly removed. For each setting, a DeepSurv model is trained using 80% of local data for training and 20% for testing.
RESULTS. In the best-case scenario, the model achieved a concordance index (c-index) of 0.75. The node with the smallest dataset initially registered the lowest c-index value (0.4); however, after the first training step, its performance began to follow the trend at levels comparable to the nodes with larger samples (0.54). In the random scenarios, the model still achieved a c-index of 0.75 but required more training rounds to converge. The same pattern was also observed in the worst-case scenario that was able to achieve a c-index of 0.74. The most relevant difference between the tested scenarios is the increased training time required to reach a good and stable performance when missing information is present. The developed technology is fully compliant with European GDPR regulations.
In the upcoming months, the FL platform will be implemented in the referral centers of the EuroBloodNET clinical network. In addition to tabular data, we plan to address the federation of features extracted from medical images, including both histological slides and radiological scans (CT, MRI and PET-CT).
CONCLUSION. The FL platform for rare hematological diseases allows multicentric training of AI models without sharing sensitive patient data. This approach ensures data privacy and security, addressing specific challenges associated with rare diseases. Moreover, this platform manages scenarios with missing data or variables, maintaining robustness and accuracy of predictive models. Overall, FL approach is expected to facilitate the development of more advanced and reliable healthcare solutions, paving the way toward personalized medicine in hematology.
Kubasch:Curis: Research Funding; Novartis: Honoraria, Research Funding; BMS: Honoraria; Janssen: Honoraria, Research Funding. Platzbecker:MDS Foundation: Membership on an entity's Board of Directors or advisory committees; Abbvie: Consultancy, Research Funding; Curis: Consultancy, Honoraria, Research Funding; Geron: Consultancy; Janssen: Consultancy, Honoraria, Research Funding; Merck: Research Funding; BMS: Consultancy, Membership on an entity's Board of Directors or advisory committees, Other: Travel support, Research Funding; Amgen: Consultancy, Research Funding; Novartis: Consultancy, Research Funding. Diez-Campelo:GSK: Consultancy, Membership on an entity's Board of Directors or advisory committees; CURIS: Membership on an entity's Board of Directors or advisory committees; AGIOS: Consultancy, Membership on an entity's Board of Directors or advisory committees; BLUEPRINT MEDICINES: Consultancy, Membership on an entity's Board of Directors or advisory committees; KEROS: Honoraria, Membership on an entity's Board of Directors or advisory committees; Novartis: Consultancy, Honoraria, Membership on an entity's Board of Directors or advisory committees; ASTEX/OTSUKA: Membership on an entity's Board of Directors or advisory committees, Other: TRAVEL TO MEETINGS; BMS/Celgene: Consultancy, Honoraria, Membership on an entity's Board of Directors or advisory committees, Other: Advisory board fees; HEMAVAN: Membership on an entity's Board of Directors or advisory committees; SYROS: Membership on an entity's Board of Directors or advisory committees; Gilead: Other: Travel reimbursement. Fenaux:Jazz Pharmaceuticals: Honoraria, Research Funding; Novartis: Research Funding; Astex: Research Funding; Servier: Research Funding; Janssen: Research Funding; AbbVie: Honoraria, Research Funding; Agios: Research Funding; BMS: Honoraria, Research Funding. Colombatti:Forma Therapeutics: Membership on an entity's Board of Directors or advisory committees; Vertex/Addmedica: Membership on an entity's Board of Directors or advisory committees; Pfizer: Consultancy, Membership on an entity's Board of Directors or advisory committees; Agios: Membership on an entity's Board of Directors or advisory committees; Novo Nordisk: Membership on an entity's Board of Directors or advisory committees. Mañú-Pereira:Agios: Other: Advisory board, Research Funding; Bristol Myers Squibb: Research Funding; Novartis: Research Funding. Della Porta:Bristol Myers Squibb: Consultancy.
This feature is available to Subscribers Only
Sign In or Create an Account Close Modal