Subscribe to RSS

DOI: 10.1055/a-2710-4226
Best Practices to Design, Plan, and Execute Large-Scale Federated Analyses—Key Learnings and Suggestions from a Study Comprising 52 Databases
Authors
Abstract
Background
Federated network studies allow data to remain locally while the research is conducted through the sharing of analytical code and aggregated results across different health care settings and countries. A large number of databases have been mapped to the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), boosting the use of analytical pipelines for standardized observational research within this open science framework. Transparency, reproducibility, and robustness of results have positioned federated analyses using the OMOP CDM within the European Health Data and Evidence Network (EHDEN) as an essential tool for generating large-scale evidence.
Objectives
We conducted large-scale federated analyses involving 52 databases from 19 countries using the OMOP CDM. In this State-of-the-Art/Best Practice article, we aimed to share key lessons and strategies for conducting such complex, large multidatabase analyses.
Results
Meticulous planning, establishing a strong community of collaborators, efficient communication channels, standardized analytics, and strategic division of responsibilities are essential. We highlight the benefits of network engagement, cross-fertilization of ideas, and shared learning. Further key elements contributing to the study's success included an inclusive, incremental implementation of the analytical code, timely engagement of data partners, and community webinars to discuss and interpret study findings.
Conclusion
We received predominantly positive feedback from data custodians about their participation, and included input for further improvements for future large-scale federated network studies from this shared learning experience.
Protection of Human and Animal Subjects
This article suggests best practices for large-scale multinational federated network studies and therefore does not fall under human subject research, which means that no ethical approval was required for this publication. Ethical approval was obtained for the discussed research underlying the best practices recommendations.
Publication History
Received: 23 December 2024
Accepted: 24 August 2025
Accepted Manuscript online:
26 September 2025
Article published online:
30 October 2025
© 2025. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution License, permitting unrestricted use, distribution, and reproduction so long as the original work is properly cited. (https://creativecommons.org/licenses/by/4.0/)
Georg Thieme Verlag KG
Oswald-Hesse-Straße 50, 70469 Stuttgart, Germany
-
References
- 1 Observational Health Data Sciences and Informatics. Chapter 8 OHDSI Analytics Tools. In: The Book of OHDSI. 2011
- 2 Reich C, Ostropolets A, Ryan P. et al. OHDSI Standardized Vocabularies: A large-scale centralized reference ontology for international data harmonization. J Am Med Inform Assoc 2024; 31 (03) 583-590
- 3 OHDSI. Our Journey: Where the OHDSI Community Has Been and Where We Are Going. 2024 . Accessed November 25, 2024 at: https://www.ohdsi.org/wp-content/uploads/2024/10/OurJourney2024.pdf
- 4 Voss EA, Shoaibi A, Yin Hui Lai L. et al. Contextualising adverse events of special interest to characterise the baseline incidence rates in 24 million patients with COVID-19 across 26 databases: A multinational retrospective cohort study. EClinicalMedicine 2023; 58: 101932
- 5 Voss EA, Blacketer C, van Sandijk S. et al. European Health Data & Evidence Network-learnings from building out a standardized international health data network. J Am Med Inform Assoc 2023; 31 (01) 209-219
- 6 Europe OHDSI. National Nodes in OHDSI Europe Chapter. Accessed December 6, 2024 at: https://ohdsi-europe.org/index.php/national-nodes
- 7 OHDSI. HADES - Health Analytics Data-to-Evidence Suite. Accessed December 6, 2024 at: https://ohdsi.github.io/Hades/
- 8 GitHub, Inc. Darwin EU® Public Code Repository. Accessed November 25, 2024 at: https://github.com/darwin-eu
- 9 Darwin EU Coordination Centre. Darwin EU. Accessed September 16, 2024 at: https://darwin-eu.org/
- 10 Pineda M, Lopez Gomez C, Maljković Heliant F. et al. Trends of use of drugs with suggested shortages and their alternatives across 52 real world data sources and 18 countries in Europe and North America. medRxiv 2024.08.28.24312695.
- 11 Voss EA, Makadia R, Matcho A. et al. Feasibility and utility of applications of the common data model to multiple, disparate observational health databases. J Am Med Inform Assoc 2015; 22 (03) 553-564
- 12 Medicine Shortages Single Point of Contact (SPOC) Working Party. EMA shortages catalogue. Accessed December 6, 2024 at: https://www.ema.europa.eu/en/committees/working-parties-other-groups/medicines-shortages-single-point-contact-spoc-working-party
- 13 Raventós B, Català M, Du M. et al. IncidencePrevalence: An R package to calculate population-level incidence rates and prevalence using the OMOP common data model. Pharmacoepidemiol Drug Saf 2024; 33 (01) e5717
