Towards Implementation of OMOP in a German University Hospital Consortium
31 July 2017
25 November 2017
24 January 2018 (online)
Background In 2015, the German Federal Ministry of Education and Research initiated a large data integration and data sharing research initiative to improve the reuse of data from patient care and translational research. The Observational Medical Outcomes Partnership (OMOP) common data model and the Observational Health Data Sciences and Informatics (OHDSI) tools could be used as a core element in this initiative for harmonizing the terminologies used as well as facilitating the federation of research analyses across institutions.
Objective To realize an OMOP/OHDSI-based pilot implementation within a consortium of eight German university hospitals, evaluate the applicability to support data harmonization and sharing among them, and identify potential enhancement requirements.
Methods The vocabularies and terminological mapping required for importing the fact data were prepared, and the process for importing the data from the source files was designed. For eight German university hospitals, a virtual machine preconfigured with the OMOP database and the OHDSI tools as well as the jobs to import the data and conduct the analysis was provided. Last, a federated/distributed query to test the approach was executed.
Results While the mapping of ICD-10 German Modification succeeded with a rate of 98.8% of all terms for diagnoses, the procedures could not be mapped and hence an extension to the OMOP standard terminologies had to be made.
Overall, the data of 3 million inpatients with approximately 26 million conditions, 21 million procedures, and 23 million observations have been imported.
A federated query to identify a cohort of colorectal cancer patients was successfully executed and yielded 16,701 patient cases visualized in a Sunburst plot.
Conclusion OMOP/OHDSI is a viable open source solution for data integration in a German research consortium. Once the terminology problems can be solved, researchers can build on an active community for further development.
MIRACUM is funded by the German Federal Ministry of Education and Research (BMBF) within the “Medical Informatics Funding Scheme” (FKZ 01ZZ1606H). The research has been cofunded by the German Federal Ministry of Economics and Technology within the Trusted Cloud initiative (FKZ 01MD11009).
The present work was performed in fulfillment of the requirements for obtaining the degree “Dr. rer. biol. hum.” from the Friedrich-Alexander-Universität Erlangen-Nürnberg.
- 1 Safran C, Bloomrosen M, Hammond WE. , et al; Expert Panel. Toward a national framework for the secondary use of health data: an American Medical Informatics Association White Paper. J Am Med Inform Assoc 2007; 14 (01) 1-9
- 2 Overhage JM, Ryan PB, Reich CG, Hartzema AG, Stang PE. Validation of a common data model for active safety surveillance research. J Am Med Inform Assoc 2012; 19 (01) 54-60
- 3 Gaye A, Marcon Y, Isaeva J. , et al. DataSHIELD: taking the analysis to the data, not the data to the analysis. Int J Epidemiol 2014; 43 (06) 1929-1944
- 4 Hripcsak G, Duke JD, Shah NH. , et al. Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers. Stud Health Technol Inform 2015; 216: 574-578
- 5 OHDSI Homepage. Available at: https://www.ohdsi.org . Accessed October 14, 2017
- 6 Hripcsak G, Ryan PB, Duke JD. , et al. Characterizing treatment pathways at scale using the OHDSI network. Proc Natl Acad Sci U S A 2016; 113 (27) 7329-7336
- 7 Federal Ministry of Education and Research (BMBF). Medical Informatics Funding Scheme – Networking data – improving healthcare. 2015. Available at: https://www.gesundheitsforschung-bmbf.de/files/Medical_Informatics_Funding_Scheme.pdf . Accessed December 25, 2017
- 8 MIRACUM Homepage. Available at: https://www.miracum.org . Accessed July 25, 2017
- 9 Mahlke L, Lefering R, Siebert H, Windolf J, Roeder N, Franz D. Description of the severely injured in the DRG system: is treatment of the severely injured still affordable? [Article in German]. Chirurg 2013; 84 (11) 978-986
- 10 Barufka S, Heller M, Prayon V, Fegert JM. Nonnative guidelines for allocating human resources in child and adolescent psychiatry using average values under convergence conditions instead of price determination - analysis of the data of university hospitals in Germany concerning the costs of calculating day and minute values according to Psych-PV and PEPP-System [Article in German]. Z Kinder Jugendpsychiatr Psychother 2015; 43 (06) 397-409
- 11 Bauer CR, Ganslandt T, Baum B. , et al. Integrated Data Repository Toolkit (IDRT). A suite of programs to facilitate health analytics on heterogeneous medical data. Methods Inf Med 2016; 55 (02) 125-135
- 12 Klann JG, Abend A, Raghavan VA, Mandl KD, Murphy SN. Data interchange using i2b2. J Am Med Inform Assoc 2016; 23 (05) 909-915
- 13 Voss EA, Ma Q, Ryan PB. The impact of standardizing the definition of visits on the consistency of multi-database observational health research. BMC Medical Research Methodology. BioMed Central 2015; 15: 13
- 14 Sun H, Depraetere K, De Roo J. , et al. Semantic processing of EHR data for clinical research. J Biomed Inform 2015; 58: 247-259
- 15 Hussain S, Sun H, Sinaci A. , et al. A framework for evaluating and utilizing medical terminology mappings. Stud Health Technol Inform 2014; 205: 594-598
- 16 Schuemie MJ, Gini R, Coloma PM. , et al. Replication of the OMOP experiment in Europe: evaluating methods for risk identification in electronic health record databases. Drug Saf 2013; 36 (Suppl. 01) S159-S169
- 17 Makadia R, Ryan PB. Transforming the Premier Perspective Hospital Database into the Observational Medical Outcomes Partnership (OMOP) Common Data Model. EGEMS (Wash DC) 2014; 2 (01) 1110
- 18 Matcho A, Ryan P, Fife D, Reich C. Fidelity assessment of a clinical practice research datalink conversion to the OMOP common data model. Drug Saf 2014; 37 (11) 945-959
- 19 Fitz Henry F, Resnic FS, Robbins SL. , et al. Creating a Common Data Model for Comparative Effectiveness with the Observational Medical Outcomes Partnership. Appl Clin Inform 2015; 6 (03) 536-547
- 20 Yoon D, Ahn EK, Park MY. , et al. Conversion and data quality assessment of electronic health record data at a Korean tertiary teaching hospital to a common data model for distributed network research. Healthc Inform Res 2016; 22 (01) 54-58
- 21 Gini R, Schuemie M, Brown J. , et al. Data Extraction and Management in Networks of Observational Health Care Databases for Scientific Research: A Comparison of EU-ADR, OMOP, Mini-Sentinel and MATRICE Strategies. EGEMS (Wash DC) 2016; 4 (01) 1189
- 22 Xu Y, Zhou X, Suehs BT. , et al. A comparative assessment of observational medical outcomes partnership and mini-sentinel common data models and analytics: implications for active drug safety surveillance. Drug Saf 2015; 38 (08) 749-765
- 23 Mandel JC, Kreda DA, Mandl KD, Kohane IS, Ramoni RB. SMART on FHIR: a standards-based, interoperable apps platform for electronic health records. J Am Med Inform Assoc 2016; 23 (05) 899-908