Transforming French Electronic Health Records into the Observational Medical Outcome Partnership's Common Data Model: A Feasibility StudyFunding This work is part of the “PAERPA: Health Pathway of Seniors for Preserved Autonomy” project funded by the Agence Régionale de Santé (ARS) Hauts-de-France. The ARS Hauts-de-France provided the data and funding. The funding bodies did not have an influence on study design and data analysis but were involved in drafting the manuscript.
27 June 2019
25 November 2019
08 January 2020 (online)
Background Common data models (CDMs) enable data to be standardized, and facilitate data exchange, sharing, and storage, particularly when the data have been collected via distinct, heterogeneous systems. Moreover, CDMs provide tools for data quality assessment, integration into models, visualization, and analysis. The observational medical outcome partnership (OMOP) provides a CDM for organizing and standardizing databases. Common data models not only facilitate data integration but also (and especially for the OMOP model) extends the range of available statistical analyses.
Objective This study aimed to evaluate the feasibility of implementing French national electronic health records in the OMOP CDM.
Methods The OMOP's specifications were used to audit the source data, specify the transformation into the OMOP CDM, implement an extract–transform–load process to feed data from the French health care system into the OMOP CDM, and evaluate the final database.
Results Seventeen vocabularies corresponding to the French context were added to the OMOP CDM's concepts. Three French terminologies were automatically mapped to standardized vocabularies. We loaded nine tables from the OMOP CDM's “standardized clinical data” section, and three tables from the “standardized health system data” section. Outpatient and inpatient data from 38,730 individuals were integrated. The median (interquartile range) number of outpatient and inpatient stays per patient was 160 (19–364).
Conclusion Our results demonstrated that data from the French national health care system can be integrated into the OMOP CDM. One of the main challenges was the use of international OMOP concepts to annotate data recorded in a French context. The use of local terminologies was an obstacle to conceptual mapping; with the exception of an adaptation of the International Classification of Diseases 10th Revision, the French health care system does not use international terminologies. It would be interesting to extend our present findings to the 65 million people registered in the French health care system.
Keywordsdata integration - secondary use - observational medical outcome partnership - Observational Health Data Sciences and Informatics
Protection of Human and Animal Subjects
Human or animal subjects were not included in the project.
- 1 Sentinel Initiativetp. Available at: www.mini-sentinel.org/ . Accessed April 4, 2019
- 2 National Patient-Centered Clinical Research Network. PCORnet. Available at: https://pcornet.org/pcornet-common-data-model/ . Accessed April 4, 2019
- 3 Chazard E, Merlin B, Ficheur G, Sarfati J-C, Beuscart R. ; PSIP Consortium. Detection of adverse drug events: proposal of a data model. Stud Health Technol Inform 2009; 148: 63-74
- 4 i2b2: Informatics for Integrating Biology & the Bedside. Available at: https://www.i2b2.org/ . Accessed April 4, 2019
- 5 OHDSI—Observational Health Data Sciences and Informatics. Available at: https://www.ohdsi.org/ . Accessed April 4, 2019
- 6 Weeks J, Pardee R. Learning to share health care data: a brief timeline of influential common data models and distributed health data networks in U.S. health care research. EGEMS Wash DC 2019; 7 (01) 4
- 7 Liyanage H, Liaw S-T, Jonnagaddala J, Hinton W, de Lusignan S. Common data models (CDMs) to enhance international big data analytics: a diabetes use case to compare three CDMs. Stud Health Technol Inform 2018; 255: 60-64
- 8 Garza M, Del Fiol G, Tenenbaum J, Walden A, Zozus MN. Evaluating common data models for use with a longitudinal community registry. J Biomed Inform 2016; 64: 333-341
- 9 CDISC. Study Data Tabulation Model (SDTM). Available at: https://www.cdisc.org/standards/foundational/sdtm . Accessed April 4, 2019
- 10 Xu Y, Zhou X, Suehs BT. , et al. A comparative assessment of observational medical outcomes partnership and mini-sentinel common data models and analytics: implications for active drug safety surveillance. Drug Saf 2015; 38 (08) 749-765
- 11 Hripcsak G, Duke JD, Shah NH. , et al. Observational health data sciences and informatics (OHDSI): opportunities for observational researchers. Stud Health Technol Inform 2015; 216: 574-578
- 12 Ryan PB, Stang PE, Overhage JM. , et al. A comparison of the empirical performance of methods for a risk identification system. Drug Saf 2013; 36 (Suppl. 01) S143-S158
- 13 Schuemie MJ, Gini R, Coloma PM. , et al. Replication of the OMOP experiment in Europe: evaluating methods for risk identification in electronic health record databases. Drug Saf 2013; 36 (Suppl. 01) S159-S169
- 14 Maier C, Lang L, Storf H. , et al. Towards implementation of OMOP in a German university hospital consortium. Appl Clin Inform 2018; 9 (01) 54-61
- 15 Makadia R, Ryan PB. Transforming the premier perspective hospital database into the observational medical outcomes partnership (OMOP) common data model. EGEMS (Wash DC) 2014; 2 (01) 1110
- 16 You SC, Lee S, Cho S-Y. , et al. Conversion of national health insurance service-national sample cohort (NHIS-NSC) database into observational medical outcomes partnership-common data model (OMOP-CDM). Stud Health Technol Inform 2017; 245: 467-470
- 17 Yoon D, Ahn EK, Park MY. , et al. Conversion and data quality assessment of electronic health record data at a korean tertiary teaching hospital to a common data model for distributed network research. Healthc Inform Res 2016; 22 (01) 54-58
- 18 Le dispositif Paerpa. Ministère des Solidarités et de la Santé. Available at: https://solidarites-sante.gouv.fr/systeme-de-sante-et-medico-social/parcours-des-patients-et-des-usagers/le-parcours-sante-des-aines-paerpa/article/le-dispositif-paerpa . Accessed April 4, 2019
- 19 Tuppin P, de Roquefeuil L, Weill A, Ricordeau P, Merlière Y. French national health insurance information system and the permanent beneficiaries sample. Rev Epidemiol Sante Publique 2010; 58 (04) 286-290
- 20 Moulis G, Lapeyre-Mestre M, Palmaro A, Pugnet G, Montastruc J-L, Sailler L. French health insurance databases: what interest for medical research?. Rev Med Interne 2015; 36 (06) 411-417
- 21 OHDSI/Common Data Model. Definition and DDLs for the OMOP Common Data Model (CDM). Available at: https://github.com/OHDSI/CommonDataModel . Accessed April 4, 2019
- 22 Observational Health Data Sciences and Informatics. ELT creation best practices. Available at: http://www.ohdsi.org/web/wiki/doku.php?id=documentation:etl_best_practices . Accessed April 4, 2019
- 23 OHDSI/Achilles. Automated Characterization of Health Information at Large-scale Longitudinal Evidence Systems (ACHILLES) - descriptive statistics about a OMOP CDM database; 2019. Available at: https://github.com/OHDSI/Achilles . Accessed October 17, 2019
- 24 Observational Health Data Sciences and Informatics. WhiteRabbit. Available at: http://www.ohdsi.org/web/wiki/doku.php?id=documentation:software:whiterabbit . Accessed April 4, 2019
- 25 Open Source ETL—Talend Open Studio for Data Integration. Talend real-time open source data integration software. Available at: https://www.talend.com/products/data-integration/data-integration-open-studio/ . Accessed April 4, 2019
- 26 Postgre SQL. The world's most advanced open source database. Available at: https://www.postgresql.org/ . Accessed April 4, 2019
- 27 Ubuntu. The leading operating system for PCs, IoT devices, servers and the cloud. Available at: https://www.ubuntu.com/ . Accessed April 4, 2019
- 28 Athena. Available at: http://athena.ohdsi.org/search-terms/terms . Accessed April 4, 2019
- 29 Lima DM, Rodrigues-Jr JF, Traina AJM, Pires FA, Gutierrez MA. Transforming two decades of ePR data to OMOP CDM for clinical research. Stud Health Technol Inform 2019; 264: 233-237
- 30 FitzHenry F, Resnic FS, Robbins SL. , et al. Creating a common data model for comparative effectiveness with the observational medical outcomes partnership. Appl Clin Inform 2015; 6 (03) 536-547
- 31 Zhang X, Wang L, Miao S. , et al. Analysis of treatment pathways for three chronic diseases using OMOP CDM. J Med Syst 2018; 42 (12) 260
- 32 Viernes B, Lynch KE, South B, Coronado G, DuVall SL. Characterizing VA users with the OMOP common data model. Stud Health Technol Inform 2019; 264: 1614-1615
- 33 Zhou X, Murugesan S, Bhullar H. , et al. An evaluation of the THIN database in the OMOP Common Data Model for active drug safety surveillance. Drug Saf 2013; 36 (02) 119-134
- 34 European Health Data Evidence Network Available at: https://www.ehden.eu/ . Accessed October 9, 2019