CC BY-NC-ND 4.0 · Appl Clin Inform 2023; 14(01): 054-064
DOI: 10.1055/s-0042-1760436
Original Article

“fhircrackr”: An R Package Unlocking Fast Healthcare Interoperability Resources for Statistical Analysis

Julia Palm
1   Institute of Medical Statistics, Computer and Data Sciences, Jena University Hospital, Jena, Thüringen, Germany
,
Frank A. Meineke
2   Institute for Medical Informatics, Statistics and Epidemiology, University of Leipzig, Leipzig, Germany
,
Jens Przybilla
2   Institute for Medical Informatics, Statistics and Epidemiology, University of Leipzig, Leipzig, Germany
3   Clinical Trial Centre Leipzig, University of Leipzig, Leipzig, Germany
,
Thomas Peschel
2   Institute for Medical Informatics, Statistics and Epidemiology, University of Leipzig, Leipzig, Germany
› Author Affiliations
Funding This work was funded in part by the German Federal Ministry of Education and Research (BMBF) within the Medical Informatics Initiative in the SMITH Consortium and the POLAR project under the funding numbers 01ZZ1910[A-Z] (POLAR) and 01ZZ1803[A-Z] (SMITH).

Abstract

Background The growing interest in the secondary use of electronic health record (EHR) data has increased the number of new data integration and data sharing infrastructures. The present work has been developed in the context of the German Medical Informatics Initiative, where 29 university hospitals agreed to the usage of the Health Level Seven Fast Healthcare Interoperability Resources (FHIR) standard for their newly established data integration centers. This standard is optimized to describe and exchange medical data but less suitable for standard statistical analysis which mostly requires tabular data formats.

Objectives The objective of this work is to establish a tool that makes FHIR data accessible for standard statistical analysis by providing means to retrieve and transform data from a FHIR server. The tool should be implemented in a programming environment known to most data analysts and offer functions with variable degrees of flexibility and automation catering to users with different levels of FHIR expertise.

Methods We propose the fhircrackr framework, which allows downloading and flattening FHIR resources for data analysis. The framework supports different download and authentication protocols and gives the user full control over the data that is extracted from the FHIR resources and transformed into tables. We implemented it using the programming language R [1] and published it under the GPL-3 open source license.

Results The framework was successfully applied to both publicly available test data and real-world data from several ongoing studies. While the processing of larger real-world data sets puts a considerable burden on computation time and memory consumption, those challenges can be attenuated with a number of suitable measures like parallelization and temporary storage mechanisms.

Conclusion The fhircrackr R package provides an open source solution within an environment that is familiar to most data scientists and helps overcome the practical challenges that still hamper the usage of EHR data for research.

Protection of Human and Animal Subjects

Artificial EHR data were used for developing and testing this software. No formal intervention was performed and no additional (patient-) data were collected.




Publication History

Received: 31 August 2022

Accepted: 09 November 2022

Article published online:
25 January 2023

© 2023. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/)

Georg Thieme Verlag KG
Rüdigerstraße 14, 70469 Stuttgart, Germany

 
  • References

  • 1 Cosgriff CV, Ebner DK, Celi LA. Data sharing in the era of COVID-19. Lancet Digit Health 2020; 2 (05) e224
  • 2 Dagliati A, Malovini A, Tibollo V, Bellazzi R. Health informatics and EHR to support clinical research in the COVID-19 pandemic: an overview. Brief Bioinform 2021; 22 (02) 812-822
  • 3 Khan MS, Dar O, Erondu NA. et al. Using critical information to strengthen pandemic preparedness: the role of national public health agencies. BMJ Glob Health 2020; 5 (09) e002830
  • 4 Prokosch HU, Bahls T, Bialke M. et al. The COVID-19 Data Exchange Platform of the German University Medicine. Stud Health Technol Inform 2022; 294: 674-678
  • 5 Brat GA, Weber GM, Gehlenborg N. et al. International electronic health record-derived COVID-19 clinical course profiles: the 4CE consortium. NPJ Digit Med 2020; 3: 109
  • 6 Lehne M, Luijten S, Vom Felde Genannt Imbusch P, Thun S. The use of FHIR in digital health - a review of the scientific literature. Stud Health Technol Inform 2019; 267: 52-58
  • 7 Matney SA, Heale B, Hasley S. et al. Lessons learned in creating interoperable Fast Healthcare Interoperability Resources profiles for large-scale public health programs. Appl Clin Inform 2019; 10 (01) 87-95
  • 8 Kiourtis A, Mavrogiorgou A, Menychtas A, Maglogiannis I, Kyriazis D. Structurally mapping healthcare data to HL7 FHIR through ontology alignment. J Med Syst 2019; 43 (03) 62
  • 9 Garcia SJ, Zayas-Cabán T, Freimuth RR. Sync for genes: making clinical genomics available for precision medicine at the point-of-care. Appl Clin Inform 2020; 11 (02) 295-302
  • 10 Gordon WJ, Baronas J, Lane WJ. A FHIR human leukocyte antigen (HLA) interface for platelet transfusion support. Appl Clin Inform 2017; 8 (02) 603-611
  • 11 McClure RC, Macumber CL, Skapik JL, Smith AM. Igniting harmonized digital clinical quality measurement through terminology, CQL, and FHIR. Appl Clin Inform 2020; 11 (01) 23-33
  • 12 Dorr DA, D'Autremont C, Pizzimenti C. et al. Assessing data adequacy for high blood pressure clinical decision support: a quantitative analysis. Appl Clin Inform 2021; 12 (04) 710-720
  • 13 Mavrogiorgou A, Kiourtis A, Touloupou M. et al. Internet of medical things (IoMT): acquiring and transforming data into HL7 FHIR through 5G network slicing. Emerg Sci J 2019; 3: 64-77
  • 14 Semler SC, Wissing F, Heyder R. German Medical Informatics Initiative. Methods Inf Med 2018; 57 (S 01): e50-e56
  • 15 Bender D, Sartipi K. HL7 FHIR: An Agile and RESTful approach to healthcare information exchange. In: Proceedings of the 26th IEEE International Symposium on Computer-Based Medical Systems. IEEE; 2013: 326-331
  • 16 Duda SN, Kennedy N, Conway D. et al. HL7 FHIR-based tools and initiatives to support clinical research: a scoping review. J Am Med Inform Assoc 2022; 29 (09) 1642-1653
  • 17 World Wide Web Consortium. XML Path Language (XPath) 1.0. W3C Recommendation. 2016. Accessed August 28, 2022, at: https://www.w3.org/TR/1999/REC-xpath-19991116
  • 18 R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2021
  • 19 Peschel T, Palm J, Przybilla J, Meineke F.. Handling HL7 FHIR resources in R with fhircrackr. R package version 2.0.1; 2022
  • 20 The Comprehensive R Archive Network. Accessed August 29, 2022 at: https://cran.r-project.org/
  • 21 Hardt D. RFC6749 - The OAuth 2.0 Authorization Framework. Internet Engeneering Task Force. 2012. Accessed August 29, 2022 at: https://www.rfc-editor.org/rfc/rfc6749
  • 22 Wickham H. httr: Tools for Working with URLs and HTTP. R package version 1.4.4. 2022. Accessed October 27, 2022 at: https://CRAN.R-project.org/package=httr
  • 23 Wickham H. S4. In: Advanced R. 2nd ed. London, United Kingdom: Chapman & Hall/CRC; 2019
  • 24 Wickham H, Hester J, Ooms J. xml2: Parse XML. R package version 1.3.3. 2021. Accessed August 29, 2022 at: https://CRAN.R-project.org/package=xml2
  • 25 HL7 FHIR Resource Patient. Accessed August 29, 2022 at: https://www.hl7.org/fhir/patient.html
  • 26 Codd EF. A relational model of data for large shared data banks. Commun ACM 1970; 13: 377-387
  • 27 Yu D, Wang X, Wu H. EHR Data Pre-Processing and Preparation. In: Statistics and Machine Learning Methods for EHR Data. 1st ed. New York: Chapman and Hall/CRC; 2020:37
  • 28 Leipzig Health Atlas. FHIR Testdata. Accessed August 29, 2022 at: https://www.health-atlas.de/studies/43
  • 29 Zautke A, Essenwanger A, Wulff A. et al. Kerndatensatz Modul Fall. Accessed August 29, 2022 at: https://www.medizininformatik-initiative.de/Kerndatensatz/Modul_Fall/IGMIIKDSModulFall.html
  • 30 Scherag A, Andrikyan W, Dreischulte T. et al. POLAR–„POLypharmazie, Arzneimittelwechselwirkungen und Risiken “–wie können Daten aus der stationären Krankenversorgung zur Beurteilung beitragen?. Prävent Gesundhförd 2022; 1-10
  • 31 Hagel S, Gantner J, Spreckelsen C. et al. Hospital-wide ELectronic medical record evaluated computerised decision support system to improve outcomes of Patients with staphylococcal bloodstream infection (HELP): study protocol for a multicentre stepped-wedge cluster randomised trial. BMJ Open 2020; 10 (02) e033391
  • 32 WEather-based STroke event and Outcome Risk Modeling (WE-STORM) project description. Accessed August 29, 2022 at: https://forschen-fuer-gesundheit.de/project.php?fdpgid=20
  • 33 NT-proBNP als Marker bei Vorhofflimmern (MII-VHF) project description. Accessed August 29, 2022 at: https://forschen-fuer-gesundheit.de/project.php?fdpgid=19
  • 34 Palm J. Code for MII Projectathon #6: MII-VHF. Accessed August 29, 2022 at: https://github.com/medizininformatik-initiative/Projectathon6-smith2
  • 35 Santhanam N, Maros M. Code for MII Projectathon #6: WE-STORM. Accessed August 29, 2022 at: https://github.com/medizininformatik-initiative/Projectathon6-miracum1
  • 36 Gruendner J, Deppenwiese N, Folz M. et al. The architecture of a Feasibility Query Portal for Distributed COVID-19 Fast Healthcare Interoperability Resources (FHIR) patient data repositories: design and implementation study. JMIR Med Inform 2022; 10 (05) e36709
  • 37 Marcon Y, Bishop T, Avraam D. et al. Orchestrating privacy-protected big data analyses of data from different resources with R and DataSHIELD. PLOS Comput Biol 2021; 17 (03) e1008880
  • 38 Kahn MG, Callahan TJ, Barnard J. et al. A harmonized data quality assessment terminology and framework for the secondary use of electronic health record data. EGEMS (Wash DC) 2016; 4 (01) 1244
  • 39 Kapsner LA, Mang JM, Mate S. et al. Linking a consortium-wide data quality assessment tool with the MIRACUM metadata repository. Appl Clin Inform 2021; 12 (04) 826-835
  • 40 Kohane IS, Aronow BJ, Avillach P. et al; Consortium For Clinical Characterization Of COVID-19 By EHR (4CE). What every reader should know about studies using electronic health record data but may be afraid to ask. J Med Internet Res 2021; 23 (03) e22219
  • 41 Cerner. Bunsen: FHIR Data with Apache Spark. Accessed October 27, 2022 at: https://engineering.cerner.com/bunsen/0.5.10-SNAPSHOT/
  • 42 Hosch R, Baldini G, Parmar V. et al. UMEssen/FHIR-PYrate: v0.2.0-beta.1 (v0.2.0-beta.1). Zenodo; 2022. Accessed December 27, 2022 at: https://doi.org/10.5281/zenodo.7025227