Subscribe to RSS

DOI: 10.1055/a-2521-4250
Large-Scale Integration of DICOM Metadata into HL7-FHIR for Medical Research
Funding This work was funded by the German Federal Ministry of Education and Research (BMBF) within the Medical Informatics Initiative (MIRACUM Consortium) under the Funding Number FKZ: 01ZZ1801A and by the Bavarian Cancer Research Center (BZKF).
Abstract
Background The current gap between the availability of routine imaging data and its provisioning for medical research hinders the utilization of radiological information for secondary purposes. To address this, the German Medical Informatics Initiative (MII) has established frameworks for harmonizing and integrating clinical data across institutions, including the integration of imaging data into research repositories, which can be expanded to routine imaging data.
Objectives This project aims to address this gap by developing a large-scale data processing pipeline to extract, convert, and pseudonymize DICOM (Digital Imaging and Communications in Medicine) metadata into “ImagingStudy” Fast Healthcare Interoperability Resources (FHIR) and integrate them into research repositories for secondary use.
Methods The data processing pipeline was developed, implemented, and tested at the Data Integration Center of the University Hospital Erlangen. It leverages existing open-source solutions and integrates seamlessly into the hospital's research IT infrastructure. The pipeline automates the extraction, conversion, and pseudonymization processes, ensuring compliance with both local and MII data protection standards. A large-scale evaluation was conducted using the imaging studies acquired by two departments at University Hospital Erlangen within 1 year. Attributes such as modality, examined body region, laterality, and the number of series and instances were analyzed to assess the quality and availability of the metadata.
Results Once established, the pipeline processed a substantial dataset comprising over 150,000 DICOM studies within an operational period of 26 days. Data analysis revealed significant heterogeneity and incompleteness in certain attributes, particularly the DICOM tag “Body Part Examined.” Despite these challenges, the pipeline successfully generated valid and standardized FHIR, providing a robust basis for future research.
Conclusion We demonstrated the setup and test of a large-scale end-to-end data processing pipeline that transforms DICOM imaging metadata directly from clinical routine into the Health Level 7-FHIR format, pseudonymizes the resources, and stores them in an FHIR server. We showcased that the derived FHIRs offer numerous research opportunities, for example, feasibility assessments within Bavarian and Germany-wide research infrastructures. Insights from this study highlight the need to extend the “ImagingStudy” FHIR with additional attributes and refine their use within the German MII.
Keywords
radiology information system - health information interoperability - medical informatics applications - radiology department - hospitalData Availability Statement
The DICOM orchestration API (https://github.com/alexa-ian/DICOM-Interval-Scheduler) and the fork of the DICOM-FHIR converter including adaptions for the local DIC at the UHE (https://github.com/alexa-ian/dicom-fhir-converter) are both available on GitHub.
Publication History
Received: 15 July 2024
Accepted: 15 January 2025
Article published online:
15 April 2025
© 2025. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution License, permitting unrestricted use, distribution, and reproduction so long as the original work is properly cited. (https://creativecommons.org/licenses/by/4.0/)
Georg Thieme Verlag KG
Oswald-Hesse-Straße 50, 70469 Stuttgart, Germany
-
References
- 1 Arvanitis TN. Informatics opportunities and challenges in medical imaging: a journey. Stud Health Technol Inform 2022; 300: 19-29
- 2 Digital Imaging and Communications in Medicine (DICOM) - PS 3.1–PS 3.22. The National Electrical Manufacturers Association (NEMA); 2024
- 3 Bidgood Jr WD, Horii SC, Prior FW, Van Syckle DE. Understanding and using DICOM, the data interchange standard for biomedical imaging. J Am Med Inform Assoc 1997; 4 (03) 199-212
- 4 Hsu W, Markey MK, Wang MD. Biomedical imaging informatics in the era of precision medicine: progress, challenges, and opportunities. J Am Med Inform Assoc 2013; 20 (06) 1010-1013
- 5 FHIR Release 4 (R4). Health Level Seven International; 2024
- 6 Semler SC, Wissing F, Heyder R. German Medical Informatics initiative. Methods Inf Med 2018; 57 (S 01): e50-e56
- 7 Prokosch HU, Acker T, Bernarding J. et al. MIRACUM: medical informatics in research and care in university medicine. Methods Inf Med 2018; 57 (S 01): e82-e91
- 8 Ammon D, Kurscheidt M, Buckow K. et al. Arbeitsgruppe Interoperabilität: Kerndatensatz und Informationssysteme für Integration und Austausch von Daten in der Medizininformatik-Initiative. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz 2024; 67: 656-667
- 9 Synedra AIM. synedra Deutschland GmbH. https://www.synedra.com/hcm/pacs
- 10 Clinical Trial Processor. John Perry. Accessed May 21, 2024 at: https://github.com/johnperry/CTP
- 11 Erickson BJ, Fajnwaks P, Langer SG, Perry J. Multisite image data collection and management using the RSNA image sharing network. Transl Oncol 2014; 7 (01) 36-39
- 12 DICOM-FHIR-Converter. LinuxForHealth. Accessed May 21, 2024 at: https://github.com/LinuxForHealth/dicom-fhir-converter
- 13 Iancu A. Fork of DICOM-FHIR-Converter. Accessed May 21, 2024 at: https://github.com/alexa-ian/dicom-fhir-converter
- 14 Apache Kafka. Apache Software Foundation. Accessed May 21, 2024 at: https://kafka.apache.org
- 15 Iancu A. GitHub: DICOM-Interval-Scheduler. Accessed December 19, 2024; available at: https://github.com/alexa-ian/DICOM-Interval-Scheduler/
- 16 Ronacher A. Flask (version 3.0.1). Pallets. Accessed May 21, 2024 at: https://palletsprojects.com/projects/flask
- 17 Kubernetes (K8s). Cloud Native Computing Foundation. Accessed December 19, 2024; available at: https://kubernetes.io
- 18 Pathling. The Australian e-Health Research Centre - Commonwealth Scientific and Industrial Research Organization. Accessed May 21, 2024 at: https://pathling.csiro.au
- 19 Zaharia M. Apache Software Foundation. Accessed May 21, 2024 at: https://spark.apache.org
- 20 DICOM PS3.16 2024b - Content Mapping Resource. NEMA. Accessed May 21, 2024 at: https://dicom.nema.org/medical/dicom/current/output/chtml/part16/chapter_L.html
- 21 FHIR-Gateway. . MIRACUM. Accessed May 21, 2024 at: https://github.com/miracum/fhir-gateway
- 22 Larobina M. Thirty years of the DICOM standard. Tomography 2023; 9 (05) 1829-1838
- 23 Gueld MO, Kohnen M, Keysers D. et al. Quality of DICOM header information for image categorization. In: Siegel EL, Huang HK. eds. Medical Imaging 2002: PACS and Integrated Medical Information Systems: Design and Evaluation. SPIE; 2002: 280-287 . SPIE Proceedings
- 24 Bidgood Jr WD. The SNOMED DICOM microglossary: controlled terminology resource for data interchange in biomedical imaging. Methods Inf Med 1998; 37 (4-5): 404-414
- 25 Towbin AJ, Roth CJ, Petersilge CA, Garriott K, Buckwalter KA, Clunie DA. The importance of body part labeling to enable enterprise imaging: a HIMSS-SIIM enterprise imaging community collaborative white paper. J Digit Imaging 2021; 34 (01) 1-15
- 26 Zhennan Yan, Yiqiang Zhan, Zhigang Peng. et al; Xiang Sean Zhou. Multi-instance deep learning: discover discriminative local anatomies for bodypart recognition. IEEE Trans Med Imaging 2016; 35 (05) 1332-1343
- 27 Wasserthal J, Breit H-C, Meyer MT. et al. TotalSegmentator: robust segmentation of 104 anatomic structures in CT images. Radiol Artif Intell 2023; 5 (05) e230024
- 28 Gruendner J, Deppenwiese N, Folz M. et al. The architecture of a feasibility query portal for distributed COVID-19 Fast Healthcare Interoperability Resources (FHIR) patient data repositories: design and implementation study. JMIR Med Inform 2022; 10 (05) e36709
- 29 Prokosch H-U, Gebhardt M, Gruendner J. et al. Towards a national portal for medical research data (FDPG): vision, status, and lessons learned. Stud Health Technol Inform 2023; 302: 307-311
- 30 Ziegler J, Gruendner J, Rosenau L, Erpenbeck M, Prokosch H-U, Deppenwiese N. Towards a Bavarian oncology real world data research platform. Stud Health Technol Inform 2023; 307: 78-85
- 31 Heyder R. NUM Coordination Office, NUKLEUS Study Group, NUM-RDP Coordination, RACOON Coordination, AKTIN Coordination, GenSurv Study Group. [The German Network of University Medicine: technical and organizational approaches for research data platforms]. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz 2023; 66 (02) 114-125
- 32 BZKF BORN-Project. University Hospital Erlangen. Accessed May 24, 2024 at: https://bzkf.de/born/?lang=en
- 33 Tseng H-H, Wei L, Cui S, Luo Y, Ten Haken RK, El Naqa I. Machine learning and imaging informatics in oncology. Oncology 2020; 98 (06) 344-362
- 34 Chennubhotla C, Clarke LP, Fedorov A. et al. An assessment of imaging informatics for precision medicine in cancer. Yearb Med Inform 2017; 26 (01) 110-119