Conversion of Automated 12-Lead Electrocardiogram Interpretations to OMOP CDM Vocabulary

Sunho Choi; Hyung Joon Joo; Yoojoong Kim; Jong-Ho Kim; Junhee Seok

doi:10.1055/s-0042-1756427

Subscribe to RSS

Please copy the URL and add it into your RSS Feed Reader.

https://www.thieme-connect.de/rss/thieme/en/10.1055-s-00035026.xml

Share / Bookmark

Facebook X Linkedin Weibo

Download PDF

CC BY-NC-ND 4.0 · Appl Clin Inform 2022; 13(04): 880-890
DOI: 10.1055/s-0042-1756427

Research Article

Conversion of Automated 12-Lead Electrocardiogram Interpretations to OMOP CDM Vocabulary

Sunho Choi^*

¹School of Electrical Engineering, Korea University, Seoul, South Korea

,

Hyung Joon Joo^*

²Korea University Research Institute for Medical Bigdata Science, Korea University, Seoul, South Korea

³Department of Cardiology, Cardiovascular Center, Korea University College of Medicine, Seoul, South Korea

,

Yoojoong Kim

⁴School of Computer Science and Information Engineering, The Catholic University of Korea, Seoul, South Korea

,

Jong-Ho Kim

²Korea University Research Institute for Medical Bigdata Science, Korea University, Seoul, South Korea

³Department of Cardiology, Cardiovascular Center, Korea University College of Medicine, Seoul, South Korea

,

Junhee Seok

¹School of Electrical Engineering, Korea University, Seoul, South Korea

› Author Affiliations Funding This research was supported by a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (grant number: HI19C0360) as well as a grant from the National Research Foundation of Korea (grant number: NRF-2022R1A2C2004003).

› Further Information

Abstract
Full Text
References
Supplementary Material

Permissions and Reprints

Abstract

Background A computerized 12-lead electrocardiogram (ECG) can automatically generate diagnostic statements, which are helpful for clinical purposes. Standardization is required for big data analysis when using ECG data generated by different interpretation algorithms. The common data model (CDM) is a standard schema designed to overcome heterogeneity between medical data. Diagnostic statements usually contain multiple CDM concepts and also include non-essential noise information, which should be removed during CDM conversion. Existing CDM conversion tools have several limitations, such as the requirement for manual validation, inability to extract multiple CDM concepts, and inadequate noise removal.

Objectives We aim to develop a fully automated text data conversion algorithm that overcomes limitations of existing tools and manual conversion.

Methods We used interpretations printed by 12-lead resting ECG tests from three different vendors: GE Medical Systems, Philips Medical Systems, and Nihon Kohden. For automatic mapping, we first constructed an ontology-lexicon of ECG interpretations. After clinical coding, an optimized tool for converting ECG interpretation to CDM terminology is developed using term-based text processing.

Results Using the ontology-lexicon, the cosine similarity-based algorithm and rule-based hierarchical algorithm showed comparable conversion accuracy (97.8 and 99.6%, respectively), while an integrated algorithm based on a heuristic approach, ECG2CDM, demonstrated superior performance (99.9%) for datasets from three major vendors.

Conclusion We developed a user-friendly software that runs the ECG2CDM algorithm that is easy to use even if the user is not familiar with CDM or medical terminology. We propose that automated algorithms can be helpful for further big data analysis with an integrated and standardized ECG dataset.

Keywords

OMOP common data model - ontology - electrocardiogram - clinical coding - ECG interpretation - ontology-lexicon

Protection of Human and Animal Subjects

The study protocol was approved by the institutional review board of Korea University Anam Hospital (IRB NO. 2019AN0227). Written informed consent was waived by the institutional review board of Korea University Anam Hospital because of the retrospective study design that posed minimal risk to the participants. The study complied with the principles of the Declaration of Helsinki.

^* These authors equally contributed to the study.

Supplementary Material

Supplementary Material

Supplementary Material

Supplementary Material

Supplementary Material

Publication History

Received: 04 April 2022

Accepted: 29 July 2022

Article published online:
21 September 2022

© 2022. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/)

Georg Thieme Verlag KG
Rüdigerstraße 14, 70469 Stuttgart, Germany

References
1 Garcia TB. 12-Lead ECG: The Art of Interpretation. Jones & Bartlett Publishers; 2013

Google Scholar
2 Smulyan H. The computerized ECG: friend and foe. Am J Med 2019; 132 (02) 153-160

Crossref PubMed Google Scholar
3 Willems JL, Abreu-Lima C, Arnaud P. et al. The diagnostic performance of computer programs for the interpretation of electrocardiograms. N Engl J Med 1991; 325 (25) 1767-1773

Crossref PubMed Google Scholar
4 Kligfield P, Gettes LS, Bailey JJ. et al; American Heart Association Electrocardiography and Arrhythmias Committee, Council on Clinical Cardiology; American College of Cardiology Foundation; Heart Rhythm Society. Recommendations for the standardization and interpretation of the electrocardiogram: part I: the electrocardiogram and its technology a scientific statement from the American Heart Association Electrocardiography and Arrhythmias Committee, Council on Clinical Cardiology; the American College of Cardiology Foundation; and the Heart Rhythm Society endorsed by the International Society for Computerized Electrocardiology. J Am Coll Cardiol 2007; 49 (10) 1109-1127

Crossref PubMed Google Scholar
5 Reich C, Ryan PB, Stang PE, Rocca M. Evaluation of alternative standardized terminologies for medical conditions within a network of observational healthcare databases. J Biomed Inform 2012; 45 (04) 689-696

Crossref PubMed Google Scholar
6 Gonçalves B, Guizzardi G, Pereira Filho JG. Using an ECG reference ontology for semantic interoperability of ECG data. J Biomed Inform 2011; 44 (01) 126-136

Crossref PubMed Google Scholar
7 Khan U, Kothari H, Kuchekar A, Koshy R. Common Data Model for Healthcare Data. Paper presented at: 2018 3rd IEEE International Conference on Recent Trends in Electronics, Information \& Communication Technology (RTEICT); 2018: 1450-1457

PubMed Google Scholar
8 Stang PE, Ryan PB, Racoosin JA. et al. Advancing the science for active surveillance: rationale and design for the Observational Medical Outcomes Partnership. Ann Intern Med 2010; 153 (09) 600-606

Crossref PubMed Google Scholar
9 Overhage JM, Ryan PB, Reich CG, Hartzema AG, Stang PE. Validation of a common data model for active safety surveillance research. J Am Med Inform Assoc 2012; 19 (01) 54-60

Crossref PubMed Google Scholar
10 Makadia R, Ryan PB. Transforming the Premier Perspective Hospital Database into the Observational Medical Outcomes Partnership (OMOP) Common Data Model. EGEMS (Wash DC) 2014; 2 (01) 1110

PubMed Google Scholar
11 Sathappan SMK, Jeon YS, Dang TK. et al. Transformation of electronic health records and questionnaire data to OMOP CDM: a feasibility study using SG_T2DM dataset. Appl Clin Inform 2021; 12 (04) 757-767

Article in Thieme Connect PubMed Google Scholar
12 Lamer A, Depas N, Doutreligne M. et al. Transforming French Electronic Health Records into the observational medical outcome partnership's common data model: a feasibility study. Appl Clin Inform 2020; 11 (01) 13-22

Article in Thieme Connect PubMed Google Scholar
13 Lynch KE, Deppen SA, DuVall SL. et al. Incrementally transforming electronic medical records into the observational medical outcomes partnership common data model: a multidimensional quality assurance approach. Appl Clin Inform 2019; 10 (05) 794-803

Article in Thieme Connect PubMed Google Scholar
14 Maier C, Lang L, Storf H. et al. Towards Implementation of OMOP in a German University Hospital Consortium. Appl Clin Inform 2018; 9 (01) 54-61

Article in Thieme Connect PubMed Google Scholar
15 Cimiano P, McCrae J, Buitelaar P, Montiel-Ponsoda E. On the role of senses in the ontology-lexicon. In: New Trends of Research in Ontologies and Lexical Resources. Springer; 2013: 43-62

Google Scholar
16 Cimiano P, Unger C, McCrae J. Ontology-based interpretation of natural language. Synth Lect Hum Lang Technol. 2014; 7 (02) 1-178

Crossref PubMed Google Scholar
17 Gomaa WH, Fahmy AA. others. A survey of text similarity approaches. Int J Comput Appl 2013; 68 (13) 13-18

PubMed Google Scholar
18 Hogan WR, Wagner MM. Accuracy of data in computer-based patient records. J Am Med Inform Assoc 1997; 4 (05) 342-355

Crossref PubMed Google Scholar
19 Sasaki M, Kita K. Rule-based text categorization using hierarchical categories. Paper presented at: SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No. 98CH36218). Vol 3.; 1998: 2827-2830

PubMed Google Scholar
20 Zhang Y, Jin R, Zhou Z-H. Understanding bag-of-words model: a statistical framework. Int J Mach Learn Cybern 2010; 1 (1–4): 43-52

Crossref PubMed Google Scholar
21 Wadia R, Akgun K, Brandt C. et al. Comparison of natural language processing and manual coding for the identification of cross-sectional imaging reports suspicious for lung cancer. JCO Clin Cancer Inform 2018; 2: 1-7

Google Scholar
22 Catling F, Spithourakis GP, Riedel S. Towards automated clinical coding. Int J Med Inform 2018; 120: 50-61

Crossref PubMed Google Scholar
23 Ternois I, Escudié J-B, Benamouzig R, Duclos C. Development of an Automatic Coding System for Digestive Endoscopies. EFMI-STC; 2018: 107-111

Google Scholar
24 Zouri M, Zouri N, Ferworn A. An Ontology Approach for Knowledge Representation of ECG Data. ITCH; 2019: 520-525

Google Scholar
25 Agassounon W, Martinoli A. Efficiency and robustness of threshold-based distributed allocation algorithms in multi-agent systems. Paper presented at: Proceedings of the First International Joint Conference on Autonomous Agents and Multiagent Systems: Part 3.; 2002: 1090-1097

PubMed Google Scholar
26 Liu H, Chen D, Chen D. et al. A large-scale multi-label 12-lead electrocardiogram database with standardized diagnostic statements. Sci Data 2022; 9 (01) 272

Crossref PubMed Google Scholar
27 Kreimeyer K, Foster M, Pandey A. et al. Natural language processing systems for capturing and standardizing unstructured clinical information: a systematic review. J Biomed Inform 2017; 73: 14-29

Crossref PubMed Google Scholar
28 Kim Y, Lee JH, Choi S. et al. Validation of deep learning natural language processing algorithm for keyword extraction from pathology reports in electronic health records. Sci Rep 2020; 10 (01) 20265

Crossref PubMed Google Scholar

Supplementary Material

Subscribe to RSS

Share / Bookmark

Conversion of Automated 12-Lead Electrocardiogram Interpretations to OMOP CDM Vocabulary

Abstract

Keywords

Protection of Human and Animal Subjects

Supplementary Material

Publication History

References