Subscribe to RSS

DOI: 10.1055/a-2797-4219
Clinical Terminology Mapping Service Based on Information Retrieval
Authors
Funding Information This research was supported by the Regional Innovation System & Education (RISE) program through the Gangwon RISE Center, funded by the Ministry of Education (MOE) and the Gangwon State (G.S.), Republic of Korea (2025-RISE-10-005). This study was supported by 2023 Research Grant from Kangwon National University (202305120001).
Abstract
Background
Standardized clinical terminology is essential for semantic interoperability. Typically, a hospital's terminology expert manually maps local terminology with international standards such as SNOMED CT. The manual mapping process is demanding, labor-intensive, and time-consuming, and its effectiveness relies on the expertise of the professional handling it.
Objective
We developed a method to map clinical terms to SNOMED CT concept descriptions using an information retrieval (IR) approach with rich synonyms. We also provide a free mapping support service to help terminology experts alleviate the challenges of manual mapping without the need for additional manipulation.
Methods
We created indexes using edge n-grams and synonyms. We adopted Elasticsearch for indexing and query processing, incorporating data from the SPECIALIST Lexicon to enrich the synonym database. Eight different indexes were initially created, but only four were retained based on performance. We tested indexes individually and in combination, using a dataset of 1,753 one-to-one mapped instances from the National Library of Medicine ICD-9-CM Procedure codes to the SNOMED CT Map. We compared our approach with MetaMap for evaluation.
Results
We found that using rich synonyms and edge n-gram indexing significantly improved the accuracy of mapping clinical terms to SNOMED CT. The indexes incorporating synonyms and edge n-grams performed better than those using either technique alone. Combining these methods captured more relevant terms and synonyms, resulting in more precise mappings. Our method outperformed the baseline provided by MetaMap, demonstrating enhanced capability in handling complex medical terminology and improving the overall mapping quality.
Conclusion
Our study introduced an IR method with rich synonyms for mapping clinical terms to SNOMED CT, analyzing 40 unmapped terms, and identifying key issues. The approach shows promise in improving terminology mapping, and future work will explore advanced methods to enhance accuracy further, aiming to reduce manual mapping efforts and improve result evaluation.
Keywords
semantic interoperability - term mapping - information retrieval - rich synonym - query expansionPublication History
Received: 27 August 2025
Accepted: 23 January 2026
Accepted Manuscript online:
03 February 2026
Article published online:
16 February 2026
© 2026. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/)
Georg Thieme Verlag KG
Oswald-Hesse-Straße 50, 70469 Stuttgart, Germany
-
References
- 1 Palojoki S, Lehtonen L, Vuokko R. Semantic interoperability of electronic health records: systematic review of alternative approaches for enhancing patient information availability. JMIR Med Inform 2024; 12: e53535
- 2 Facile R, Chronaki C, van Reusel P, Kush R. Standards in sync: five principles to achieve semantic interoperability for TRUE research for healthcare. Front Digit Health 2025; 7: 1567624
- 3 Sung S, Park HA, Jung H, Kang H. A SNOMED CT mapping guideline for the local terms used to document clinical findings and procedures in electronic medical records in South Korea: methodological study. JMIR Med Inform 2023; 11: e46127
- 4 SNOMED International. Accessed August 27, 2025 at: https://www.snomed.org
- 5 World Health Organization. WHO-FIC Classifications and Terminology Mapping: Principles and Best Practice. Geneva: WHO; 2021
- 6 Thandi M, Brown S, Wong ST. Mapping frailty concepts to SNOMED CT. Int J Med Inform 2021; 149: 104409
- 7 Lougheed MD, Thomas NJ, Wasilewski NV, Morra AH, Minard JP. Use of SNOMED CT® and LOINC® to standardize terminology for primary care asthma electronic health records. J Asthma 2018; 55 (06) 629-639
- 8 Block L, Handfield S. Mapping wound assessment data elements in SNOMED CT. Stud Health Technol Inform 2016; 225: 1078-1079
- 9 Mészáros Á, Kovács S, Héja T, Bagyura Z, Zemplényi A. Mapping Hungarian procedure codes to SNOMED CT. BMC Med Res Methodol 2023; 23 (01) 240
- 10 EDI-SNOMED CT mapping table. Accessed February 13, 2026; available at: https://hins.or.kr/menu/viewMenu.do?menuNo=3070200
- 11 KCD-SNOMED CT mapping table. Accessed February 13, 2026; available at: https://hins.or.kr/menu/viewMenu.do?menuNo=3070100
- 12 Pedersen MK, Eriksson R, Reguant R. et al. A unidirectional mapping of ICD-8 to ICD-10 codes, for harmonized longitudinal analysis of diseases. Eur J Epidemiol 2023; 38 (10) 1043-1052
- 13 Rajput AM, Triep K, Endrich O. Semi-automated approach to map clinical concepts to SNOMED CT terms by using terminology server. In: dHealth 2022. Amsterdam: IOS Press; 2022: 67-72
- 14 Gaudet-Blavignac C, Foufi V, Bjelogrlic M, Lovis C. Use of the systematized nomenclature of medicine clinical terms (SNOMED CT) for processing free text in health care: systematic scoping review. J Med Internet Res 2021; 23 (01) e24594
- 15 Torres FBG, Gomes DC, Hino AAF, Moro C, Cubas MR. Comparison of the results of manual and automated processes of cross-mapping between nursing terms: quantitative study. JMIR Nurs 2020; 3 (01) e18501
- 16 Gupta S, MacLean DL, Heer J, Manning CD. Induced lexico-syntactic patterns improve information extraction from online medical forums. J Am Med Inform Assoc 2014; 21 (05) 902-909
- 17 de Bruijn B, Cherry C, Kiritchenko S, Martin J, Zhu X. Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010. J Am Med Inform Assoc 2011; 18 (05) 557-562
- 18 Wu Y, Denny JC, Trent Rosenbloom S. et al. A long journey to short abbreviations: developing an open-source framework for clinical abbreviation recognition and disambiguation (CARD). J Am Med Inform Assoc 2017; 24 (e1): e79-e86
- 19 So EY, Park HA. Exploring the possibility of information sharing between the medical and nursing domains by mapping medical records to SNOMED CT and ICNP. Healthc Inform Res 2011; 17 (03) 156-161
- 20 Wade G, Rosenbloom ST. Experiences mapping a legacy interface terminology to SNOMED CT. BMC Med Inform Decis Mak 2008; 8 (Suppl. 01) S3
- 21 Fung KW, Bodenreider O. Utilizing the UMLS for semantic mapping between terminologies. AMIA Annu Symp Proc 2005; 2005: 266-270
- 22 Wang Y, Patrick J, Miller G, O'Hallaran J. A computational linguistics motivated mapping of ICPC-2 PLUS to SNOMED CT. BMC Med Inform Decis Mak 2008; 8 (Suppl. 01) S5
- 23 Brown SH, Husser CS, Wahner-Roedler D. et al. Using SNOMED CT as a reference terminology to cross map two highly pre-coordinated classification systems. Stud Health Technol Inform 2007; 129 (Pt 1): 636-639
- 24 Cartagena FP, Schaeffer M, Rifai D, Doroshenko V, Goldberg HS. Leveraging the NLM map from SNOMED CT to ICD-10-CM to facilitate adoption of ICD-10-CM. J Am Med Inform Assoc 2015; 22 (03) 659-670
- 25 Allones JL, Martinez D, Taboada M. Automated mapping of clinical terms into SNOMED-CT. An application to codify procedures in pathology. J Med Syst 2014; 38 (10) 134
- 26 Nadkarni PM, Darer JA. Migrating existing clinical content from ICD-9 to SNOMED. J Am Med Inform Assoc 2010; 17 (05) 602-607
- 27 Kate RJ. Towards converting clinical phrases into SNOMED CT expressions. Biomed Inform Insights 2013; 6 (Suppl. 01) 29-37
- 28 Schütze H, Manning CD, Raghavan P. Introduction to Information Retrieval. Vol 39. Cambridge: Cambridge University Press; 2008
- 29 InfoClinic. Mapping support service. Accessed August 27, 2025 at: http://stom.infoclinic.co
- 30 Gormley C, Tong Z. Elasticsearch: The Definitive Guide: A Distributed Real-time Search and Analytics Engine. Sebastopol, CA: O'Reilly Media, Inc.; 2015
- 31 Elasticsearch. Accessed August 27, 2025 at: https://www.elastic.co
- 32 Chen X, Gururaj AE, Ozyurt B. et al. DataMed—an open source discovery index for finding biomedical datasets. J Am Med Inform Assoc 2018; 25 (03) 300-308
- 33 National Library of Medicine. Unified Medical Language System (UMLS): The SPECIALIST Lexicon. Accessed August 27, 2025 at: https://www.nlm.nih.gov/research/umls/new_users/online_learning/LEX_001.html
- 34 Batch MetaMap. Accessed August 27, 2025 at: https://ii.nlm.nih.gov/Batch/UTS_Required/MetaMap.html
- 35 ICD-9-CM procedure codes to SNOMED CT map. Accessed August 27, 2025 at: https://www.nlm.nih.gov/research/umls/mapping_projects/icd9cmv3_to_snomedct.html
- 36 Mikolov T, Le QV, Sutskever I. Exploiting similarities among languages for machine translation. arXiv preprint arXiv:1309.4168. Published 2013. Updated 2022