Appl Clin Inform
DOI: 10.1055/a-2616-9858
Research Article

Extracting social determinants of health from dental clinical notes

Farhana Pethani
1   Biomedical Informatics and Digital Health, The University of Sydney Faculty of Medicine and Health, Sydney, Australia (Ringgold ID: RIN522555)
2   Data61, Commonwealth Scientific and Industrial Research Organisation, Sydney, Australia (Ringgold ID: RIN2221)
,
Alec Chapman
3   Informatics, Decision-Enhancement and Analytic Sciences Center, Veterans Affairs Salt Lake City Health Care System, Salt Lake City, United States
4   Division of Epidemiology, School of Medicine, University of Utah Division of Epidemiology Biostatistics, Salt Lake City, United States (Ringgold ID: RIN208352)
,
Mike Conway
5   Biomedical Informatics, The University of Melbourne, Melbourne, Australia (Ringgold ID: RIN2281)
,
Xiang Dai
2   Data61, Commonwealth Scientific and Industrial Research Organisation, Sydney, Australia (Ringgold ID: RIN2221)
,
Demiana Bishay
6   The University of Sydney School of Dentistry, The University of Sydney Faculty of Medicine and Health, Sydney, Australia (Ringgold ID: RIN522555)
,
Victor Jun Xiang Choh
6   The University of Sydney School of Dentistry, The University of Sydney Faculty of Medicine and Health, Sydney, Australia (Ringgold ID: RIN522555)
,
Alexander He
6   The University of Sydney School of Dentistry, The University of Sydney Faculty of Medicine and Health, Sydney, Australia (Ringgold ID: RIN522555)
,
Su-Elle Lim
6   The University of Sydney School of Dentistry, The University of Sydney Faculty of Medicine and Health, Sydney, Australia (Ringgold ID: RIN522555)
,
Huey Ying Ng
6   The University of Sydney School of Dentistry, The University of Sydney Faculty of Medicine and Health, Sydney, Australia (Ringgold ID: RIN522555)
,
Tanya Mahony
7   Oral Health Services, Nepean Blue Mountains Local Health District, Penrith, Australia (Ringgold ID: RIN223690)
,
Albert Yaacoub
7   Oral Health Services, Nepean Blue Mountains Local Health District, Penrith, Australia (Ringgold ID: RIN223690)
6   The University of Sydney School of Dentistry, The University of Sydney Faculty of Medicine and Health, Sydney, Australia (Ringgold ID: RIN522555)
,
Sarvnaz Karimi
2   Data61, Commonwealth Scientific and Industrial Research Organisation, Sydney, Australia (Ringgold ID: RIN2221)
,
Heiko Spallek
6   The University of Sydney School of Dentistry, The University of Sydney Faculty of Medicine and Health, Sydney, Australia (Ringgold ID: RIN522555)
,
Adam G Dunn
1   Biomedical Informatics and Digital Health, The University of Sydney Faculty of Medicine and Health, Sydney, Australia (Ringgold ID: RIN522555)
› Institutsangaben
Gefördert durch: Commonwealth Scientific and Industrial Research Organisation Postgraduate Scholarship
Gefördert durch: Department of Education, Australian Government Research Training Program Scholarship

Objective In dentistry, social determinants of health (SDoH) are potentially recorded in the clinical notes of Electronic Dental Records (EDRs). The objective of this study was to examine the availability of SDoH data in dental clinical notes and evaluate NLP methods to extract SDoH from dental clinical notes. Methods A set of 1,000 dental clinical notes was sampled from a dataset of 105,311 patient visits to a dental clinic and manually annotated for information pertaining to sugar, tobacco, alcohol, methamphetamine, housing, and employment. Annotations included temporality, dose, type, duration, and frequency where appropriate. Experiments were to compare extraction using fine-tuned pre-trained language models (PLMs) with a rule-based approach. Performance was measured by F1-score. Results For identifying SDoH, the best performing PLM method produced F1-scores of 0.75 (sugar), 0.69 (tobacco), 0.67 (alcohol), 0.42 (housing), and 0 (employment). The rule-based method produced F1-scores of 0.70 (sugar), 0.69 (tobacco), 0.53 (alcohol), 0.44 (housing), and 0 (employment). The overall difference between PLMs and rule-based methods was F1-score of 0.04 (95% confidence interval -0.01, 0.09). SDoH were relatively rare in dental clinical notes, from sugar (9.1%), tobacco (3.9%), alcohol (1.2%), housing (1.2%), employment (0.2%), and methamphetamine use (0%). Conclusions The main challenge of extracting SDoH information from dental clinical notes was the frequency with which they are recorded, and the brevity and inconsistency where they are recorded. Improved surveillance likely needs new ways to standardise how SDoH are reported in dental clinical notes.



Publikationsverlauf

Eingereicht: 23. Januar 2025

Angenommen nach Revision: 20. Mai 2025

Accepted Manuscript online:
21. Mai 2025

© . Thieme. All rights reserved.

Georg Thieme Verlag KG
Oswald-Hesse-Straße 50, 70469 Stuttgart, Germany