Subscribe to RSS

DOI: 10.1055/a-2703-7227
A Natural Language Processing Approach to Identify Negative Patient Descriptors in Electronic Health Records for Maternal Care
Authors
Abstract
Background
Maternal harm, especially for Black women, is a significant health care issue. Unstructured clinical notes in electronic health records (EHRs) may reveal unsafe maternal care. Prior studies using natural language processing (NLP) have shown that tone and sentiment in notes contribute to preventable safety events.
Objective
This study aimed to examine whether negative patient descriptors in EHR clinical notes are associated with adverse maternal outcomes and how their use varies by patient demographics.
Methods
We conducted a retrospective cohort study of women who delivered at two large birthing hospitals in Washington, DC between January 1, 2016 and March 31, 2020. Using a predefined list of negative keywords (e.g., combative) and NLP, we identified sentences from clinical notes for manual review. Two subject matter experts labeled keywords as “negative descriptors” if they negatively described patients. A logistic regression model with elastic net regularization was trained on the labeled sentences to classify the remaining corpus. We evaluated the prevalence of negative descriptors by race, age, insurance type, and pregnancy outcomes, and calculated adjusted odds ratios.
Results
Among 190,026 clinical notes from 9,302 patients, 719 notes associated with 444 patients contained at least one negative descriptor. Of these, 313 (70.5%) were Black, 45 (10.1%) were White, and 86 (19.4%) were from Other racial groups (p < 0.001). Negative descriptors were more common among younger patients (18–29 years: 49.3%) and those with Medicare/Medicaid insurance (65.3%). Although case patients—defined as those with postpartum readmission or severe maternal morbidity—had slightly fewer descriptors overall, they had higher adjusted odds of having them. Black patients were associated with higher odds, and commercial insurance with lower odds, of having negative descriptors.
Conclusion
Negative descriptors appear disproportionately in the notes of Black patients and those with public insurance, suggesting implicit bias in documentation. Addressing biased language is essential for improving equity in maternal care.
Protection of Human and Animal Subjects
The study was performed in compliance with the World Medical Association Declaration of Helsinki on Ethical Principles for Medical Research Involving Human Subjects and was reviewed by our Institutional Review Board.
Data Availability
The datasets generated during and/or analyzed during this study are not publicly available due to the sensitive nature of the electronic health records data and identifiable information embedded in clinical notes but are available from the corresponding author on reasonable request with proper data use agreement.
Publication History
Received: 30 April 2025
Accepted: 17 September 2025
Article published online:
28 October 2025
© 2025. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution License, permitting unrestricted use, distribution, and reproduction so long as the original work is properly cited. (https://creativecommons.org/licenses/by/4.0/)
Georg Thieme Verlag KG
Oswald-Hesse-Straße 50, 70469 Stuttgart, Germany
-
References
- 1 CDC. Data from the Pregnancy Mortality Surveillance System. Accessed July 18, 2025 at: https://www.cdc.gov/maternal-mortality/php/pregnancy-mortality-surveillance-data/index.html?cove-tab=1
- 2 CDC. Working Together to Reduce Black Maternal Mortality. Accessed July 18, 2025 at: https://www.cdc.gov/womens-health/features/maternal-mortality.html
- 3 Lawton B, MacDonald EJ, Brown SA. et al. Preventability of severe acute maternal morbidity. Am J Obstet Gynecol 2014; 210 (06) 557.e1-557.e6
- 4 Lawton BA, Jane MacDonald E, Stanley J, Daniells K, Geller SE. Preventability review of severe maternal morbidity. Acta Obstet Gynecol Scand 2019; 98 (04) 515-522
- 5 Montesi G, Lechi A. Prevention of medication errors: detection and audit. Br J Clin Pharmacol 2009; 67 (06) 651-655
- 6 Thomas AD, Pandit C, Krevat SA. Race differences in reported “near miss” patient safety events in health care system high reliability organizations. J Patient Saf 2021; 17 (08) e1605-e1608
- 7 Thomas ADPC, Pandit C, Krevat SA. Race differences in reported harmful patient safety events in healthcare system high reliability organizations. J Patient Saf 2020; 16 (04) e235-e239
- 8 Alhusen JL, Bower KM, Epstein E, Sharps P. Racial discrimination and adverse birth outcomes: an integrative review. J Midwifery Womens Health 2016; 61 (06) 707-720
- 9 Crawley LM, Ahn DK, Winkleby MA. Perceived medical discrimination and cancer screening behaviors of racial and ethnic minority adults. Cancer Epidemiol Biomarkers Prev 2008; 17 (08) 1937-1944
- 10 Feagin J, Bennefield Z. Systemic racism and U.S. health care. Soc Sci Med 2014; 103: 7-14
- 11 Giurgescu C, McFarlin BL, Lomax J, Craddock C, Albrecht A. Racial discrimination and the black-white gap in adverse birth outcomes: a review. J Midwifery Womens Health 2011; 56 (04) 362-370
- 12 Lothian JA. The continued mistreatment of women during pregnancy and childbirth. Springer; 2019: 183-185
- 13 Nelson A. Unequal treatment: confronting racial and ethnic disparities in health care. J Natl Med Assoc 2002; 94 (08) 666-668
- 14 Schulman KA, Berlin JA, Harless W. et al. The effect of race and sex on physicians' recommendations for cardiac catheterization. N Engl J Med 1999; 340 (08) 618-626
- 15 Vedam S, Stoll K, Taiwo TK. et al; GVtM-US Steering Council. The Giving Voice to Mothers study: inequity and mistreatment during pregnancy and childbirth in the United States. Reprod Health 2019; 16 (01) 77
- 16 Williams DR, Wyatt R. Racial bias in health care and health: challenges and opportunities. JAMA 2015; 314 (06) 555-556
- 17 Mehra R, Boyd LM, Magriples U, Kershaw TS, Ickovics JR, Keene DE. Black pregnant women “get the most judgment”: a qualitative study of the experiences of Black women at the intersection of race, gender, and pregnancy. Womens Health Issues 2020; 30 (06) 484-492
- 18 Badreldin N, Grobman WA, Yee LM. Racial disparities in postpartum pain management. Obstet Gynecol 2019; 134 (06) 1147-1153
- 19 Boakye PN, Prendergast N, Bailey A. et al. Anti-Black medical gaslighting in healthcare: experiences of Black women in Canada. Can J Nurs Res 2025; 57 (01) 59-68
- 20 Gary F, Still C, Mickels P, Hassan M, Evans E. Muddling through the health system: experiences of three groups of black women in three regions. Journal of National Black Nurses' Association. J Natl Black Nurses Assoc 2015; 26 (01) 22-28
- 21 Thompson T-aM, Young Y-Y, Bass TM. et al. Racism runs through it: examining the sexual and reproductive health experience of Black women in the South: study examines the sexual and reproductive health experiences of Black women in the South. Health Aff 2022; 41 (02) 195-202
- 22 Park J, Saha S, Chee B, Taylor J, Beach MC. Physician use of stigmatizing language in patient medical records. JAMA Netw Open 2021; 4 (07) e2117052-e2117052
- 23 Cohan A, Fong A, Goharian N, Ratwani R. A neural attention model for categorizing patient safety events. Springer; 2017: 720-726
- 24 Cohan A, Fong A, Ratwani RM, Goharian N. Identifying harm events in clinical care through medical narratives. 2017: 52-59
- 25 Gilboa L, Hoffman D, Fong A, Ratwani R. A Sentiment Analysis of Patient Safety Event Reports. SAGE Publications; New Delhi, India: 2015: 162-164
- 26 CDC. Identifying Severe Maternal Morbidity (SMM). Accessed July 18, 2025 at: https://www.cdc.gov/maternal-infant-health/php/severe-maternal-morbidity/icd.html
- 27 Sun M, Oliwa T, Peek ME, Tung EL. Negative patient descriptors: documenting racial bias in the electronic health record. Health Aff (Millwood) 2022; 41 (02) 203-211
- 28 Mikolov TSI, Chen K, Corrado GS, Dean J. Distributed representations of words and phrases and their compositionality. Adv Neural Inf Process Syst 2013; 26
- 29 Vajjala S, Majumder B, Gupta A, Surana H. Practical natural language processing: a comprehensive guide to building real-world NLP systems. O'Reilly Media; 2020
- 30 Marquardt DW, Snee RD. Ridge regression in practice. Am Stat 1975; 29 (01) 3-20
- 31 Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc B 1996; 58 (01) 267-288
- 32 P Goddu A, O'Conor KJ, Lanzkron S. et al. Do words matter? Stigmatizing language and the transmission of bias in the medical record. J Gen Intern Med 2018; 33 (05) 685-691
- 33 Breen KJ, Greenberg PB. Difficult physician-patient encounters. Intern Med J 2010; 40 (10) 682-688
- 34 Haas LJ, Leiser JP, Magill MK, Sanyer ON. Management of the difficult patient. Am Fam Physician 2005; 72 (10) 2063-2068
- 35 Wang MD, Khanna R, Najafi N. Characterizing the source of text in electronic health record progress notes. JAMA Intern Med 2017; 177 (08) 1212-1213
- 36 Peek ME, Odoms-Young A, Quinn MT, Gorawara-Bhat R, Wilson SC, Chin MH. Racism in healthcare: its relationship to shared decision-making and health disparities: a response to Bradby. Soc Sci Med 2010; 71 (01) 13-17
