Subscribe to RSS

DOI: 10.1055/a-2702-1574
Identifying Pediatric Long COVID: Comparing an EHR Algorithm to Manual Review
Authors
Funding The research reported in this publication was conducted using PEDSnet, A Pediatric Clinical Research Network. PEDSnet has been developed with funding from the Patient-Centered Outcomes Research Institute (PCORI); PEDSnet's participation in PCORnet is funded through PCORI award RI-CHOP-01-PS10. This publication includes data from the following PEDSnet institutions: Ann & Robert H. Lurie Children's Hospital of Chicago, Children's Hospital of Philadelphia, Children's Hospital Colorado, Cincinnati Children's Hospital Medical Center, Children's National Medical Center, Nationwide Children's Hospital, Nemours Children's Health, Seattle Children's Hospital, and Stanford Medicine Children's Health. This study is funded by NIH Researching COVID to Enhance Recovery (RECOVER) Initiative (Agreement no.: OT2HL161847-01), which seeks to understand, treat, and prevent the postacute sequelae of SARS-CoV-2 infection (PASC). For more information on RECOVER, visit https://recovercovid.org/
Abstract
Background
Long COVID, characterized by persistent or recurring symptoms post-COVID-19 infection, poses challenges for pediatric care and research due to the lack of a standardized clinical definition. Adult-focused phenotypes do not translate well to children, given developmental and physiological differences, and pediatric-specific phenotypes have not been compared with chart review.
Objective
This study introduces and evaluates a pediatric-specific rule-based computable phenotype (CP) to identify long COVID using electronic health record data. We compare its performance to manual chart review.
Methods
We applied the CP, composed of diagnostic codes empirically associated with long COVID, to 339,467 pediatric patients with SARS-CoV-2 infection in the RECOVER PCORnet EHR database. The CP identified 31,781 patients with long COVID. Clinicians conducted chart reviews on a subset of patients across 16 hospital systems to assess performance. We qualitatively reviewed discordant cases to understand differences between CP and clinician identification.
Results
Among the 651 reviewed patients (339 females, M age = 10.10 years), the CP showed moderate agreement with clinician identification (accuracy = 0.62, positive predictive value [PPV] = 0.49, negative predictive value [NPV] = 0.75, sensitivity = 0.52, specificity = 0.84). Performance was largely consistent across age and dominant variant but varied by symptom cluster count. Most discrepancies between the CP and chart review occurred when the CP identified a case, but the clinician did not, often because clinicians attributed symptoms to preexisting conditions (73%). When clinicians identified cases missed by the CP, they often used broader symptom or timing criteria (69%). Model performance improved when the CP accounted for preexisting conditions (accuracy = 0.71, PPV = 0.65, NPV = 0.74, sensitivity = 0.59, specificity = 0.79).
Conclusion
This study presents a CP for pediatric long COVID. While agreement with manual review was moderate, most discrepancies were explained by differences in interpreting symptoms when patients had preexisting conditions. Accounting for these conditions improved accuracy and highlights the need for a consensus definition. These findings support the development of reliable, scalable tools for pediatric long COVID research.
Keywords
long COVID - COVID-19 - pediatrics - electronic health records - computable phenotype - PEDSnetProtection of Human and Animal Subjects
This study constitutes human subjects' research. Institutional Review Board (IRB) approval was obtained under Biomedical Research Alliance of New York (BRANY) protocol #21–08–508. BRANY waived the need for consent and HIPAA authorization.
Note
This content is solely the responsibility of the authors and does not necessarily represent the official views of the RECOVER Initiative, the NIH, or other funders. The views, statements, and opinions presented in this publication are solely the responsibility of the author(s) and do not necessarily represent the views of other organizations participating in, collaborating with, or funding PCORnet or of the Patient-Centered Outcomes Research Institute (PCORI).
* Co-first authors.
# Membership of the RECOVER Consortium is provided in [Supplementary Appendix] (available in the online version only).
Publication History
Received: 26 January 2025
Accepted: 09 September 2025
Article published online:
24 October 2025
© 2025. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution License, permitting unrestricted use, distribution, and reproduction so long as the original work is properly cited. (https://creativecommons.org/licenses/by/4.0/)
Georg Thieme Verlag KG
Oswald-Hesse-Straße 50, 70469 Stuttgart, Germany
-
References
- 1 Davis HE, Assaf GS, McCorkell L. et al. Characterizing long COVID in an international cohort: 7 months of symptoms and their impact. EClinicalMedicine 2021; 38: 101019
- 2 Katz BZ, Shiraishi Y, Mears CJ, Binns HJ, Taylor R. Chronic fatigue syndrome after infectious mononucleosis in adolescents. Pediatrics 2009; 124 (01) 189-193
- 3 Sellers SA, Hagan RS, Hayden FG, Fischer II WA. The hidden burden of influenza: a review of the extra-pulmonary complications of influenza infection. Influenza Other Respir Viruses 2017; 11 (05) 372-393
- 4 Mandel H, Yoo YJ, Allen AJ. et al. Long-COVID incidence proportion in adults and children between 2020 and 2024. Clin Infect Dis 2025; 80 (06) 1247-1261
- 5 Pellegrino R, Chiappini E, Licari A, Galli L, Marseglia GL. Prevalence and clinical presentation of long COVID in children: a systematic review. Eur J Pediatr 2022; 181 (12) 3995-4009
- 6 Pinto Pereira SM, Nugawela MD, Stephenson T. et al; CLoCk Consortium. Post-COVID-19 condition (long COVID) in children and young people 12 months after infection or reinfection with the Omicron variant: a prospective observational study. Sci Rep 2024; 14 (01) 9957
- 7 Vahratian A, Adjaye-Gbewonyo D, Lin JS, Saydah S. Long COVID in children: United States, 2022. NCHS Data Brief, no 479. Hyattsville, MD: National Center for Health Statistics; 2023.
- 8 Lopez-Leon S, Wegman-Ostrosky T, Ayuzo Del Valle NC. et al. Long-COVID in children and adolescents: a systematic review and meta-analyses. Sci Rep 2022; 12 (01) 9950
- 9 Lorman V, Rao S, Jhaveri R. et al. Understanding pediatric long COVID using a tree-based scan statistic approach: an EHR-based cohort study from the RECOVER program. JAMIA Open 2023; 6 (01) ooad016
- 10 Zheng Y-B, Zeng N, Yuan K. et al. Prevalence and risk factor for long COVID in children and adolescents: a meta-analysis and systematic review. J Infect Public Health 2023; 16 (05) 660-672
- 11 Reese J, Blau H, Bergquist T. et al. Generalizable long COVID subtypes: findings from the NIH N3C and RECOVER program. MedRxiv Prepr Serv Health Sci 2022:2022.05.24.22275398
- 12 Rao S, Lee GM, Razzaghi H. et al. Clinical features and burden of postacute sequelae of SARS-CoV-2 infection in children and adolescents. JAMA Pediatr 2022; 176 (10) 1000-1009
- 13 Rao S, Gross RS, Mohandas S. et al. Postacute sequelae of SARS-CoV-2 in children. Pediatrics 2024; 153 (03) e2023062570
- 14 Feldstein LR, Rose EB, Horwitz SM. et al; Overcoming COVID-19 Investigators, CDC COVID-19 Response Team. Multisystem inflammatory syndrome in U.S. children and adolescents. N Engl J Med 2020; 383 (04) 334-346
- 15 Wu EY, Campbell MJ. Cardiac manifestations of multisystem inflammatory syndrome in children (MIS-C) following COVID-19. Curr Cardiol Rep 2021; 23 (11) 168
- 16 Fritsche LG, Jin W, Admon AJ, Mukherjee B. Characterizing and predicting post-acute sequelae of SARS CoV-2 infection (PASC) in a large academic medical center in the US. J Clin Med 2023; 12 (04) 1328
- 17 Pfaff ER, Madlock-Brown C, Baratta JM. et al; N3C Consortium, RECOVER Consortium. Coding long COVID: characterizing a new disease through an ICD-10 lens. BMC Med 2023; 21 (01) 58
- 18 International Classification of Diseases. Tenth Revision (ICD-10), World Health Organization (WHO) 2019/2021. Licensed under Creative Commons Attribution-NoDerivatives 3.0 IGO license (CC BY-ND 3.0 IGO). . Accessed September 22, 2025 at: https://icd.who.int/browse11
- 19 Centers for Disease Control and Prevention. Post-COVID conditions: Information for healthcare providers. Accessed January 5, 2024 at: https://www.cdc.gov/coronavirus/2019-ncov/hcp/clinical-care/post-covid-conditions.html
- 20 Pfaff ER, Girvin AT, Bennett TD. et al; N3C Consortium. Identifying who has long COVID in the USA: a machine learning approach using N3C data. Lancet Digit Health 2022; 4 (07) e532-e541
- 21 Lorman V, Razzaghi H, Song X. et al. A machine learning-based phenotype for long COVID in children: An EHR-based study from the RECOVER program. PLoS One 2023; 18 (08) e0289774
- 22 Garai R, Krivácsy P, Herczeg V. et al. Clinical assessment of children with long COVID syndrome. Pediatr Res 2023; 93 (06) 1616-1625
- 23 Kikkenborg Berg S, Palm P, Nygaard U. et al. Long COVID symptoms in SARS-CoV-2-positive children aged 0-14 years and matched controls in Denmark (LongCOVIDKidsDK): a national, cross-sectional study. Lancet Child Adolesc Health 2022; 6 (09) 614-623
- 24 RECOVER. Researching COVID to Enhance Recovery. RECOVER: Researching COVID to Enhance Recovery. . Accessed January 4, 2024 at: https://recovercovid.org
- 25 Mejias A, Schuchard J, Rao S. et al. Leveraging serologic testing to identify children at risk for post-acute sequelae of SARS-CoV-2 infection: an electronic health record-based cohort study from the RECOVER program. J Pediatr 2023; 257: 113358
- 26 Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (REDCap)–a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform 2009; 42 (02) 377-381
- 27 Gross RS, Thaweethai T, Kleinman LC. et al; RECOVER-Pediatrics Consortium, RECOVER-Pediatrics Group Authors. Characterizing long COVID in children and adolescents. JAMA 2024; 332 (14) 1174-1188
- 28 Posit team. (2023). RStudio: Integrated Development Environment for R. Posit Software, PBC, Boston, MA; . Accessed September 22, 2025 at: http://www.posit.co/
- 29 Simon TD, Cawthon ML, Stanford S. et al; Center of Excellence on Quality of Care Measures for Children with Complex Needs (COE4CCN) Medical Complexity Working Group. Pediatric medical complexity algorithm: a new method to stratify children by medical complexity. Pediatrics 2014; 133 (06) e1647-e1654
- 30 Wang L, Davis PB, Berger N, Kaelber DC, Volkow N, Xu R. Association of COVID-19 with respiratory syncytial virus (RSV) infections in children aged 0-5 years in the USA in 2022: a multicentre retrospective cohort study. Fam Med Community Health 2023; 11 (04) e002456
- 31 Xie Y, Choi T, Al-Aly Z. Long-term outcomes following hospital admission for COVID-19 versus seasonal influenza: a cohort study. Lancet Infect Dis 2024; 24 (03) 239-255
- 32 World Health Organization. 8D89.2 Postural orthostatic tachycardia syndrome. In International statistical classification of diseases and related health problems (11th ed.). 2019 . Accessed September 22, 2025 at: https://icd.who.int/browse/2024-01/mms/en#1533647472
- 33 George A, Winters B. Postural orthostatic tachycardia syndrome (POTS): a frequently missed diagnosis. J Nurse Pract 2023; 19 (02) 104439
- 34 National Academies of Sciences, Engineering, and Medicine. A long COVID definition: a chronic, systemic disease state with profound consequences. Washington, DC: National Academies Press; 2024
