TY - JOUR AU - Wang, Zhan; Talburt, John R.; Wu, Ningning; Dagtas, Serhan; Zozus, Meredith Nahm TI - A Rule-Based Data Quality Assessment System for Electronic Health Record Data SN - 1869-0327 PY - 2020 JO - Appl Clin Inform JF - Applied Clinical Informatics LA - EN VL - 11 IS - 04 SP - 622 EP - 634 DA - 2020/09/23 KW - data quality KW - electronic health records KW - health care AB - Objective Rule-based data quality assessment in health care facilities was explored through compilation, implementation, and evaluation of 63,397 data quality rules in a single-center case study to assess the ability of rules-based data quality assessment to identify data errors of importance to physicians and system owners.Methods We applied a design science framework to design, demonstrate, test, and evaluate a scalable framework with which data quality rules can be managed and used in health care facilities for data quality assessment and monitoring.Results We identified 63,397 rules partitioned into 28 logic templates. A total of 819,683 discrepancies were identified by 4.5% of the rules. Nine out of 11 participating clinical and operational leaders indicated that the rules identified data quality problems and articulated next steps that they wanted to take based on the reported information.Discussion The combined rule template and knowledge table approach makes governance and maintenance of otherwise large rule sets manageable. Identified challenges to rule-based data quality monitoring included the lack of curated and maintained knowledge sources relevant to data error detection and lack of organizational resources to support clinical and operational leaders with investigation and characterization of data errors and pursuit of corrective and preventative actions. Limitations of our study included implementation within a single center and dependence of the results on the implemented rule set.Conclusion This study demonstrates a scalable framework (up to 63,397 rules) with which data quality rules can be implemented and managed in health care facilities to identify data errors. The data quality problems identified at the implementation site were important enough to prompt action requests from clinical and operational leaders. PB - Georg Thieme Verlag KG DO - 10.1055/s-0040-1715567 UR - http://www.thieme-connect.com/products/ejournals/abstract/10.1055/s-0040-1715567 ER -