Implementing endoscopy video recording in routine clinical practice: Strategies from three tertiary care centers

Jonas L. Steinhäuser; Tyler M. Berzin; Mark E. Geissler; Cornelius Weber; Nora Herzog; Maxime Le Floch; Stefan Brückner; Jochen Hampe; Sami Elamin; Joel Troya; Alexander Hann; Franz Brinkmann

doi:10.1055/a-2592-3338

Subscribe to RSS

Please copy the URL and add it into your RSS Feed Reader.

https://www.thieme-connect.de/rss/thieme/en/10.1055-s-00025476.xml

Share / Bookmark

Facebook Linkedin Weibo

Download PDF

CC BY 4.0 · Endosc Int Open 2025; 13: a25923338
DOI: 10.1055/a-2592-3338

Original article

Implementing endoscopy video recording in routine clinical practice: Strategies from three tertiary care centers

Jonas L. Steinhäuser

¹Else Kröner Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany

²Department of Medicine I and Faculty of Medicine, University Hospital Carl Gustav Carus, Dresden, Germany (Ringgold ID: RIN39063)

,

Tyler M. Berzin

³Center for Advanced Endoscopy, Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, United States

,

Mark E. Geissler

¹Else Kröner Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany

²Department of Medicine I and Faculty of Medicine, University Hospital Carl Gustav Carus, Dresden, Germany (Ringgold ID: RIN39063)

³Center for Advanced Endoscopy, Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, United States

,

Cornelius Weber

¹Else Kröner Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany

²Department of Medicine I and Faculty of Medicine, University Hospital Carl Gustav Carus, Dresden, Germany (Ringgold ID: RIN39063)

,

Nora Herzog

¹Else Kröner Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany

²Department of Medicine I and Faculty of Medicine, University Hospital Carl Gustav Carus, Dresden, Germany (Ringgold ID: RIN39063)

,

Maxime Le Floch

¹Else Kröner Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany

²Department of Medicine I and Faculty of Medicine, University Hospital Carl Gustav Carus, Dresden, Germany (Ringgold ID: RIN39063)

,

Stefan Brückner

²Department of Medicine I and Faculty of Medicine, University Hospital Carl Gustav Carus, Dresden, Germany (Ringgold ID: RIN39063)

,

Jochen Hampe

¹Else Kröner Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany

²Department of Medicine I and Faculty of Medicine, University Hospital Carl Gustav Carus, Dresden, Germany (Ringgold ID: RIN39063)

,

Sami Elamin

³Center for Advanced Endoscopy, Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, United States

,

Joel Troya

⁴Interventional and Experimental Endoscopy (InExEn), Department of Internal Medicine II, University Hospital Würzburg, Würzburg, Germany

,

Alexander Hann

⁴Interventional and Experimental Endoscopy (InExEn), Department of Internal Medicine II, University Hospital Würzburg, Würzburg, Germany

,

Franz Brinkmann

¹Else Kröner Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany

²Department of Medicine I and Faculty of Medicine, University Hospital Carl Gustav Carus, Dresden, Germany (Ringgold ID: RIN39063)

› Author Affiliations

Supported by: Bundesministerium für Bildung und Forschung 03ZU1210GA ,03ZU121HB

› Further Information

Also available at

Abstract
Full Text
References
Figures

PDF Download Permissions and Reprints

Abstract
Introduction
Methods
Results

Dresden

Boston

Würzburg

Discussion

Dresden

Boston

Würzburg

Privacy, data security, and standardization

Conclusions
References

Abstract

Background and study aims

Endoscopy video recordings are valuable data for training and deploying artificial intelligence (AI) models. However, collecting these data is challenging and time-consuming, demanding new workflows and robust data management strategies.

Methods

Here, we outline the challenges associated with routinely recording endoscopy data in clinical practice and share experiences and solutions from three endoscopy centers in Germany and the United States.

Results

Each center uses a recording setup tailored to specific needs of that endoscopy unit. Common challenges include integrating with the hospital’s electronic health records, automating video recording, and addressing data privacy concerns. In all cases, having dedicated research staff to manage daily operations has proven essential for successful implementation.

Conclusions

By describing successful strategies, we aim to inspire gastroenterology divisions worldwide to adapt routine video recording for endoscopy procedures, thereby increasing the volume and diversity of datasets necessary for developing clinically impactful AI applications.

Keywords

Quality and logistical aspects - Image and data processing, documentatiton - Training

Introduction

Real-world clinical data that are routinely generated in hospitals represent a rich source of information for scientific research and innovation. Development and propagation of “clinical data warehouses” centralizes data organization and streamlines the process of reusing this data for scientific purposes or improving clinical care [1]. Although gastrointestinal endoscopy generates vast amounts of high-definition video data, only a few selected still images are typically archived alongside the written examination report in routine clinical practice. Nevertheless, complete recordings of endoscopic examinations are a valuable resource for teaching and research. For scientific purposes in general and for developing artificial intelligence (AI) models in particular, access to large amounts of real-world endoscopy data is necessary [2]. This endeavor requires implementation of appropriate infrastructure and workflows to integrate endoscopy video recording into routine clinical operations. Here, we share experiences from three large, tertiary care endoscopy centers in establishing endoscopic video recording of routine clinical examinations for scientific purposes. Our aim is to encourage others to join our effort in harnessing the valuable endoscopic videos generated by clinicians to advance the field by leveraging this rich and diverse data.

Methods

Recording endoscopic videos necessitates capturing the video signal between the endoscopic video processor and the monitor, without compromising the primary clinical feed. Therefore, consultation with the on-site medical technician is necessary to jointly develop a solution for determining at which point in the video signal chain it is most reasonable to capture the endoscopic video for archiving. Depending on the system, manufacturer consultation and adjustments may be required to access the raw video signal before processing. Auxiliary, medical-grade hardware, certified for clinical use, is typically required to process and record the video signal. The recorded video can be output to hard disk (HDD) or network drives and cloud storage.

When recording videos for scientific analysis, ensuring data protection and privacy compliance is crucial, which involves removing identifying information and assigning pseudonyms to the recordings. In Europe, the General Data Protection Regulation (GDPR), and in the United States, the Health Insurance Portability and Accountability Act (HIPAA), provide legal basis for recording and reusing patient data for scientific purposes. In general, recording fully anonymized data involves fewer legal and ethical requirements than handling pseudonymized videos. Some hardware can anonymize videos, excluding patient information (entered manually or obtained from the hospital worklist) from recordings and output filenames, but this can complicate retrospective merging of the endoscopic procedure with the written report and/or other clinical data of the patient. In addition, caution is needed when third parties, such as commercial recording solution providers, handle data, for example when using cloud storage. Also, manual review of the videos is needed to remove any personal data (images of the endoscopy suite, staff, or the patient) from the video before it can be used for scientific purposes. During this essential step, the process of pseudonymization (e.g., by assigning a study ID and renaming files accordingly) can be integrated. This addresses privacy concerns in light of ongoing data ownership debates [3]. Ideally, deidentification and pseudonymization would be managed at the institutional level, but because establishing such processes can take years, it is currently more practical to keep them within the research group. For raw recorded videos, it is advisable to set up a pipeline for semi-automatic preprocessing (i.e., preparing the recordings for future analysis) and management of procedure-related data in a dedicated research database.

Minimizing additional work steps for the endoscopy staff on-site is essential for ensuring compliance with the video recording protocol. Ideally, any additional effort required to facilitate the recording should not interfere with routine workflows in the endoscopy suite. Change management in healthcare and modifying routine workflows within a well-established medical team can be challenging [4] and requires careful preparation and testing before starting data collection. Prior to implementation, it is advisable to explain the proposal and actively seek input from the endoscopy staff. For the recording process itself, clear and concise practical instructions must be provided both verbally and in writing, supplemented by brief instructions (e.g., pictograms) kept close to the recording device. Especially during the initial implementation phase, it is recommended to have a dedicated contact person available that the staff on-site can address in case of uncertainties or problems. It is advisable to routinely check recordings for patterns of failures or non-compliance. Often, errors occur when staff are unsure how to use the recording device and avoid it for fear of mistakes. Identifying this can lead to providing additional instructions as needed.

In addition, the perspective of the examiner should be considered. Endoscopists may be reluctant to have their procedures recorded [5]. It is crucial to communicate transparently the purposes for which the videos are used, ensuring that quality control or evaluation of examiners is not conducted without their informed consent.

Results

Dresden

Each examination room in the gastrointestinal endoscopy unit at the University Hospital Dresden, Germany, is equipped with an Ibox Touch (meso international GmbH, Mittweida, Germany) recording device, which connects to the hospital local area network via ethernet. The device receives input from the endoscopic video processor (Olympus Deutschland GmbH, Hamburg, Germany) via a HD-SDI output. For each examination, endoscopy staff manually enters the patient ID and initiates and terminates the recording. The resulting video is automatically exported in chunks of approximately 10 minutes and saved at a resolution of 1920×1080 pixels to a hospital network drive. Following basic quality control, a research assistant assigns a study ID and creates a new entry in a dedicated REDCap [6] database.

Subsequently, a series of Python scripts execute general preprocessing. Semi-automated preprocessing steps include: 1) concatenating video chunks of the recording; 2) cropping the video to retain only the endoscopic video signal by removing extraneous black borders around the video frame; 3) automatically detecting and blurring nuisance text (e.g., text displayed by the recording device or the video processor); and 4) trimming videos to only contain the intraluminal part of the procedure. Only the final step requires human assistance to provide time stamps, or more precisely, frame indices of the relevant video sections to the script.

Recorded ERCP videos are currently being processed and prepared for publication as a publicly available dataset. Analysis of the processed videos shows that their duration ranges from 2 to 113 minutes, with an average length of 30.6 minutes. The videos have resolutions of either 1080 × 1080 or 1232 × 1048 pixels, depending on the endoscope used. According to the corresponding written reports, 32% of the videos capture examinations of patients undergoing their first ERCP, whereas 61% are repeat ERCPs, and in 7% of cases, this information is unknown. A papillotomy was performed in 59% of all cases, and severe bleeding—defined as bleeding requiring additional intervention such as coagulation or clipping—occurred in 8% of videos.

Also, we implemented a workflow to simultaneously capture the endoscopy and fluoroscopy signal side-by-side e.g., during endoscopic retrograde cholangiopancreatography (ERCP). In contrast to our current approach, this task requires a video recording device capable of handling two independent video sources simultaneously, for which we use the DIANA (DEKOM Engineering GmbH, Hamburg, Germany) video recording system ([Fig. 1]).

Fig. 1 Schematic overview of the endoscopy video recording setup at Dresden. Illustration of signal flow within the endoscopy room, highlighting points of video signal capture. Devices for video signal generation are marked in blue, video processing devices in turquoise, elements responsible for data recording in orange, and data recording destinations in purple.

Boston

The endoscopy unit of the Beth Israel Deaconess Medical Center (BIDMC), a Harvard Medical School teaching hospital, employs multiple recording technologies across 18 endoscopy rooms, enabling a comparison of various systems. All rooms are equipped with Olympus endoscopic video processors (Olympus America Inc., Center Valley, Pennsylvania, United States) and Virgo recording devices (Virgo Surgical Video Solutions Inc., San Francisco, California, United States), which capture 1080p videos and store them on a commercial cloud server ([Fig. 2]). The Virgo system utilizes AI to automate initiation and termination of recordings based on scope insertion and removal, although manual trimming by research staff is often necessary due to limitations of the AI algorithm. The Virgo system lacks integration with the hospital electronic health record (EHR) system, necessitating manual mapping of recorded videos to patient identities based on procedure room and start time. This task can be performed retrospectively by research staff independently of clinical personnel. In addition, research staff can also match the videos to downloaded procedure reports, histology results, and preoperative, intraoperative, and postoperative reports obtained from the EHR. Videos and their corresponding endoscopy reports are subsequently stored in a Microsoft Teams cloud, with patient information documented in a secure Excel spreadsheet. Although the Virgo system demonstrates proficiency in single-stream recording, capture of multiple data streams (e.g., concurrent fluoroscopy and endoscopy sources), has presented challenges.

Fig. 2 Schematic overview of the endoscopy video recording setup at BIDMC. Illustration of signal flow within the endoscopy room, highlighting points of video signal capture. Devices for video signal generation are marked in blue, video processing devices in turquoise, elements responsible for data recording in orange, and data recording destinations in purple.

In addition to the main recording system, two other systems are employed in the BIDMC endoscopy unit. An Olympus Image Stream (Olympus America Inc., Center Valley, Pennsylvania, United States) system is used in the advanced endoscopy unit because of its capability for handling multiple datastreams simultaneously. However, this older system requires endoscopy staff to enter patient data by hand and to manually initiate and terminate the recording process. This approach was found to be less reliable, because staff may be too busy to enter patient data and do not reliably initiate and terminate the recording, resulting in loss of recordings or requiring manual trimming of video recordings. A third video recording system, the AIDA system (Karl Storz SE & Co. KG, Tuttlingen, Germany), is integrated with the hospital EHR and, therefore, able to obtain patient information and intraprocedural matching of the recording to patient records. However, it is not an “always-on” recording either, necessitating staff to manually initiate and terminate each recording.

A first subset of endoscopy video recordings has been processed and collected in a dataset for ERCP [7].

Würzburg

The endoscopy unit at the University Hospital Würzburg, Germany, and the eight associated academic and non-academic centers participating in clinical AI evaluations, exhibit a diverse range of setups, workflows, and endoscopic processors handling both digital and analog signals. Beyond mere recording, our objective encompassed real-time application of one or more in-house developed AIs on the endoscopic video feed, presenting the AI output to the examiner on the primary or secondary display with minimal latency.

Our system comprises a computer running Ubuntu (Canonical Ltd., London, UK), equipped with a DeckLink Mini Recorder 4K (Blackmagic Design Pty. Ltd., Port Melbourne, Australia) video capture card and a MSI-GeForce RTX3080Ti (NVIDIA Corporation, Santa Clara, California, United States) graphics card ([Fig. 3]). Utilizing this capture card facilitates acquisition of SDI and HDMI signals without the necessity for conversion, thereby eliminating potential latency. Our custom-developed C++ application for image stream acquisition enables parallel execution of distinct threads: image capture, processing, and display. This architecture ensures imperceptible image delay. Within the capturing pipeline, we have the option to directly anonymize data through image cropping, overlaying a black bar on the patient information, or selective blurring. The video file is securely stored on an encrypted external HDD and redundantly on an internal HDD for backup purposes.

Fig. 3 Schematic overview of the endoscopy video recording setup at Würzburg. Diagram illustrating signal flow within the endoscopy room, highlighting the points of video signal capture. Devices for video signal generation are marked in blue, video processing devices in turquoise, elements responsible for data recording in orange, and data recording destinations in purple.

To ensure recording commences only after informed consent has been obtained, we developed an associated device featuring a button LED that indicates active recording. The system with all hardware components is described in detail in Lux et al. [8]. The software framework is freely available for research purposes from https://ukw.de/inexen.

[Table 1] provides an overview of the different recording strategies employed at the three institutions.

Table 1 Comparison of recording setups among three centers.
	Dresden	Boston	Würzburg
EGD, esophagogastroduodenoscopy; ERCP, endoscopic retrograde cholangiopancreatography; ESD, endoscopic submucosal dissection; EUS, endoscopic ultrasound; POEM, peroral endoscopic myotomy. Ibox Touch (meso international GmbH, Mittweida, Germany). DIANA (DEKOM Engineering GmbH, Hamburg, Germany). Virgo (Virgo Surgical Video Solutions Inc., San Francisco, California, United States). AIDA (Karl Storz SE & Co. KG, Tuttlingen, Germany). Image Stream (Olympus America Inc., Center Valley, Pennsylvania, United States).
Endoscopy recording device(s)	Ibox Touch	Virgo, AIDA, Image Stream	Custom grabber card-based recording
Fluoroscopy recording device(s)	DIANA	Image Stream, Virgo	Not applied
Procedures	ERCP, ESD, POEM	EGD, Colonoscopy, ERCP, EUS, ESD, POEM	EGD, Colonoscopy, EUS
Recording destination	Hospital network drive	Cloud, hospital network drive	Hospital network drive
Research database	REDCap	Microsoft Teams	Custom SQL database
Dedicated staff	Research assistant	Research assistant	Research assistant