RSS-Feed abonnieren
DOI: 10.1055/s-0039-1681509
INCORPORATION OF TEMPORAL INFORMATION IN A DEEP NEURAL NETWORK IMPROVES PERFORMANCE LEVEL FOR AUTOMATED POLYP DETECTION AND DELINEATION
Publikationsverlauf
Publikationsdatum:
18. März 2019 (online)
Aims:
As opposed to current automated polyp detection techniques, endoscopists will use information from previous video-frames to indicate the presence of a polyp. We aim to exploit this type of temporal information by introducing memory cells into an artificial intelligence (AI) system.
Methods:
Colonoscopy videos from 104 patients are included with 258 polyps. Shorter video-clips of each polyp are extracted and only a few frames were annotated by experts. These manual annotations are automatically propagated over the entire clip. The resulting, much larger annotated dataset is then used to train a convolutional neural network (CNN). This network is extended with a recurrent module, resulting in an AI system that uses knowledge from previous timesteps.
Frame-level sensitivity and specificity describe detection power. For delineation accuracy, the soft Dice score quantifies the amount of overlap between a delineation map and its ground truth considering the confidence of the network (a number between 0 and 1 where the latter means perfect overlap with 100% confidence).
Results:
Two different networks are trained for evaluation. A first CNN is trained solely on the expert annotated frames and a second CNN includes the temporal module and is trained on all the auto-generated annotations (called EXP and REC respectively). The results are shown in table 1. The incorporation of temporal information improves the network for each metric and especially increases specificity since it makes the network less sensitive to confusing frames. Pairwise t-tests show that all differences are significant with p < 0,00001 (significance level of 0,05).
N |
Sensitivity |
Specificity |
Soft Dice score |
|
CNN1 – EXP |
758 |
0,83 |
0,54 |
0,38 |
CNN2 – REC |
40887 |
0,91 |
0,74 |
0,56 |
Conclusions:
The inclusion of temporal information provides more accurate and confident results for polyp detection and delineation on endoscopic videos.