Open Access
CC BY 4.0 · Methods Inf Med 2024; 63(03/04): 122-136
DOI: 10.1055/a-2562-2163
Original Article for a Focus Theme

Alternative Strategies to Generate Class Activation Maps Supporting AI-based Advice in Vertebral Fracture Detection in X-ray Images

Samuele Pe
1   Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy
,
Lorenzo Famiglini
2   Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milan, Italy
,
Enrico Gallazzi
3   ASST G. Pini – CTO, Milan, Italy
,
Chandra Bortolotto
4   Unit of Radiology, Department of Clinical, Surgical, Diagnostic, and Pediatric Sciences, University of Pavia, Pavia, Italy
5   Department of Radiology, I.R.C.C.S. Policlinic San Matteo Foundation, Pavia, Italy
,
Luisa Carone
5   Department of Radiology, I.R.C.C.S. Policlinic San Matteo Foundation, Pavia, Italy
,
Andrea Cisarri
4   Unit of Radiology, Department of Clinical, Surgical, Diagnostic, and Pediatric Sciences, University of Pavia, Pavia, Italy
,
Alberto Salina
4   Unit of Radiology, Department of Clinical, Surgical, Diagnostic, and Pediatric Sciences, University of Pavia, Pavia, Italy
,
Lorenzo Preda
4   Unit of Radiology, Department of Clinical, Surgical, Diagnostic, and Pediatric Sciences, University of Pavia, Pavia, Italy
5   Department of Radiology, I.R.C.C.S. Policlinic San Matteo Foundation, Pavia, Italy
,
Riccardo Bellazzi
1   Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy
,
Federico Cabitza
2   Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milan, Italy
6   Department of Reconstructive Surgery and Osteo-articular Infections C.R.I.O. Unit, I.R.C.C.S. Galeazzi Orthopaedic Institute, Milan, Italy
,
Enea Parimbelli
1   Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy
7   Telfer School of Management, University of Ottawa, Ottawa, Ontario, Canada
› Institutsangaben

Funding All research described in the article has been reviewed in compliance with ethical standards of the Italian Lombardia Region health systems and medical research bodies, and it is in line with the World Medical Association Declaration of Helsinki on Ethical Principles for Medical Research Involving Human Subjects. This work was carried out as part of the main author's master thesis at the University of Pavia. Samuele Pe is currently a PhD student enrolled in the National PhD program in Artificial Intelligence, XXXIX cycle, course on Health and Life Sciences, organized by Università Campus Bio-Medico di Roma. This work was supported by the Italian Ministry of Research, under the complementary actions to the NRRP “Fit4MedRob - Fit for Medical Robotics” Grant (# PNC0000007). Enea Parimbelli and Federico Cabitza acknowledge funding support provided by the Italian project PRIN PNRR 2022 InXAID - Interaction with eXplainable Artificial Intelligence in (medical) Decision-making. CUP: H53D23008090001 funded by the European Union - Next Generation EU.
Preview

Abstract

Background

Balancing artificial intelligence (AI) support with appropriate human oversight is challenging, with associated risks such as algorithm aversion and technology dominance. Research areas like eXplainable AI (XAI) and Frictional AI aim to address these challenges. Studies have shown that presenting XAI explanations as “juxtaposed evidence” supporting contrasting classifications, rather than just providing predictions, can be beneficial.

Objectives

This study aimed to design and compare multiple pipelines for generating juxtaposed evidence in the form of class activation maps (CAMs) that highlight areas of interest in a fracture detection task with X-ray images.

Materials and Methods

We designed three pipelines to generate such evidence. The pipelines are based on a fracture detection task from 630 thoraco-lumbar X-ray images (48% of which contained fractures). The first, a single-model approach, uses an algorithm of the Grad-CAM family applied to a ResNeXt-50 network trained through transfer learning. The second, a dual-model approach, employs two networks—one optimized for sensitivity and the other for specificity—providing targeted explanations for positive and negative cases. The third, a generative approach, leverages autoencoders to create activation maps from feature tensors, extracted from the raw images. Each approach produced two versions of activation maps: AM3—as we termed it—which captures fine-grained, low-level features, and AM4, highlighting high-level, aggregated features. We conducted a validation study by comparing the generated maps with binary ground-truth masks derived from a consensus of four clinician annotators, identifying the actual locations of fractures in a subset of positive cases.

Results

HiResCAM proved to be the best performing Grad-CAM variant and was used in both the single- and dual-model strategies. The generative approach demonstrated the greatest overlap with the clinicians' assessments, indicating its ability to align with human expertise.

Conclusion

The results highlight the potential of Judicial AI to enhance diagnostic decision-making and foster a synergistic collaboration between humans and AI.



Publikationsverlauf

Eingereicht: 28. September 2024

Angenommen: 16. Dezember 2024

Artikel online veröffentlicht:
03. Juni 2025

© 2025. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution License, permitting unrestricted use, distribution, and reproduction so long as the original work is properly cited. (https://creativecommons.org/licenses/by/4.0/)

Georg Thieme Verlag KG
Oswald-Hesse-Straße 50, 70469 Stuttgart, Germany