Breast Cancer Malignancy Prediction through Explainable Models based on a Multimodal Signature of Features
- Autori: Prinzi, F.; Militello, C.; Bartolotta, T.V.; Vitabile, S.
- Anno di pubblicazione: 2025
- Tipologia: Contributo in atti di convegno pubblicato in volume
- Parole Chiave: Breast cancer; Clinical features; Explainable model; Interpretable features; Multimodal signature; Radiomic features
- OA Link: http://hdl.handle.net/10447/686846
Abstract
Breast cancer classification through ultrasound imaging poses a significant challenge due to the inherent noise present in ultrasound images. The radiologist’s reporting process aims to assess the lesions within the images following the Breast Imaging-Reporting and Data System (BI-RADS). This work investigates whether the medical knowledge, represented by the BI-RADS information, augmented by pixel-based quantitative features, can improve breast cancer classification. Machine learning classifiers, including XGBoost, Random Forest, and Support Vector Machine, were trained with an intelligible multimodal signature composed of the BI-RADS and radiomic features. Exploiting the intrinsic interpretability of our model input, the work aims to obtain an explainable predictive model using post-hoc explanation methods. A proprietary dataset composed of 237 B-mode ultrasound scans was acquired, and a total of 103 radiomics features were extracted. Before the training of classifiers, a pipeline for selecting an informative and non-redundant signature was implemented. A 10-fold Cross-Validation repeated 20 times was considered for the training in 80% of the dataset, and the best model in terms of accuracy was selected for testing on the remaining 20%. Our results prove how the medical knowledge, represented by the BI-RADS information, is enhanced with the use of radiomic features. XGBoost was the best model, showing an AUROC of 0.977 ± 0.029 and 0.956 in the training and test phases, respectively. In addition, the implemented global explanation using the SHAP method and exploiting the intelligibility of radiomic features, allowed us to confirm some important model findings.