Exploring Fairness and Performance Drivers Across State-of-the-Art Pulmonary Nodule Detection Algorithms
John McCabe1, Daryl O. Cheng1, Andrew Crossingham2, Junaid Choudhary2, Samantha L Quaife3, Tanya Patrick4, Monica Mullin4,5, Amyn Bhamani4, Esther Arthur-Darkwa6, Aoife Walker6, Arjun Nair2, Alan Hackshaw6, SUMMIT consortium , Sam M. Janes4, Joseph Jacob1, Carole H. Sudre7,8
1: Satsuma Lab, Hawkes Institute, University College London, United Kingdom, 2: University College London Hospitals NHS Foundation Trust, London, United Kingdom, 3: Centre for Cancer Screening, Prevention, Detection and Early Diagnosis, Wolfson Institute of Population Health, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom, 4: Lungs for Living Research Centre, UCL Respiratory, University College London, United Kingdom, 5: Department of Respirology, University of British Columbia, Vancouver, Canada, 6: CRUK & UCL Cancer Trials Centre, University College London, London, United Kingdom, 7: Unit for Lifelong Health and Ageing at UCL, Department of Population Science and Experimental Medicine, University College London, United Kingdom, 8: Hawkes Institute, University College London, United Kingdom
Publication date: 2025/12/21
https://doi.org/10.59275/j.melba.2025-6838
Abstract
Lung cancer is the leading cause of cancer-related deaths in the UK. Its high mortality rate is primarily due to its asymptomatic nature in the early stages, leading to late-stage diagnoses. However, effective early detection methods, such as Low-Dose Computed Tomography (LDCT), and treatments for early-stage disease make lung cancer an ideal candidate for screening. The UK Government aims to implement a national lung cancer screening programme targeting high-risk populations by 2029. This will significantly increase the workload on an already stretched radiology workforce, driving the adoption of computer-aided detection (CADe) systems to support radiologists. The datasets used to train these algorithms are typically drawn from previous lung cancer screening trials and studies (National Lung Screening Trial Research Team (2011); de Koning (2020)), which often lack balanced representation of protected groups, such as sex and ethnicity. This project examines whether training nodule detection algorithms on low-dose computed tomography (LDCT) scans from a London-based lung screening study, where these groups are typically under-represented, affects algorithm performance for under-represented categories. Our results indicate that overall performance remains equitable across all categories, even when trained on unbalanced datasets. The discriminative performance of deep learning-based pulmonary nodule detection algorithms is primarily driven by the composition of the dataset, specifically, the relative proportion of nodule types and sizes, rather than by protected attributes such as sex or ethnic group. The features learned from the nodules themselves drive detection outcomes, meaning that in populations where the prevalent nodule characteristics closely match the training data, performance is likely to be strong. While this study found no demographic disparities for nodule detection, there is no guarantee that this will be true across all populations, particularly those in populations where cancer risk predominates within different nodule distributions. This study provides an early assessment of performance variations of deep learning models across under-represented groups within a standard lung cancer screening dataset. While previous research has focused on improving how well nodule detection algorithms identify pulmonary nodules, this study uniquely focuses on demographic performance disparities and the impact of training data composition and algorithm design on model generalisability. The findings highlight critical considerations for the deployment of CADe systems in lung cancer screening, ensuring equitable performance across diverse patient populations. Our code is available at https://github.com/johnmccabe44/fairness-in-nodule-detection
Keywords
Nodule Detection Algorithms · Fairness in AI · Lung Cancer Screening
Bibtex
@article{melba:2025:029:mccabe,
title = "Exploring Fairness and Performance Drivers Across State-of-the-Art Pulmonary Nodule Detection Algorithms",
author = "McCabe, John and Cheng, Daryl O. and Crossingham, Andrew and Choudhary, Junaid and Quaife, Samantha L and Patrick, Tanya and Mullin, Monica and Bhamani, Amyn and Arthur-Darkwa, Esther and Walker, Aoife and Nair, Arjun and Hackshaw, Alan and , SUMMIT consortium and Janes, Sam M. and Jacob, Joseph and Sudre, Carole H.",
journal = "Machine Learning for Biomedical Imaging",
volume = "3",
issue = "Special issue on FAIMI",
year = "2025",
pages = "665--686",
issn = "2766-905X",
doi = "https://doi.org/10.59275/j.melba.2025-6838",
url = "https://melba-journal.org/2025:029"
}
RIS
TY - JOUR
AU - McCabe, John
AU - Cheng, Daryl O.
AU - Crossingham, Andrew
AU - Choudhary, Junaid
AU - Quaife, Samantha L
AU - Patrick, Tanya
AU - Mullin, Monica
AU - Bhamani, Amyn
AU - Arthur-Darkwa, Esther
AU - Walker, Aoife
AU - Nair, Arjun
AU - Hackshaw, Alan
AU - , SUMMIT consortium
AU - Janes, Sam M.
AU - Jacob, Joseph
AU - Sudre, Carole H.
PY - 2025
TI - Exploring Fairness and Performance Drivers Across State-of-the-Art Pulmonary Nodule Detection Algorithms
T2 - Machine Learning for Biomedical Imaging
VL - 3
IS - Special issue on FAIMI
SP - 665
EP - 686
SN - 2766-905X
DO - https://doi.org/10.59275/j.melba.2025-6838
UR - https://melba-journal.org/2025:029
ER -