A Multi-Objective Evaluation Framework for Analyzing Utility-Fairness Trade-Offs in Machine Learning Systems
Gökhan Özbulak1,2
, Oscar Jimenez-del-Toro1
, Maíra Fatoretto3
, Lilian Berton3
, André Anjos1
1: Idiap Research Institute, Martigny, Switzerland, 2: École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland, 3: Federal University of São Paulo (UNIFESP), São Paulo, Brazil
Publication date: 2025/12/30
https://doi.org/10.59275/j.melba.2025-ab9a
Abstract
The evaluation of fairness models in Machine Learning involves complex challenges, such as defining appropriate metrics, balancing trade-offs between utility and fairness, and there are still gaps in this stage. This work presents a novel multi-objective evaluation framework that enables the analysis of utility-fairness trade-offs in Machine Learning systems. The framework was developed using criteria from Multi-Objective Optimization that collect comprehensive information regarding this complex evaluation task. The assessment of multiple Machine Learning systems is summarized, both quantitatively and qualitatively, in a straightforward manner through a radar chart and a measurement table encompassing various aspects such as convergence, system capacity, and diversity. The framework’s compact representation of performance facilitates the comparative analysis of different Machine Learning strategies for decision-makers, in real-world applications, with single or multiple fairness requirements. In particular, this study focuses on the medical imaging domain, where fairness considerations are crucial due to the potential impact of biased diagnostic systems on patient outcomes. The proposed framework enables a systematic evaluation of multiple fairness constraints helping to identify and mitigate disparities among demographic groups while maintaining diagnostic performance. The framework is model-agnostic and flexible to be adapted to any kind of Machine Learning systems, that is, black- or white-box, any kind and quantity of evaluation metrics, including multidimensional fairness criteria. The functionality and effectiveness of the proposed framework is shown with different simulations, and an empirical study conducted on three real-world medical imaging datasets with various Machine Learning systems. Our evaluation framework is publicly available at https://pypi.org/project/fairical
Keywords
Machine Learning · Multidimensional Evaluation · Multi-Objective Optimization · Utility-Fairness Trade-off · Medical Image Analysis
Bibtex
@article{melba:2025:050:özbulak,
title = "A Multi-Objective Evaluation Framework for Analyzing Utility-Fairness Trade-Offs in Machine Learning Systems",
author = "Özbulak, Gökhan and Jimenez-del-Toro, Oscar and Fatoretto, Maíra and Berton, Lilian and Anjos, André",
journal = "Machine Learning for Biomedical Imaging",
volume = "3",
issue = "Special issue on FAIMI",
year = "2025",
pages = "938--957",
issn = "2766-905X",
doi = "https://doi.org/10.59275/j.melba.2025-ab9a",
url = "https://melba-journal.org/2025:050"
}
RIS
TY - JOUR
AU - Özbulak, Gökhan
AU - Jimenez-del-Toro, Oscar
AU - Fatoretto, Maíra
AU - Berton, Lilian
AU - Anjos, André
PY - 2025
TI - A Multi-Objective Evaluation Framework for Analyzing Utility-Fairness Trade-Offs in Machine Learning Systems
T2 - Machine Learning for Biomedical Imaging
VL - 3
IS - Special issue on FAIMI
SP - 938
EP - 957
SN - 2766-905X
DO - https://doi.org/10.59275/j.melba.2025-ab9a
UR - https://melba-journal.org/2025:050
ER -