Benchmarking Deep Learning and Vision Foundation Models for Atypical vs. Normal Mitosis Classification with Cross-Dataset Evaluation

Sweta Banerjee1Orcid, Viktoria Weiss2Orcid, Taryn A. Donovan3Orcid, Rutger H.J. Fick4, Thomas Conrad5, Jonas Ammeling6Orcid, Nils Porsche1, Robert Klopfleisch5Orcid, Christopher C. Kaltenecker7Orcid, Katharina Breininger8Orcid, Marc Aubreville1Orcid, Christof A. Bertram2Orcid
1: Flensburg University of Applied Sciences, Germany, 2: University of Veterinary Medicine, Vienna, Austria, 3: The Schwarzman Animal Medical Center, New York, USA, 4: Diffusely, Paris, France, 5: Freie Universität Berlin, Berlin, Germany, 6: Technische Hochschule Ingolstadt, Ingolstadt, Germany, 7: Medical University of Vienna, Austria, 8: Julius-Maximilians-Universität Würzburg, Würzburg, Germany
Publication date: 2026/03/12
https://doi.org/10.59275/j.melba.2026-6c1g
PDF · Code

Abstract

Atypical mitosis marks a deviation in the cell division process that has been shown be an independent prognostic marker for tumor malignancy. However, atypical mitosis classification remains challenging due to low prevalence, at times subtle morphological differences from normal mitotic figures, low inter-rater agreement among pathologists, and class imbalance in datasets. Building on the Atypical Mitosis dataset for Breast Cancer (AMi-Br), this study presents a comprehensive benchmark comparing deep learning approaches for automated atypical mitotic figure (AMF) classification, including end-to-end fine-tuned deep learning models, foundation models with linear probing, and foundation models fine-tuned with low-rank adaptation (LoRA). For rigorous evaluation, we further introduce two new held-out AMF datasets - AtNorM-Br, a dataset of mitotic figures from the TCGA breast cancer cohort, and AtNorM-MD, a multi-domain dataset of mitotic figures from a subset of the MIDOG++ training set. We found average balanced accuracy values of up to 0.8135, 0.7788, and 0.7723 on the in-domain AMi-Br and the out-of-domain AtNorm-Br and AtNorM-MD datasets, respectively. Our work shows that atypical mitotic figure classification, while being a challenging problem, can be effectively addressed through the use of recent advances in transfer learning and model fine-tuning techniques. We make all code and data used in this paper available in this github repository: https://github.com/DeepMicroscopy/AMi-Br_Benchmark

Keywords

Atypical Mitosis · Deep Learning · Foundation Models · Classification · Benchmarking · Histopathology · low-rank adaptation

Bibtex @article{melba:2026:006:banerjee, title = "Benchmarking Deep Learning and Vision Foundation Models for Atypical vs. Normal Mitosis Classification with Cross-Dataset Evaluation", author = "Banerjee, Sweta and Weiss, Viktoria and Donovan, Taryn A. and Fick, Rutger H.J. and Conrad, Thomas and Ammeling, Jonas and Porsche, Nils and Klopfleisch, Robert and Kaltenecker, Christopher C. and Breininger, Katharina and Aubreville, Marc and Bertram, Christof A.", journal = "Machine Learning for Biomedical Imaging", volume = "2026", issue = "MELBA–BVM 2025 Special Issue", year = "2026", pages = "115--125", issn = "2766-905X", doi = "https://doi.org/10.59275/j.melba.2026-6c1g", url = "https://melba-journal.org/2026:006" }
RISTY - JOUR AU - Banerjee, Sweta AU - Weiss, Viktoria AU - Donovan, Taryn A. AU - Fick, Rutger H.J. AU - Conrad, Thomas AU - Ammeling, Jonas AU - Porsche, Nils AU - Klopfleisch, Robert AU - Kaltenecker, Christopher C. AU - Breininger, Katharina AU - Aubreville, Marc AU - Bertram, Christof A. PY - 2026 TI - Benchmarking Deep Learning and Vision Foundation Models for Atypical vs. Normal Mitosis Classification with Cross-Dataset Evaluation T2 - Machine Learning for Biomedical Imaging VL - 2026 IS - MELBA–BVM 2025 Special Issue SP - 115 EP - 125 SN - 2766-905X DO - https://doi.org/10.59275/j.melba.2026-6c1g UR - https://melba-journal.org/2026:006 ER -

2026:006 cover