BaMBo: An Annotated Bone Marrow Biopsy Dataset for Segmentation Task

Anilpreet Singh1,2, Satyender Dharamdasani3, Praveen Sharma3, Sukrit Gupta1
1: Department of Computer Science and Engineering, Indian Institute of Technology Ropar, Roopnagar, India, 2: Department of Computer Science and Engineering, Thapar Institute of Engineering and Technology, Patiala, India, 3: Department of Hematology, Postgraduate Institute of Medical Education and Research, Chandigarh, India
Publication date: 2025/12/31
https://doi.org/10.59275/j.melba.2025-dcae
PDF · Code

Abstract

Bone marrow examination has become increasingly important for the diagnosis and treatment of hematologic and other illnesses. The present methods for analyzing bone marrow biopsy samples involve subjective and inaccurate assessments by visual estimation by pathologists. Thus, there is a need to develop automated tools to assist in the analysis of bone marrow samples. However, there is a lack of publicly available standardized and high-quality datasets that can aid in the research and development of automated tools that can provide consistent and objective measurements. In this paper, we present a comprehensive Bone Marrow Biopsy (BaMBo) dataset consisting 185 semantic-segmented bone marrow biopsy images, specifically designed for the automated calculation of bone marrow cellularity. Our dataset comprises high-resolution, generalized images of bone marrow biopsies, each annotated with precise semantic segmentation of different haematological components. These components are divided into 4 classes: Bony trabeculae, adipocytes, cellular region and Background (BG). The annotations were performed with the help of two experienced hematopathologists that were supported by state-of-the-art Deep Learning (DL) models and image processing techniques. We then used our dataset to train a custom U-Net based DL model that performs multi-class semantic segmentation of the images (Dice Score: 0.831± 0.099) and predicts the cellularity of these images with an error of 5.9% ± 8.8%. This shows the applicability of our data for future research in this domain. Our code is available at https://github.com/AI-in-Medicine-IIT-Ropar/BaMbo-Bone-Marrow-Biopsy

Keywords

medical imaging · dataset · Bone Marrow Biopsy · cellularity · semi-automatic annotation

Bibtex @article{melba:2025:041:singh, title = "BaMBo: An Annotated Bone Marrow Biopsy Dataset for Segmentation Task", author = "Singh, Anilpreet and Dharamdasani, Satyender and Sharma, Praveen and Gupta, Sukrit", journal = "Machine Learning for Biomedical Imaging", volume = "3", issue = "Special Issue on Open Data at MICCAI 2024–2025", year = "2025", pages = "856--865", issn = "2766-905X", doi = "https://doi.org/10.59275/j.melba.2025-dcae", url = "https://melba-journal.org/2025:041" }
RISTY - JOUR AU - Singh, Anilpreet AU - Dharamdasani, Satyender AU - Sharma, Praveen AU - Gupta, Sukrit PY - 2025 TI - BaMBo: An Annotated Bone Marrow Biopsy Dataset for Segmentation Task T2 - Machine Learning for Biomedical Imaging VL - 3 IS - Special Issue on Open Data at MICCAI 2024–2025 SP - 856 EP - 865 SN - 2766-905X DO - https://doi.org/10.59275/j.melba.2025-dcae UR - https://melba-journal.org/2025:041 ER -

2025:041 cover