No More Training: SAM’s Zero-Shot Transfer Capabilities for Cost-Efficient Medical Image Segmentation

Gutiérrez, Juan D.; Rodriguez-Echeverria, Roberto; Delgado, Emilio; Suero-Rodrigo, Miguel Ángel; Sánchez-Figueroa, Fernando

doi:10.1109/ACCESS.2024.3353142

No More Training: SAM’s Zero-Shot Transfer Capabilities for Cost-Efficient Medical Image Segmentation

dc.contributor.affiliation	Universidade de Santiago de Compostela. Departamento de Electrónica e Computación
dc.contributor.author	Gutiérrez, Juan D.
dc.contributor.author	Rodriguez-Echeverria, Roberto
dc.contributor.author	Delgado, Emilio
dc.contributor.author	Suero-Rodrigo, Miguel Ángel
dc.contributor.author	Sánchez-Figueroa, Fernando
dc.date.accessioned	2025-04-24T07:35:44Z
dc.date.available	2025-04-24T07:35:44Z
dc.date.issued	2024-01-11
dc.description.abstract	Semantic segmentation of medical images presents an enormous potential for diagnosis and surgery. However, achieving precise results involves designing and training complex Deep Learning (DL) models specifically for this task, which is only available to some. SAM is a model developed by Meta capable of segmenting objects present in virtually any type of image. This paper showcases SAM’s robustness and exceptional performance in medical image segmentation, even in the absence of direct training on these image types (lung Computed Tomographies (CTs) and chest X-rays, in particular). Additionally, it achieves this impressive outcome while requiring minimal user intervention. Although the dataset used to train SAM does not contain a single sample of both medical image types, processing a popular dataset comprised of 20 volumes with a total of 3520 slices using the ViT-L version of the model yields an average Jaccard index of 91.45% and an average Dice score of 94.95% . The same version of the model achieves a 93.19% Dice score and a 87.45% Jaccard index when segmenting a frequently-used chest X-ray dataset. The values obtained are above the 70% mark recommended in the literature, and close to state-of-the art models developed specifically for medical segmentation. These results are achieved without user interaction by providing the model with positive prompts based on the masks of the dataset used and a negative prompt located in the center of bounding box that contains the masks.
dc.description.peerreviewed	SI
dc.description.sponsorship	This work was supported in part by MCIN/AEI/10.13039/50100011033 under Grant CPP2021-008491, and in part by the European Union NextGenerationEU/PRTR.
dc.identifier.citation	J. D. Gutiérrez, R. Rodriguez-Echeverria, E. Delgado, M. Á. S. Rodrigo and F. Sánchez-Figueroa, "No More Training: SAM’s Zero-Shot Transfer Capabilities for Cost-Efficient Medical Image Segmentation," in IEEE Access, vol. 12, pp. 24205-24216, 2024, doi: 10.1109/ACCESS.2024.3353142
dc.identifier.doi	10.1109/ACCESS.2024.3353142
dc.identifier.essn	2169-3536
dc.identifier.issn	2169-3536
dc.identifier.uri	https://hdl.handle.net/10347/41028
dc.journal.title	IEEE Access
dc.language.iso	eng
dc.page.final	24216
dc.page.initial	24205
dc.publisher	IEEE
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/CPP2021-008491/ES/MUSICGENIA: Una Plataforma en la Nube para de Generación de Música bajo Demanda por medio de Inteligencia Artificial/
dc.relation.publisherversion	https://ieeexplore.ieee.org/document/10388320
dc.rights	© 2024 The Authors. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
dc.rights.accessRights	open access
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject	Image segmentation
dc.subject	Lung
dc.subject	X-ray imaging
dc.subject	Computed tomography
dc.subject	Medical diagnostic imaging
dc.subject	Training
dc.subject	Task analysis
dc.subject	Image segmentation
dc.subject	Deep learning
dc.subject	Zero-shot learning
dc.subject	Medical imaging
dc.subject	Semantic segmentation
dc.title	No More Training: SAM’s Zero-Shot Transfer Capabilities for Cost-Efficient Medical Image Segmentation
dc.type	journal article
dc.type.hasVersion	VoR
dc.volume.number	12
dspace.entity.type	Publication
relation.isAuthorOfPublication	34f83200-7a0f-4455-a120-b9c6daf3bcd4
relation.isAuthorOfPublication.latestForDiscovery	34f83200-7a0f-4455-a120-b9c6daf3bcd4

Files

Original bundle

Now showing 1 - 1 of 1

Name:: No_More_Training.pdf
Size:: 3.28 MB
Format:: Adobe Portable Document Format

Download

Collections

Electrónica e Computación