No More Training: SAM’s Zero-Shot Transfer Capabilities for Cost-Efficient Medical Image Segmentation

dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Electrónica e Computación
dc.contributor.authorGutiérrez, Juan D.
dc.contributor.authorRodriguez-Echeverria, Roberto
dc.contributor.authorDelgado, Emilio
dc.contributor.authorSuero-Rodrigo, Miguel Ángel
dc.contributor.authorSánchez-Figueroa, Fernando
dc.date.accessioned2025-04-24T07:35:44Z
dc.date.available2025-04-24T07:35:44Z
dc.date.issued2024-01-11
dc.description.abstractSemantic segmentation of medical images presents an enormous potential for diagnosis and surgery. However, achieving precise results involves designing and training complex Deep Learning (DL) models specifically for this task, which is only available to some. SAM is a model developed by Meta capable of segmenting objects present in virtually any type of image. This paper showcases SAM’s robustness and exceptional performance in medical image segmentation, even in the absence of direct training on these image types (lung Computed Tomographies (CTs) and chest X-rays, in particular). Additionally, it achieves this impressive outcome while requiring minimal user intervention. Although the dataset used to train SAM does not contain a single sample of both medical image types, processing a popular dataset comprised of 20 volumes with a total of 3520 slices using the ViT-L version of the model yields an average Jaccard index of 91.45% and an average Dice score of 94.95% . The same version of the model achieves a 93.19% Dice score and a 87.45% Jaccard index when segmenting a frequently-used chest X-ray dataset. The values obtained are above the 70% mark recommended in the literature, and close to state-of-the art models developed specifically for medical segmentation. These results are achieved without user interaction by providing the model with positive prompts based on the masks of the dataset used and a negative prompt located in the center of bounding box that contains the masks.
dc.description.peerreviewedSI
dc.description.sponsorshipThis work was supported in part by MCIN/AEI/10.13039/50100011033 under Grant CPP2021-008491, and in part by the European Union NextGenerationEU/PRTR.
dc.identifier.citationJ. D. Gutiérrez, R. Rodriguez-Echeverria, E. Delgado, M. Á. S. Rodrigo and F. Sánchez-Figueroa, "No More Training: SAM’s Zero-Shot Transfer Capabilities for Cost-Efficient Medical Image Segmentation," in IEEE Access, vol. 12, pp. 24205-24216, 2024, doi: 10.1109/ACCESS.2024.3353142
dc.identifier.doi10.1109/ACCESS.2024.3353142
dc.identifier.essn2169-3536
dc.identifier.issn2169-3536
dc.identifier.urihttps://hdl.handle.net/10347/41028
dc.journal.titleIEEE Access
dc.language.isoeng
dc.page.final24216
dc.page.initial24205
dc.publisherIEEE
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/CPP2021-008491/ES/MUSICGENIA: Una Plataforma en la Nube para de Generación de Música bajo Demanda por medio de Inteligencia Artificial/
dc.relation.publisherversionhttps://ieeexplore.ieee.org/document/10388320
dc.rights© 2024 The Authors. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectImage segmentation
dc.subjectLung
dc.subjectX-ray imaging
dc.subjectComputed tomography
dc.subjectMedical diagnostic imaging
dc.subjectTraining
dc.subjectTask analysis
dc.subjectImage segmentation
dc.subjectDeep learning
dc.subjectZero-shot learning
dc.subjectMedical imaging
dc.subjectSemantic segmentation
dc.titleNo More Training: SAM’s Zero-Shot Transfer Capabilities for Cost-Efficient Medical Image Segmentation
dc.typejournal article
dc.type.hasVersionVoR
dc.volume.number12
dspace.entity.typePublication
relation.isAuthorOfPublication34f83200-7a0f-4455-a120-b9c6daf3bcd4
relation.isAuthorOfPublication.latestForDiscovery34f83200-7a0f-4455-a120-b9c6daf3bcd4

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
No_More_Training.pdf
Size:
3.28 MB
Format:
Adobe Portable Document Format