Efficient Semantic Segmentation of Multispectral Land Cover Images Using Mask2Former

dc.contributor.affiliationUniversidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS)
dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Electrónica e Computación
dc.contributor.authorCanosa García, Pablo
dc.contributor.authorOrdóñez Iglesias, Álvaro
dc.contributor.authorBlanco Heras, Dora
dc.contributor.authorArgüello Pedreira, Francisco
dc.date.accessioned2026-01-29T12:08:57Z
dc.date.available2026-01-29T12:08:57Z
dc.date.issued2025-08-08
dc.description.abstractSemantic segmentation for EO is a process that involves assigning a specific label or category to each pixel in an image, enabling precise analysis for land cover applications such as environmental conservation, urban planning or disaster management. Deep learning-based segmentation models have proliferated in recent years, but they often are not well adapted to the unique properties of multi and hyperspectral images, frequently used in remote sensing. Mask2Former is a universal segmentation model based on the concept of masked attention and employs a pretrained classification model as backbone to create intermediate representations. This article presents a preliminary adaptation of Mask2Former for the segmentation of multispectral remote sensing images. This adaptation includes modifying the backbone to accept multispectral inputs and adapting the data processing pipelines to leverage all available spectral bands effectively. The computational cost of the method has also been analyzed as an initial assessment of potential scalability and efficiency for large-scale applications. Experimental results using the FiveBillionPixels dataset reveal a notable improvement in segmentation accuracy when incorporating multispectral bands, outperforming RGB-only performance without a relevant increase in computational cost.
dc.description.peerreviewedSI
dc.description.sponsorshipThis work was supported in part by grants PID2022-141623NB-I00 and TED2021-130367B-I00 funded by MCIN/AEI/10.13039/501100011033 and by European Union NextGenerationEU/PRTR'. It was also supported by Xunta de Galicia - Consellería de Cultura, Educación, Formación Profesional e Universidades [Centro de investigación de Galicia accreditation 2024-2027 ED431G-2023/04 and Reference Competitive Group accreditation, ED431C-2022/16], and by ERDF/EU.
dc.identifier.citationP. Canosa, Á. Ordóñez, D. B. Heras and F. Argüello, "Efficient Semantic Segmentation of Multispectral Land Cover Images Using Mask2Former," IGARSS 2025 - 2025 IEEE International Geoscience and Remote Sensing Symposium, Brisbane, Australia, 2025, pp. 1621-1625, doi: 10.1109/IGARSS55030.2025.11243109.
dc.identifier.doi10.1109/IGARSS55030.2025.11243109
dc.identifier.isbn979-8-3315-0810-4
dc.identifier.issn2153-7003
dc.identifier.urihttps://hdl.handle.net/10347/45580
dc.journal.titleIGARSS 2025 - 2025 IEEE International Geoscience and Remote Sensing Symposium
dc.language.isoeng
dc.publisherIEEE
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-141623NB-I00/ES/COMPUTACION DE ALTAS PRESTACIONES, HETEROGENEA Y EN LA NUBE PARA APLICACIONES DE ALTA DEMANDA
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Proyectos Estratégicos Orientados a la Transición Ecológica y a la Transición Digital/TED2021-130367B-I00/ES/MONITORIZACION DIGITAL RAPIDA DE ECOSISTEMAS FLUVIALES
dc.relation.publisherversionhttp://doi.org/10.1109/IGARSS55030.2025.11243109
dc.rights© 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.rights.accessRightsopen access
dc.subjectLand cover
dc.subjectTransformer
dc.subjectSemantic segmentation
dc.subjectMultispectral
dc.subjectComputational cost
dc.subject.classification330406 Arquitectura de ordenadores
dc.subject.classification120304 Inteligencia artificial
dc.titleEfficient Semantic Segmentation of Multispectral Land Cover Images Using Mask2Former
dc.typejournal article
dc.type.hasVersionAM
dspace.entity.typePublication
relation.isAuthorOfPublicationa22a0ed8-b87b-473e-b16c-58d78c852dfd
relation.isAuthorOfPublication24b7bf8f-61a5-44da-9a17-67fb85eab726
relation.isAuthorOfPublication01d58a96-54b8-492d-986c-f9005bac259c
relation.isAuthorOfPublication.latestForDiscoverya22a0ed8-b87b-473e-b16c-58d78c852dfd

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
IGARSS2025_Mask2Former_PostPrint.pdf
Size:
707.31 KB
Format:
Adobe Portable Document Format