Efficient Semantic Segmentation of Multispectral Land Cover Images Using Mask2Former

Canosa García, Pablo; Ordóñez Iglesias, Álvaro; Blanco Heras, Dora; Argüello Pedreira, Francisco

doi:10.1109/IGARSS55030.2025.11243109

Efficient Semantic Segmentation of Multispectral Land Cover Images Using Mask2Former

Files

IGARSS2025_Mask2Former_PostPrint.pdf (707.31 KB)

Identifiers

URI: https://hdl.handle.net/10347/45580

ISSN: 2153-7003

ISBN: 979-8-3315-0810-4

DOI: 10.1109/IGARSS55030.2025.11243109

Publication date

2025-08-08

Authors

Canosa García, Pablo

Ordóñez Iglesias, Álvaro

Blanco Heras, Dora

Argüello Pedreira, Francisco

Publisher

IEEE

Metrics

Export

Abstract

Semantic segmentation for EO is a process that involves assigning a specific label or category to each pixel in an image, enabling precise analysis for land cover applications such as environmental conservation, urban planning or disaster management. Deep learning-based segmentation models have proliferated in recent years, but they often are not well adapted to the unique properties of multi and hyperspectral images, frequently used in remote sensing. Mask2Former is a universal segmentation model based on the concept of masked attention and employs a pretrained classification model as backbone to create intermediate representations. This article presents a preliminary adaptation of Mask2Former for the segmentation of multispectral remote sensing images. This adaptation includes modifying the backbone to accept multispectral inputs and adapting the data processing pipelines to leverage all available spectral bands effectively. The computational cost of the method has also been analyzed as an initial assessment of potential scalability and efficiency for large-scale applications. Experimental results using the FiveBillionPixels dataset reveal a notable improvement in segmentation accuracy when incorporating multispectral bands, outperforming RGB-only performance without a relevant increase in computational cost.

Keywords

Land cover| Transformer| Semantic segmentation| Multispectral| Computational cost

Bibliographic citation

P. Canosa, Á. Ordóñez, D. B. Heras and F. Argüello, "Efficient Semantic Segmentation of Multispectral Land Cover Images Using Mask2Former," IGARSS 2025 - 2025 IEEE International Geoscience and Remote Sensing Symposium, Brisbane, Australia, 2025, pp. 1621-1625, doi: 10.1109/IGARSS55030.2025.11243109.

Publisher version

http://doi.org/10.1109/IGARSS55030.2025.11243109

Rights

© 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Collections

Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS)
Electrónica e Computación

Full item page

Efficient Semantic Segmentation of Multispectral Land Cover Images Using Mask2Former

Files

Identifiers

Publication date

Authors

Advisors

Tutors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

Metrics

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Relation

Has part

Has version

Is based on

Is part of

Is referenced by

Is version of

Requires

Publisher version

Sponsors

Rights

Collections