Relation networks for few-shot video object detection

dc.contributor.affiliationUniversidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS)
dc.contributor.authorCores Costa, Daniel
dc.contributor.authorSeidenari, Lorenzo
dc.contributor.authorBimbo, Alberto del
dc.contributor.authorBrea Sánchez, Víctor Manuel
dc.contributor.authorMucientes Molina, Manuel
dc.date.accessioned2025-11-17T12:46:37Z
dc.date.available2025-11-17T12:46:37Z
dc.date.issued2023-06-25
dc.description.abstractThis paper describes a new few-shot video object detection framework that leverages spatio-temporal information through a relation module with attention mechanisms to mine relationships among proposals in different frames. The output of the relation module feeds a spatio-temporal double head with a category-agnostic confidence predictor to decrease overfitting in order to address the issue of reduced training sets inherent to few-shot solutions. The predicted score is the input to a long-term object linking approach that provides object tubes across the whole video, which ensures spatio-temporal consistency. Our proposal establishes a new state-of-the-art in the FSVOD500 dataset.
dc.description.sponsorshipThis research was partially funded by the Spanish Ministerio de Ciencia e Innovación (grant number PID2020-112623GB-I00), and the Galician Consellería de Cultura, Educación e Universidade (grant numbers ED431C 2018/29, ED431C 2021/048, ED431G 2019/04). These grants are co-funded by the European Regional Development Fund (ERDF).
dc.identifier.citationCores, D., Seidenari, L., Bimbo, A.D., Brea, V.M., Mucientes, M. (2023). Relation Networks for Few-Shot Video Object Detection. In: Pertusa, A., Gallego, A.J., Sánchez, J.A., Domingues, I. (eds) Pattern Recognition and Image Analysis. IbPRIA 2023. Lecture Notes in Computer Science, vol 14062. Springer, Cham. https://doi.org/10.1007/978-3-031-36616-1_19
dc.identifier.doi10.1007/978-3-031-36616-1_19
dc.identifier.isbn978-3-031-36616-1
dc.identifier.urihttps://hdl.handle.net/10347/43854
dc.language.isoeng
dc.publisherSpringer
dc.relation.ispartofseriesLecture Notes in Computer Science (LNCS); 14062
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-112623GB-I000/ES/IA RESPONSABLE PARA MINERIA DE PROCESOS 2.0
dc.relation.publisherversionhttps://doi.org/10.1007/978-3-031-36616-1_19
dc.rights.accessRightsopen access
dc.subjectFew-shot object detection
dc.subjectVideo object detection
dc.subject.classification120304 Inteligencia artificial
dc.titleRelation networks for few-shot video object detection
dc.typebook part
dc.type.hasVersionAM
dspace.entity.typePublication
relation.isAuthorOfPublication3daa2166-1c2d-4b3d-bbb0-3d0036bd8cf2
relation.isAuthorOfPublication22d4aeb8-73ba-4743-a84e-9118799ab1f2
relation.isAuthorOfPublication21112b72-72a3-4a96-bda4-065e7e2bb262
relation.isAuthorOfPublication.latestForDiscovery3daa2166-1c2d-4b3d-bbb0-3d0036bd8cf2

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Cores23_ibpria_fsv.pdf
Size:
570.36 KB
Format:
Adobe Portable Document Format