Relation networks for few-shot video object detection
| dc.contributor.affiliation | Universidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS) | |
| dc.contributor.author | Cores Costa, Daniel | |
| dc.contributor.author | Seidenari, Lorenzo | |
| dc.contributor.author | Bimbo, Alberto del | |
| dc.contributor.author | Brea Sánchez, Víctor Manuel | |
| dc.contributor.author | Mucientes Molina, Manuel | |
| dc.date.accessioned | 2025-11-17T12:46:37Z | |
| dc.date.available | 2025-11-17T12:46:37Z | |
| dc.date.issued | 2023-06-25 | |
| dc.description.abstract | This paper describes a new few-shot video object detection framework that leverages spatio-temporal information through a relation module with attention mechanisms to mine relationships among proposals in different frames. The output of the relation module feeds a spatio-temporal double head with a category-agnostic confidence predictor to decrease overfitting in order to address the issue of reduced training sets inherent to few-shot solutions. The predicted score is the input to a long-term object linking approach that provides object tubes across the whole video, which ensures spatio-temporal consistency. Our proposal establishes a new state-of-the-art in the FSVOD500 dataset. | |
| dc.description.sponsorship | This research was partially funded by the Spanish Ministerio de Ciencia e Innovación (grant number PID2020-112623GB-I00), and the Galician Consellería de Cultura, Educación e Universidade (grant numbers ED431C 2018/29, ED431C 2021/048, ED431G 2019/04). These grants are co-funded by the European Regional Development Fund (ERDF). | |
| dc.identifier.citation | Cores, D., Seidenari, L., Bimbo, A.D., Brea, V.M., Mucientes, M. (2023). Relation Networks for Few-Shot Video Object Detection. In: Pertusa, A., Gallego, A.J., Sánchez, J.A., Domingues, I. (eds) Pattern Recognition and Image Analysis. IbPRIA 2023. Lecture Notes in Computer Science, vol 14062. Springer, Cham. https://doi.org/10.1007/978-3-031-36616-1_19 | |
| dc.identifier.doi | 10.1007/978-3-031-36616-1_19 | |
| dc.identifier.isbn | 978-3-031-36616-1 | |
| dc.identifier.uri | https://hdl.handle.net/10347/43854 | |
| dc.language.iso | eng | |
| dc.publisher | Springer | |
| dc.relation.ispartofseries | Lecture Notes in Computer Science (LNCS); 14062 | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-112623GB-I000/ES/IA RESPONSABLE PARA MINERIA DE PROCESOS 2.0 | |
| dc.relation.publisherversion | https://doi.org/10.1007/978-3-031-36616-1_19 | |
| dc.rights.accessRights | open access | |
| dc.subject | Few-shot object detection | |
| dc.subject | Video object detection | |
| dc.subject.classification | 120304 Inteligencia artificial | |
| dc.title | Relation networks for few-shot video object detection | |
| dc.type | book part | |
| dc.type.hasVersion | AM | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 3daa2166-1c2d-4b3d-bbb0-3d0036bd8cf2 | |
| relation.isAuthorOfPublication | 22d4aeb8-73ba-4743-a84e-9118799ab1f2 | |
| relation.isAuthorOfPublication | 21112b72-72a3-4a96-bda4-065e7e2bb262 | |
| relation.isAuthorOfPublication.latestForDiscovery | 3daa2166-1c2d-4b3d-bbb0-3d0036bd8cf2 |
Files
Original bundle
1 - 1 of 1