Tracking more than 100 arbitrary objects at 25 FPS through deep learning

Vaquero Otal, Lorenzo; Brea Sánchez, Víctor Manuel; Mucientes Molina, Manuel

doi:10.1016/j.patcog.2021.108205

Tracking more than 100 arbitrary objects at 25 FPS through deep learning

Files

2022_patcog_vaquero_tracking.pdf (3.26 MB)

Identifiers

URI: http://hdl.handle.net/10347/26777

ISSN: 0031-3203

DOI: 10.1016/j.patcog.2021.108205

Publication date

2022

Authors

Vaquero Otal, Lorenzo

Brea Sánchez, Víctor Manuel

Mucientes Molina, Manuel

Publisher

Elsevier

Metrics

Export

Abstract

Most video analytics applications rely on object detectors to localize objects in frames. However, when real-time is a requirement, running the detector at all the frames is usually not possible. This is somewhat circumvented by instantiating visual object trackers between detector calls, but this does not scale with the number of objects. To tackle this problem, we present SiamMT, a new deep learning multiple visual object tracking solution that applies single-object tracking principles to multiple arbitrary objects in real-time. To achieve this, SiamMT reuses feature computations, implements a novel crop-and-resize operator, and defines a new and efficient pairwise similarity operator. SiamMT naturally scales up to several dozens of targets, reaching 25 fps with 122 simultaneous objects for VGA videos, or up to 100 simultaneous objects in HD720 video. SiamMT has been validated on five large real-time benchmarks, achieving leading performance against current state-of-the-art trackers

Keywords

Multiple visual object tracking| Motion estimation| Deep learning| Siamese networks

Bibliographic citation

Pattern Recognition 2022, 121: 108205. https://doi.org/10.1016/j.patcog.2021.108205

Publisher version

https://doi.org/10.1016/j.patcog.2021.108205

Rights

© 2021 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Attribution-NonCommercial-NoDerivatives 4.0 Internacional

Collections

Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS)
Electrónica e Computación

Full item page

Tracking more than 100 arbitrary objects at 25 FPS through deep learning

Files

Identifiers

Publication date

Authors

Advisors

Tutors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

Metrics

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Relation

Has part

Has version

Is based on

Is part of

Is referenced by

Is version of

Requires

Publisher version

Sponsors

Rights

Collections