Towards generic 3D tracking in RGBD videos: benchmark and baseline

Jinyu Yang, Zhongqun Zhang, Zhe Li, Hyung Jin Chang, Ales Leonardis, Feng Zheng*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

164 Downloads (Pure)

Abstract

Tracking in 3D scenes is gaining momentum because of its numerous applications in robotics, autonomous driving, and scene understanding. Currently, 3D tracking is limited to specific model-based approaches involving point clouds, which impedes 3D trackers from applying in natural 3D scenes. RGBD sensors provide a more reasonable and acceptable solution for 3D object tracking due to their readily available synchronised color and depth information. Thus, in this paper, we investigate a novel problem: is it possible to track a generic (class-agnostic) 3D object in RGBD videos and predict 3D bounding boxes of the object of interest? To inspire research on this topic, we newly construct a standard benchmark for generic 3D object tracking, ‘Track-it-in-3D’, which contains 300 RGBD video sequences with dense 3D annotations and corresponding evaluation protocols. Furthermore, we propose an effective tracking baseline to estimate 3D bounding boxes for arbitrary objects in RGBD videos, by fusing appearance and spatial information effectively. Resources are available on https://github.com/yjybuaa/Track-it-in-3D.
Original languageEnglish
Title of host publicationComputer Vision – ECCV 2022
Subtitle of host publication17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXII
EditorsShai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner
PublisherSpringer
Pages112–128
Number of pages17
Edition1
ISBN (Electronic)9783031200472
ISBN (Print)9783031200465
DOIs
Publication statusPublished - 23 Oct 2022
Event17th European Conference on Computer Vision (ECCV 2022) - Tel Aviv, Israel
Duration: 24 Oct 202228 Oct 2022

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume13682
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference17th European Conference on Computer Vision (ECCV 2022)
Abbreviated titleECCV 2022
Country/TerritoryIsrael
CityTel Aviv
Period24/10/2228/10/22

Fingerprint

Dive into the research topics of 'Towards generic 3D tracking in RGBD videos: benchmark and baseline'. Together they form a unique fingerprint.

Cite this