Tracking by 3D Model Estimation of Unknown Objects in Videos

Open Access
Authors
  • D. Rozumnyi
  • J. Matas
  • M. Pollefeys
  • V. Ferrari
Publication date 2023
Book title 2023 IEEE/CVF International Conference on Computer Vision
Book subtitle ICCV 2023 : Paris, France, 2-6 October 2023 : proceedings
ISBN
  • 9798350307191
ISBN (electronic)
  • 9798350307184
Event 2023 IEEE/CVF International Conference on Computer Vision (ICCV)
Pages (from-to) 14040-14050
Publisher Los Alamitos, California: IEEE Computer Society
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Most model-free visual object tracking methods formulate the tracking task as object location estimation given by a 2D segmentation or a bounding box in each video frame. We argue that this representation is limited and instead propose to guide and improve 2D tracking with an explicit object representation, namely the textured 3D shape and 6DoF pose in each video frame. Our representation tackles a complex long-term dense correspondence problem between all 3D points on the object for all video frames, including frames where some points are invisible. To achieve that, the estimation is driven by re-rendering the input video frames as well as possible through differentiable rendering, which has not been used for tracking before. The proposed optimization minimizes a novel loss function to estimate the best 3D shape, texture, and 6DoF pose. We improve the state-of-the-art in 2D segmentation tracking on three different datasets with mostly rigid objects.
Document type Conference contribution
Note With supplemental material
Language English
Published at https://doi.org/10.48550/arXiv.2304.06419 https://doi.org/10.1109/ICCV51070.2023.01295
Published at https://openaccess.thecvf.com/content/ICCV2023/html/Rozumnyi_Tracking_by_3D_Model_Estimation_of_Unknown_Objects_in_Videos_ICCV_2023_paper.html
Other links https://www.proceedings.com/72328.html
Downloads
Supplementary materials
Permalink to this page
Back