Cited time in webofscience Cited time in scopus

Self-Supervised Monocular Depth and Motion Learning in Dynamic Scenes: Semantic Prior to Rescue

Title
Self-Supervised Monocular Depth and Motion Learning in Dynamic Scenes: Semantic Prior to Rescue
Author(s)
Lee, SeokjuRameau, FrancoisIm, SunghoonKweon, In So
Issued Date
2022-09
Citation
International Journal of Computer Vision, v.130, no.9, pp.2265 - 2285
Type
Article
Author Keywords
3D visual perceptionMonocular depth estimationMotion estimationSelf-supervised learning
ISSN
0920-5691
Abstract
We introduce an end-to-end joint training framework that explicitly models 6-DoF motion of multiple dynamic objects, ego-motion, and depth in a monocular camera setup without geometric supervision. Our technical contributions are three-fold. First, we highlight the fundamental difference between inverse and forward projection while modeling the individual motion of each rigid object, and propose a geometrically correct projection pipeline using a neural forward projection module. Second, we propose two types of residual motion learning frameworks to explicitly disentangle camera and object motions in dynamic driving scenes with different levels of semantic prior knowledge: video instance segmentation as a strong prior, and object detection as a weak prior. Third, we design a unified photometric and geometric consistency loss that holistically imposes self-supervisory signals for every background and object region. Lastly, we present a unsupervised method of 3D motion field regularization for semantically plausible object motion representation. Our proposed elements are validated in a detailed ablation study. Through extensive experiments conducted on the KITTI, Cityscapes, and Waymo open dataset, our framework is shown to outperform the state-of-the-art depth and motion estimation methods. Our code, dataset, and models are publicly available. © 2023 Springer Nature Switzerland AG. Part of Springer Nature.
URI
http://hdl.handle.net/20.500.11750/17172
DOI
10.1007/s11263-022-01641-5
Publisher
Springer
Related Researcher
  • 임성훈 Im, Sunghoon
  • Research Interests Computer Vision; Deep Learning; Robot Vision
Files in This Item:

There are no files associated with this item.

Appears in Collections:
Department of Electrical Engineering and Computer Science Computer Vision Lab. 1. Journal Articles

qrcode

  • twitter
  • facebook
  • mendeley

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE