Home>Results

  • Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

[期刊论文]

DSC3D: Deformable Sampling Constraints in Stereo 3D Object Detection for Autonomous Driving

Share
Edit Delete 报错

author:

Chen, J. (Chen, J..) [1] | Song, Q. (Song, Q..) [2] | Guo, W. (Guo, W..) [3] (Scholars:郭文忠) | Unfold

Indexed by:

Scopus

Abstract:

Camera-based stereo 3D object detection estimates 3D properties of objects with binocular images only, which is a cost-effective solution for autonomous driving. The state-of-the-art methods mainly improve the detection accuracy of general objects by designing ingenious stereo matching algorithms or complex pipeline modules. Moreover, additional fine-grained annotations, such as masks or LiDAR point clouds, are often introduced to deal with the occlusion problems, which brings in high manual costs for this task. To address the detection bottleneck caused by occlusion in a more cost-effective manner, we develop a novel stereo 3D object detection method named DSC3D, which achieves significant improvements for occluded objects without introducing additional supervision. Specifically, we first report the ambiguity in feature sampling, which refers to the presence of noisy features in the sampling for occluded objects. Then, we propose the Epipolar Constraint Deform-Attention (ECDA) module to address the unreliable left-right correspondence computation in stereo matching caused by occlusion, which reweights epipolar features by adaptively aggregating local neighbor information. Furthermore, to ensure that 3D property estimation is based on robust object features, we propose visible regions guided constraint to explicitly guide the offset learning for feature sampling. Extensive experiments conducted on the KITTI benchmark have demonstrated the proposed DSC3D outperforms the state-of-the-art camera-based methods.  © 1991-2012 IEEE.

Keyword:

3D Object Detection Autonomous Driving Binocular Images Occluded Object Stereo Matching

Community:

  • [ 1 ] [Chen J.]Fuzhou University, College of Computer and Big Data, Fuzhou, China
  • [ 2 ] [Chen J.]The Chinese University of Hong Kong, School of Science and Engineering, Guangdong, Shenzhen, China
  • [ 3 ] [Song Q.]The Chinese University of Hong Kong, School of Science and Engineering, Guangdong, Shenzhen, China
  • [ 4 ] [Guo W.]Fuzhou University, College of Computer and Big Data, Fuzhou, China
  • [ 5 ] [Huang R.]The Chinese University of Hong Kong, School of Science and Engineering, Guangdong, Shenzhen, China

Reprint 's Address:

Show more details

Related Article:

Source :

IEEE Transactions on Circuits and Systems for Video Technology

ISSN: 1051-8215

Year: 2024

8 . 3 0 0

JCR@2023

Cited Count:

WoS CC Cited Count:

30 Days PV: 1

Online/Total:80/10034030
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1