AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection

Liu, Zongdai; Zhou, Dingfu; Lu, Feixiang; Fang, Jin; Zhang, Liangjun

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.11127 (cs)

[Submitted on 25 Aug 2021]

Title:AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection

Authors:Zongdai Liu, Dingfu Zhou, Feixiang Lu, Jin Fang, Liangjun Zhang

View PDF

Abstract:Existing deep learning-based approaches for monocular 3D object detection in autonomous driving often model the object as a rotated 3D cuboid while the object's geometric shape has been ignored. In this work, we propose an approach for incorporating the shape-aware 2D/3D constraints into the 3D detection framework. Specifically, we employ the deep neural network to learn distinguished 2D keypoints in the 2D image domain and regress their corresponding 3D coordinates in the local 3D object coordinate first. Then the 2D/3D geometric constraints are built by these correspondences for each object to boost the detection performance. For generating the ground truth of 2D/3D keypoints, an automatic model-fitting approach has been proposed by fitting the deformed 3D object model and the object mask in the 2D image. The proposed framework has been verified on the public KITTI dataset and the experimental results demonstrate that by using additional geometrical constraints the detection performance has been significantly improved as compared to the baseline method. More importantly, the proposed framework achieves state-of-the-art performance with real time. Data and code will be available at this https URL

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.11127 [cs.CV]
	(or arXiv:2108.11127v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.11127

Submission history

From: Zongdai Liu [view email]
[v1] Wed, 25 Aug 2021 08:50:06 UTC (5,510 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators