Lifting 2D Object Locations to 3D by Discounting LiDAR Outliers across Objects and Views

McCraith, Robert; Insafutdinov, Eldar; Neumann, Lukas; Vedaldi, Andrea

Computer Science > Computer Vision and Pattern Recognition

arXiv:2109.07945 (cs)

[Submitted on 16 Sep 2021 (v1), last revised 9 Oct 2021 (this version, v2)]

Title:Lifting 2D Object Locations to 3D by Discounting LiDAR Outliers across Objects and Views

Authors:Robert McCraith, Eldar Insafutdinov, Lukas Neumann, Andrea Vedaldi

View PDF

Abstract:We present a system for automatic converting of 2D mask object predictions and raw LiDAR point clouds into full 3D bounding boxes of objects. Because the LiDAR point clouds are partial, directly fitting bounding boxes to the point clouds is meaningless. Instead, we suggest that obtaining good results requires sharing information between \emph{all} objects in the dataset jointly, over multiple frames. We then make three improvements to the baseline. First, we address ambiguities in predicting the object rotations via direct optimization in this space while still backpropagating rotation prediction through the model. Second, we explicitly model outliers and task the network with learning their typical patterns, thus better discounting them. Third, we enforce temporal consistency when video data is available. With these contributions, our method significantly outperforms previous work despite the fact that those methods use significantly more complex pipelines, 3D models and additional human-annotated external sources of prior information.

Comments:	ICRA 2022 submission
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2109.07945 [cs.CV]
	(or arXiv:2109.07945v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2109.07945

Submission history

From: Robert McCraith [view email]
[v1] Thu, 16 Sep 2021 13:01:13 UTC (24,889 KB)
[v2] Sat, 9 Oct 2021 14:50:26 UTC (46,141 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Lifting 2D Object Locations to 3D by Discounting LiDAR Outliers across Objects and Views

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Lifting 2D Object Locations to 3D by Discounting LiDAR Outliers across Objects and Views

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators