MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

Shamsafar, Faranak; Woerz, Samuel; Rahim, Rafia; Zell, Andreas

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.09770 (cs)

[Submitted on 22 Aug 2021]

Title:MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

Authors:Faranak Shamsafar, Samuel Woerz, Rafia Rahim, Andreas Zell

View PDF

Abstract:Recent methods in stereo matching have continuously improved the accuracy using deep models. This gain, however, is attained with a high increase in computation cost, such that the network may not fit even on a moderate GPU. This issue raises problems when the model needs to be deployed on resource-limited devices. For this, we propose two light models for stereo vision with reduced complexity and without sacrificing accuracy. Depending on the dimension of cost volume, we design a 2D and a 3D model with encoder-decoders built from 2D and 3D convolutions, respectively. To this end, we leverage 2D MobileNet blocks and extend them to 3D for stereo vision application. Besides, a new cost volume is proposed to boost the accuracy of the 2D model, making it performing close to 3D networks. Experiments show that the proposed 2D/3D networks effectively reduce the computational expense (27%/95% and 72%/38% fewer parameters/operations in 2D and 3D models, respectively) while upholding the accuracy. Our code is available at this https URL.

Comments:	Under review. Further figures and tables in the appendix. Code provided
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.09770 [cs.CV]
	(or arXiv:2108.09770v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.09770

Submission history

From: Faranak Shamsafar [view email]
[v1] Sun, 22 Aug 2021 16:14:27 UTC (36,640 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Rafia Rahim
Andreas Zell

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators