Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions

Wu, Yang; Wang, Dingheng; Lu, Xiaotong; Yang, Fan; Li, Guoqi; Dong, Weisheng; Shi, Jianbo

doi:10.1007/s11633-022-1340-5

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.13055 (cs)

[Submitted on 30 Aug 2021 (v1), last revised 9 Sep 2021 (this version, v2)]

Title:Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions

Authors:Yang Wu, Dingheng Wang, Xiaotong Lu, Fan Yang, Guoqi Li, Weisheng Dong, Jianbo Shi

View PDF

Abstract:Visual recognition is currently one of the most important and active research areas in computer vision, pattern recognition, and even the general field of artificial intelligence. It has great fundamental importance and strong industrial needs. Deep neural networks (DNNs) have largely boosted their performances on many concrete tasks, with the help of large amounts of training data and new powerful computation resources. Though recognition accuracy is usually the first concern for new progresses, efficiency is actually rather important and sometimes critical for both academic research and industrial applications. Moreover, insightful views on the opportunities and challenges of efficiency are also highly required for the entire community. While general surveys on the efficiency issue of DNNs have been done from various perspectives, as far as we are aware, scarcely any of them focused on visual recognition systematically, and thus it is unclear which progresses are applicable to it and what else should be concerned. In this paper, we present the review of the recent advances with our suggestions on the new possible directions towards improving the efficiency of DNN-related visual recognition approaches. We investigate not only from the model but also the data point of view (which is not the case in existing surveys), and focus on three most studied data types (images, videos and points). This paper attempts to provide a systematic summary via a comprehensive survey which can serve as a valuable reference and inspire both researchers and practitioners who work on visual recognition problems.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.13055 [cs.CV]
	(or arXiv:2108.13055v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.13055
Journal reference:	Mach. Intell. Res. (2022)
Related DOI:	https://doi.org/10.1007/s11633-022-1340-5

Submission history

From: Dingheng Wang [view email]
[v1] Mon, 30 Aug 2021 08:19:34 UTC (4,610 KB)
[v2] Thu, 9 Sep 2021 02:47:15 UTC (4,610 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators