Greedy Network Enlarging

Liu, Chuanjian; Han, Kai; Xiao, An; Deng, Yiping; Zhang, Wei; Xu, Chunjing; Wang, Yunhe

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.00177 (cs)

[Submitted on 31 Jul 2021 (v1), last revised 26 Nov 2021 (this version, v3)]

Title:Greedy Network Enlarging

Authors:Chuanjian Liu, Kai Han, An Xiao, Yiping Deng, Wei Zhang, Chunjing Xu, Yunhe Wang

View PDF

Abstract:Recent studies on deep convolutional neural networks present a simple paradigm of architecture design, i.e., models with more MACs typically achieve better accuracy, such as EfficientNet and RegNet. These works try to enlarge all the stages in the model with one unified rule by sampling and statistical methods. However, we observe that some network architectures have similar MACs and accuracies, but their allocations on computations for different stages are quite different. In this paper, we propose to enlarge the capacity of CNN models by improving their width, depth and resolution on stage level. Under the assumption that the top-performing smaller CNNs are a proper subcomponent of the top-performing larger CNNs, we propose an greedy network enlarging method based on the reallocation of computations. With step-by-step modifying the computations on different stages, the enlarged network will be equipped with optimal allocation and utilization of MACs. On EfficientNet, our method consistently outperforms the performance of the original scaling method. In particular, with application of our method on GhostNet, we achieve state-of-the-art 80.9% and 84.3% ImageNet top-1 accuracies under the setting of 600M and 4.4B MACs, respectively.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2108.00177 [cs.CV]
	(or arXiv:2108.00177v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.00177

Submission history

From: Chuanjian Liu [view email]
[v1] Sat, 31 Jul 2021 08:36:30 UTC (932 KB)
[v2] Wed, 4 Aug 2021 08:07:19 UTC (968 KB)
[v3] Fri, 26 Nov 2021 03:36:45 UTC (968 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Greedy Network Enlarging

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Greedy Network Enlarging

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators