Communication-Computation Efficient Device-Edge Co-Inference via AutoML

Zhang, Xinjie; Shao, Jiawei; Mao, Yuyi; Zhang, Jun

Computer Science > Machine Learning

arXiv:2108.13009 (cs)

[Submitted on 30 Aug 2021 (v1), last revised 31 Aug 2021 (this version, v2)]

Title:Communication-Computation Efficient Device-Edge Co-Inference via AutoML

Authors:Xinjie Zhang, Jiawei Shao, Yuyi Mao, Jun Zhang

View PDF

Abstract:Device-edge co-inference, which partitions a deep neural network between a resource-constrained mobile device and an edge server, recently emerges as a promising paradigm to support intelligent mobile applications. To accelerate the inference process, on-device model sparsification and intermediate feature compression are regarded as two prominent techniques. However, as the on-device model sparsity level and intermediate feature compression ratio have direct impacts on computation workload and communication overhead respectively, and both of them affect the inference accuracy, finding the optimal values of these hyper-parameters brings a major challenge due to the large search space. In this paper, we endeavor to develop an efficient algorithm to determine these hyper-parameters. By selecting a suitable model split point and a pair of encoder/decoder for the intermediate feature vector, this problem is casted as a sequential decision problem, for which, a novel automated machine learning (AutoML) framework is proposed based on deep reinforcement learning (DRL). Experiment results on an image classification task demonstrate the effectiveness of the proposed framework in achieving a better communication-computation trade-off and significant inference speedup against various baseline schemes.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Signal Processing (eess.SP); Machine Learning (stat.ML)
Cite as:	arXiv:2108.13009 [cs.LG]
	(or arXiv:2108.13009v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2108.13009

Submission history

From: Xinjie Zhang [view email]
[v1] Mon, 30 Aug 2021 06:36:30 UTC (3,822 KB)
[v2] Tue, 31 Aug 2021 15:13:59 UTC (3,817 KB)

Computer Science > Machine Learning

Title:Communication-Computation Efficient Device-Edge Co-Inference via AutoML

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Communication-Computation Efficient Device-Edge Co-Inference via AutoML

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators