Automatic Tuning of Tensorflow's CPU Backend using Gradient-Free Optimization Algorithms

Mebratu, Derssie; Hasabnis, Niranjan; Mercati, Pietro; Sharma, Gaurit; Najnin, Shamima

Computer Science > Machine Learning

arXiv:2109.06266 (cs)

[Submitted on 13 Sep 2021]

Title:Automatic Tuning of Tensorflow's CPU Backend using Gradient-Free Optimization Algorithms

Authors:Derssie Mebratu, Niranjan Hasabnis, Pietro Mercati, Gaurit Sharma, Shamima Najnin

View PDF

Abstract:Modern deep learning (DL) applications are built using DL libraries and frameworks such as TensorFlow and PyTorch. These frameworks have complex parameters and tuning them to obtain good training and inference performance is challenging for typical users, such as DL developers and data scientists. Manual tuning requires deep knowledge of the user-controllable parameters of DL frameworks as well as the underlying hardware. It is a slow and tedious process, and it typically delivers sub-optimal solutions.
In this paper, we treat the problem of tuning parameters of DL frameworks to improve training and inference performance as a black-box optimization problem. We then investigate applicability and effectiveness of Bayesian optimization (BO), genetic algorithm (GA), and Nelder-Mead simplex (NMS) to tune the parameters of TensorFlow's CPU backend. While prior work has already investigated the use of Nelder-Mead simplex for a similar problem, it does not provide insights into the applicability of other more popular algorithms. Towards that end, we provide a systematic comparative analysis of all three algorithms in tuning TensorFlow's CPU backend on a variety of DL models. Our findings reveal that Bayesian optimization performs the best on the majority of models. There are, however, cases where it does not deliver the best results.

Comments:	To appear in the Proceedings of the Machine Learning on HPC Systems (MLHPCS) workshop held in conjunction with International Supercomputing Conference (ISC), July 2, 2021
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as:	arXiv:2109.06266 [cs.LG]
	(or arXiv:2109.06266v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.06266

Submission history

From: Niranjan Hasabnis [view email]
[v1] Mon, 13 Sep 2021 19:10:23 UTC (2,424 KB)

Computer Science > Machine Learning

Title:Automatic Tuning of Tensorflow's CPU Backend using Gradient-Free Optimization Algorithms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Automatic Tuning of Tensorflow's CPU Backend using Gradient-Free Optimization Algorithms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators