Activation Functions in Deep Learning: A Comprehensive Survey and Benchmark

Dubey, Shiv Ram; Singh, Satish Kumar; Chaudhuri, Bidyut Baran

Computer Science > Machine Learning

arXiv:2109.14545 (cs)

[Submitted on 29 Sep 2021 (v1), last revised 28 Jun 2022 (this version, v3)]

Title:Activation Functions in Deep Learning: A Comprehensive Survey and Benchmark

Authors:Shiv Ram Dubey, Satish Kumar Singh, Bidyut Baran Chaudhuri

View PDF

Abstract:Neural networks have shown tremendous growth in recent years to solve numerous problems. Various types of neural networks have been introduced to deal with different types of problems. However, the main goal of any neural network is to transform the non-linearly separable input data into more linearly separable abstract features using a hierarchy of layers. These layers are combinations of linear and nonlinear functions. The most popular and common non-linearity layers are activation functions (AFs), such as Logistic Sigmoid, Tanh, ReLU, ELU, Swish and Mish. In this paper, a comprehensive overview and survey is presented for AFs in neural networks for deep learning. Different classes of AFs such as Logistic Sigmoid and Tanh based, ReLU based, ELU based, and Learning based are covered. Several characteristics of AFs such as output range, monotonicity, and smoothness are also pointed out. A performance comparison is also performed among 18 state-of-the-art AFs with different networks on different types of data. The insights of AFs are presented to benefit the researchers for doing further research and practitioners to select among different choices. The code used for experimental comparison is released at: \url{this https URL}.

Comments:	Accepted in Neurocomputing, Elsevier
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2109.14545 [cs.LG]
	(or arXiv:2109.14545v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.14545

Submission history

From: Shiv Ram Dubey [view email]
[v1] Wed, 29 Sep 2021 16:41:19 UTC (1,429 KB)
[v2] Tue, 15 Feb 2022 16:10:59 UTC (1,395 KB)
[v3] Tue, 28 Jun 2022 04:13:53 UTC (774 KB)

Computer Science > Machine Learning

Title:Activation Functions in Deep Learning: A Comprehensive Survey and Benchmark

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Activation Functions in Deep Learning: A Comprehensive Survey and Benchmark

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators