The edge of chaos: quantum field theory and deep neural networks

Grosvenor, Kevin T.; Jefferson, Ro

High Energy Physics - Theory

arXiv:2109.13247 (hep-th)

[Submitted on 27 Sep 2021 (v1), last revised 25 Jan 2022 (this version, v2)]

Title:The edge of chaos: quantum field theory and deep neural networks

Authors:Kevin T. Grosvenor, Ro Jefferson

View PDF

Abstract:We explicitly construct the quantum field theory corresponding to a general class of deep neural networks encompassing both recurrent and feedforward architectures. We first consider the mean-field theory (MFT) obtained as the leading saddlepoint in the action, and derive the condition for criticality via the largest Lyapunov exponent. We then compute the loop corrections to the correlation function in a perturbative expansion in the ratio of depth $T$ to width $N$, and find a precise analogy with the well-studied $O(N)$ vector model, in which the variance of the weight initializations plays the role of the 't Hooft coupling. In particular, we compute both the $\mathcal{O}(1)$ corrections quantifying fluctuations from typicality in the ensemble of networks, and the subleading $\mathcal{O}(T/N)$ corrections due to finite-width effects. These provide corrections to the correlation length that controls the depth to which information can propagate through the network, and thereby sets the scale at which such networks are trainable by gradient descent. Our analysis provides a first-principles approach to the rapidly emerging NN-QFT correspondence, and opens several interesting avenues to the study of criticality in deep neural networks.

Comments:	Matches published version. Added appendix on NN-QFT dictionary. Various minor edits & improvements
Subjects:	High Energy Physics - Theory (hep-th); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2109.13247 [hep-th]
	(or arXiv:2109.13247v2 [hep-th] for this version)
	https://doi.org/10.48550/arXiv.2109.13247

Submission history

From: Ro Jefferson [view email]
[v1] Mon, 27 Sep 2021 18:00:00 UTC (1,088 KB)
[v2] Tue, 25 Jan 2022 19:00:01 UTC (1,093 KB)

High Energy Physics - Theory

Title:The edge of chaos: quantum field theory and deep neural networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

High Energy Physics - Theory

Title:The edge of chaos: quantum field theory and deep neural networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators