Acceleration Method for Learning Fine-Layered Optical Neural Networks

Aoyama, Kazuo; Sawada, Hiroshi

Computer Science > Machine Learning

arXiv:2109.01731 (cs)

[Submitted on 1 Sep 2021]

Title:Acceleration Method for Learning Fine-Layered Optical Neural Networks

Authors:Kazuo Aoyama, Hiroshi Sawada

View PDF

Abstract:An optical neural network (ONN) is a promising system due to its high-speed and low-power operation. Its linear unit performs a multiplication of an input vector and a weight matrix in optical analog circuits. Among them, a circuit with a multiple-layered structure of programmable Mach-Zehnder interferometers (MZIs) can realize a specific class of unitary matrices with a limited number of MZIs as its weight matrix. The circuit is effective for balancing the number of programmable MZIs and ONN performance. However, it takes a lot of time to learn MZI parameters of the circuit with a conventional automatic differentiation (AD), which machine learning platforms are equipped with. To solve the time-consuming problem, we propose an acceleration method for learning MZI parameters. We create customized complex-valued derivatives for an MZI, exploiting Wirtinger derivatives and a chain rule. They are incorporated into our newly developed function module implemented in C++ to collectively calculate their values in a multi-layered structure. Our method is simple, fast, and versatile as well as compatible with the conventional AD. We demonstrate that our method works 20 times faster than the conventional AD when a pixel-by-pixel MNIST task is performed in a complex-valued recurrent neural network with an MZI-based hidden unit.

Comments:	9 pages, 9 figures
Subjects:	Machine Learning (cs.LG); Optics (physics.optics); Quantum Physics (quant-ph); Machine Learning (stat.ML)
Cite as:	arXiv:2109.01731 [cs.LG]
	(or arXiv:2109.01731v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.01731

Submission history

From: Kazuo Aoyama [view email]
[v1] Wed, 1 Sep 2021 06:46:55 UTC (886 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-09

Change to browse by:

cs
physics
physics.optics
quant-ph
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Hiroshi Sawada

export BibTeX citation

Computer Science > Machine Learning

Title:Acceleration Method for Learning Fine-Layered Optical Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Acceleration Method for Learning Fine-Layered Optical Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators