Towards Size-Independent Generalization Bounds for Deep Operator Nets

Gopalani, Pulkit; Karmakar, Sayar; Kumar, Dibyakanti; Mukherjee, Anirbit

Computer Science > Machine Learning

arXiv:2205.11359 (cs)

[Submitted on 23 May 2022 (v1), last revised 4 Dec 2024 (this version, v3)]

Title:Towards Size-Independent Generalization Bounds for Deep Operator Nets

Authors:Pulkit Gopalani, Sayar Karmakar, Dibyakanti Kumar, Anirbit Mukherjee

View PDF HTML (experimental)

Abstract:In recent times machine learning methods have made significant advances in becoming a useful tool for analyzing physical systems. A particularly active area in this theme has been "physics-informed machine learning" which focuses on using neural nets for numerically solving differential equations. In this work, we aim to advance the theory of measuring out-of-sample error while training DeepONets - which is among the most versatile ways to solve P.D.E systems in one-shot. Firstly, for a class of DeepONets, we prove a bound on their Rademacher complexity which does not explicitly scale with the width of the nets involved. Secondly, we use this to show how the Huber loss can be chosen so that for these DeepONet classes generalization error bounds can be obtained that have no explicit dependence on the size of the nets. The effective capacity measure for DeepONets that we thus derive is also shown to correlate with the behavior of generalization error in experiments.

Comments:	33 pages, 7 figures; Published in TMLR, December 2024
Subjects:	Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
Cite as:	arXiv:2205.11359 [cs.LG]
	(or arXiv:2205.11359v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2205.11359

Submission history

From: Pulkit Gopalani [view email]
[v1] Mon, 23 May 2022 14:45:34 UTC (76 KB)
[v2] Mon, 22 Jan 2024 18:01:37 UTC (933 KB)
[v3] Wed, 4 Dec 2024 17:37:38 UTC (3,140 KB)

Computer Science > Machine Learning

Title:Towards Size-Independent Generalization Bounds for Deep Operator Nets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Size-Independent Generalization Bounds for Deep Operator Nets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators