Diagnosing and Fixing Manifold Overfitting in Deep Generative Models

Loaiza-Ganem, Gabriel; Ross, Brendan Leigh; Cresswell, Jesse C.; Caterini, Anthony L.

Statistics > Machine Learning

arXiv:2204.07172 (stat)

[Submitted on 14 Apr 2022 (v1), last revised 28 Nov 2022 (this version, v4)]

Title:Diagnosing and Fixing Manifold Overfitting in Deep Generative Models

Authors:Gabriel Loaiza-Ganem, Brendan Leigh Ross, Jesse C. Cresswell, Anthony L. Caterini

View PDF

Abstract:Likelihood-based, or explicit, deep generative models use neural networks to construct flexible high-dimensional densities. This formulation directly contradicts the manifold hypothesis, which states that observed data lies on a low-dimensional manifold embedded in high-dimensional ambient space. In this paper we investigate the pathologies of maximum-likelihood training in the presence of this dimensionality mismatch. We formally prove that degenerate optima are achieved wherein the manifold itself is learned but not the distribution on it, a phenomenon we call manifold overfitting. We propose a class of two-step procedures consisting of a dimensionality reduction step followed by maximum-likelihood density estimation, and prove that they recover the data-generating distribution in the nonparametric regime, thus avoiding manifold overfitting. We also show that these procedures enable density estimation on the manifolds learned by implicit models, such as generative adversarial networks, hence addressing a major shortcoming of these models. Several recently proposed methods are instances of our two-step procedures; we thus unify, extend, and theoretically justify a large class of models.

Comments:	Accepted for publication in TMLR
Subjects:	Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Methodology (stat.ME)
Cite as:	arXiv:2204.07172 [stat.ML]
	(or arXiv:2204.07172v4 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2204.07172

Submission history

From: Gabriel Loaiza-Ganem [view email]
[v1] Thu, 14 Apr 2022 18:00:03 UTC (10,127 KB)
[v2] Wed, 22 Jun 2022 18:01:06 UTC (10,366 KB)
[v3] Wed, 10 Aug 2022 18:00:03 UTC (10,367 KB)
[v4] Mon, 28 Nov 2022 19:00:00 UTC (18,200 KB)

Statistics > Machine Learning

Title:Diagnosing and Fixing Manifold Overfitting in Deep Generative Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Diagnosing and Fixing Manifold Overfitting in Deep Generative Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators