Towards a theory of out-of-distribution learning

Dey, Jayanta; Geisa, Ali; Mehta, Ronak; Tomita, Tyler M.; Helm, Hayden S.; Xu, Haoyin; Eaton, Eric; Dick, Jeffery; Priebe, Carey E.; Vogelstein, Joshua T.

Statistics > Machine Learning

arXiv:2109.14501 (stat)

[Submitted on 29 Sep 2021 (v1), last revised 7 Jun 2024 (this version, v5)]

Title:Towards a theory of out-of-distribution learning

Authors:Jayanta Dey, Ali Geisa, Ronak Mehta, Tyler M. Tomita, Hayden S. Helm, Haoyin Xu, Eric Eaton, Jeffery Dick, Carey E. Priebe, Joshua T. Vogelstein

View PDF HTML (experimental)

Abstract:Learning is a process wherein a learning agent enhances its performance through exposure of experience or data. Throughout this journey, the agent may encounter diverse learning environments. For example, data may be presented to the leaner all at once, in multiple batches, or sequentially. Furthermore, the distribution of each data sample could be either identical and independent (iid) or non-iid. Additionally, there may exist computational and space constraints for the deployment of the learning algorithms. The complexity of a learning task can vary significantly, depending on the learning setup and the constraints imposed upon it. However, it is worth noting that the current literature lacks formal definitions for many of the in-distribution and out-of-distribution learning paradigms. Establishing proper and universally agreed-upon definitions for these learning setups is essential for thoroughly exploring the evolution of ideas across different learning scenarios and deriving generalized mathematical bounds for these learners. In this paper, we aim to address this issue by proposing a chronological approach to defining different learning tasks using the provably approximately correct (PAC) learning framework. We will start with in-distribution learning and progress to recently proposed lifelong or continual learning. We employ consistent terminology and notation to demonstrate how each of these learning frameworks represents a specific instance of a broader, more generalized concept of learnability. Our hope is that this work will inspire a universally agreed-upon approach to quantifying different types of learning, fostering greater understanding and progress in the field.

Subjects:	Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2109.14501 [stat.ML]
	(or arXiv:2109.14501v5 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2109.14501

Submission history

From: Jayanta Dey [view email]
[v1] Wed, 29 Sep 2021 15:35:16 UTC (627 KB)
[v2] Thu, 7 Oct 2021 17:46:04 UTC (628 KB)
[v3] Wed, 24 Nov 2021 18:18:39 UTC (635 KB)
[v4] Thu, 6 Jan 2022 16:46:24 UTC (635 KB)
[v5] Fri, 7 Jun 2024 17:24:36 UTC (637 KB)

Statistics > Machine Learning

Title:Towards a theory of out-of-distribution learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Towards a theory of out-of-distribution learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators