On Tilted Losses in Machine Learning: Theory and Applications

Li, Tian; Beirami, Ahmad; Sanjabi, Maziar; Smith, Virginia

Computer Science > Machine Learning

arXiv:2109.06141 (cs)

[Submitted on 13 Sep 2021 (v1), last revised 1 Jun 2023 (this version, v3)]

Title:On Tilted Losses in Machine Learning: Theory and Applications

Authors:Tian Li, Ahmad Beirami, Maziar Sanjabi, Virginia Smith

View PDF

Abstract:Exponential tilting is a technique commonly used in fields such as statistics, probability, information theory, and optimization to create parametric distribution shifts. Despite its prevalence in related fields, tilting has not seen widespread use in machine learning. In this work, we aim to bridge this gap by exploring the use of tilting in risk minimization. We study a simple extension to ERM -- tilted empirical risk minimization (TERM) -- which uses exponential tilting to flexibly tune the impact of individual losses. The resulting framework has several useful properties: We show that TERM can increase or decrease the influence of outliers, respectively, to enable fairness or robustness; has variance-reduction properties that can benefit generalization; and can be viewed as a smooth approximation to the tail probability of losses. Our work makes rigorous connections between TERM and related objectives, such as Value-at-Risk, Conditional Value-at-Risk, and distributionally robust optimization (DRO). We develop batch and stochastic first-order optimization methods for solving TERM, provide convergence guarantees for the solvers, and show that the framework can be efficiently solved relative to common alternatives. Finally, we demonstrate that TERM can be used for a multitude of applications in machine learning, such as enforcing fairness between subgroups, mitigating the effect of outliers, and handling class imbalance. Despite the straightforward modification TERM makes to traditional ERM objectives, we find that the framework can consistently outperform ERM and deliver competitive performance with state-of-the-art, problem-specific approaches.

Comments:	arXiv admin note: substantial text overlap with arXiv:2007.01162
Subjects:	Machine Learning (cs.LG); Information Theory (cs.IT); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2109.06141 [cs.LG]
	(or arXiv:2109.06141v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.06141

Submission history

From: Tian Li [view email]
[v1] Mon, 13 Sep 2021 17:33:42 UTC (11,272 KB)
[v2] Wed, 19 Oct 2022 21:01:11 UTC (12,211 KB)
[v3] Thu, 1 Jun 2023 06:30:40 UTC (12,296 KB)

Computer Science > Machine Learning

Title:On Tilted Losses in Machine Learning: Theory and Applications

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On Tilted Losses in Machine Learning: Theory and Applications

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators