Risk averse non-stationary multi-armed bandits

Benac, Leo; Godin, Frédéric

Computer Science > Machine Learning

arXiv:2109.13977 (cs)

[Submitted on 28 Sep 2021]

Title:Risk averse non-stationary multi-armed bandits

Authors:Leo Benac, Frédéric Godin

View PDF

Abstract:This paper tackles the risk averse multi-armed bandits problem when incurred losses are non-stationary. The conditional value-at-risk (CVaR) is used as the objective function. Two estimation methods are proposed for this objective function in the presence of non-stationary losses, one relying on a weighted empirical distribution of losses and another on the dual representation of the CVaR. Such estimates can then be embedded into classic arm selection methods such as epsilon-greedy policies. Simulation experiments assess the performance of the arm selection algorithms based on the two novel estimation approaches, and such policies are shown to outperform naive benchmarks not taking non-stationarity into account.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2109.13977 [cs.LG]
	(or arXiv:2109.13977v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.13977

Submission history

From: Frédéric Godin [view email]
[v1] Tue, 28 Sep 2021 18:34:54 UTC (799 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Fréderic Godin

export BibTeX citation

Computer Science > Machine Learning

Title:Risk averse non-stationary multi-armed bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Risk averse non-stationary multi-armed bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators