Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms

Zheng, Liyuan; Fiez, Tanner; Alumbaugh, Zane; Chasnov, Benjamin; Ratliff, Lillian J.

Computer Science > Machine Learning

arXiv:2109.12286 (cs)

[Submitted on 25 Sep 2021]

Title:Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms

Authors:Liyuan Zheng, Tanner Fiez, Zane Alumbaugh, Benjamin Chasnov, Lillian J. Ratliff

View PDF

Abstract:The hierarchical interaction between the actor and critic in actor-critic based reinforcement learning algorithms naturally lends itself to a game-theoretic interpretation. We adopt this viewpoint and model the actor and critic interaction as a two-player general-sum game with a leader-follower structure known as a Stackelberg game. Given this abstraction, we propose a meta-framework for Stackelberg actor-critic algorithms where the leader player follows the total derivative of its objective instead of the usual individual gradient. From a theoretical standpoint, we develop a policy gradient theorem for the refined update and provide a local convergence guarantee for the Stackelberg actor-critic algorithms to a local Stackelberg equilibrium. From an empirical standpoint, we demonstrate via simple examples that the learning dynamics we study mitigate cycling and accelerate convergence compared to the usual gradient dynamics given cost structures induced by actor-critic formulations. Finally, extensive experiments on OpenAI gym environments show that Stackelberg actor-critic algorithms always perform at least as well and often significantly outperform the standard actor-critic algorithm counterparts.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2109.12286 [cs.LG]
	(or arXiv:2109.12286v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.12286

Submission history

From: Liyuan Zheng [view email]
[v1] Sat, 25 Sep 2021 06:18:41 UTC (804 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Liyuan Zheng
Tanner Fiez
Benjamin Chasnov
Lillian J. Ratliff

export BibTeX citation

Computer Science > Machine Learning

Title:Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators