Model-based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian

Peng, Baiyu; Duan, Jingliang; Chen, Jianyu; Li, Shengbo Eben; Xie, Genjin; Zhang, Congsheng; Guan, Yang; Mu, Yao; Sun, Enxin

Computer Science > Machine Learning

arXiv:2108.11623 (cs)

[Submitted on 26 Aug 2021]

Title:Model-based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian

Authors:Baiyu Peng, Jingliang Duan, Jianyu Chen, Shengbo Eben Li, Genjin Xie, Congsheng Zhang, Yang Guan, Yao Mu, Enxin Sun

View PDF

Abstract:Safety is essential for reinforcement learning (RL) applied in the real world. Adding chance constraints (or probabilistic constraints) is a suitable way to enhance RL safety under uncertainty. Existing chance-constrained RL methods like the penalty methods and the Lagrangian methods either exhibit periodic oscillations or learn an over-conservative or unsafe policy. In this paper, we address these shortcomings by proposing a separated proportional-integral Lagrangian (SPIL) algorithm. We first review the constrained policy optimization process from a feedback control perspective, which regards the penalty weight as the control input and the safe probability as the control output. Based on this, the penalty method is formulated as a proportional controller, and the Lagrangian method is formulated as an integral controller. We then unify them and present a proportional-integral Lagrangian method to get both their merits, with an integral separation technique to limit the integral value in a reasonable range. To accelerate training, the gradient of safe probability is computed in a model-based manner. We demonstrate our method can reduce the oscillations and conservatism of RL policy in a car-following simulation. To prove its practicality, we also apply our method to a real-world mobile robot navigation task, where our robot successfully avoids a moving obstacle with highly uncertain or even aggressive behaviors.

Subjects:	Machine Learning (cs.LG); Robotics (cs.RO); Systems and Control (eess.SY)
Cite as:	arXiv:2108.11623 [cs.LG]
	(or arXiv:2108.11623v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2108.11623

Submission history

From: Baiyu Peng [view email]
[v1] Thu, 26 Aug 2021 07:34:14 UTC (10,310 KB)

Computer Science > Machine Learning

Title:Model-based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Model-based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators