Data Generation Method for Learning a Low-dimensional Safe Region in Safe Reinforcement Learning

Zhou, Zhehua; Oguz, Ozgur S.; Ren, Yi; Leibold, Marion; Buss, Martin

Electrical Engineering and Systems Science > Systems and Control

arXiv:2109.05077 (eess)

[Submitted on 10 Sep 2021]

Title:Data Generation Method for Learning a Low-dimensional Safe Region in Safe Reinforcement Learning

Authors:Zhehua Zhou, Ozgur S. Oguz, Yi Ren, Marion Leibold, Martin Buss

View PDF

Abstract:Safe reinforcement learning aims to learn a control policy while ensuring that neither the system nor the environment gets damaged during the learning process. For implementing safe reinforcement learning on highly nonlinear and high-dimensional dynamical systems, one possible approach is to find a low-dimensional safe region via data-driven feature extraction methods, which provides safety estimates to the learning algorithm. As the reliability of the learned safety estimates is data-dependent, we investigate in this work how different training data will affect the safe reinforcement learning approach. By balancing between the learning performance and the risk of being unsafe, a data generation method that combines two sampling methods is proposed to generate representative training data. The performance of the method is demonstrated with a three-link inverted pendulum example.

Subjects:	Systems and Control (eess.SY); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2109.05077 [eess.SY]
	(or arXiv:2109.05077v1 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2109.05077

Submission history

From: Zhehua Zhou [view email]
[v1] Fri, 10 Sep 2021 19:22:43 UTC (1,634 KB)

Electrical Engineering and Systems Science > Systems and Control

Title:Data Generation Method for Learning a Low-dimensional Safe Region in Safe Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:Data Generation Method for Learning a Low-dimensional Safe Region in Safe Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators