An experimental study of two predictive reinforcement learning methods and comparison with model-predictive control

Dobriborsci, Dmitrii; Osinenko, Pavel

Computer Science > Robotics

arXiv:2108.04857 (cs)

[Submitted on 10 Aug 2021 (v1), last revised 23 Aug 2021 (this version, v2)]

Title:An experimental study of two predictive reinforcement learning methods and comparison with model-predictive control

Authors:Dmitrii Dobriborsci, Pavel Osinenko

View PDF

Abstract:Reinforcement learning (RL) has been successfully used in various simulations and computer games. Industry-related applications, such as autonomous mobile robot motion control, are somewhat challenging for RL up to date though. This paper presents an experimental evaluation of predictive RL controllers for optimal mobile robot motion control. As a baseline for comparison, model-predictive control (MPC) is used. Two RL methods are tested: a roll-out Q-learning, which may be considered as MPC with terminal cost being a Q-function approximation, and a so-called stacked Q-learning, which in turn is like MPC with the running cost substituted for a Q-function approximation. The experimental foundation is a mobile robot with a differential drive (Robotis Turtlebot3). Experimental results showed that both RL methods beat the baseline in terms of the accumulated cost, whereas the stacked variant performed best. Provided the series of previous works on stacked Q-learning, this particular study supports the idea that MPC with a running cost adaptation inspired by Q-learning possesses potential of performance boost while retaining the nice properties of MPC.

Subjects:	Robotics (cs.RO); Dynamical Systems (math.DS)
Cite as:	arXiv:2108.04857 [cs.RO]
	(or arXiv:2108.04857v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2108.04857

Submission history

From: Pavel Osinenko [view email]
[v1] Tue, 10 Aug 2021 18:17:35 UTC (1,556 KB)
[v2] Mon, 23 Aug 2021 19:36:44 UTC (1,556 KB)

Computer Science > Robotics

Title:An experimental study of two predictive reinforcement learning methods and comparison with model-predictive control

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:An experimental study of two predictive reinforcement learning methods and comparison with model-predictive control

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators