Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods

Saglam, Baturay; Duran, Enes; Cicek, Dogan C.; Mutlu, Furkan B.; Kozat, Suleyman S.

doi:10.1109/ICTAI52525.2021.00027

Computer Science > Machine Learning

arXiv:2109.10736 (cs)

[Submitted on 22 Sep 2021 (v1), last revised 23 Sep 2021 (this version, v2)]

Title:Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods

Authors:Baturay Saglam, Enes Duran, Dogan C. Cicek, Furkan B. Mutlu, Suleyman S. Kozat

View PDF

Abstract:In value-based deep reinforcement learning methods, approximation of value functions induces overestimation bias and leads to suboptimal policies. We show that in deep actor-critic methods that aim to overcome the overestimation bias, if the reinforcement signals received by the agent have a high variance, a significant underestimation bias arises. To minimize the underestimation, we introduce a parameter-free, novel deep Q-learning variant. Our Q-value update rule combines the notions behind Clipped Double Q-learning and Maxmin Q-learning by computing the critic objective through the nested combination of maximum and minimum operators to bound the approximate value estimates. We evaluate our modification on the suite of several OpenAI Gym continuous control tasks, improving the state-of-the-art in every environment tested.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2109.10736 [cs.LG]
	(or arXiv:2109.10736v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.10736
Related DOI:	https://doi.org/10.1109/ICTAI52525.2021.00027

Submission history

From: Baturay Sağlam [view email]
[v1] Wed, 22 Sep 2021 13:49:35 UTC (3,681 KB)
[v2] Thu, 23 Sep 2021 16:05:23 UTC (3,682 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-09

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

export BibTeX citation

Computer Science > Machine Learning

Title:Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators