Error Controlled Actor-Critic

Gao, Xingen; Chao, Fei; Zhou, Changle; Ge, Zhen; Lin, Chih-Min; Yang, Longzhi; Chang, Xiang; Shang, Changjing

Computer Science > Machine Learning

arXiv:2109.02517 (cs)

[Submitted on 6 Sep 2021 (v1), last revised 7 Sep 2021 (this version, v2)]

Title:Error Controlled Actor-Critic

Authors:Xingen Gao, Fei Chao, Changle Zhou, Zhen Ge, Chih-Min Lin, Longzhi Yang, Xiang Chang, Changjing Shang

View PDF

Abstract:On error of value function inevitably causes an overestimation phenomenon and has a negative impact on the convergence of the algorithms. To mitigate the negative effects of the approximation error, we propose Error Controlled Actor-critic which ensures confining the approximation error in value function. We present an analysis of how the approximation error can hinder the optimization process of actor-critic this http URL, we derive an upper boundary of the approximation error of Q function approximator and find that the error can be lowered by restricting on the KL-divergence between every two consecutive policies when training the policy. The results of experiments on a range of continuous control tasks demonstrate that the proposed actor-critic algorithm apparently reduces the approximation error and significantly outperforms other model-free RL algorithms.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2109.02517 [cs.LG]
	(or arXiv:2109.02517v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.02517

Submission history

From: Fei Chao Dr [view email]
[v1] Mon, 6 Sep 2021 14:51:20 UTC (3,991 KB)
[v2] Tue, 7 Sep 2021 03:08:50 UTC (3,989 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Changle Zhou
Chih-Min Lin

export BibTeX citation

Computer Science > Machine Learning

Title:Error Controlled Actor-Critic

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Error Controlled Actor-Critic

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators