Temporal Shift Reinforcement Learning

Thomas, Deepak George; Wongpiromsarn, Tichakorn; Jannesari, Ali

Computer Science > Machine Learning

arXiv:2109.02145 (cs)

[Submitted on 5 Sep 2021 (v1), last revised 27 Oct 2021 (this version, v3)]

Title:Temporal Shift Reinforcement Learning

Authors:Deepak George Thomas, Tichakorn Wongpiromsarn, Ali Jannesari

View PDF

Abstract:The function approximators employed by traditional image-based Deep Reinforcement Learning (DRL) algorithms usually lack a temporal learning component and instead focus on learning the spatial component. We propose a technique, Temporal Shift Reinforcement Learning (TSRL), wherein both temporal, as well as spatial components are jointly learned. Moreover, TSRL does not require additional parameters to perform temporal learning. We show that TSRL outperforms the commonly used frame stacking heuristic on both of the Atari environments we test on while beating the SOTA for one of them. This investigation has implications in the robotics as well as sequential decision-making domains.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2109.02145 [cs.LG]
	(or arXiv:2109.02145v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.02145

Submission history

From: Deepak-George Thomas [view email]
[v1] Sun, 5 Sep 2021 18:47:13 UTC (955 KB)
[v2] Tue, 5 Oct 2021 13:56:04 UTC (599 KB)
[v3] Wed, 27 Oct 2021 01:24:52 UTC (962 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Deepak-George Thomas

export BibTeX citation

Computer Science > Machine Learning

Title:Temporal Shift Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Temporal Shift Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators