Deep Exploration for Recommendation Systems

Zhu, Zheqing; Van Roy, Benjamin

Computer Science > Information Retrieval

arXiv:2109.12509 (cs)

[Submitted on 26 Sep 2021 (v1), last revised 30 Jul 2023 (this version, v4)]

Title:Deep Exploration for Recommendation Systems

Authors:Zheqing Zhu, Benjamin Van Roy

View PDF

Abstract:Modern recommendation systems ought to benefit by probing for and learning from delayed feedback. Research has tended to focus on learning from a user's response to a single recommendation. Such work, which leverages methods of supervised and bandit learning, forgoes learning from the user's subsequent behavior. Where past work has aimed to learn from subsequent behavior, there has been a lack of effective methods for probing to elicit informative delayed feedback. Effective exploration through probing for delayed feedback becomes particularly challenging when rewards are sparse. To address this, we develop deep exploration methods for recommendation systems. In particular, we formulate recommendation as a sequential decision problem and demonstrate benefits of deep exploration over single-step exploration. Our experiments are carried out with high-fidelity industrial-grade simulators and establish large improvements over existing algorithms.

Subjects:	Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2109.12509 [cs.IR]
	(or arXiv:2109.12509v4 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2109.12509

Submission history

From: Zheqing Zhu [view email]
[v1] Sun, 26 Sep 2021 06:54:26 UTC (976 KB)
[v2] Wed, 24 May 2023 01:14:52 UTC (2,245 KB)
[v3] Wed, 19 Jul 2023 21:28:52 UTC (2,245 KB)
[v4] Sun, 30 Jul 2023 08:39:53 UTC (2,246 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.IR

< prev | next >

new | recent | 2021-09

Change to browse by:

cs
cs.AI
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zheqing Zhu
Benjamin Van Roy

export BibTeX citation

Computer Science > Information Retrieval

Title:Deep Exploration for Recommendation Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Deep Exploration for Recommendation Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators