On Assessing the Usefulness of Proxy Domains for Developing and Evaluating Embodied Agents

Courchesne, Anthony; Censi, Andrea; Paull, Liam

Computer Science > Machine Learning

arXiv:2109.14516 (cs)

[Submitted on 29 Sep 2021 (v1), last revised 7 Oct 2021 (this version, v2)]

Title:On Assessing the Usefulness of Proxy Domains for Developing and Evaluating Embodied Agents

Authors:Anthony Courchesne (1 and 2), Andrea Censi (3), Liam Paull (1 and 2) ((1) Mila, (2) Université de Montréal, (3) ETH Zürich)

View PDF

Abstract:In many situations it is either impossible or impractical to develop and evaluate agents entirely on the target domain on which they will be deployed. This is particularly true in robotics, where doing experiments on hardware is much more arduous than in simulation. This has become arguably more so in the case of learning-based agents. To this end, considerable recent effort has been devoted to developing increasingly realistic and higher fidelity simulators. However, we lack any principled way to evaluate how good a "proxy domain" is, specifically in terms of how useful it is in helping us achieve our end objective of building an agent that performs well in the target domain. In this work, we investigate methods to address this need. We begin by clearly separating two uses of proxy domains that are often conflated: 1) their ability to be a faithful predictor of agent performance and 2) their ability to be a useful tool for learning. In this paper, we attempt to clarify the role of proxy domains and establish new proxy usefulness (PU) metrics to compare the usefulness of different proxy domains. We propose the relative predictive PU to assess the predictive ability of a proxy domain and the learning PU to quantify the usefulness of a proxy as a tool to generate learning data. Furthermore, we argue that the value of a proxy is conditioned on the task that it is being used to help solve. We demonstrate how these new metrics can be used to optimize parameters of the proxy domain for which obtaining ground truth via system identification is not trivial.

Comments:	8 pages, 6 figures Accepted & Presented at IROS2021 For associated code, see this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO); Systems and Control (eess.SY)
MSC classes:	68T40 ("Primary"), 68T07 ("Secondary")
ACM classes:	I.2.9; I.6.4
Cite as:	arXiv:2109.14516 [cs.LG]
	(or arXiv:2109.14516v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.14516

Submission history

From: Anthony Courchesne [view email]
[v1] Wed, 29 Sep 2021 16:04:39 UTC (5,518 KB)
[v2] Thu, 7 Oct 2021 14:32:44 UTC (5,518 KB)

Computer Science > Machine Learning

Title:On Assessing the Usefulness of Proxy Domains for Developing and Evaluating Embodied Agents

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On Assessing the Usefulness of Proxy Domains for Developing and Evaluating Embodied Agents

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators