Regret Analysis of Learning-Based MPC with Partially-Unknown Cost Function

Dogan, Ilgin; Shen, Zuo-Jun Max; Aswani, Anil

Mathematics > Optimization and Control

arXiv:2108.02307 (math)

[Submitted on 4 Aug 2021 (v1), last revised 27 Jan 2023 (this version, v2)]

Title:Regret Analysis of Learning-Based MPC with Partially-Unknown Cost Function

Authors:Ilgin Dogan, Zuo-Jun Max Shen, Anil Aswani

View PDF

Abstract:The exploration/exploitation trade-off is an inherent challenge in data-driven adaptive control. Though this trade-off has been studied for multi-armed bandits (MAB's) and reinforcement learning for linear systems; it is less well-studied for learning-based control of nonlinear systems. A significant theoretical challenge in the nonlinear setting is that there is no explicit characterization of an optimal controller for a given set of cost and system parameters. We propose the use of a finite-horizon oracle controller with full knowledge of parameters as a reasonable surrogate to optimal controller. This allows us to develop policies in the context of learning-based MPC and MAB's and conduct a control-theoretic analysis using techniques from MPC- and optimization-theory to show these policies achieve low regret with respect to this finite-horizon oracle. Our simulations exhibit the low regret of our policy on a heating, ventilation, and air-conditioning model with partially-unknown cost function.

Comments:	16 pages, 2 figures
Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG)
Cite as:	arXiv:2108.02307 [math.OC]
	(or arXiv:2108.02307v2 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2108.02307

Submission history

From: Ilgin Dogan [view email]
[v1] Wed, 4 Aug 2021 22:43:51 UTC (1,738 KB)
[v2] Fri, 27 Jan 2023 14:54:57 UTC (1,158 KB)

Mathematics > Optimization and Control

Title:Regret Analysis of Learning-Based MPC with Partially-Unknown Cost Function

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Regret Analysis of Learning-Based MPC with Partially-Unknown Cost Function

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators