FORESEE: Prediction with Expansion-Compression Unscented Transform for Online Policy Optimization

Parwana, Hardik; Panagou, Dimitra

Computer Science > Robotics

arXiv:2209.12644 (cs)

[Submitted on 26 Sep 2022 (v1), last revised 1 Feb 2024 (this version, v2)]

Title:FORESEE: Prediction with Expansion-Compression Unscented Transform for Online Policy Optimization

Authors:Hardik Parwana, Dimitra Panagou

View PDF HTML (experimental)

Abstract:Propagating state distributions through a generic, uncertain nonlinear dynamical model is known to be intractable and usually begets numerical or analytical approximations. We introduce a method for state prediction, called the Expansion-Compression Unscented Transform, and use it to solve a class of online policy optimization problems. Our proposed algorithm propagates a finite number of sigma points through a state-dependent distribution, which dictates an increase in the number of sigma points at each time step to represent the resulting distribution; this is what we call the expansion operation. To keep the algorithm scalable, we augment the expansion operation with a compression operation based on moment matching, thereby keeping the number of sigma points constant across predictions over multiple time steps. Its performance is empirically shown to be comparable to Monte Carlo but at a much lower computational cost. Under state and control input constraints, the state prediction is subsequently used in tandem with a proposed variant of constrained gradient-descent for online update of policy parameters in a receding horizon fashion. The framework is implemented as a differentiable computational graph for policy training. We showcase our framework for a quadrotor stabilization task as part of a benchmark comparison in safe-control-gym and for optimizing the parameters of a Control Barrier Function based controller in a leader-follower problem.

Subjects:	Robotics (cs.RO); Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC)
Cite as:	arXiv:2209.12644 [cs.RO]
	(or arXiv:2209.12644v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2209.12644

Submission history

From: Hardik Parwana [view email]
[v1] Mon, 26 Sep 2022 12:47:08 UTC (1,367 KB)
[v2] Thu, 1 Feb 2024 02:14:20 UTC (4,571 KB)

Computer Science > Robotics

Title:FORESEE: Prediction with Expansion-Compression Unscented Transform for Online Policy Optimization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:FORESEE: Prediction with Expansion-Compression Unscented Transform for Online Policy Optimization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators