Model-Free Learning of Optimal Deterministic Resource Allocations in Wireless Systems via Action-Space Exploration

Hashmi, Hassaan; Kalogerias, Dionysios S.

Electrical Engineering and Systems Science > Systems and Control

arXiv:2108.10352 (eess)

[Submitted on 23 Aug 2021 (v1), last revised 26 Sep 2021 (this version, v2)]

Title:Model-Free Learning of Optimal Deterministic Resource Allocations in Wireless Systems via Action-Space Exploration

Authors:Hassaan Hashmi, Dionysios S. Kalogerias

View PDF

Abstract:Wireless systems resource allocation refers to perpetual and challenging nonconvex constrained optimization tasks, which are especially timely in modern communications and networking setups involving multiple users with heterogeneous objectives and imprecise or even unknown models and/or channel statistics. In this paper, we propose a technically grounded and scalable primal-dual deterministic policy gradient method for efficiently learning optimal parameterized resource allocation policies. Our method not only efficiently exploits gradient availability of popular universal policy representations, such as deep neural networks, but is also truly model-free, as it relies on consistent zeroth-order gradient approximations of the associated random network services constructed via low-dimensional perturbations in action space, thus fully bypassing any dependence on critics. Both theory and numerical simulations confirm the efficacy and applicability of the proposed approach, as well as its superiority over the current state of the art in terms of both achieving near-optimal performance and scalability.

Comments:	6 pages, 4 figures
Subjects:	Systems and Control (eess.SY); Machine Learning (cs.LG)
Cite as:	arXiv:2108.10352 [eess.SY]
	(or arXiv:2108.10352v2 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2108.10352

Submission history

From: Hassaan Hashmi [view email]
[v1] Mon, 23 Aug 2021 18:26:16 UTC (1,896 KB)
[v2] Sun, 26 Sep 2021 21:52:33 UTC (1,897 KB)

Electrical Engineering and Systems Science > Systems and Control

Title:Model-Free Learning of Optimal Deterministic Resource Allocations in Wireless Systems via Action-Space Exploration

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:Model-Free Learning of Optimal Deterministic Resource Allocations in Wireless Systems via Action-Space Exploration

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators