Minimax Optimal Quantization of Linear Models: Information-Theoretic Limits and Efficient Algorithms

Saha, Rajarshi; Pilanci, Mert; Goldsmith, Andrea J.

Computer Science > Information Theory

arXiv:2202.11277 (cs)

[Submitted on 23 Feb 2022 (v1), last revised 30 Aug 2022 (this version, v2)]

Title:Minimax Optimal Quantization of Linear Models: Information-Theoretic Limits and Efficient Algorithms

Authors:Rajarshi Saha, Mert Pilanci, Andrea J. Goldsmith

View PDF

Abstract:High-dimensional models often have a large memory footprint and must be quantized after training before being deployed on resource-constrained edge devices for inference tasks. In this work, we develop an information-theoretic framework for the problem of quantizing a linear regressor learned from training data $(\mathbf{X}, \mathbf{y})$, for some underlying statistical relationship $\mathbf{y} = \mathbf{X}\boldsymbol{\theta} + \mathbf{v}$. The learned model, which is an estimate of the latent parameter $\boldsymbol{\theta} \in \mathbb{R}^d$, is constrained to be representable using only $Bd$ bits, where $B \in (0, \infty)$ is a pre-specified budget and $d$ is the dimension. We derive an information-theoretic lower bound for the minimax risk under this setting and propose a matching upper bound using randomized embedding-based algorithms which is tight up to constant factors. The lower and upper bounds together characterize the minimum threshold bit-budget required to achieve a performance risk comparable to the unquantized setting. We also propose randomized Hadamard embeddings that are computationally efficient and are optimal up to a mild logarithmic factor of the lower bound. Our model quantization strategy can be generalized and we show its efficacy by extending the method and upper-bounds to two-layer ReLU neural networks for non-linear regression. Numerical simulations show the improved performance of our proposed scheme as well as its closeness to the lower bound.

Comments:	50 pages, 31 figures, 9 tables
Subjects:	Information Theory (cs.IT); Machine Learning (cs.LG); Signal Processing (eess.SP); Machine Learning (stat.ML)
Cite as:	arXiv:2202.11277 [cs.IT]
	(or arXiv:2202.11277v2 [cs.IT] for this version)
	https://doi.org/10.48550/arXiv.2202.11277

Submission history

From: Rajarshi Saha [view email]
[v1] Wed, 23 Feb 2022 02:39:04 UTC (499 KB)
[v2] Tue, 30 Aug 2022 19:53:38 UTC (3,976 KB)

Computer Science > Information Theory

Title:Minimax Optimal Quantization of Linear Models: Information-Theoretic Limits and Efficient Algorithms

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Theory

Title:Minimax Optimal Quantization of Linear Models: Information-Theoretic Limits and Efficient Algorithms

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators