Sparse Plus Low Rank Matrix Decomposition: A Discrete Optimization Approach

Bertsimas, Dimitris; Cory-Wright, Ryan; Johnson, Nicholas A. G.

Statistics > Machine Learning

arXiv:2109.12701 (stat)

[Submitted on 26 Sep 2021 (v1), last revised 2 Oct 2023 (this version, v3)]

Title:Sparse Plus Low Rank Matrix Decomposition: A Discrete Optimization Approach

Authors:Dimitris Bertsimas, Ryan Cory-Wright, Nicholas A. G. Johnson

View PDF

Abstract:We study the Sparse Plus Low-Rank decomposition problem (SLR), which is the problem of decomposing a corrupted data matrix into a sparse matrix of perturbations plus a low-rank matrix containing the ground truth. SLR is a fundamental problem in Operations Research and Machine Learning which arises in various applications, including data compression, latent semantic indexing, collaborative filtering, and medical imaging. We introduce a novel formulation for SLR that directly models its underlying discreteness. For this formulation, we develop an alternating minimization heuristic that computes high-quality solutions and a novel semidefinite relaxation that provides meaningful bounds for the solutions returned by our heuristic. We also develop a custom branch-and-bound algorithm that leverages our heuristic and convex relaxations to solve small instances of SLR to certifiable (near) optimality. Given an input $n$-by-$n$ matrix, our heuristic scales to solve instances where $n=10000$ in minutes, our relaxation scales to instances where $n=200$ in hours, and our branch-and-bound algorithm scales to instances where $n=25$ in minutes. Our numerical results demonstrate that our approach outperforms existing state-of-the-art approaches in terms of rank, sparsity, and mean-square error while maintaining a comparable runtime.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:2109.12701 [stat.ML]
	(or arXiv:2109.12701v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2109.12701
Journal reference:	Journal of Machine Learning Research, 24(267), 1-51 (2023)

Submission history

From: Nicholas Johnson [view email]
[v1] Sun, 26 Sep 2021 20:49:16 UTC (1,090 KB)
[v2] Wed, 19 Apr 2023 05:57:25 UTC (227 KB)
[v3] Mon, 2 Oct 2023 01:38:45 UTC (1,228 KB)

Statistics > Machine Learning

Title:Sparse Plus Low Rank Matrix Decomposition: A Discrete Optimization Approach

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Sparse Plus Low Rank Matrix Decomposition: A Discrete Optimization Approach

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators