Memory-Efficient Convex Optimization for Self-Dictionary Separable Nonnegative Matrix Factorization: A Frank-Wolfe Approach

Nguyen, Tri; Fu, Xiao; Wu, Ruiyuan

doi:10.1109/TSP.2022.3177845

Electrical Engineering and Systems Science > Signal Processing

arXiv:2109.11135 (eess)

[Submitted on 23 Sep 2021 (v1), last revised 9 May 2022 (this version, v2)]

Title:Memory-Efficient Convex Optimization for Self-Dictionary Separable Nonnegative Matrix Factorization: A Frank-Wolfe Approach

Authors:Tri Nguyen, Xiao Fu, Ruiyuan Wu

View PDF

Abstract:Nonnegative matrix factorization (NMF) often relies on the separability condition for tractable algorithm design. Separability-based NMF is mainly handled by two types of approaches, namely, greedy pursuit and convex programming. A notable convex NMF formulation is the so-called self-dictionary multiple measurement vectors (SD-MMV), which can work without knowing the matrix rank a priori, and is arguably more resilient to error propagation relative to greedy pursuit. However, convex SD-MMV renders a large memory cost that scales quadratically with the problem size. This memory challenge has been around for a decade, and a major obstacle for applying convex SD-MMV to big data analytics. This work proposes a memory-efficient algorithm for convex SD-MMV. Our algorithm capitalizes on the special update rules of a classic algorithm from the 1950s, namely, the Frank-Wolfe (FW) algorithm. It is shown that, under reasonable conditions, the FW algorithm solves the noisy SD-MMV problem with a memory cost that grows linearly with the amount of data. To handle noisier scenarios, a smoothed group sparsity regularizer is proposed to improve robustness while maintaining the low memory footprint with guarantees. The proposed approach presents the first linear memory complexity algorithmic framework for convex SD-MMV based NMF. The method is tested over a couple of unsupervised learning tasks, i.e., text mining and community detection, to showcase its effectiveness and memory efficiency.

Subjects:	Signal Processing (eess.SP); Machine Learning (cs.LG)
Cite as:	arXiv:2109.11135 [eess.SP]
	(or arXiv:2109.11135v2 [eess.SP] for this version)
	https://doi.org/10.48550/arXiv.2109.11135
Related DOI:	https://doi.org/10.1109/TSP.2022.3177845

Submission history

From: Tri Nguyen [view email]
[v1] Thu, 23 Sep 2021 04:25:33 UTC (329 KB)
[v2] Mon, 9 May 2022 20:43:08 UTC (636 KB)

Electrical Engineering and Systems Science > Signal Processing

Title:Memory-Efficient Convex Optimization for Self-Dictionary Separable Nonnegative Matrix Factorization: A Frank-Wolfe Approach

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Signal Processing

Title:Memory-Efficient Convex Optimization for Self-Dictionary Separable Nonnegative Matrix Factorization: A Frank-Wolfe Approach

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators