Analysis of spectral clustering algorithms for community detection: the general bipartite setting

Zhou, Zhixin; Amini, Arash A.

Mathematics > Statistics Theory

arXiv:1803.04547 (math)

[Submitted on 12 Mar 2018 (v1), last revised 22 Dec 2018 (this version, v2)]

Title:Analysis of spectral clustering algorithms for community detection: the general bipartite setting

Authors:Zhixin Zhou, Arash A. Amini

View PDF

Abstract:We consider spectral clustering algorithms for community detection under a general bipartite stochastic block model (SBM). A modern spectral clustering algorithm consists of three steps: (1) regularization of an appropriate adjacency or Laplacian matrix (2) a form of spectral truncation and (3) a k-means type algorithm in the reduced spectral domain. We focus on the adjacency-based spectral clustering and for the first step, propose a new data-driven regularization that can restore the concentration of the adjacency matrix even for the sparse networks. This result is based on recent work on regularization of random binary matrices, but avoids using unknown population level parameters, and instead estimates the necessary quantities from the data. We also propose and study a novel variation of the spectral truncation step and show how this variation changes the nature of the misclassification rate in a general SBM. We then show how the consistency results can be extended to models beyond SBMs, such as inhomogeneous random graph models with approximate clusters, including a graphon clustering problem, as well as general sub-Gaussian biclustering. A theme of the paper is providing a better understanding of the analysis of spectral methods for community detection and establishing consistency results, under fairly general clustering models and for a wide regime of degree growths, including sparse cases where the average expected degree grows arbitrarily slowly.

Subjects:	Statistics Theory (math.ST); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
Cite as:	arXiv:1803.04547 [math.ST]
	(or arXiv:1803.04547v2 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.1803.04547

Submission history

From: Zhixin Zhou [view email]
[v1] Mon, 12 Mar 2018 21:50:58 UTC (60 KB)
[v2] Sat, 22 Dec 2018 23:24:58 UTC (95 KB)

Mathematics > Statistics Theory

Title:Analysis of spectral clustering algorithms for community detection: the general bipartite setting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:Analysis of spectral clustering algorithms for community detection: the general bipartite setting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators