Compressive Visual Representations

Lee, Kuang-Huei; Arnab, Anurag; Guadarrama, Sergio; Canny, John; Fischer, Ian

Computer Science > Machine Learning

arXiv:2109.12909 (cs)

[Submitted on 27 Sep 2021 (v1), last revised 4 Dec 2021 (this version, v3)]

Title:Compressive Visual Representations

Authors:Kuang-Huei Lee, Anurag Arnab, Sergio Guadarrama, John Canny, Ian Fischer

View PDF

Abstract:Learning effective visual representations that generalize well without human supervision is a fundamental problem in order to apply Machine Learning to a wide variety of tasks. Recently, two families of self-supervised methods, contrastive learning and latent bootstrapping, exemplified by SimCLR and BYOL respectively, have made significant progress. In this work, we hypothesize that adding explicit information compression to these algorithms yields better and more robust representations. We verify this by developing SimCLR and BYOL formulations compatible with the Conditional Entropy Bottleneck (CEB) objective, allowing us to both measure and control the amount of compression in the learned representation, and observe their impact on downstream tasks. Furthermore, we explore the relationship between Lipschitz continuity and compression, showing a tractable lower bound on the Lipschitz constant of the encoders we learn. As Lipschitz continuity is closely related to robustness, this provides a new explanation for why compressed models are more robust. Our experiments confirm that adding compression to SimCLR and BYOL significantly improves linear evaluation accuracies and model robustness across a wide range of domain shifts. In particular, the compressed version of BYOL achieves 76.0% Top-1 linear evaluation accuracy on ImageNet with ResNet-50, and 78.8% with ResNet-50 2x.

Comments:	NeurIPS 2021. 27 pages, 4 figures. Code and pretrained models at this https URL
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
Cite as:	arXiv:2109.12909 [cs.LG]
	(or arXiv:2109.12909v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.12909

Submission history

From: Kuang-Huei Lee [view email]
[v1] Mon, 27 Sep 2021 09:53:43 UTC (1,682 KB)
[v2] Wed, 29 Sep 2021 07:12:12 UTC (1,682 KB)
[v3] Sat, 4 Dec 2021 12:22:08 UTC (1,663 KB)

Computer Science > Machine Learning

Title:Compressive Visual Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Compressive Visual Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators