Scale-invariant representation of machine learning

Lee, Sungyeop; Jo, Junghyo

doi:10.1103/PhysRevE.105.044306

Computer Science > Machine Learning

arXiv:2109.02914 (cs)

[Submitted on 7 Sep 2021 (v1), last revised 23 Mar 2022 (this version, v2)]

Title:Scale-invariant representation of machine learning

Authors:Sungyeop Lee, Junghyo Jo

View PDF

Abstract:The success of machine learning has resulted from its structured representation of data. Similar data have close internal representations as compressed codes for classification or emerged labels for clustering. We observe that the frequency of internal codes or labels follows power laws in both supervised and unsupervised learning models. This scale-invariant distribution implies that machine learning largely compresses frequent typical data, and simultaneously, differentiates many atypical data as outliers. In this study, we derive the process by which these power laws can naturally arise in machine learning. In terms of information theory, the scale-invariant representation corresponds to a maximally uncertain data grouping among possible representations that guarantee a given learning accuracy.

Subjects:	Machine Learning (cs.LG); Information Theory (cs.IT); Data Analysis, Statistics and Probability (physics.data-an)
Cite as:	arXiv:2109.02914 [cs.LG]
	(or arXiv:2109.02914v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.02914
Related DOI:	https://doi.org/10.1103/PhysRevE.105.044306

Submission history

From: Junghyo Jo [view email]
[v1] Tue, 7 Sep 2021 07:56:15 UTC (3,201 KB)
[v2] Wed, 23 Mar 2022 08:11:08 UTC (5,330 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-09

Change to browse by:

cs
cs.IT
math
math.IT
physics
physics.data-an

References & Citations

DBLP - CS Bibliography

listing | bibtex

Junghyo Jo

export BibTeX citation

Computer Science > Machine Learning

Title:Scale-invariant representation of machine learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Scale-invariant representation of machine learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators