Measurement noise scaling laws for cellular representation learning

Gowri, Gokul; Yin, Peng; Klein, Allon M.

Quantitative Biology > Quantitative Methods

arXiv:2503.02726 (q-bio)

[Submitted on 4 Mar 2025]

Title:Measurement noise scaling laws for cellular representation learning

Authors:Gokul Gowri, Peng Yin, Allon M. Klein

View PDF HTML (experimental)

Abstract:Deep learning scaling laws predict how performance improves with increased model and dataset size. Here we identify measurement noise in data as another performance scaling axis, governed by a distinct logarithmic law. We focus on representation learning models of biological single cell genomic data, where a dominant source of measurement noise is due to molecular undersampling. We introduce an information-theoretic metric for cellular representation model quality, and find that it scales with sampling depth. A single quantitative relationship holds across several model types and across several datasets. We show that the analytical form of this relationship can be derived from a simple Gaussian noise model, which in turn provides an intuitive interpretation for the scaling law. Finally, we show that the same relationship emerges in image classification models with respect to two types of imaging noise, suggesting that measurement noise scaling may be a general phenomenon. Scaling with noise can serve as a guide in generating and curating data for deep learning models, particularly in fields where measurement quality can vary dramatically between datasets.

Subjects:	Quantitative Methods (q-bio.QM); Information Theory (cs.IT)
Cite as:	arXiv:2503.02726 [q-bio.QM]
	(or arXiv:2503.02726v1 [q-bio.QM] for this version)
	https://doi.org/10.48550/arXiv.2503.02726

Submission history

From: Gokul Gowri [view email]
[v1] Tue, 4 Mar 2025 15:44:59 UTC (3,894 KB)

Quantitative Biology > Quantitative Methods

Title:Measurement noise scaling laws for cellular representation learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantitative Biology > Quantitative Methods

Title:Measurement noise scaling laws for cellular representation learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators