Private measures, random walks, and synthetic data

Boedihardjo, March; Strohmer, Thomas; Vershynin, Roman

Computer Science > Cryptography and Security

arXiv:2204.09167 (cs)

[Submitted on 20 Apr 2022 (v1), last revised 23 Mar 2024 (this version, v2)]

Title:Private measures, random walks, and synthetic data

Authors:March Boedihardjo, Thomas Strohmer, Roman Vershynin

View PDF HTML (experimental)

Abstract:Differential privacy is a mathematical concept that provides an information-theoretic security guarantee. While differential privacy has emerged as a de facto standard for guaranteeing privacy in data sharing, the known mechanisms to achieve it come with some serious limitations. Utility guarantees are usually provided only for a fixed, a priori specified set of queries. Moreover, there are no utility guarantees for more complex - but very common - machine learning tasks such as clustering or classification. In this paper we overcome some of these limitations. Working with metric privacy, a powerful generalization of differential privacy, we develop a polynomial-time algorithm that creates a private measure from a data set. This private measure allows us to efficiently construct private synthetic data that are accurate for a wide range of statistical analysis tools. Moreover, we prove an asymptotically sharp min-max result for private measures and synthetic data for general compact metric spaces. A key ingredient in our construction is a new superregular random walk, whose joint distribution of steps is as regular as that of independent random variables, yet which deviates from the origin logarithmicaly slowly.

Subjects:	Cryptography and Security (cs.CR); Probability (math.PR); Statistics Theory (math.ST)
Cite as:	arXiv:2204.09167 [cs.CR]
	(or arXiv:2204.09167v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2204.09167

Submission history

From: Thomas Strohmer [view email]
[v1] Wed, 20 Apr 2022 00:06:52 UTC (1,273 KB)
[v2] Sat, 23 Mar 2024 04:41:45 UTC (1,065 KB)

Computer Science > Cryptography and Security

Title:Private measures, random walks, and synthetic data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Private measures, random walks, and synthetic data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators