Differentially Private Synthetic Data without Training

880
11.7
Опубликовано 24 марта 2025, 15:48
Speakers: Zinan Lin
Host: Kim Laine

Generating differentially private (DP) synthetic data that closely resembles original data while preserving user privacy is a scalable solution to address privacy concerns in today's data-driven world.

In this talk, I will introduce Private Evolution (PE), a new training-free framework for DP synthetic data generation, which contrasts with existing approaches that rely on training DP generative models. PE treats foundation models as blackboxes and only utilizes their inference APIs. We demonstrate that across both images and text, PE: (1) matches or even outperforms prior state-of-the-art (SoTA) methods in the fidelity-privacy trade-off without any model training; (2) enables the use of advanced open-source models (e.g., Mixtral) and API-based models (e.g., GPT-3.5), where previous SoTA approaches are inapplicable; and (3) is more computationally efficient than prior SoTA methods.

Additionally, I will discuss recent extensions of PE--both from our work and contributions from the broader community--including the integration of data simulators, fusion of knowledge from multiple models for DP data synthesis, and applications in federated learning. We hope that PE unlocks the full potential of foundation models in privacy-preserving machine learning and accelerates the adoption of DP synthetic data across industries.
Свежие видео
11 дней – 1 592 4341:49
Designed for Sleeping… 😴
13 дней – 49 2911:12
Top 5 iOS 27 features you need to know
14 дней – 14 2790:37
This is what every hoops fan dreams of
Случайные видео
212 дней – 1 7120:37
BoneAir Swim | Unboxing
309 дней – 18 2021:38
Keke's Prime Confessions
19.04.22 – 29 8624:43
Machine Learning basics
06.10.07 – 18 0412:15
iPod Shuffle - Review
автотехномузыкадетское