Private Post-GAN Boosting

Marcel Neunhoeffer, Zhiwei Steven Wu, Cynthia Dwork

May 2021

PDF Code Dataset

Real samples from 25 multivariate normal distributions (black dots), synthetic examples from the last generator of a GAN (teal dots, left column) and synthetic examples from a GAN with our post-GAN Boosting method (teal dots, right column).

Abstract

Differentially private GANs have proven to be a promising approach for generating realistic synthetic data without compromising the privacy of individuals. However, due to the privacy-protective noise introduced in the training, the convergence of GANs becomes even more elusive, which often leads to poor utility in the output generator at the end of training. We propose Private post-GAN boosting (Private PGB), a differentially private method that combines samples produced by the sequence of generators obtained during GAN training to create a high-quality synthetic dataset. Our method leverages the Private Multiplicative Weights method (Hardt and Rothblum, 2010) and the discriminator rejection sampling technique (Azadi et al., 2019) for reweighting generated samples, to obtain high quality synthetic data even in cases where GAN training does not converge. We evaluate Private PGB on a Gaussian mixture dataset and two US Census datasets, and demonstrate that Private PGB improves upon the standard private GAN approach across a collection of quality measures. Finally, we provide a non-private variant of PGB that improves the data quality of standard GAN training.

Type

Conference paper

Publication

Ninth International Conference on Learning Representations

Marcel Neunhoeffer

Postdoctoral Researcher

I’m a quantitative social scientist with an interest in how new methods from computer sciences can be of use for social scientists. In particular, I’m interested in how to learn useful things from data without compromising privacy.

Private Post-GAN Boosting

Abstract

Marcel Neunhoeffer

Postdoctoral Researcher

Related