paperJanuary 2024

User-level Differentially Private Stochastic Convex Optimization: Efficient Algorithms with Optimal Rates

AuthorsHilal Asi, Daogao Liu

We study differentially private stochastic convex optimization (DP-SCO) under user-level privacy, where each user may hold multiple data items. Existing work for user-level DP-SCO either requires super-polynomial runtime or requires a number of users that grows polynomially with the dimensionality of the problem. We develop new algorithms for user-level DP-SCO that obtain optimal rates, run in polynomial time, and require a number of users that grow logarithmically in the dimension. Moreover, our algorithms are the first to obtain optimal rates for non-smooth functions in polynomial time. These algorithms are based on multiple-pass DP-SGD, combined with a novel private mean estimation procedure for concentrated data, which applies an outlier removal step before estimating the mean of the gradients.

Related readings and updates.

Faster Algorithms for User-Level Private Stochastic Convex Optimization

November 20, 2024research area Methods and Algorithms, research area Privacyconference NeurIPS

We study private stochastic convex optimization (SCO) under user-level differential privacy (DP) constraints. In this setting, there are $n$ users, each possessing $m$ data items, and we need to protect the privacy of each user’s entire collection of data items. Existing algorithms for user-level DP SCO are impractical in many large-scale machine learning scenarios because: (i) they make restrictive assumptions on the smoothness parameter of the…

Mean Estimation with User-level Privacy under Data Heterogeneity

November 15, 2022research area Methods and Algorithms, research area Privacyconference NeurIPS

A key challenge in many modern data analysis tasks is that user data is heterogeneous. Different users may possess vastly different numbers of data points. More importantly, it cannot be assumed that all users sample from the same underlying distribution. This is true, for example in language data, where different speech styles result in data heterogeneity. In this work we propose a simple model of heterogeneous user data that differs in both…

User-level Differentially Private Stochastic Convex Optimization: Efficient Algorithms with Optimal Rates

Related readings and updates.

Faster Algorithms for User-Level Private Stochastic Convex Optimization

Mean Estimation with User-level Privacy under Data Heterogeneity

Discover opportunities in Machine Learning.