View publication

*= Equal Contributors

Online prediction from experts is a fundamental problem in machine learning and several works have studied this problem under privacy constraints. We propose and analyze new algorithms for this problem that improve over the regret bounds of the best existing algorithms for non-adaptive adversaries. For approximate differential privacy, our algorithms achieve regret bounds of O(Tlogd+logd/ε)O(\sqrt{T \log d} + \log d/\varepsilon) for the stochastic setting and O(Tlogd+T1/3logd/ε)O(\sqrt{T \log d} + T^{1/3} \log d/\varepsilon) for oblivious adversaries (where dd is the number of experts). For pure DP, our algorithms are the first to obtain sub-linear regret for oblivious adversaries in the high-dimensional regime dTd \ge T. Moreover, we prove new lower bounds for adaptive adversaries. Our results imply that unlike the non-private setting, there is a strong separation between the optimal regret for adaptive and non-adaptive adversaries for this problem. Our lower bounds also show a separation between pure and approximate differential privacy for adaptive adversaries where the latter is necessary to achieve the non-private O(T)O(\sqrt{T}) regret.

Related readings and updates.

Private Online Learning via Lazy Algorithms

We study the problem of private online learning, specifically, online prediction from experts (OPE) and online convex optimization (OCO). We propose a new transformation that transforms lazy online learning algorithms into private algorithms. We apply our transformation for differentially private OPE and OCO using existing lazy algorithms for these problems. Our final algorithms obtain regret which significantly improves the regret in the high…
See paper details

Near-Optimal Algorithms for Private Online Optimization in the Realizable Regime

*=Equal Contributors We consider online learning problems in the realizable setting, where there is a zero-loss solution, and propose new Differentially Private (DP) algorithms that obtain near-optimal regret bounds. For the problem of online prediction from experts, we design new algorithms that obtain near-optimal regret O(ε−1log⁡1.5d)O \big( \varepsilon^{-1} \log^{1.5}{d} \big)O(ε−1log1.5d) where ddd is the number of experts. This…
See paper details