Active Learning with Expected Error Reduction
In collaboration with Stanford University, University of Washington
AuthorsSteve Mussmann, Julia Reisler, Daniel Tsai, Ehsan Mousavi, Shayne O’Brien, Moises Goldszmidt
In collaboration with Stanford University, University of Washington
AuthorsSteve Mussmann, Julia Reisler, Daniel Tsai, Ehsan Mousavi, Shayne O’Brien, Moises Goldszmidt
Active learning has been studied extensively as a method for efficient data col- lection. Among the many approaches in literature, Expected Error Reduction (EER) Roy & McCallum (2001) has been shown to be an effective method for ac- tive learning: select the candidate sample that, in expectation, maximally decreases the error on an unlabeled set. However, EER requires the model to be retrained for every candidate sample and thus has not been widely used for modern deep neural networks due to this large computational cost. In this paper we reformulate EER under the lens of Bayesian active learning and derive a computationally efficient version that can use any Bayesian parameter sampling method (such as Gal & Ghahramani (2016)). We then compare the empirical performance of our method using Monte Carlo dropout for parameter sampling against state of the art methods in the deep active learning literature. Experiments are performed on four standard benchmark datasets and three WILDS datasets (Koh et al., 2021). The results indicate that our method outperforms all other methods except one in the data shift scenario – a model dependent, non-information theoretic method that requires an order of magnitude higher computational cost (Ash et al., 2019).