Directions in ML: Taking Advantage of Randomness in Expensive Optimization Problems

2 186

13.2

Microsoft Research340 тыс

Следующее

12.03.21 – 9891:07:36

A Tale of Two Cities: Software Developers in Practice During the COVID-19 Pandemic

Популярные

198 дней – 1 0981:02:04

ML for High-Performance Climate and Earth Virtualization Engines

282 дня – 5 5015:12

MatterGen: A Generative Model for Materials Design | Microsoft Research Forum

Опубликовано 8 марта 2021, 20:52

Optimization is at the heart of machine learning, and gradient computation is central to many optimization techniques. Stochastic optimization, in particular, has taken center stage as the principal method of fitting many models, from deep neural networks to variational Bayesian posterior approximations. Generally, one uses data subsampling to efficiently construct unbiased gradient estimators for stochastic optimization, but this is only one possibility. In this talk, I discuss two alternative approaches to constructing unbiased gradient estimates in machine learning problems. The first approach uses randomized truncation of objective functions defined as loops or limits. Such objectives arise in settings ranging from hyperparameter selection, to fitting parameters of differential equations, to variational inference using lower bounds on the log-marginal likelihood. The second approach revisits the Jacobian accumulation problem at the heart of automatic differentiation, observing that it is possible to collapse the linearized computational graph of, e.g., deep neural networks, in a randomized way such that less memory is used but little performance is lost. These projects are joint work with students Alex Beatson, Deniz Oktay, Joshua Aduol, and Nick McGreivy.

Learn more about the 2020-2021 Directions in ML: AutoML and Automating Algorithms virtual speaker series: aka.ms/diml

Свежие видео