Machine Learning Work Shop - Online Learning Against Adaptive Adversaries

1 069

29.7

Microsoft Research334 тыс

Следующее

12.08.16 – 29324:49

Machine Learning Work Shop - Graphical Event Models for Temporal Event Streams

Популярные

196 дней – 545 1050:31

Join us for Research Forum on September 3, 2024

231 день – 69742:19

Combining Machine Learning and Bayesian networks for Decision Support in Arrythmia Diagnosis

Опубликовано 12 августа 2016, 3:05

Most machine learning algorithms rely on the assumption that the data is generated by a stochastic process. Many online learning algorithms go one step further and allow the data to be generated by an oblivious adversary (a.k.a. a non-adaptive or non-reactive adversary). Very little is known about the machine learning scenario where the data is generated by a more powerful adversary, such as a switching-cost adversary, a memory-bounded adversary, or a general adaptive adversary. In this talk, I will describe ongoing efforts to understand what can and cannot be done against these powerful rivals. First, I will define policy-regret, which is a meaningful way of measuring the performance of a learning algorithm in the adversarial setting. Then, I will present the current state-of-the-art upper and lower bounds on policy regret against different adversary types, in the full-information and the bandit-feedback settings. This talk represents joint work with Ambuj Tewari, Raman Arora, Nicolo Cesa-Bianchi, and Ohad Shamir.

Свежие видео