Microsoft Research335 тыс
Опубликовано 27 июня 2016, 17:09
The linear bandit problem is a far-reaching extension of the classical multi-armed bandit problem. In the recent years linear bandits have emerged as a core problem of sequential decision making, somewhat analogously to what happened with linear programming in optimization or linear regression in statistics. Despite its importance we still do not have a complete picture for this problem: in some cases we have optimal strategies (from an information theoretic point of view) but they are algorithmically intractable, while in other cases we even lack information optimal strategies. In this talk I will describe precisely the situation where we stand and the contributions I made to this problem.
Свежие видео
Случайные видео
New Way Now Sundogs rises to creative challenges for global clients with Gemini for Google Workspace