NIPS: Oral Session 6 - Nishant A. Mehta

108

Microsoft Research334 тыс

Следующее

18.08.16 – 11318:40

NIPS: Spotlight Session 6 - Learning Theory Spotlights

Популярные

84 дня – 4 47715:12

Analog optical computing for sustainable AI and beyond

24.01.23 – 1 7243:34

SmartKC: A Low-cost, Smartphone-based Corneal Topographer

Опубликовано 18 августа 2016, 18:28

From Stochastic Mixability to Fast Rates Empirical risk minimization (ERM) is a fundamental learning rule for statistical learning problems where the data is generated according to some unknown distribution P and returns a hypothesis f chosen from a fixed class F with small loss Γäô . In the parametric setting, depending upon (Γäô,F,P) ERM can have slow (1/nΓêÜ) or fast (1/n) rates of convergence of the excess risk as a function of the sample size n . There exist several results that give sufficient conditions for fast rates in terms of joint properties of Γäô , F , and P , such as the margin condition and the Bernstein condition. In the non-statistical prediction with expert advice setting, there is an analogous slow and fast rate phenomenon, and it is entirely characterized in terms of the mixability of the loss Γäô (there being no role there for F or P ). The notion of stochastic mixability builds a bridge between these two models of learning, reducing to classical mixability in a special case. The present paper presents a direct proof of fast rates for ERM in terms of stochastic mixability of (Γäô,F,P) , and in so doing provides new insight into the fast-rates phenomenon. The proof exploits an old result of Kemperman on the solution to the general moment problem. We also show a partial converse that suggests a characterization of fast rates for ERM in terms of stochastic mixability is possible.

Свежие видео