последние час день неделя месяц всё

Bandit Learning with Switching Costs

411

45.7

Microsoft Research332 тыс

Следующее

28.06.16 – 6311:32:54

Microsoft Research New York City Lab Opening

Популярные

01.02.23 – 7 4535:27

Seeing AI app - Creating a Route

27.10.22 – 52858:13

Fireside chat: What’s next in large-scale AI

Опубликовано 28 июня 2016, 20:25

Consider the adversarial two-armed bandit problem in a setting where the player incurs a unit cost each time he switches actions. We prove that the player's T-round regret in this setting (i.e., his excess loss compared to the better of the two actions) is T 2/3 (up to a log term). In the corresponding full-information problem, the minimax regret is known to grow at a slower rate of T 1/2 . The difference between these two rates indicates that learning with bandit feedback (i.e. just knowing the loss from the player's action, not the alternative) can be significantly harder than learning with full-information feedback. It also shows that without switching costs, any regret-minimizing algorithm for the bandit problem must sometimes switch actions very frequently. The proof is based on an information-theoretic analysis of a loss process arising from a multi-scale random walk. (Joint work with Ofer Dekel, Jian Ding and Tomer Koren, to appear in STOC 2014 available at arxiv.org/abs/1310.2997)

Свежие видео

4 дня – 52 4699:56

A Fictional Book Becomes REAL!

5 дней – 8779:16

Amazon RDS for MySQL zero-ETL integration with Amazon Redshift Demo | Amazon Web Services

5 дней – 3159:10

Optimise Amazon DocumentDB Queries for Nested Array Objects | Amazon Web Services

7 дней – 1 1250:16

Where you looking for the cutest desk decor? #aliexpress

8 дней – 15 6202:24

Signage Setup Assistant: Single-point mobile solution for digital signage | Samsung

13 дней – 6 8188:25

Google Trends for Marketing & Sales

Случайные видео

66 дней – 31 40011:55

2024's Best Handheld Retro Gaming Consoles! - Back To School Promotion Up To 80% OFF! (AliExpress)

69 дней – 2141:04

Woodworking Circles Cutting Jig - Shop on Banggood

290 дней – 276 6630:40

OUKITEL OT8 Fashion Tablet - Your Gateway to Fun!

14.12.22 – 6 373 51523:18

Tesla Self Driving vs Everyday Roads!

24.11.21 – 5 0440:30

Live On Black Friday - Newegg's Best Deals For You!

12.05.18 – 50 7005:34

I/O '18 Guide - Wear OS by Google

2 дня – 28 1130:30

Hidden feature of LinusTechTips screwdriver

4 дня – 2 3010:19

This is how we make our ALI ESPRESSO ☕️ #espresso #ASMR

6 дней – 6 90318:38

Custom Adaptive layouts in Compose | Spotlight Week

6 дней – 01:03:41

Adam Savage's Live Streams: More on MythBusters!

7 дней – 13 9041:57

Unleash GenAI Creativity with Intel® Tiber™ AI Cloud | Intel Software

9 дней – 1 6880:33

See how the #XiaomiMasterClass Masters Gathering helped shape new insights of #NightHeroes!💡

11 дней – 1 386 98411:38

I’m Downsizing and I’m SO EXCITED

5 часов – 540:31

OUKITEL - RT3Pro Underwater Unboxing! Extreme Durability Test! #oukitel#smartphone #RT3pro

1 день – 8 7430:48

iPad mini 2024 (A17 Pro) Hands-On

авто техно музыка детское

Последние техно видео О рейтинге Добавить канал English