последние час день неделя месяц всё

Direct Nash Optimization: Teaching language models to self-improve with.. | Microsoft Research Forum

2 072

8.4

Microsoft Research348 тыс

Следующее

312 дней – 2 4456:36

Project Aurora: The first large-scale foundation model of the atmosphere | Microsoft Research Forum

Популярные

5 дней – 32449:11

AI Testing and Evaluation: Learnings from pharmaceuticals and medical devices

39 дней – 3938:57

Ideas: Exploring AI frontiers with Rafah Hosn

Опубликовано 3 сентября 2024, 18:59

Corby Rosset, Senior Researcher, Microsoft Research AI Frontiers, discusses teaching language models to self-improve using a preference oracle like GPT-4, framing it as a two-player game to find an optimal policy at a Nash equilibrium, and achieving state-of-the-art win rates against GPT-4 Turbo on benchmarks such as Alpaca-Eval and MT-Bench.

This session aired on September 3, 2024 at Microsoft Research Forum, Episode 4.

Register for the series: aka.ms/registerresearchforumYT...

Continue watching episode 4: aka.ms/researchforumYTe4
Explore all previous episodes: aka.ms/researchforumYTplaylist

Свежие видео

1 день – 1 363 8481:36

Life opens up with Galaxy | Samsung

2 дня – 2 0870:22

Crispy toast. Golden melts. Zero guesswork ✨ #amazonfinds

3 дня – 216 1081:40

Unboxing Galaxy Watch8 Series | Samsung

4 дня – 2750:45

Thermal Haptics through XR Technology

5 дней – 1 3300:11

Model behavior unlocked 🐾✨ Mark your calendars — Prime Day is July 8–11

5 дней – 40 6029:46

Mibrofit GS Explorer S Review – Worth It or Skip It?

Случайные видео

181 день – 5310:50

Ulefone Armor 27 Smartphone - Shop on Banggood

09.04.24 – 124 2170:16

Bespoke Refrigerator with AI Family Hub™+ l BESPOKE AI 2024 l Samsung

04.03.24 – 9 3560:26

Unboxing the Motorola Moto G Play 🔥 Best affordable smartphone? 💭

12.12.23 – 4 7850:46

Finally a thin & light rugged phone that won't drag your pants down - AGM H6

02.11.22 – 1371:35

What are the 3 things satellite operators need to have to be end-to-end solution providers?

28.07.22 – 72 0415:41

How to log messages in the Console #DevToolsTips

2 дня – 00:21

Surface Pro 12-inch is 40% OFF already for Prime Day!

3 дня – 670:21

DOOGEE Fire 6 Thermal Imaging Smartphone - Shop on Banggood

5 дней – 1 7460:17

Prime Day deals land July 8-11 📦 #amazonfinds

10 дней – 1 4358:14

SLMs and LLMs: What’s the Difference? | Amazon Web Services

10 дней – 62510:15

Mark Papermaster on Advancing AI 2025 and the AMD Open-Source AI Software Stack

10 дней – 160 74240:28

Adam Savage's Animatronic Grogu Build Project!

11 дней – 2 9410:14

How to summarize your files with Gemini in Google Drive 💡 #Shorts

1 день – 57 04911:52

Apple's Massive $10 Billion Mistake!

авто техно музыка детское

Последние техно видео О рейтинге Добавить канал English