последние час день неделя месяц всё

Understanding Knowledge Distillation in Neural Sequence Generation

5 607

23.1

Microsoft Research334 тыс

Следующее

17.01.20 – 9901:30:09

Underestimating the challenge of cognitive disabilities (and digital literacy)

Популярные

345 дней – 1 58153:47

Effective Human-AI Decision-Making or Everyone: A Sisyphean Task?

18.07.23 – 2 4284:44

AI for Precision Health

Опубликовано 17 января 2020, 18:47

Sequence-level knowledge distillation (KD) -- learning a student model with targets decoded from a pre-trained teacher model -- has been widely used in sequence generation applications (e.g. model compression, non-autoregressive translation (NAT), low-resource translation, etc). However, the underlying reasons behind this success have, as of yet, been unclear. In this talk, we will try to tackle the understanding of KD particularly in two scenarios: (1) Learning a weak student from a strong teacher model while keeping the same parallel data used for training the teacher; (2) Learning a student from a teacher model of equal size while the targets are generated from additional monolingual data.

Talk slides: microsoft.com/en-us/research/u...

See more on this and other talks at Microsoft Research: microsoft.com/en-us/research/v...

Свежие видео

3 дня – 2507:08

How to allow users to authenticate to an RDS for MySQL DB instance using Amazon IAM credentials?

4 дня – 141 39937:21

Re-Building my Ultimate Dream Setup - Part 3

5 дней – 482 7660:57

Gaming on the BACK of the phone?

6 дней – 5020:37

It’s time for the Gemini Social Media Contest! 📅 #Shorts

9 дней – 8 1041:43

Meet NVIDIA Certification Experts: Interview with Certification Program Lead

11 дней – 139 8570:57

Unboxing the new AI TV, Neo QLED 8K | Samsung

Случайные видео

93 дня – 4730:19

DOOGEE Blade 10 Pro | Unleash the power with our thinnest rugged phone featuring 5150mAh battery!

140 дней – 6 1180:59

The Frame: Experience Art Your Own Way at Art Basel | Samsung

17.01.23 – 92 0011:14

ISOCELL HP2: More pixels. Epic details. | Samsung

18.06.21 – 39 3102:27

New integrated Gmail quick start for Google Workspace users. What is Google Workspace?

02.05.21 – 6 0570:30

HP DHE 6002 Speaker Sound bar Buy at Banggood

02.05.09 – 72 8205:00

Dual SIM Acer DX900 Two-Line Demo | Pocketnow

2 дня – 341:45

Redragon H312 Open-back Gaming Headphone Unboxing and First Look!

3 дня – 2 44517:41

Exploring alternative interactions in JavaScript

5 дней – 5730:58

Click Therapeutics x Google Workspace #Shorts

5 дней – 120 77426:23

Adam Savage Explores Wing’s Drone Engineering Workshop!

6 дней – 42024:48

Securing Tomorrow: How Fortanix is Shaping the Future of Data Protection | InTechnology | Intel

9 дней – 210 5280:39

Apple Pay | Plates | Apple

10 дней – 26 2251:51

What is Gemma Scope?

2 дня – 2 1921:02

Introducing Windows 365 Link | Microsoft Ignite 2024

3 дня – 321:44

Redragon H371 Gaming Headphone Unboxing and First Look!

авто техно музыка детское

Последние техно видео О рейтинге Добавить канал English