Efficient Large-Scale AI Workshop | Session 2: Training and inference efficiency

1 908

24.5

Microsoft Research340 тыс

Следующее

04.11.22 – 1 1812:06:35

Efficient Large-Scale AI Workshop | Session 3: Aligning models with human intent

Популярные

94 дня – 2011:03:32

Quantum Lattice Enumeration in Limited Depth, Fernando Virdia

147 дней – 30 4383:25

Look Ma, no markers: holistic performance capture without the hassle

Опубликовано 4 ноября 2022, 20:12

This workshop was part of the Microsoft Research Summit 2022: microsoft.com/en-us/research/e...

To bring AI to more people, models need to be cheaper to train and run, in terms of both computational and human resources. Thus, we will focus on increasing efficiency across various parts of the training and inference pipeline.

Learn more about the Efficient Large-Scale AI Workshop: microsoft.com/en-us/research/e...

0:00 Efficient Vision Transformer
Song Han, Massachusetts Institute of Technology

38:30 Large Scale MoE Models into Cloud Scale Production with Highly Efficient Inference and Training
Young Jin Kim, Microsoft Translator
Hany Awadalla, Azure AI Cognitive Services

1:43:32 LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models
Mojan Javaheripi, Microsoft Research Redmond and University of California San Diego

Свежие видео