Efficient and Scalable Deep Learning

3 849

18.3

Microsoft Research335 тыс

Следующее

04.11.19 – 2 3711:03:12

Visually Grounded Language Understanding and Generation

Популярные

123 дня – 1 5991:23:36

Decoding the Human Brain – A Neurosurgeon’s Experience

12.12.22 – 74130:03

Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits

Опубликовано 4 ноября 2019, 11:26

In deep learning, researchers keep gaining higher performance by using larger models. However, there are two obstacles blocking the community to build larger models: (1) training larger models is more time-consuming, which slows down model design exploration, and (2) inference of larger models is also slow, which disables their deployment to computation constrained applications. In this talk, I will introduce some of our efforts to remove those obstacles. On the training side, we propose TernGrad to reduce communication bottleneck to scale up distributed deep learning; on the inference side, we propose structurally sparse neural networks to remove redundant neural components for faster inference. At the end, I will very briefly introduce (1) my recent efforts to accelerate AutoML, and (2) future work to utilize my research to overcome scaling issues in Natural Language Processing.

Talk slides: microsoft.com/en-us/research/u...

See more on this talk at Microsoft Research: microsoft.com/en-us/research/v...

Свежие видео