Pruning AI Models for Peak Performance - NVIDIA DRIVE Labs Ep. 31

9 819
17.9
NVIDIA1.89 млн
Следующее
20.09.23 – 360 6991:28
Quick Tour Of Nvidia Dgx H100
Популярные
14 дней – 4 1340:45
Meet an NVIDIA Thermal Engineer
Опубликовано 19 сентября 2023, 15:55
Check out HALP (Hardware-Aware Latency Pruning), a new method designed to adapt convolutional neural networks (CNNs) and #transformer-based architectures for real-time performance. HALP optimizes pre-trained models to maximize compute utilization. In testing with NVIDIA DRIVE Orin™ on the road, it consistently outperformed alternative approaches.

00:00:00 - Introducing Hardware-Aware Latency Pruning (HALP)
00:00:29 - Common Model Optimization
00:00:59 - DNN Pruning
00:01:21 - Hardware Aware Latency Pruning
00:01:31 - Classification Tasks
00:01:37 - 3D Object Detection
00:02:04 - HALP with Transformers
00:03:09 - To learn more, visit our GitHub and project pages

GitHub: nvda.ws/3rlM7mo
Product page: nvda.ws/46961je
Watch the full series here: nvda.ws/3LsSgnH
Learn more about DRIVE Labs: nvda.ws/36r5c6t

Follow us on social:
Twitter: nvda.ws/3LRdkSs
LinkedIn: nvda.ws/3wI4kue
#NVIDIADRIVE
Случайные видео
23.05.23 – 387 2104:37
Why Connectors Have So Many Pins
20.07.22 – 7160:59
Meet Sonja from Espoo, Finland
24.05.21 – 44 5404:06
Thank You !
21 день – 13 5570:30
Microsoft OneDrive Mobile App
автотехномузыкадетское