последние час день неделя месяц всё

How to autoscale a TGI deployment on GKE

1 211

13.5

Google Cloud Platform1.2 млн

Следующее

84 дня – 8088:29

Choosing between self-hosted GKE and managed Vertex AI to host AI models

Популярные

13 дней – 2 7587:53

The generative AI decision tree

103 дня – 2 0677:41

Deploy open models with TGI on Cloud Run

Опубликовано 26 ноября 2024, 17:00

Tutorial: Configure autoscaling for TGI on GKE → goo.gle/3Z9a7WK
Learn more about observability on GKE → goo.gle/4951bWY
Hugging Face TGI (Text Generation Inference) → goo.gle/4hXScLk

Text Generation Inference (TGI) is a toolkit developed by Hugging Face for deploying and serving LLMs. TGI is ready for production with its support for observability and metrics built-in.. Watch along as Googlers Wietse Venema and Abdel Sghiouar demonstrate how to autoscale TGI workloads on Google Kubernetes Engine (GKE) using TGI queue size as the scaling signal.

More resources:
Learn more about the TGI architecture → goo.gle/3Oo8mzY
A deep dive into autoscaling LLM workloads on GKE → goo.gle/4fKpD2t

Watch more Google Cloud: Building with Hugging Face → goo.gle/BuildWithHuggingFace
Subscribe to Google Cloud Tech → goo.gle/GoogleCloudTech

#GoogleCloud #HuggingFace

Speakers: Wietse Venema, Abdel Sghiouar
Products Mentioned: Google Kubernetes Engine, Gemma

Свежие видео

5 дней – 56 85221:52

S25 Ultra vs iPhone 16 Pro Max - Real-World Battery Test!

8 дней – 3 44310:13

SMASHING Kickstarter Goals: Dockcase Smart USB-C Hub 7-in-1 with M.2 & LCD Screen

9 дней – 7 0770:24

Bespoke AI meets SmartThings | Tap View | Samsung

10 дней – 676 9323:38:28

My Business Is In Danger - WAN Show February 7, 2025

11 дней – 2272:32

Mapping Earth's DNA | AWS Pioneers Project | Amazon Web Services

13 дней – 183 44823:27

Adam Savage Upgrades His New Steel Toolbox!

Случайные видео

196 дней – 4241:19

A quantum leap for the network | The Network Effect | Episode 03 teaser

09.11.23 – 6625:32

如何解决因为凭证不起作用而无法登录账户的问题？

16.04.23 – 4 8252:12

Future Self – Vani mentors STEM students | AWS Scholarship

26.01.23 – 09:27

Adam Savage in Real Time: Flywheel Car Assembly

02.12.21 – 11 0880:30

Creative photography: Painting the sky

29.10.19 – 24 0203:40

KUGOO M2 PRO Folding Electric Scooter review

1 день – 4251:24

Content-adaptive video compression doesn’t miss a thing

2 дня – 8530:15

The new it girl tumbler has arrived 😇 #amazonfinds

4 дня – 443 1162:42

Your Favorite YouTube Channels Might Not Survive This

4 дня – 1370:25

NIKOLATOY V-22 Magnetic Levitation Airplane Desk Ornament - Shop on Banggood

5 дней – 1122:09

AWS for Software Companies, Customer Interview, CloudZero | Amazon Web Services

5 дней – 6140:35

Nokia FastMile 5G Gateway 6

6 дней – 36635:02

Dr. Sultan Al Jaber: The connection between AI and energy

1 день – 102 53313:48

Setup Wars - Episode 357

3 дня – 1 0480:12

Oh, she really did that 🧥 #amazonfinds

авто техно музыка детское

Последние техно видео О рейтинге Добавить канал English