Deploy open models with TGI on Cloud Run

190
Предыдущее
5 часов – 67411:34
RAG with LangChain on Google Cloud
Популярные
Опубликовано 7 ноября 2024, 17:00
Tutorial: How to deploy Gemma 2 on Cloud Run with TGI → goo.gle/3Yoztjh
Get started with Cloud Run GPU → goo.gle/4ec7mJS
Docs: Text Generation Inference → goo.gle/4e7qusz

Start serving text generation inference with fast token speed and serve requests for a fraction of the cost of traditional methods. Watch along and learn how to deploy the Gemma 2 model to Cloud Run using Hugging Face TGI with Wietse Venema (Google) and Alvaro Bartolome (Hugging Face).

More resources:
Gemma 2 (9b) on the Hugging Face Hub → goo.gle/3C1vX6R
Hugging Face Deep Learning Containers for Google Cloud → goo.gle/3BPaYUM

Watch more Google Cloud: Building with Hugging Face → goo.gle/BuildWithHuggingFace
Subscribe to Google Cloud Tech → goo.gle/GoogleCloudTech

#GoogleCloud #HuggingFace

Speakers: Wietse Venema, Alvaro Bartolome
Products Mentioned: Gemma, Hugging Face, Cloud Run
Свежие видео
15 дней – 3 59110:49
Google Trends for Journalists
16 дней – 12 4871:41
AI Summit DC 2024 Highlights
21 день – 7854:36
Back to Basics: Secrets Management
Случайные видео
24.04.12 – 3 5055:10
F&N Holdings goes Google
автотехномузыкадетское