последние час день неделя месяц всё

GKE Gemma 2 deployment with Hugging Face

1 160

11

Google Cloud Platform1.18 млн

Следующее

33 дня – 3 9688:14

New "task type" embedding from the DeepMind team improves RAG search quality

Популярные

78 дней – 8290:42

Gemini in action: How Volkswagen built its AI assistant pt. 1

170 дней – 33645:30

How to deliver a large-scale Kubernetes network with Shopify

Опубликовано 15 ноября 2024, 17:00

Tutorial: Serve Gemma on GKE with TGI → goo.gle/4fFKt2Q
Learn more about TGI (text generation inference) from Hugging Face → goo.gle/4e7qusz
Hugging Face Deep Learning containers for Google Cloud → goo.gle/3BPaYUM

Text Generation Inference (TGI) is a toolkit for deploying and serving Large Language Models (LLMs). TGI enables high performance text generation for the most popular open LLMs. Gemma is a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini models. Watch along as Googlers Wietse Venema and Mofi Rahman demonstrate how to deploy Gemma 2 with 27 billion parameters on Google Kubernetes Engine using Hugging Face TGI.

Watch more Google Cloud: Building with Hugging Face → goo.gle/BuildWithHuggingFace
Subscribe to Google Cloud Tech → goo.gle/GoogleCloudTech

#GoogleCloud #HuggingFace

Speakers: Wietse Venema, Mofi Rahman
Products Mentioned: Gemma, Hugging Face Deep Learning containers, Google Kubernetes Engine

Свежие видео

7 дней – 1052:50

GROMACS | SC24 Demo | Intel Business

7 дней – 13213:22

How do I reduce and prevent unnecessary charges in CloudWatch?

9 дней – 950:35

Cloudenergy CL24-150 24V 150Ah LiFePO4 Battery - Shop on Banggood

10 дней – 131 6339:00

Did Discovery Push for Bigger and Bigger Explosions on MythBusters?

14 дней – 55851:02

Assessing AI's progress

15 дней – 3 0374:54

Fine-tuning open AI models using Hugging Face TRL

Случайные видео

37 дней – 3 5554:50

Semantic modeling for AI

239 дней – 35 7860:59

AMD HYPR-RX: Next-Level Performance Made Easy

02.12.22 – 1 98359:05

The Positive Leadership Podcast with Jean-Philippe Courtois: Michael Bungay Stanier, author & coach

12.08.22 – 10 4611:04

2022 Lenovo Legion 7 AMD Advantage™ Edition: Precision. Speed. Supercharged.

16.12.20 – 4 11657:09

Reinforcement Learning (RL) Open Source Fest 2020 | Day 2 Demos

25.04.19 – 10 8422:51

Indie Games Accelerator, Android Studio 3.4, & Scheduling Cloud Functions for Firebase

19 часов – 520:49

State-of-the-Art Multimodal Inference on AMD Instinct™ MI300X, Cyprien de Masson d’Autume, Reka AI

1 день – 1551:45

The AI revolution: Preparing for a surge in 5G uplink traffic

5 дней – 8051:46

Zomato drives the future of delivery with productivity-on-the-go

9 дней – 5740:17

Score seasonal savings with HTC VIVE's holiday sale 🎁

13 дней – 3723:32

How Analytics-as-a-Service is revolutionizing telecom networks - Episode 1

15 дней – 1 2131:07

Fine-tuning Gemma for the world's languages

20 дней – 737 93619:16

Adam Savage's Favorite Things of 2024!

1 день – 2663:54

Visual QnA | Intel

1 день – 5 8790:12

Step by step, line dancing swept dance floors and became a Breakout Search of 2024. #YearInSearch

авто техно музыка детское

Последние техно видео О рейтинге Добавить канал English