Introducing EmbeddingGemma: The Best-in-Class Open Model for On-Device Embeddings

116 436
8.6
Следующее
Популярные
Опубликовано 4 сентября 2025, 16:02
Discover EmbeddingGemma, a state-of-the-art 308 million parameter text embedding model designed to power generative AI experiences directly on your hardware. Ideal for mobile-first Al, EmbeddingGemma brings powerful capabilities to your applications, enabling features like semantic search, information retrieval, and custom classification – all while running efficiently on-device.

In this video, Alice Lisak and Lucas Gonzalez from the Gemma team introduce EmbeddingGemma and explain how it works. Learn how you can run this model on less than 200MB of RAM with quantization, customize its output dimensions with Matryoshka Representation Learning (MRL), and
build powerful offline Al features.

Resources:
Learn about EmbeddingGemma → developers.googleblog.com/en/i...
EmbeddingGemma documentation → ai.google.dev/gemma/docs/embed...
Gemma Cookbook → github.com/google-gemini/gemma...
Quickstart RAG notebook → github.com/google-gemini/gemma...
Discover Gemma models → deepmind.google/models/gemma


Chapters
0:00 - Intro
0:26 - Model overview
1:18 - Model features
2:29 - RAG
2:54 - Website embedding demo
3:23 - Tools and platforms
3:41 - Conclusion


Subscribe to Google for Developers → goo.gle/developers

Speaker:Alice Lisak Lucas Gonzalez
Products Mentioned: Google AI, Gemma,Generative AI
автотехномузыкадетское