Using Gemini Pro Vision for multimodal use cases with text, images, and videos

8 110
17.6
Следующее
276 дней – 2 13538:36
Upskill your org in ML/AI with Kaggle
Популярные
122 дня – 1 2940:27
What Gemma model can help me code?
Опубликовано 16 мая 2024, 14:10
What are the applications of multimodality with Gemini? This session will cover a variety of different multimodal use cases for text, images, and video, and provide some ideas on how to apply multimodality to practical business scenarios. You'll also gain experience with Gemini Pro Vision.

To complete this workshop, you will need a laptop and a Google Cloud Project.

Walk through an interactive notebook with multimodal use cases with Gemini → goo.gle/4b98tbY
Learn about multimodal prompts in the Gemini documentation → goo.gle/4aNzaTV
Try out multimodal capabilities in Gemini Pro Vision to create a retail recommendation system → goo.gle/49PRc6I

NOTE: Cloud Credits discussed in this session or workshop were for live audiences only

Speakers: Lavi Nigam, Katie Nguyen

Watch more:
Check out all the AI videos at Google I/O 2024 → goo.gle/io24-ai-yt

Subscribe to Google Developers → goo.gle/developers

#GoogleIO

Products Mentioned: Gemini
Event: Google I/O 2024
автотехномузыкадетское