Introducing Lemonade Server: Local LLM Serving with GPU and NPU Acceleration

12 594
28.6
Следующее
276 дней – 2032:16
Shaping the Future of AI
Популярные
104 дня – 2 4092:06:00
AMD at CES® 2026
Опубликовано 14 июля 2025, 19:00
In this video, we introduce Lemonade Server—a powerful tool that lets you deploy local large language models (LLMs) directly on your PC. With support for industry-standard APIs, Lemonade Server easily connects to a wide range of applications, enabling you to replace cloud-based LLMs with fast, private, local alternatives.

🔧 What You’ll See

How to install and set up Lemonade Server
Downloading, managing, and prompting LLMs
Exploring key resources: GitHub repo, documentation, model details, and featured apps

🖥️ Test Setup
We demonstrate everything using an AMD Ryzen™ AI 395+ Mini PC with 128GB of RAM, showcasing the performance and flexibility of local inference.

Whether you're a developer, researcher, or enthusiast, this walkthrough will help you get started with local LLMs in minutes.

Links Referenced in the Video:
Lemonade Server: lemonade-server.ai
Local LLM Servers: lemonade-server.ai/docs/server...

Find the resources you need to develop using AMD products: amd.com/en/developer.html

Find Ryzen AI Software 1.5 documentation:
ryzenai.docs.amd.com/en/latest...

Have questions or ideas? Collaborate directly with developers and experts on the AMD Developer Community Discord:
discord.gg/2tYF7hqW

***

© 2025 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo, EPYC, ROCm, and AMD Instinct and combinations thereof are trademarks of Advanced Micro Devices, Inc.
автотехномузыкадетское