Introducing Lemonade Server: Local LLM Serving with GPU and NPU Acceleration

14 823

28.2

AMD Developer Central30.6 тыс

Следующее

333 дня – 2042:16

Shaping the Future of AI

Популярные

33 дня – 10623:03

Accelerating Business Outcomes with AMD Enterprise AI Suite

218 дней – 25522:50

Fireside Chat with Ion Stoica

Опубликовано 14 июля 2025, 19:00

In this video, we introduce Lemonade Server—a powerful tool that lets you deploy local large language models (LLMs) directly on your PC. With support for industry-standard APIs, Lemonade Server easily connects to a wide range of applications, enabling you to replace cloud-based LLMs with fast, private, local alternatives.

🔧 What You’ll See

How to install and set up Lemonade Server
Downloading, managing, and prompting LLMs
Exploring key resources: GitHub repo, documentation, model details, and featured apps

🖥️ Test Setup
We demonstrate everything using an AMD Ryzen™ AI 395+ Mini PC with 128GB of RAM, showcasing the performance and flexibility of local inference.

Whether you're a developer, researcher, or enthusiast, this walkthrough will help you get started with local LLMs in minutes.

Links Referenced in the Video:
Lemonade Server: lemonade-server.ai
Local LLM Servers: lemonade-server.ai/docs/server...

Find the resources you need to develop using AMD products: amd.com/en/developer.html

Find Ryzen AI Software 1.5 documentation:
ryzenai.docs.amd.com/en/latest...

Have questions or ideas? Collaborate directly with developers and experts on the AMD Developer Community Discord:
discord.gg/2tYF7hqW

***

© 2025 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo, EPYC, ROCm, and AMD Instinct and combinations thereof are trademarks of Advanced Micro Devices, Inc.

Свежие видео