Back to Basics: Understanding Retrieval Augmented Generation (RAG)

27 488
20.2
Опубликовано 26 января 2024, 1:11
As interest in Large Language Models (LLMs) grows, numerous developers and organizations are hard at work creating programs that take use of their potential. However, the topic of how to enhance the performance of the LLM application arises when the pre-trained LLMs do not function as anticipated or hoped for out of the box. At this stage model fine tuning or retrieval-augmented generation (RAG) to enhance the outcomes is needed. In this episode, join Nitin as he walks through what RAG is and best practices for implementation using AWS with Bedrock/FM and AWS services.

Additional Resources:
Amazon Bedrock: aws.amazon.com/bedrock
What is RAG?: aws.amazon.com/what-is/retriev...
What are vector databases?: aws.amazon.com/what-is/vector-...
Knowledge Bases for Amazon Bedrock: aws.amazon.com/bedrock/knowled...
Agents for Amazon Bedrock: aws.amazon.com/bedrock/agents
Amazon Titan Text Embeddings models: docs.aws.amazon.com/bedrock/la...
Amazon Titan Multimodal Embeddings model: docs.aws.amazon.com/bedrock/la...

Check out more resources for architecting in the #AWS cloud:
amzn.to/3qXIsWN

#AWS #AmazonWebServices #CloudComputing #BackToBasics #GenerativeAI #AmazonBedrock
автотехномузыкадетское