Improving complex RAG systems and achieving no regret lightning fast deployment iterations of LLMs

  Рет қаралды 45

Data Science Festival

Data Science Festival

Ай бұрын

A talk by Hannes Kindbom from 9fin.
This session covers Improving complex RAG systems and achieving no regret lightning fast deployment iterations of LLMs.
In November 2022, ChatGPT took the world by storm and Large Language Models (LLMs) have been a hot topic ever since. However, their limitations such as outdated training data, restricted context windows, latency and API rate limits have become clear. Retrieval Augmented Generation (RAG) has grown popular as an approach to circumvent some of these challenges but RAG systems are complex and the user experience is hard to test offline, making prod deployments scary.
In this talk, you’ll learn how to tackle these problems to achieve safe and lightning fast deployment iterations of LLM based applications by deploying to “shadow” and feature flagging beta versions using AWS lambda aliases. We’ll dive into how to leverage the unique data this gives us to evaluate our system in production. Target audience include ML/software engineers as well as data scientists. Participants would benefit from having some coding experience with LLMs, RAG and AWS lambdas or equivalent.
Technical Level: Technical practitioner
This session was part of the Data Science Festival MayDay event 2024. Find out more at datasciencefestival.com/event...
The Data Science Festival is the place for data-driven people to come together, share cutting-edge ideas, and solve real-world problems. We run monthly events, meet-ups, and the biggest free-to-attend data festivals in the UK. Join the community at datasciencefestival.com/

Пікірлер
Accelerating Machine Learning Serving with Distributed Caches
30:26
Data Science Festival
Рет қаралды 36
Lessons Learned on LLM RAG Solutions
34:31
Prolego
Рет қаралды 23 М.
لقد سرقت حلوى القطن بشكل خفي لأصنع مصاصة🤫😎
00:33
Cool Tool SHORTS Arabic
Рет қаралды 28 МЛН
Spot The Fake Animal For $10,000
00:40
MrBeast
Рет қаралды 194 МЛН
Turbocharge your Machine Learning with Open Data!
35:39
Data Science Festival
Рет қаралды 20
Haleon's LLM Translation Tool Development Story
23:37
Data Science Festival
Рет қаралды 50
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 853 М.
MLOPS in Financial Services
15:03
Data Science Festival
Рет қаралды 112
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 921 М.
Top 5 Most-Used Deployment Strategies
10:00
ByteByteGo
Рет қаралды 254 М.
High-performance RAG with LlamaIndex
59:37
AI Makerspace
Рет қаралды 17 М.
Evaluating LLM-based Applications
33:50
Databricks
Рет қаралды 23 М.
Why Agent Frameworks Will Fail (and what to use instead)
19:21
Dave Ebbelaar
Рет қаралды 34 М.