LLM Hallucinations in RAG QA - Thomas Stadelmann, deepset.ai

No video

LLM Hallucinations in RAG QA - Thomas Stadelmann, deepset.ai

Рет қаралды 6,685

deepset

Жыл бұрын

Hallucinations are one of the biggest challenges in running LLM-based applications as they can significantly undermine the trustworthiness of your application. Retrieval-augmented QA not only enables us to run LLMs on any data but also to mitigate hallucinations. However even with retrieval augmentation we cannot fully avoid them.
In this webinar Thomas will show you current approaches on how to systematically detect hallucinations, paving the way for automating this critical issue.
#ai #llm #generativeai #developer #deepset.ai

Пікірлер: 4

@vsrohit 11 ай бұрын

Can you please provide the links for the hallucination detection model?

@pythontok4192 9 ай бұрын

What does the hallucination detector model compare each sentence of the answer against? If you run it against the contexts , say if k=4 snippets retrieved by the rag, wouldn’t some of them not be relevant?

@davefar2964 7 ай бұрын

Thanks a lot for this presentation, the research papers on hallucinations as well as your BERTscore solutions were quite interesting. Another class of approaches (that causes high cost but no big latency if done in parallel) to detect and avoid hallucinations (see kzfaq.info/get/bejne/otmKdrmeqKi2nJc.htmlsi=-dYinw2SiAGw44df&t=1428) is getting multiple samples at test time, e.g. doing self-consistency or ensembling, deciding for the final answer by mayority voting or ranking.

@billykotsos4642

@billykotsos4642 Жыл бұрын

So even RAG can't be trusted 100% huh...

Enabling NLP for Enterprise Applications | Milos Rusic | deepset.ai

1:03:44

Enabling NLP for Enterprise Applications | Milos Rusic | deepset.ai

deepset

Рет қаралды 476

How to set up RAG - Retrieval Augmented Generation (demo)

19:52

How to set up RAG - Retrieval Augmented Generation (demo)

Don Woodlock

Рет қаралды 24 М.

Слепой наказал на дороге 🚘 @tv3_international #второезрение #детектив #расследование

00:51

Слепой наказал на дороге 🚘 @tv3_international #второезрение #детектив #расследование

ТВ3 - сериалы и шоу

Рет қаралды 2,6 МЛН

Joker can't swim!#joker #shorts

00:46

Joker can't swim!#joker #shorts

Untitled Joker

Рет қаралды 37 МЛН

WHOA, HUMAN! ARE YOU A FRIDGE-NINJA? SHOW ME THOSE FOOD STORING PAWS! 😺🗡️

00:14

WHOA, HUMAN! ARE YOU A FRIDGE-NINJA? SHOW ME THOSE FOOD STORING PAWS! 😺🗡️

What the Meowk!

Рет қаралды 9 МЛН

A teacher captured the cutest moment at the nursery #shorts

00:33

A teacher captured the cutest moment at the nursery #shorts

Fabiosa Stories

Рет қаралды 62 МЛН

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

15:21

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Entry Point AI

Рет қаралды 76 М.

LangChain "Hallucinations in Document Question-Answering" Webinar

58:31

LangChain "Hallucinations in Document Question-Answering" Webinar

LangChain

Рет қаралды 5 М.

How to Train a DE-licious Embedding Model

34:39

How to Train a DE-licious Embedding Model

Haystack

Рет қаралды 231

Mitigating LLM Hallucinations with a Metrics-First Evaluation Framework

1:00:40

Mitigating LLM Hallucinations with a Metrics-First Evaluation Framework

DeepLearningAI

Рет қаралды 22 М.

RAG Versus Fine Tuning-How to Efficiently Tailor an LLM to Your Domain Data

31:40

RAG Versus Fine Tuning-How to Efficiently Tailor an LLM to Your Domain Data

deepset

Рет қаралды 2,1 М.

“What's wrong with LLMs and what we should be building instead” - Tom Dietterich - #VSCF2023

49:47

“What's wrong with LLMs and what we should be building instead” - Tom Dietterich - #VSCF2023

valgrAI

Рет қаралды 151 М.

How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini

34:22

How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini

Google for Developers

Рет қаралды 51 М.

RAG But Better: Rerankers with Cohere AI

23:43

RAG But Better: Rerankers with Cohere AI

James Briggs

Рет қаралды 57 М.

Webinar: Fix Hallucinations in RAG Systems with Pinecone and Galileo

55:40

Webinar: Fix Hallucinations in RAG Systems with Pinecone and Galileo

Galileo

Рет қаралды 2,6 М.

How to detect prompt injections - Jasper Schwenzow, deepset.ai

40:15

How to detect prompt injections - Jasper Schwenzow, deepset.ai

deepset

Рет қаралды 1,1 М.

Слепой наказал на дороге 🚘 @tv3_international #второезрение #детектив #расследование

00:51

Слепой наказал на дороге 🚘 @tv3_international #второезрение #детектив #расследование

ТВ3 - сериалы и шоу

Рет қаралды 2,6 МЛН