37% Better Output with 15 Lines of Code - Llama 3 8B (Ollama) & 70B (Groq)

  Рет қаралды 13,667

All About AI

All About AI

Күн бұрын

To try everything Brilliant has to offer-free-for a full 30 days, visit brilliant.org/AllAboutAI . You’ll also get 20% off an annual premium subscription.
37% Better Output with 15 Lines of Code - Llama 3 8B (Ollama) & 70B (Groq)
GitHub Project:
github.com/AllAboutAI-YT/easy...
👊 Become a member and get access to GitHub and Code:
/ allaboutai
🤖 Great AI Engineer Course:
scrimba.com/learn/aiengineer?...
📧 Join the newsletter:
www.allabtai.com/newsletter/
🌐 My website:
www.allabtai.com
In this video I try to improve a known problem when using RAG in local model like Llama 3 8B on ollama. This local RAG system was improved by just adding around 15 lines of code. Feel free to share and rate on GitHub :)
00:00 Llama 3 Improved RAG Intro
02:01 Problem / Soulution
03:05 Brilliant.org
04:26 How this works
12:05 Llama 3 70B Groq
15:12 Conclusion

Пікірлер: 29
@AllAboutAI
@AllAboutAI 24 күн бұрын
Brilliant: To try everything Brilliant has to offer-free-for a full 30 days, visit brilliant.org/AllAboutAI . You’ll also get 20% off an annual premium subscription.
@pec8377
@pec8377 21 күн бұрын
@AllAboutAi the issue is it makes the assumption that the question is related to the content passed, which is not always the case in a conversation. Like suddenly you talk about something else, let's say "How are you", it will be rewritten to be aligned to the precedent context, which is not what you want.. then you need to implement some more mechanism or tweak your prompt to only rephrase when the question seems to be linked to the past. Many discussions about this..
@MattJonesYT
@MattJonesYT 24 күн бұрын
Another approach to this is to just ask for the simple llm to hallucinate an answer to the current chat. That answer will not be correct but it will probably have the phrases needed for the RAG system to find the needed excerpts. There's a technical term for this idea which I can't remember but I came across it on the TwoSetAI channel which has a lot of similar tricks
@robboerman9378
@robboerman9378 23 күн бұрын
HyDE, Hypothetical Document Embeddings. Works very well and easy to implement. Similarity search on a vector database using a hallucinated answer to the question instead of the question usually gives better similarity
@AllAboutAI
@AllAboutAI 23 күн бұрын
yes this is nice, thnx :)
@kenhtinhthuc
@kenhtinhthuc 23 күн бұрын
RAG is a bit too much of an exact match because it is based on concepts and similar concepts. Therefore no match, no return. HyDE makes the search a bit more fuzzy by expanding the query and introducing more concepts. It would be good to have an evaluator to check on the faithfuness of retrieval and the relevance of the ouputs to the original query.
@ASchnacky
@ASchnacky 24 күн бұрын
Dolphin-llama3 & Groq-llama3 are awesome! Well done!
@ByZaMo64
@ByZaMo64 19 күн бұрын
how are they different?
@akimezra7178
@akimezra7178 14 күн бұрын
Bruuuuuuh, just found this channel, you sure you're human?!?! Wish i had 5% of your brain.... thank you so much for your work! Im learning so much!!
@MarcShade
@MarcShade 24 күн бұрын
dolphin-llama3:8b-v2.9-fp16 is so good as an assistant!
@ASchnacky
@ASchnacky 24 күн бұрын
Dolphin-llama3 & Groq-llama3
@nic-ori
@nic-ori 24 күн бұрын
👍👍👍Thanks! Useful information.
@futureworldhealing
@futureworldhealing 24 күн бұрын
best AI python coding channel hands down
@AllAboutAI
@AllAboutAI 23 күн бұрын
thnx a lot :D
@Edoras5916
@Edoras5916 23 күн бұрын
direct, didactic almost verbatim in my book, explanation. excellent
@realorfake4765
@realorfake4765 20 күн бұрын
based on your experience, why is olama better than LMStudio?
@technolus5742
@technolus5742 24 күн бұрын
Great job
@AllAboutAI
@AllAboutAI 23 күн бұрын
thnx :)
@elsondasilva8636
@elsondasilva8636 24 күн бұрын
💎💎🌟💎💎💎💎
@iamisobe
@iamisobe 24 күн бұрын
first
@monstercameron
@monstercameron 24 күн бұрын
What about doing the same for the output? One pass is the internal voice, compare it to the promo to see if matches up and a second pass for any corections. Like giving LLMs an inner voice like we do.
@AllAboutAI
@AllAboutAI 23 күн бұрын
interesting
@buttpub
@buttpub 24 күн бұрын
the problem and solution is that your setup is stateless
@AllAboutAI
@AllAboutAI 23 күн бұрын
interesting, will look into
@buttpub
@buttpub 23 күн бұрын
@@AllAboutAIllms such as those built on transformer architectures, are fundamentally stateless, meaning they do not inherently maintain information about previous inputs across separate input sequences like recurrent neural networks. however; they can emulate state-like behavior through the use of positional and specialized embeddings that incorporate contextual information within a given sequence, processing data in a stateless manner, the autoregressive nature of many llms allows them to generate text by sequentially predicting the next token based on the accumualted outputs, mimicking a form of statefulness. allowing them to handle extensive and complex sequences effectively, tho each processing step inherently lacks a continuous internal state beyond its immediate inputs.
didn't want to let me in #tiktok
00:20
Анастасия Тарасова
Рет қаралды 11 МЛН
YouTube's Biggest Mistake..
00:34
Stokes Twins
Рет қаралды 69 МЛН
How to open a can? 🤪 lifehack
00:25
Mr.Clabik - Friends
Рет қаралды 14 МЛН
Llama 3 - 8B & 70B Deep Dive
23:54
Sam Witteveen
Рет қаралды 30 М.
GPT-4o Deep Dive & Hidden Abilities you should know about
28:11
How to Install and test LLaMA 3 Locally [2024]
10:36
CodewithBro
Рет қаралды 15 М.
LangGraph 101: it's better than LangChain
32:26
James Briggs
Рет қаралды 35 М.
Building a $100,000 PC for Minecraft
19:09
Linus Tech Tips
Рет қаралды 6 МЛН
Run 70Bn Llama 3 Inference on a Single 4GB GPU
8:18
Rohan-Paul-AI
Рет қаралды 10 М.
"okay, but I want Llama 3 for my specific use case" - Here's how
24:20
How I Made AI Assistants Do My Work For Me: CrewAI
19:21
Maya Akim
Рет қаралды 657 М.
host ALL your AI locally
24:20
NetworkChuck
Рет қаралды 506 М.
Пленка или защитное стекло: что лучше?
0:52
Слава 100пудово!
Рет қаралды 1,8 МЛН
Я Создал Новый Айфон!
0:59
FLV
Рет қаралды 3,3 МЛН
What % of charge do you have on phone?🔋
0:11
Diana Belitskay
Рет қаралды 365 М.