LARS Full Demo

  Рет қаралды 1,658

Abheek Gulati

Abheek Gulati

Ай бұрын

Over the past year, I’ve been relentlessly working single-handedly on an application that I’ve now named LARS, for the ‘LLM & Advanced Referencing Solution’. It enables you to run LLM's (Large Language Models) locally on your device, upload your own documents and engage in Q&A sessions where the LLM grounds its responses in your uploaded content. This grounding helps increase accuracy and reduce the common issue of AI-generated inaccuracies or "hallucinations." This technique is commonly known as "Retrieval Augmented Generation", or RAG.
However, LARS takes the concept of RAG much further by adding detailed citations to every response, supplying you with specific document names, page numbers, text-highlighting, and images relevant to your question, and even presenting a document reader right within the response window!
There are features baked into LARS solely focused on improving the user experience such as:
1. Chat history, to resume prior conversations
2. Per-response user ratings, to identify focus-areas for improvements and
3. Conversation memory, so the user may ask follow-up questions
I’m happy to connect and discuss technical details further, such as the LLM-backend, embedding models used (there are four supplied in LARS!), vector database, text-extraction techniques (comprising fully local or OCR techniques, combined with custom parsers for scanned and table-heavy documents), options built in to tune the LLM's response via advanced settings (temperature, top-k/p etc.) and the prompt-engineering and RAG-tweaking tools built into LARS.
Last but certainly not the least, LARS can utilize your Nvidia-CUDA GPU to dramatically speed up inferencing and allows you to specify the exact number of model-layers you’d like to offload to the GPU. This is useful for hybrid CPU+GPU inferencing in memory-limited scenarios.
I’m keen to discuss this more and delve into the technical details with fellow enthusiasts. Feel free to connect via email at abheekg@hotmail.com or drop a hello on LinkedIn at / abheek-gulati
#GenAI #GenerativeAI #LLMs #LargeLanguageModels #genai #llm #RAG #generativeai #RetrievalAugmentedGeneration

Пікірлер: 4
@stevenmichaelis-martin3215
@stevenmichaelis-martin3215 8 күн бұрын
Great solution to a complex problem. Looks like a super helpful capability that will support building solutions that ensure data privacy in a local environment. Nice work! 🏍
@gujjewjetha
@gujjewjetha 12 күн бұрын
Absolute Madlad stuff
@yasin6904
@yasin6904 11 күн бұрын
Where can we download and try?
@abheekgulati8551
@abheekgulati8551 11 күн бұрын
Via GitHub: github.com/abgulati/LARS/
Unlimited AI Agents running locally with Ollama & AnythingLLM
15:21
How to set up RAG - Retrieval Augmented Generation (demo)
19:52
Don Woodlock
Рет қаралды 13 М.
⬅️🤔➡️
00:31
Celine Dept
Рет қаралды 37 МЛН
Её Старший Брат Настоящий Джентельмен ❤️
00:18
Глеб Рандалайнен
Рет қаралды 8 МЛН
ROCK PAPER SCISSOR! (55 MLN SUBS!) feat @PANDAGIRLOFFICIAL #shorts
00:31
Robert Greene: A Process for Finding & Achieving Your Unique Purpose
3:11:18
WE MUST ADD STRUCTURE TO DEEP LEARNING BECAUSE...
1:49:11
Machine Learning Street Talk
Рет қаралды 80 М.
Hands-On Power BI Tutorial 📊Beginner to Pro [Full Course] ⚡
3:05:45
Pragmatic Works
Рет қаралды 2,1 МЛН
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 802 М.
Discover Prompt Engineering | Google AI Essentials
30:30
Google Career Certificates
Рет қаралды 36 М.
Should You Use Open Source Large Language Models?
6:40
IBM Technology
Рет қаралды 339 М.
A GENIUS Way to use ChatGPT for Presentations!
7:38
Jeff Su
Рет қаралды 98 М.
Controlling Your Dopamine For Motivation, Focus & Satisfaction
2:16:32
Andrew Huberman
Рет қаралды 10 МЛН
5 New AI Tools You Should Try
9:18
Skill Leap AI
Рет қаралды 27 М.