Flowise Record Manager: Stop Duplicate Data Forever!

  Рет қаралды 4,772

Leon van Zyl

Leon van Zyl

Күн бұрын

In this Flowise Record Managers video we will have a look at one of the easiest solutions for preventing duplicate and stale data in your vector stores.
🙏 Support My Channel:
Buy me a coffee ☕ : www.buymeacoffee.com/leonvanzyl
PayPal Donation: www.paypal.com/ncp/payment/EK...
💬 Chat with Like-Minded Individuals on Discord:
/ discord
🧠 I can build your chatbots for you!
www.cognaitiv.ai
🕒 TIMESTAMPS:
00:00 - Intro
02:23 - Prevent Duplicate with Record Manager
04:49 - Adding Record Manager to Flow
06:03 - Setting up Postgres Database
07:45 - Record Manager Additional Parameters
09:11 - Simple Text Doc Uploader
09:42 - Set SourceId Meta Data
11:35 - Cleanup Method NONE
13:12 - Cleanup Method INCREMENTAL
14:49 - Cleanup Method FULL
16:20 - Outro

Пікірлер: 58
@leonvanzyl
@leonvanzyl 2 ай бұрын
Hi guys, one note on the None cleanup method. In the video 13:00 we only see 2 records. This was a bug. The None method will actually add the change to the Vector Store, but will NOT delete the previous record. It will also not remove the deleted document "Cat". This means that you will see 3 records in total (Dog, Dog 2 and Cat). Thanks to the Flowise team for pointing this out.
@HassanAllaham
@HassanAllaham 2 ай бұрын
Very useful, direct, and clear content... as usual .. Thank you from deep my heart ...With every video related to Flowise, I get more and more sure that Flowise is the right way to get the best results of LLMs .. KZfaq asked me about this channel and my answer was: It is a life changer (+5-stars) ...I watched and keep watching many any many of channels ... this channel is one of the best I ever watched .. even my notification bell is on "ALL" but whenever I open my browser the first thing I do is to go here searching for any new video .. Believe it or not: One of the things I hate in KZfaq that it does not permit us to add more than one like on each video. Sometimes I like to re-watch one or more of your videos. Every time I automatically hit the like button to find that I am mistakenly removing the old like. I hope that KZfaq, one day, will permit us to add more than one like. Again thank you for the good useful content. Have a nice day. 🌹🌹🌹
@leonvanzyl
@leonvanzyl 2 ай бұрын
I wish I could give this comment more than 1 heart ❤️❤️. You have absolutely made my day!
@HemaSherbeny
@HemaSherbeny 2 ай бұрын
Please dont stop!!! i rearched all the internet to find an answer to (how to update vector db with new data) and never find a solution!! Really thanks Leon
@leonvanzyl
@leonvanzyl 2 ай бұрын
You're welcome 🤗
@regman25
@regman25 27 күн бұрын
Each of your videos is amazingly simple to understand. Thanks for your job Sir!
@Alvaro-cs7zs
@Alvaro-cs7zs 2 ай бұрын
Another great video!! Leon, I don't know if you accept suggestions. But some of us have asked in the flowise discord channel for a video about how to manage when you have a lot of knowledge you want to include in the flowise flows. how to use different retrievals with different sources of information with just one agent. A video about that would be highly appreciate it!! Thanks Leon. Your work is amazing!
@leonvanzyl
@leonvanzyl 2 ай бұрын
You could just add more than one retrieval tool, one for each source?
@ThePandaGuitar
@ThePandaGuitar 27 күн бұрын
Man your videos are so clear and well explained
@MladenKirilov
@MladenKirilov 2 ай бұрын
This was on my agenda to try today and you’ve posted 5min ago. Thanks for all the content that you make!
@leonvanzyl
@leonvanzyl 2 ай бұрын
Great minds think alike 😉
@vskavo
@vskavo 2 ай бұрын
I was going to look how this works today too! Thanks Leon!!!
@maniecronje
@maniecronje 2 ай бұрын
Thank Leon always great to watch your videos, small thank you enjoy the coffee ☕️
@leonvanzyl
@leonvanzyl 2 ай бұрын
Thank you for the coffees Manie! ❤️
@nicolasgramnea1324
@nicolasgramnea1324 2 ай бұрын
Thank you for this. I was looking for a good solution to update file data. Your videos are always clear and easy to follow.
@leonvanzyl
@leonvanzyl 2 ай бұрын
You're welcome 🤗
@GilbertMizrahi
@GilbertMizrahi 2 ай бұрын
Great video. Super helpful. Thanks for doing all these videos.
@leonvanzyl
@leonvanzyl 2 ай бұрын
You're welcome 🤗
@ziayounasch
@ziayounasch 2 ай бұрын
Thanks a lot for such an informative video... Long awaited...
@WayneBruton
@WayneBruton 2 ай бұрын
Great video Leon, Thanks
@leonvanzyl
@leonvanzyl 2 ай бұрын
You're welcome
@ricardofernandez2286
@ricardofernandez2286 Ай бұрын
Hello Leon, I hope you are doing great!! Theoretically perfect! Thank you very much for the explanation!! However, I have a "fun" situation I'd like to share: I have configured everything in full cleanup mode, I have no errors, I execute the upsert and I get my 58 embeddings from 3 txt files added (shown in the result box as added, inserted into the vector database, chroma, and inserted in my record manager database, mysql, with the correct source metadata) So far, so good. Now I run the upsert again, no changes in the data or files, and I get 58 skipped, 58 deleted. And indeed all the records are deleted from everywhere. I would have expected only 58 skipped. Nothing to add, but all the files and data present in the upsert. I run the upsert again, 58 added. Run it again 58 skipped 58 deleted. And so on ... Your insights will be highly appreciated!!! Thank you very much!!
@sdcharly
@sdcharly 2 ай бұрын
perfectly explained again @Leon !! loved it. not sure what to do with all the junk in the Pinecone already lol !!
@leonvanzyl
@leonvanzyl 2 ай бұрын
Clear the DB or use a new namespace 👍
@MsAntonio664
@MsAntonio664 2 ай бұрын
OOHH!! THANKS!!! INCRIVEL.
@jimbob3823
@jimbob3823 2 ай бұрын
This was perfect thank you for explaining! Now I am stuck at how I apply "source" value to use incremental on multiple documents or say scraped content. Do you use the same method as the plain text input examples you demonstrated?
@rude_people_die_young
@rude_people_die_young 2 ай бұрын
Next level content once again. When it’s logical to do so I’d love to see a fully local version so we don’t sprawl across multiple cloud solutions.
@muhammadmursalin8915
@muhammadmursalin8915 29 күн бұрын
Excellent boss😊😊
@AliKibao
@AliKibao 2 ай бұрын
Gems! Thank you.
@leonvanzyl
@leonvanzyl 2 ай бұрын
You're welcome 🤗
@MaliRasko
@MaliRasko 2 ай бұрын
Legend!
@leonvanzyl
@leonvanzyl 10 күн бұрын
Thank you for the generous support! 💖
@drtunmyatwin
@drtunmyatwin Ай бұрын
thanks for your video I am following what you teach in video and it works well but I just want to know that how to delete vector count. is this correct to delete index to delete vectors counts . What I want is just to delete vectors counts which are duplicated for testing not to delete index.
@mirensookdeo
@mirensookdeo 2 ай бұрын
Another good tutorial Leon. Does this work with LlamaIndex as well or is it only for the LangChain components?
@leonvanzyl
@leonvanzyl 2 ай бұрын
It should work as long as the node has input for Record Manager 👍.
@estevegraells
@estevegraells 2 ай бұрын
Amazing Leon. Just one question if you have time: as for the metadata for every file (I am uploading 300 PDFs) during the video you mention that we can use the "filename" as source, which it is a good idea. How could you use the filename into "source" node as parameter? Thanks in advance.
@RobertoFabrizi
@RobertoFabrizi 2 ай бұрын
I don't understand how Record Managers works assuming that I have a pdf and add a paragraph somewhere in it. This newly added paragraph, depending on the text splitter and chunk_size/overlap, might have a waterfall effect of grouping the following (unchanged as far as their content/meaning) paragraphs in a different way, and therefore generate completely different hashes. How can Record Manager handle this?
@sdcharly
@sdcharly 2 ай бұрын
Thanks!
@leonvanzyl
@leonvanzyl 2 ай бұрын
Thank you for the support ❤️
@alexkimen654
@alexkimen654 2 ай бұрын
Thank you for the video. It's very useful. However, could you help me with a question: how can I use Flowise to take a large text file, split it into fragments, and then use AI to summarize/translate/abstract each fragment (I've understood up to this step), but I do not understand how to then combine all these summaries/abstracts into one text file and save it on disk. Thanks for your help.
@estevegraells
@estevegraells 2 ай бұрын
¡Gracias!
@leonvanzyl
@leonvanzyl 2 ай бұрын
Thank you for the support ❤️
@RaghvendraSingh21
@RaghvendraSingh21 2 ай бұрын
Hi Leon, thanks a lot..just checking this out. Can you also suggest some ways to use embedding model or tools that can ingest documents with graphs and pictures.
@leonvanzyl
@leonvanzyl 2 ай бұрын
You need to use the unstructured document loader for that. Will create a tutorial on it 👍
@RaghvendraSingh21
@RaghvendraSingh21 2 ай бұрын
​@@leonvanzylgreat, that would be awesome!
@wilschoy78
@wilschoy78 2 ай бұрын
As always sir Leon very informative and useful. Hope if you can help me how to call an upsert method from external sources, currently I am working on moodle plugin development calling the html chatbot but my problem was the uploading part should be done inside moodle.Again thanks and keep uploading!!! God bless!!!
@leonvanzyl
@leonvanzyl 2 ай бұрын
Check out my API video.. you can call the API from outside of Flowise to upsert documents.
@cooperchien1
@cooperchien1 Ай бұрын
I see that with one document update, Flowise record manager can still update the vector database with the new document content without any meta data configuration. I guess meta data is only important if 1. we update more than one document. 2. track what document must get updated This means to maintain the version of the data in the vector database, we also must keep track of all the meta data assigned to each document we upsert, right?
@lucaciaraffo111
@lucaciaraffo111 Ай бұрын
Thank you very much really good and accurate explanation! Could you just take away one doubt for me? How would the Record Manager work with web pages? I tried hooking a scraping tool (cheerio) to load data from a web page. The first upsert loads the data, if I do it again it skips the data. It seems to rely on the URL. But what if i change the data within the page? For example if I update my price list on the page would there be a way for the record manager to understand that the data has changed so it has to delete the old data and add the new ones?Thanks a lot in advance!
@leonvanzyl
@leonvanzyl Ай бұрын
Thank you for the feedback! That is exactly what RM will do. When the website changes and you upsert again, RM will detect that the document has changed and therefore delete the old record and insert new doc.
@lucaciaraffo111
@lucaciaraffo111 Ай бұрын
@@leonvanzyl Got it! Thank you Leon!
@williamx0
@williamx0 2 ай бұрын
Is Flowise essentially a Visual Flow builder for the RunnableSequence method from langchain?
@leonvanzyl
@leonvanzyl 2 ай бұрын
Basically, yes. It uses Langchain with LCEL below the hood. It also implements LlamaIndex, so best of both worlds. Flowise used to be a Langchain prototyping platform, but it's since evolved into a full blown AI app builder.
@KratomSyndicate
@KratomSyndicate Ай бұрын
the chatbot needs to understand the current date, the records being added continuously can just have dates in which specials are active.
@florentflote
@florentflote 2 ай бұрын
@alexfratello9930
@alexfratello9930 2 ай бұрын
these are great videos, I would love it if you could show me the possibility of creating a chatbot that also integrates within whatsapp with flowise, I would be grateful.
@leonvanzyl
@leonvanzyl 2 ай бұрын
I've been torn about this.. I do want to create a WhatsApp tutorial and have done this for clients. The problem is that it is very expensive and complex. Not just technically, you also need a registered company to even consider this process. I'll be creating the video for a very small amount of people who would realistically be in a position to follow along.
@jimbob3823
@jimbob3823 2 ай бұрын
How about Telegram?
Forget OpenAI! Flowise Tool Agents Work with ANYONE (ft. Claude 3)
14:18
🚨 STOP Listening to Tech Twitter & Do THIS Instead
1:00
Dave Gray
Рет қаралды 258 М.
Final muy increíble 😱
00:46
Juan De Dios Pantoja 2
Рет қаралды 46 МЛН
Looks realistic #tiktok
00:22
Анастасия Тарасова
Рет қаралды 4,7 МЛН
Children deceived dad #comedy
00:19
yuzvikii_family
Рет қаралды 8 МЛН
World’s Deadliest Obstacle Course!
28:25
MrBeast
Рет қаралды 154 МЛН
Langflow Vs Flowise
0:23
Fahd Mirza
Рет қаралды 886
Chunking Strategies in RAG: Optimising Data for Advanced AI Responses
14:02
Testcontainers have forever changed the way I write tests
12:11
Dreams of Code
Рет қаралды 111 М.
The BETTER Raspberry Pi? // ZimaBlade Review
13:41
Christian Lempa
Рет қаралды 11 М.
There’s a fast new code editor in town
3:58
Fireship
Рет қаралды 1 МЛН
AI Meets SQL: Developing AI Database Agent in 10 Minutes
10:15
Denis Magda
Рет қаралды 6 М.
Best mobile of all time💥🗿 [Troll Face]
0:24
Special SHNTY 2.0
Рет қаралды 2,5 МЛН
cute mini iphone
0:34
승비니 Seungbini
Рет қаралды 6 МЛН
Игровой Комп с Авито за 4500р
1:00
ЖЕЛЕЗНЫЙ КОРОЛЬ
Рет қаралды 1,8 МЛН
iPhone 16 с инновационным аккумулятором
0:45
ÉЖИ АКСЁНОВ
Рет қаралды 2,6 МЛН