"okay, but I want Llama 3 for my specific use case" - Here's how

  Рет қаралды 83,277

David Ondrej

David Ondrej

Күн бұрын

If you want a personalized AI strategy to future-proof yourself and your business, join my community: www.skool.com/new-society
Follow me on Twitter - x.com/DavidOndrej1
Please Subscribe.
Major credit to @engineerprompt who beautifully explained the entire Google Colab.
Title heavily inspired by: @AIJasonZ
My Google Colab: colab.research.google.com/dri...
Unsloth GitHub: github.com/unslothai/unsloth?...
Dataset: huggingface.co/datasets/yahma...

Пікірлер: 101
@DavidOndrej
@DavidOndrej 26 күн бұрын
If you want a personalized AI strategy to future-proof yourself and your business, join my community: www.skool.com/new-society
@ThisIsSimonBorec
@ThisIsSimonBorec 26 күн бұрын
I highly recommend it, the community is fabulous!
@vishalsaichindepalli2798
@vishalsaichindepalli2798 26 күн бұрын
It would be great if you could make a video on how to create datasets for fine tuning using LLM's/Agents!
@d.d.z.
@d.d.z. 3 күн бұрын
It can be so helpful!
@christopheboucher127
@christopheboucher127 26 күн бұрын
yes dataset made by agents ! Thx for all your content !
@Matthew-tg4uk
@Matthew-tg4uk 6 күн бұрын
vicious circle. give llm a little data and say simulate. llm uses trained data to simulate. user takes simulated data and does the same in another llm. very little new data has been added to the system.
@AlphaCrucis
@AlphaCrucis 26 күн бұрын
This is the kind of content that I've been wanting to see that I haven't been able to find in an easily digestible form.
@laimi7
@laimi7 23 күн бұрын
Thank you for this video. The topic of fine-tuning was very interesting to me.
@jonathanholmes9219
@jonathanholmes9219 20 күн бұрын
Yes please. Team of Agents to create a fine tuning data set from your proprietary data.
@thiago.nobrega
@thiago.nobrega 26 күн бұрын
Keep up the amazing work bro. You provide us valuable knowledge.
@flavorbot
@flavorbot 25 күн бұрын
love the videos thanks a lot for taking the time to put them out
@dennisking8281
@dennisking8281 22 күн бұрын
Yes, please make a video on how to create the datasets for fine tuning AI - and Thanks for all you do.
@NB-qq8wo
@NB-qq8wo 25 күн бұрын
LOVE these empowering videos, thanks for sharing 🙏
@agenticmark
@agenticmark 17 сағат бұрын
your right headphone will be over your eye soon ;D Thanks for the great content David!
@alvinjamur1
@alvinjamur1 25 күн бұрын
why are some here trashing david ondrej? he is imparting knowledge in an easy to understand way for peeps that do not know. i wrote my first neural net from scratch in 1993 and i have been an ML practitioner since then. i can tell u that info back then was hard to come by. be grateful that u have easy access to it. if u dont like it better to move along rather than disperse caustic.
@jees__antony
@jees__antony 10 күн бұрын
Great work... Thanks for the tutorial ❤❤❤
@sethjchandler
@sethjchandler 26 күн бұрын
I have seen a lot of videos on fine-tuning and read a lot, and I have to say this is one of the most lucid, explanations. By making it very concrete and showing the code and, importantly, the training data you make very clear what is going on in fine tuning great job!
@gonzalodijoux5953
@gonzalodijoux5953 8 күн бұрын
hello, do you know if it's possible to fine tune with ebook pdf on a specific domain (financial, medical...) ?
@kylearnold9647
@kylearnold9647 26 күн бұрын
You're putting out some great content
@VaibhavPatil-rx7pc
@VaibhavPatil-rx7pc 26 күн бұрын
Excellent information ❤
@Chris-zc9bp
@Chris-zc9bp 26 күн бұрын
TY I finally trained my first model. Here's another vote for the how to create the fine tuning using LLM agents.
@jayhu6075
@jayhu6075 26 күн бұрын
This is a very useful topic, in the future we can train our datasets to specifically use them for different applications, particularly in healthcare or other institutions, benefiting people. Hopefully, a next topic will be about how to create your own datasets. Thanks for the explanation
@darkesco
@darkesco 25 күн бұрын
Very useful information!
@zeynelacikgoez
@zeynelacikgoez 26 күн бұрын
It would be an interesting topic for a video on how to use agents to generate data for fine-tuning.
@DrKnowitallKnows
@DrKnowitallKnows 26 күн бұрын
Hey I could be wrong but I believe you'll have better luck fine tuning with a less quantized version of the model. At least 8 or 16bit would be preferable to 4. I'm not an expert on quantized models, but you lose a lot of resolution when you quantize that much and that likely makes it more challenging for the LoRA to train. Definitely correct me if I'm wrong, folks, but I think this is the case.
@AlphaCrucis
@AlphaCrucis 26 күн бұрын
Nice to see you here!
@DrumAndSpaces
@DrumAndSpaces 26 күн бұрын
perfect timing i was just thinking about having multiple llama 3 versions fine tuned for specific coding projects instead of a broad coding language base. is this just a waste of time and im better off having a general coding version instead? i was considering having a few fine tuned models to imitate a development team with crew.
@UnSingeEnivre
@UnSingeEnivre 25 күн бұрын
I would love to see a dataset fine tuning tutorial!
@glorixx5974
@glorixx5974 7 күн бұрын
Great vid, it would be really awesome if you could make a video on how to make data sets for fine tuning! That would help a lot
@kamipls6790
@kamipls6790 26 күн бұрын
Hey Ondrej! I think this might be a stretch of the topic, but is it possible to use an llm like llama 3 and fine tune it to respond in another language or would it be necessary to train an llm from scratch for this?
@icesteel5855
@icesteel5855 26 күн бұрын
As I am , i need to know this
@user-xk6rg7nh8y
@user-xk6rg7nh8y 10 күн бұрын
Thanks alot !!! it is really helpful :)
@Balajik7-qh1pq
@Balajik7-qh1pq 7 күн бұрын
awesome David
@humanbeingmusic
@humanbeingmusic 24 күн бұрын
can you offer any advise about importing the ggufs into ollama, mine just spit out gibberish, I presume it has something to do with the modelcard but no idea
@user-ef4df8xp8p
@user-ef4df8xp8p 26 күн бұрын
Thank you...
@carnageasada5350
@carnageasada5350 26 күн бұрын
Please do make a video on creating datasets, both with and without the use of agents!
@zippytechnologies
@zippytechnologies 24 күн бұрын
If there a way to generate the dataset input output data from contextual data like emails and q&a from website forums?
@joseeduardobolisfortes
@joseeduardobolisfortes 15 күн бұрын
This video is exactly what I was looking for. Thank you. Now, I wish to know which hardware configuration I will need to install and use Llama 3 models locally in my own machine. Can you help me?
@PaulFishwick
@PaulFishwick 14 күн бұрын
This seems like a lot of work in forming the data prep rather than the RAG approach (eg. custom GPTs) where you embed N documents to “fine tune”. Thoughts on each approach?
@gnoppixlinux
@gnoppixlinux 25 күн бұрын
love the 3 primary colors at 10:12 :)
@SpicyMelonYT
@SpicyMelonYT 8 күн бұрын
Is the trained model able to be used with "ollama run trained_model_name"? Do I have to download it directly and put it some where for that to work? I currently have a python program setup that uses the ollama module and runs llama3. But I would like to use a fine tuned model instead as I am trying to make a Jarvis like personal assistant!
@kamruzzamanuzzal3764
@kamruzzamanuzzal3764 26 күн бұрын
any way to input images as well to fine tune on image and text at the same time?
@Jonathan-et4df
@Jonathan-et4df 23 күн бұрын
please make a video on how to create datasets!
@wetcel1236
@wetcel1236 25 күн бұрын
Hey David, thanx for this awesome served topic! Exactly what I need to get through this week 😅
@yongxing1848
@yongxing1848 22 күн бұрын
when are you going to make datasets for fine-tuning, I have currently data in mysql that I need to extract and create the datasets for fine-tuning llama.
@nasiksami2351
@nasiksami2351 19 күн бұрын
Hey David, great video and great explanation. Please make a tutorial on how to generate dataset using LLM. For my use case, I have a classification problem and the class imbalance is severe. for the minority classes, I want to generate more meaningful samples using LLM and then build an LLM model to do text classification on the dataset. Any suggestion on achieving this would be great!
@user-fe4qc7ot5d
@user-fe4qc7ot5d 18 күн бұрын
Where is fine tuning models stored and how can I find and download it for use?
@gileneusz
@gileneusz 25 күн бұрын
10:29 that would be great tbh, using agents to make dataset to finetune the model is just like inception, you can also make agents to prepare dataset for other agents to create dataset to finetune the model (inception level 2) or make agents to prepare dataset for agents to prepare dataset for finetuning the model which will be used to prepare dataset for agents to prepare dataset............
@siema32
@siema32 8 күн бұрын
Actually GPT-4 can be fine-tuned by the user, it's done within the openai API and of course used by it's API later on. It obviously has downsides, like the model is still invoked on the OpenAI servers and they are collecting all the data which goes through it (no privacy), but it is possible :)
@trueindian03
@trueindian03 21 күн бұрын
How to train a data set which is not in the form of instructions, input, output format, lets say I want to train the model using the data from a pdf, or any other means, how can we do that, please suggest some ideas. Thanks in advance.
@Will_669
@Will_669 8 күн бұрын
what's the dataset like if train for conversations? for example: in a conversation, we have one instruction, multi inputs, and multi outputs
@sourabhiitian
@sourabhiitian 13 күн бұрын
hi i have a question, what if i want to use my dataset json file into the cell instead of huggingface alpaca json. Can you give the part of the input code
@RemekKinas
@RemekKinas 26 күн бұрын
I am looking for tutorial how to generate dataset using Agents. There is no such tutorial (or I am not able to find it). It would be great to generate chat format (conversation) dataset as a response of task. So as an input you have list of task, question and then agents generate conversation to this topic.
@belu6914
@belu6914 10 күн бұрын
Did anyone get the example running? The copied notbook results in an error when starting the training. I already fixed the missing comma and set the max_steps to 60.
@josephtilly258
@josephtilly258 25 күн бұрын
Are local LLM really that local or just free ? Because I'm not really running it on my computer, more of a cloud base free and flexible llm ?
@theobgshow
@theobgshow 21 күн бұрын
Yea they are. You can run ollama on your computer then pull down a model, such as llama3, Mistral or Dolphin and run everything, completely locally
@shaigrustamov5115
@shaigrustamov5115 11 күн бұрын
it's a good video, thanks. But there are a lot of videos about fine tuning. It would be perfect if you would create a video on how to create own data sets for fine tuning. 👍
@jackderrida
@jackderrida 26 күн бұрын
3:37 He is 100% correct that already fine-tuned LLMs like GPT, Claude, and even Gemini 1.5 Pro with 1m+ context, are freaking awful at trying to emulate writing styles. Worst part about ChatGPT for this purpose is that no matter how much you tell it not to, it's filled with clauses like "On the other hand,", "Finally, ", or "As a consequence" and I'll explain to it again all the reasons those phrases don't belong in a rap song.
@nimesh.akalanka
@nimesh.akalanka 22 күн бұрын
Is there any free method to fine tune an large language model locall. I have a small workstation with 128GB DDR4 memory, Nvidia RTX A1000 X2 SLI VGA, AMD Threadripper process. I tried AutoTune-Advanced and LLaMA-Factory. They both failed on me. Autotrain say I dont have enough VRAM. LLaMA-Factory say I dont have CUDA. Please help me.
@andrelvcoelho
@andrelvcoelho 20 күн бұрын
Yeah, it would be nice if you could set up a video showing how to automatically generate datasets for fine-tuning LLMs… Tks
@strategy419
@strategy419 9 күн бұрын
did you try finetuning gpt3.5 on the playground?
@christiansroy
@christiansroy 15 күн бұрын
You can definitely fine-tune ChatGPT 3.5 and you can also ask open AI to invite you to their private waitlist to be able to fine tune GPT 4. So it is definitely possible.
@ScROnZjara
@ScROnZjara 25 күн бұрын
More content with lama!!! 🙏❤️
@richierosewall3035
@richierosewall3035 21 күн бұрын
Hey what about phi-3..?
@stanisd
@stanisd 19 күн бұрын
open AI has its own API for fine-tuning
@GreenStorm01
@GreenStorm01 22 күн бұрын
How about Fine-Tuning vs. RAG in those specific things?
@adilzahir9921
@adilzahir9921 26 күн бұрын
I want to use that for my work , i want to use it to find the best strategy for debts recoverts and to choose the debtors who will pay mostly and who don't ,how i should proceed ? Thanks
@ASchnacky
@ASchnacky 26 күн бұрын
I had same idea
@adilzahir9921
@adilzahir9921 25 күн бұрын
@@ASchnacky y'a that would be great if we can do that without coding ,good luck
@tekipeps
@tekipeps 22 күн бұрын
Nice, how to deploy the saved model?
@CodingScot
@CodingScot 26 күн бұрын
Do you ever sleep? Wow this is amazing 🎉👏
@AlejandroCastillo9
@AlejandroCastillo9 25 күн бұрын
I want to Create a Lama 3 legal Assistent. I would be happy in you can Show a data prep example
@alma4355
@alma4355 25 күн бұрын
I'm a subscriber, please make the making dataset video
@thedatascientist-lg4ls
@thedatascientist-lg4ls 17 күн бұрын
Yeah, a video on agents for finetuning datasets with a fine tuned LLMs, and used by agents for a real world application.
@brianmorin5547
@brianmorin5547 12 күн бұрын
If pushing to hugging face, no config.json so won’t work
@eldinmujovic8705
@eldinmujovic8705 26 күн бұрын
Can I do this in any language?
@harristengku7153
@harristengku7153 26 күн бұрын
Oh wow you managed to fix the fine-tuning issue? Its been a headache for the entire open source rn, because Llama 3 trained their models differently so every fine tune would end up way worse than the original base model.
@DavidOndrej
@DavidOndrej 26 күн бұрын
If you watch the video, you will see that I openly admit that I am not an expert when it comes to fine-tuning. In fact, making this video definitely was outside of my comfort zone.
@harristengku7153
@harristengku7153 26 күн бұрын
@@DavidOndrejay respect man. In fact I think you should include more fine tuning to your videos in the future. You can’t run away from fine tuning if you want A.I to move to commercial use. Llama 3 is probably the only exception in the industry rn that has everyone stumped
@brutely9718
@brutely9718 10 күн бұрын
NotImplementedError: No operator found for `memory_efficient_attention_forward` with inputs:
@samfisher92sc
@samfisher92sc 25 күн бұрын
Please bro make a video for create datasets
@Corteum
@Corteum 25 күн бұрын
What's an example of what you or someone else has created with this?
@MrAtomUniverse
@MrAtomUniverse 11 күн бұрын
its no longer april T.T
@kingKai2022
@kingKai2022 8 күн бұрын
colab doesnt work....
@zoranProCode
@zoranProCode 25 күн бұрын
Lava FRI?
@pythonholic
@pythonholic 26 күн бұрын
I really don't see the benefit in using an AI agent. I've tried to understand its purpose, but it seems like another way of avoiding using GPT and similar models. Can you give us a real example? Perhaps even instances of freelance use?
@HakaiKaien
@HakaiKaien 25 күн бұрын
AI agents are a bit different from chat bots. With chat bots, you have a large language model responding to your prompt. With agents, you have a bunch of models talking among themselves to accomplish a task you give them. You can think about Agents as a company of employees. You give each of them roles and functions. You can use agents to build an application or a game for example.
@jackgaleras
@jackgaleras 22 күн бұрын
Fine tune or RAG
@RedShipsofSpainAgain
@RedShipsofSpainAgain 22 күн бұрын
This guy's community is $77/month. There's 510 members. $77/month * 12 months = $924/yr. $924/yr * 510 members = $471,240/yr. So this guy's subscription is grossing nearly half a million USD annually.
@DavidOndrej
@DavidOndrej 22 күн бұрын
I wish… not everyone joined at 77
@user-jc6tj2xt1p
@user-jc6tj2xt1p 24 күн бұрын
Wanna be yer cmdmp 😊
@b6234
@b6234 16 күн бұрын
I stopped at "10x better" I will make my life 100 time better by not watching
@matthewm8289
@matthewm8289 26 күн бұрын
Its not Apache 2.0 licence, so you are very limited what you can do. Its not opensource!
@braadress
@braadress 25 күн бұрын
Yes, it's opensource. Llama 3 is released under the CreativeML Open RAIL-M license. This license allows for broad use, including commercial use, while imposing certain restrictions aimed at ensuring responsible usage and maintaining safety.
@my-financial-wealthblog4423
@my-financial-wealthblog4423 12 күн бұрын
I watched your video. Understood nothing.
@dennisdemers9880
@dennisdemers9880 26 күн бұрын
I joined the community but I don't know how to access it. Or when are the weekly meetings.? Trying to generate a python program as it turns out it's getting more and more sophisticated. Just llama three better at it than Chad gbt4
@DavidOndrej
@DavidOndrej 26 күн бұрын
You can access it with the same link. The weekly meetings are on Tuesday and Saturday - more details in the "Calendar" tab at the top www.skool.com/new-society/calendar
@ramezdemitry3249
@ramezdemitry3249 24 күн бұрын
NameError Traceback (most recent call last) in () 11 {}""" 12 get_ipython().system('pip install tokenizer') ---> 13 EOS_TOKEN = tokenizer.eos_token # do not forget this part! 14 def formatting_prompts_func(examples): 15 instructions = examples["instruction"] NameError: name 'tokenizer' is not defined what shall i do here?
@Sunshine-vd3bf
@Sunshine-vd3bf 21 күн бұрын
It would be great if you could make a video on how to create datasets for fine tuning using LLM's/Agents!
Fine-tuning Large Language Models (LLMs) | w/ Example Code
28:18
Shaw Talebi
Рет қаралды 224 М.
КАРМАНЧИК 2 СЕЗОН 4 СЕРИЯ
24:05
Inter Production
Рет қаралды 667 М.
ШЕЛБИЛАР | bayGUYS
24:45
bayGUYS
Рет қаралды 633 М.
How did CatNap end up in Luca cartoon?🙀
00:16
LOL
Рет қаралды 7 МЛН
Don’t take steroids ! 🙏🙏
00:16
Tibo InShape
Рет қаралды 33 МЛН
host ALL your AI locally
24:20
NetworkChuck
Рет қаралды 500 М.
Build Anything with AI Agents, Here's How
29:49
David Ondrej
Рет қаралды 144 М.
Make Anything with ChatGPT, Here’s How
1:10:38
David Ondrej
Рет қаралды 9 М.
How to Install and test LLaMA 3 Locally [2024]
10:36
CodewithBro
Рет қаралды 15 М.
26 Incredible Use Cases for the New GPT-4o
21:58
The AI Advantage
Рет қаралды 161 М.
Meta Llama 3 Fine tuning, RAG, and Prompt Engineering for Drug Discovery
1:07:41
Unleash the power of Local LLM's with Ollama x AnythingLLM
10:15
Tim Carambat
Рет қаралды 85 М.
LLAMA-3 🦙: EASIET WAY To FINE-TUNE ON YOUR DATA 🙌
15:17
Prompt Engineering
Рет қаралды 37 М.
How I Made AI Assistants Do My Work For Me: CrewAI
19:21
Maya Akim
Рет қаралды 657 М.
I Analyzed My Finance With Local LLMs
17:51
Thu Vu data analytics
Рет қаралды 374 М.
КАРМАНЧИК 2 СЕЗОН 4 СЕРИЯ
24:05
Inter Production
Рет қаралды 667 М.