Run Llama 2 Web UI on Colab or LOCALLY!

Рет қаралды 37,332

11 ай бұрын

Llama 2 is latest model from Facebook and this tutorial teaches you how to run Llama 2 4-bit quantized model on Free Colab.
Camenduru's Repo github.com/camenduru/text-gen...
Colab used in the video - colab.research.google.com/git...
To run locally (on Linux):
Download the Colab as ipynb
Run all (make sure you have the GPU set up ready)
❤️ If you want to support the channel ❤️
Support here:
Patreon - / 1littlecoder
Ko-Fi - ko-fi.com/1littlecoder

Пікірлер: 44

@forwardatom 11 ай бұрын

Love your tutorials, keep them coming 🙂

@1littlecoder 10 ай бұрын

Thank you

@sonochord 11 ай бұрын

Thanks. I'd love to see how to fine-tune one of the LLaMa 2 Pretrained models via Colab if that were possible or Pytorch.

@SUGATORAY 11 ай бұрын

Thanks for making the video. 🎉

@darkreaper4990 10 ай бұрын

AMAZING! Thank you so much!

@gui-zx3di 11 ай бұрын

Now that it is open source, is it easier to embed Pdf and fine-tune the model to work with those ? Or another ai would be better suited ?

@tarun4705 11 ай бұрын

Very helpful. Thank you

@1ofallkind 11 ай бұрын

Thanks for sharing!. What is the best way to prepare dataset for tabular question answer?

@hannahcobb8380 11 ай бұрын

thank you so much ! could you possibly make a tutorial to show how we can train LLaMa on our own data?

@FatimaHABIB-jm4ji 11 ай бұрын

Thanks alot, can I use LLAMA model using my own PDF files and Langchain, to build a qustion answering system ? the same way as using openai models ?

@123arskas 11 ай бұрын

Thank you for posting this video. Hope you could share LangChain implementation of LLaMa 2

@DomenicoVotta 11 ай бұрын

Thx for your video, can you make a tutorial to explain how to train Llama 2 on your own custom dataset?

@ayushkumar2418 3 ай бұрын

Getting this error "'LlamaTokenizer' object has no attribute 'sp_model'", can anyone fix this

@christopherchilton-smith6482 11 ай бұрын

You are doing God's work. Of course by God I mean our future AI overlord.

@1littlecoder 10 ай бұрын

Haha

@OsakaRed 10 ай бұрын

Is there anybody who got Llama working on Windows? I tried to use the ipynb file and ran it with the Linux subsystem, but got lots of errors. Before that, I also tried following the official meta instructions from GitHub, but didn't get further than downloading the model and setting up the Conda environment with torch/cuda and running the installation script. Like, what do I do next? The Meta GitHub instructions are not particularly straightforward on how to use it 😅

@TheAmit4sun 10 ай бұрын

Any suggestion on how to use the private data embeddings with lama2?

@SirKingJax 11 ай бұрын

Hello, could you make a video about the cheapest server providers that supports running this? I keep running out of ram.

@uku4171 9 ай бұрын

How do I use the non-chat version/the base model? The restrictions on the chat version are really annoying. Thanks.

@mariocuezzo8027 10 ай бұрын

To run locally (on Linux): Download the Colab as ipynb this is not enough for running.... please explain more, what about content folder?

@ujjwalchetan4907 11 ай бұрын

What about the "training" tab that you didn't explain?

@tushardev5135 10 ай бұрын

will this collab notebook won't work on windows device servers

@jennilthiyam1261 7 ай бұрын

how to set up this on my local system. I have GPU.

@michaelkopec.5814 10 ай бұрын

what GPU do i need for largest llm model of Llama 2 to work with ? is 16GB enough ? i have RTX 3070 , i am looking to get intel's ARC A770 very informative video., thank you.

@michaelkopec.5814 10 ай бұрын

@@tejasundeep so multiple GPU will work too i guess, is there any benefit of going to intel's ARC A770 or should i stick with regular gaming GPUs ? e.g. 3090 TI/4090 ? Thank you.

@thehkmalhotra9714 9 ай бұрын

Hey thanks for the tutorial. I really love your content bro. But what if I want to add an API to this Llama2 collab. Like I want an API endpoint that is connected with Llama2 collab so that whenever I hit that endpoint it should give me the response from it. Will be waiting for your reply. Lot’s of love buddy ❤

@anuragsaxena674 4 ай бұрын

Did you get any solution

@toromanow 6 ай бұрын

How to invoke the deployed model prorgammatically?

@RyanSmith-rb1ch 11 ай бұрын

Is there anyway I could turn this free notebook into a api I could programmatically query?

@RyanSmith-rb1ch 11 ай бұрын

It looks like Text Generation Web UI has an api, but I get 403 when attempting to connect. Guessing the port must be opened in the notebook? I don't know how to do that.

@thehkmalhotra9714 9 ай бұрын

@@RyanSmith-rb1chHey you got any solution for this? Would love to know if you got any. I am also looking for the same. Thanks ❤

@dawndao4740 10 ай бұрын

how open de program of llama2? how do you can forget this

@ashishrathore7783 9 ай бұрын

Can you make a video on how to use Llama 2 through code just like how we use GPT models in code?

@tushaar9027 11 ай бұрын

Hi , can you please create a video of chatpdf like system using llama2 open source model

@fahnub 11 ай бұрын

I love open source.

@Kishor-ai 10 ай бұрын

ungakuda LinkedIn la eppadi connect panurathu solluga?

@starry1m 10 ай бұрын

how to run 70B on windows?

@ithanhunt3250 10 ай бұрын

*HOW TO DOWNLOAD THE WHOLE MODEL TO HAVE IT ON MY PC LOCALLY **#WITHOUT** INTERNET? YOU NEED TO EXPLAIN THAT*

@Ryan-yj4sd 11 ай бұрын

Make video on fine tuning and deploy to hugging face please

@QHawk7 9 ай бұрын

*You said use it commercially ? how? any ideas?* 1:12

@1littlecoder 9 ай бұрын

Please elaborate

@uku4171 9 ай бұрын

Doesn't work anymore.

@pyparul 10 ай бұрын

Hello, I am getting an error "ImportError: cannot import name 'is_npu_available' from 'accelerate.utils' (/usr/local/lib/python3.10/dist-packages/accelerate/utils/__init__.py)" can you please help me how can i fix this?

@1littlecoder 10 ай бұрын

Have you installed accelerate ?