The Best Way to Deploy AI Models (Inference Endpoints)

  Рет қаралды 11,834

VRSEN

VRSEN

Күн бұрын

Unlock your AI model's full potential with serverless deployment 🚀 Dive into our comprehensive guide on deploying open-source models with Hugging Face and shape the future of AI! 💡🤖
Notebook: colab.research.google.com/dri...
🤝For all sorts of projects, reach out to me via email on the "About" page of my channel.
📞Consulting: calendly.com/vrsen/ai-project...
🐦Twitter: / __vrsen__
Intro 00:00
Understanding the Tradeoffs: Different Deployment Options 00:44
Serverless Deployment: An Efficient Solution 02:32
A Practical Walkthrough: Deploying a Model from Hugging Face 03:33
Conclusion 04:57
About: Explore the ins and outs of AI model deployment in this comprehensive video tutorial. We'll cover popular options such as cloud-based, on-premise, edge, and serverless deployments, focusing on their trade-offs in cost, latency, and scalability. Learn how to optimally deploy open-source models from Hugging Face, harnessing serverless deployment's power to unlock your AI model's full potential. Understand the future trends in AI deployment and engage in a practical walkthrough for serverless model deployment using Hugging Face's inference endpoints. Ideal for AI enthusiasts seeking to enhance their knowledge in efficient model deployment.

Пікірлер: 36
@bjorginson
@bjorginson Жыл бұрын
Great editing and information delivery, keep it up!
@pequod4557
@pequod4557 11 ай бұрын
Very informative video & straight to the point. Thanks for making these!
@mykhailoshchuka
@mykhailoshchuka Жыл бұрын
Great video! The visual presentation of information is very creative and helps me get more out of this video!
@HeyFaheem
@HeyFaheem 5 ай бұрын
Highly informative yet underrated
@KelvinWKiger
@KelvinWKiger 8 ай бұрын
Excellent explanation, thank you.
@user-nr4jp7vi2y
@user-nr4jp7vi2y 9 ай бұрын
Great great video!!
@udaym4204
@udaym4204 9 ай бұрын
hats off to you Thank you.
@father_mihai
@father_mihai 9 ай бұрын
Solid video! I've been wondering whether hugging face charges per usage minutes, or whether it charges for non-stop uptime. I couldn't find anything online in layman terms, so I stayed away from it /9didnt want to spend 500$+ per month for a small app). This finally answered the question. Thanks man!
@evanscastonguay
@evanscastonguay Жыл бұрын
really good video as usual. :)
@vrsen
@vrsen Жыл бұрын
Thank you, Evans🙏 Next one will be better!
@user-ew8ld1cy4d
@user-ew8ld1cy4d Жыл бұрын
Good one VRSEN!
@vrsen
@vrsen Жыл бұрын
Thanks you🙏
@asdasdaa7063
@asdasdaa7063 6 ай бұрын
My question is i see, "After 15 minutes" it becomes serverless? Does that mean for those 15 minutes, before it becomes serverless, you will be charged? If the answer is yes then its better to use something like azure functions or something maybe?
@stefanvasilev8948
@stefanvasilev8948 Жыл бұрын
Simple and straight to the point
@karamjittech
@karamjittech 11 ай бұрын
Nice video, but hugging face charges for the defined minutes. Any video on replicate or any other options we have?
@MohanKumar-gj9th
@MohanKumar-gj9th Жыл бұрын
Cool video animation
@Ryan-yj4sd
@Ryan-yj4sd Жыл бұрын
nice
@vintagegenious
@vintagegenious Жыл бұрын
Awesome mate, I actually needed your videos for the project I am working on right now. Do you know if HuggingFace endpoints are "private", can you host models trained on personal GDPR data ?
@vrsen
@vrsen Жыл бұрын
Yeah, you can make it “protected,” than only you will be able to call it with your api key
@vintagegenious
@vintagegenious Жыл бұрын
@@vrsen Nice
@Ryan-yj4sd
@Ryan-yj4sd Жыл бұрын
How to do batch inference. I have 5mm prompts I want to run. Is it possible to for reasonable cost?
@vrsen
@vrsen Жыл бұрын
I think you will have to run them 1 by 1 in a loop, unfortunately.
@heski6847
@heski6847 8 ай бұрын
спасибо
@shaonsikder556
@shaonsikder556 11 ай бұрын
What are the cost of serverless deployment?
@admiralhyperspace0015
@admiralhyperspace0015 11 ай бұрын
I don't use inference endpoints because its not by the second charge, It charges as you said 15 minutes above when you stopped inference. That is fifteen minutes everytime I did not use my gpu. I have been looking at other providers just for this issue.
@vrsen
@vrsen 11 ай бұрын
Check out replicate
@The.Now.Network
@The.Now.Network Жыл бұрын
What do you use to edit your video's ?
@vrsen
@vrsen Жыл бұрын
My video editor😂
@vintagegenious
@vintagegenious Жыл бұрын
​@@vrsenThat's actually a good plan
@vintagegenious
@vintagegenious Жыл бұрын
What about RunPod serverless?
@vrsen
@vrsen Жыл бұрын
Haven’t tested it. Must be good. Hugging face is just super easy to deploy
@niteshgupta9331
@niteshgupta9331 11 ай бұрын
getting this ? "Inference API does not yet support transformers models for this pipeline type"
@edzynda
@edzynda Жыл бұрын
Who edits your videos?
@vrsen
@vrsen Жыл бұрын
www.upwork.com/freelancers/~01153421eb93b94914
@user-yo8yn7bh9f
@user-yo8yn7bh9f Жыл бұрын
Бро кто ты такой, пока Россия живет в 2021 ты уже в 2025
@vrsen
@vrsen Жыл бұрын
АрсенийGPT
Deploy ML model in 10 minutes. Explained
12:41
Danil Zherebtsov
Рет қаралды 14 М.
НЫСАНА КОНЦЕРТ 2024
2:26:34
Нысана театры
Рет қаралды 1,5 МЛН
A teacher captured the cutest moment at the nursery #shorts
00:33
Fabiosa Stories
Рет қаралды 55 МЛН
Mama vs Son vs Daddy 😭🤣
00:13
DADDYSON SHOW
Рет қаралды 50 МЛН
I'm Excited To see If Kelly Can Meet This Challenge!
00:16
Mini Katana
Рет қаралды 29 МЛН
How to Build Your First AI-Powered Web App
15:31
VRSEN
Рет қаралды 5 М.
Deploy models with Hugging Face Inference Endpoints
16:45
Julien Simon
Рет қаралды 15 М.
Run your own AI (but private)
22:13
NetworkChuck
Рет қаралды 1,3 МЛН
Fast LLM Serving with vLLM and PagedAttention
32:07
Anyscale
Рет қаралды 20 М.
What are AI Agents?
12:29
IBM Technology
Рет қаралды 119 М.
Hugging Face LLMs with SageMaker + RAG with Pinecone
32:30
James Briggs
Рет қаралды 17 М.
vLLM on Kubernetes in Production
27:31
Kubesimplify
Рет қаралды 2,1 М.
Как противодействовать FPV дронам
44:34
Стратег Диванного Легиона
Рет қаралды 98 М.
КРУТОЙ ТЕЛЕФОН
0:16
KINO KAIF
Рет қаралды 7 МЛН
Лучший браузер!
0:27
Honey Montana
Рет қаралды 1,1 МЛН
Хакер взломал компьютер с USB кабеля. Кевин Митник.
0:58
Последний Оплот Безопасности
Рет қаралды 2,3 МЛН