Deploy models with Hugging Face Inference Endpoints

Рет қаралды 15,157

Жыл бұрын

In this video, I show you how to deploy Transformer models straight from the Hugging Face hub to managed infrastructure on AWS, in just a few clicks. Starting from a model that I already trained for image classification, I first deploy an endpoint protected by Hugging Face token authentication. Then, I deploy a second endpoint in a private subnet, and I show you how to access it securely from your AWS account thanks to AWS PrivateLink.
⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos ⭐️⭐️⭐️
⭐️⭐️⭐️ Want to buy me a coffee? I can always use more :) www.buymeacoffee.com/julsimon ⭐️⭐️⭐️
- Model: huggingface.co/juliensimon/au...
- Inference Endpoints: huggingface.co/inference-endp...
- Inference Endpoints documentation: huggingface.co/docs/inference...
- AWS PrivateLink documentation: docs.aws.amazon.com/vpc/lates...
Code:
import requests, json, os
API_URL = ENDPOINT_URL
MY_API_TOKEN = os.getenv("MY_API_TOKEN")
headers = {"Authorization": "Bearer "+MY_API_TOKEN, "Content-Type": "image/jpg"}
def query(filename):
with open(filename, "rb") as f:
data = f.read()
response = requests.request("POST", API_URL, headers=headers, data=data)
return json.loads(response.content.decode("utf-8"))
output = query("food.jpg")

Пікірлер: 30

@50kT Жыл бұрын

This is the exact content I was looking for yesterday, you posted it today! Fantastic lol Really hope I can get everything set up to put my idea into production at scale.

@juliensimonfr Жыл бұрын

Glad it was helpful!

@caiyu538 Жыл бұрын

I need to recheck your previous video. There are deployment of training instance. Now it is to deploy inference instance. Always great to revisit to understand different terms for a beginner.

@caiyu538 Жыл бұрын

Thank you for hugging face. It makes deployment much easier.

@grandplazaunited Жыл бұрын

Thanks Julien. Besides ease of using hugging face endpoints, i learned about how VPC endpoints work!

@juliensimonfr Жыл бұрын

Cool :)

@caiyu538 Жыл бұрын

great lectures.

@danielminchev4173 Жыл бұрын

This is pure gold, thank you!

@juliensimonfr 10 ай бұрын

Thanks!

@arunvijay8949 5 ай бұрын

Fantastic , great learning thank you very much. So now I can use these endpoints from Langchain or lllama Index without worrying about the deployment of my model.

@juliensimonfr 5 ай бұрын

Exactly, and you're welcome :)

@connorshorten6311 Жыл бұрын

Thanks Julien, great video!

@juliensimonfr Жыл бұрын

Glad you liked it!

@sandiegoman Жыл бұрын

Ughh, I wish I had found this earlier. I created my own VPS with both front end and back end server to provide access to a transformer model. Thanks, this should help.

@juliensimonfr 10 ай бұрын

Glad I could help!

@datasciencesolutions2361 Жыл бұрын

Great job sincerely!

@juliensimonfr Жыл бұрын

Thanks!

@moneyjuice Жыл бұрын

That's amazing, Merci pour le partage

@juliensimonfr Жыл бұрын

Glad you liked it.

@innocentanyaele5986 4 ай бұрын

You're the best!!!

@juliensimonfr 4 ай бұрын

Thanks, I'll tell my wife

@nb9t7 Ай бұрын

Hey Julien, Where can we find the training model video for food dataset? Also, I am trying to use a model and deploy it on Hugging Face Inference, but it errors out saying I need a config.json file. I'm not sure how to create it. Any leads would be really helpful. Thanks!

@blockchaingeek7118 Жыл бұрын

I'll appreciate if you share how to deploy models .ckpt or safetansors on a vps that I already own (vultr or digitalocean)

@user-mb6uv8ih4c 6 ай бұрын

In this we need AWS for model storage , or we can directly use by the inference api endpoints of hugging face , because I want to use jais13b-chat model @Julien Simon

@juliensimonfr 6 ай бұрын

Inference Endpoints lets you deploy any hub model on managed infrastructure running on AWS or Azure. Not sure what you mean by 'model storage' ?