Deploying and Scaling AI Applications with the NVIDIA TensorRT Inference Server on Kubernetes

  Рет қаралды 6,666

NGINX

NGINX

Күн бұрын

The open source NVIDIA TensorRT Inference Server is production‑ready software that simplifies deployment of AI models for speech recognition, natural language processing, recommendation systems, object detection, and more. It integrates with NGINX, Kubernetes, and Kubeflow for a complete solution for real‑time and offline data center AI inference. It can run inference on GPUs and CPUs. It supports all popular AI frameworks and maximizes GPU utilization by serving multiple models per GPU and dynamically batching client requests, which is crucial to avoiding under‑ or over‑provisioning and managing costs.
In this session, Davide:
Shows how TRTIS simplifies AI deployment in production environments based in the data center, cloud, or edge.
Shares best practices and a sample deployment
Explores integration with Kubernetes, Kubeflow, Prometheus, Kubernetes autoscaling, gRPC, and the NGINX load balancer.
To learn more, go to www.nginx.com.

Пікірлер: 4
@MarxOrx
@MarxOrx Жыл бұрын
This is EXACTLY what I needed. Give this man a cookie 🎉
@belialvandals4284
@belialvandals4284 4 жыл бұрын
Fantastico!
@burnwalsourabh
@burnwalsourabh 2 жыл бұрын
Awesome
@calebnkosi8309
@calebnkosi8309 2 жыл бұрын
Nice
Do You Need a Service Mesh?
27:25
NGINX
Рет қаралды 638
ИРИНА КАЙРАТОВНА - АЙДАХАР (БЕКА) [MV]
02:51
ГОСТ ENTERTAINMENT
Рет қаралды 1,4 МЛН
Её Старший Брат Настоящий Джентельмен ❤️
00:18
Глеб Рандалайнен
Рет қаралды 8 МЛН
A pack of chips with a surprise 🤣😍❤️ #demariki
00:14
Demariki
Рет қаралды 35 МЛН
AI at the Edge  TensorFlow to TensorRT on Jetson
54:03
NVIDIA Developer
Рет қаралды 20 М.
Everything you Need to Know about using GPUs with Kubernetes - Rohit Agarwal, Google
31:33
CNCF [Cloud Native Computing Foundation]
Рет қаралды 8 М.
How Cookpad Leverages Triton Inference Server To Boost Their Model S... Jose Navarro & Prayana Galih
32:02
CNCF [Cloud Native Computing Foundation]
Рет қаралды 1,2 М.
Inference Optimization with NVIDIA TensorRT
36:28
NCSAatIllinois
Рет қаралды 10 М.
A Deep Dive on Supporting Multi-Instance GPUs in Containers and Kubernetes - Kevin Klues, NVIDIA
31:55
CNCF [Cloud Native Computing Foundation]
Рет қаралды 6 М.
ПОКУПКА ТЕЛЕФОНА С АВИТО?🤭
1:00
Корнеич
Рет қаралды 1,7 МЛН
DC Fast 🏃‍♂️ Mobile 📱 Charger
0:42
Tech Official
Рет қаралды 482 М.
How To Unlock Your iphone With Your Voice
0:34
요루퐁 yorupong
Рет қаралды 23 МЛН
i love you subscriber ♥️ #iphone #iphonefold #shortvideo
0:14
Si pamerR
Рет қаралды 3,1 МЛН
5 НЕЛЕГАЛЬНЫХ гаджетов, за которые вас посадят
0:59
Кибер Андерсон
Рет қаралды 1,6 МЛН