GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Artificial Intelligence and Deep Learning
Presentation
Media
TRTIS 深度剖析:在 NVIDIA GPU 上部署 BERT 实战
Abstract:
TensorRT Inference Server简称TRTIS,是NVIDIA开源的轻量级GPU在线服务部署框架,该Talk将会深入介绍TRTIS的丰富特性,比如多模型部署、多深度学习框架支持、多GPU负载均衡、流式服务部署以及GPU服务指标检测等等。并结合Demo演示如何在TRTIS上部署NVIDIA最新推出的BERT推理加速方案,高效部署BERT的在线GPU推理服务。
 
Topics:
Artificial Intelligence and Deep Learning
Type:
Talk
Event:
GTC China
Year:
2019
Session ID:
CN9506
Streaming:
Download:
Share: