GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Artificial Intelligence and Deep Learning
Presentation
Media
Faster Transformer – Transformer 网络推理的高效实现
Abstract:
Faster Transformer是NVIDIA针对Transformer网络优化工作的开源项目。本次talk中,我们将会介绍如何通过CUDA和cuBLAS搭建高效的Transformer Encoder和Decoder推理网络,同时也会从Network Pruning的角度讲解如何结合算法以及Faster Transformer框架实现裁剪BERT网络的推理优化。
 
Topics:
Artificial Intelligence and Deep Learning
Type:
Talk
Event:
GTC China
Year:
2019
Session ID:
CN9468
Streaming:
Download:
Share: