SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC On-Demand

AI Application Deployment and Inference
Presentation
Media
Quantized Neural Networks and QEngine
Abstract:
We'll discuss network quantization its background, methods, achievements, and the motivation behind it. Deep neural networks have achieved remarkable performance in a wide range of tasks. But DNNs are computationally intensive and resource-consuming, which hinders their use in embedded systems. We'll explain how we're working to alleviate this problem with quantized neural networks and a lightweight framework for efficient inference of these networks.
 
Topics:
AI Application Deployment and Inference
Type:
Tutorial
Event:
GTC Silicon Valley
Year:
2019
Session ID:
S9713
Streaming:
Download:
Share: