SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC On-Demand

AI Application Deployment and Inference
Presentation
Media
Optimizing Facebook AI Workloads for NVIDIA GPUs
Abstract:
We'll demonstrate the use of Nsight Systems to quickly identify bottlenecks and achieve significant speedups in production workflows at Facebook. We'll also describe how we use the CUPTI API for on-demand, customized timeline analysis of workflows running in production and collect detailed performance metrics across our GPU fleet at very low overhead.
 
Topics:
AI Application Deployment and Inference
Type:
Talk
Event:
GTC Silicon Valley
Year:
2019
Session ID:
S9866
Streaming:
Download:
Share: