SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Developer - Tools & Libraries
Presentation
Media
Analysis-Driven Performance Optimization
Speakers:
Paulius Micikevicius
- NVIDIA
Abstract:
The goal of this session is to demystify performance optimization by transforming it into an analysis-driven process. There are three fundamental limiters to kernel performance: instruction throughput, memory throughput, and latency. In this session we will describe: •how to use profiling tools and source code instrumentation to assess the significance of each limiter; •what optimizations to apply for each limiter; •how to determine when hardware limits are reached. Concepts will be illustrated with some examples and are equally applicable to both CUDA and OpenCL development. It is assumed that attendees are already familiar with the fundamental optimization techniques.
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2012
Download:
Share: