GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Performance Optimization
Presentation
Media
Deep Dive into Dynamic Parallelism Performance
Abstract:
Dynamic parallelism enables a CUDA kernel to create and synchronize new nested work by launching child kernels from the GPU. Such a nested parallelism programming model maps directly to many real-world programming patterns like adaptive grids or tree-traversal based computations. We'll systematically analyze the performance characteristics of dynamic parallelism by means of real-world application case studies and suggest programming guidelines to get the best performance out of the dynamic parallelism feature. (This talk will be held in collaboration with Thejaswi Rao.)
 
Topics:
Performance Optimization
Type:
Talk
Event:
GTC Silicon Valley
Year:
2016
Session ID:
S6807
Streaming:
Download:
Share: