GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Performance Optimization
Presentation
Media
Featured Talk: Memory Management Tips, Tricks and Techniques
Abstract:
GPUs can push teraflops of mathematical power, but feeding the SMs with data can often be harder than optimising your algorithm. A well-designed program must take into account both access of data from within the GPU as well as allocation and transfer of data between CPU and GPU. This talk will cover techniques including sub-allocation, shared memory management, and parallel memory structures such as stacks, queues and ring-buffers which can greatly improve the throughput of your algorithms. 75% of programs are limited by memory bandwidth and not compute power, so careful memory management is critical to a high-performance program.
 
Topics:
Performance Optimization, Programming Languages, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2015
Session ID:
S5530
Streaming:
Download:
Share: