SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Graphics and AI
Presentation
Media
OpenCL Optimization
Abstract:

In this session, we will discuss how to optimize OpenCL programs on NVIDIA GPUs. Three main aspects are discussed: memory, execution configuration, and instruction throughput. On memory optimization, we will discuss how to increase bandwidth by global memory coalescing and using local memory. Then we will discuss the concept of occupancy and various considerations in specifying the execution configuration of a kernel. Finally, we discuss techniques for improving instruction throughput.

 
Topics:
Graphics and AI, Tools & Libraries, Professional Visualisation, Medical Imaging & Radiology
Type:
Talk
Event:
GTC Silicon Valley
Year:
2009
Session ID:
S09068
Streaming:
Download:
Share:
Tools & Libraries
Presentation
Media
OpenCL Optimization
Speakers:
Peng Wang
- NVIDIA
Abstract:
Learn how to optimize your OpenCL application to achieve maximum performance on NVIDIA GPUs. We will first briefly discuss how the OpenCL programming model maps onto NVIDIA GPU's architecture. We will then talk about memory, instruction, and NDRange optimization techniques, illustrating each with small code samples.
 
Topics:
Tools & Libraries, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S09068
Download:
Share: