GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Performance Optimization1
Presentation
Media
Exploiting CUDA Dynamic Parallelism for Low-Power ARM-Based Prototypes
Abstract:
Learn to exploit CUDA features for saving energy and thus your pockets. This session briefs about the Pedraforca prototype developed at Barcelona Supercomputing Centre under the Mont-Blanc project. The prototype is based on NVIDIA® Tegra® and NVIDIA® Tesla® platforms and aims at reducing the raw power footprint of the HPC clusters. This session describes in depth how to exploit CUDA dynamic parallelism and CUDA streams for GPU applications to be ported on low power ARM based prototypes. Also includes architectural description of the prototype, power budget comparisons, and various example codes for improving the programming skills of CUDA users.
 
Topics:
Performance Optimization1, Intelligent Machines, IoT & Robotics, HPC and Supercomputing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2015
Session ID:
S5384
Streaming:
Download:
Share: