GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

HPC and Supercomputing
Presentation
Media
Optimizing a LBM code for Compute Clusters with Kepler GPUs
Abstract:
To fully utilize a GPU Cluster the single GPU code as well as the inter GPU communication needs to be efficient. In this session a LBM code applying a D2Q37 model is used as a case study to explain by example how both targets can be met. The compute intensive collide kernel of the LBM code is optimized for Kepler specifically considering the large amount of state needed per thread due to the complex D2Q37 model. To gain efficient inter GPU communication CUDA-aware MPI was used. We explain how this was done and present performance results on a Infiniband Cluster with GPUDirect RDMA.
 
Topics:
HPC and Supercomputing, Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2014
Session ID:
S4186
Streaming:
Download:
Share: