GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

HPC and Supercomputing
Presentation
Media
Latest Advances in MVAPICH2 MPI Library for NVIDIA GPU Clusters with InfiniBand
Abstract:
Learn about the latest developments in MVAPICH2 library that simplifies the task of porting Message Passing Interface (MPI) applications to supercomputing clusters with NVIDIA GPUs. MVAPICH2 supports MPI communication directly from GPU device memory and optimizes it using various features offered by the CUDA toolkit, providing optimized performance on different GPU node configurations. These optimizations are integrated transparently under standard MPI API, for better programmability. Recent advances in MVAPICH2 include designs for MPI-3 RMA using GPUDirect RDMA, framework for MPI Datatype processing using CUDA kernels, support for heterogeneous clusters with GPU and non-GPU nodes, and more. We use the popular OSU micro-benchmark suite and example applications to demonstrate how developers can effectively take advantage of MVAPICH2 in applications using MPI and CUDA/OpenACC. We provide guidance on issues like processor affinity to GPU and network that can significantly affect the performance of MPI applications that use MVAPICH2.
 
Topics:
HPC and Supercomputing, Performance Optimization1, Programming Languages
Type:
Talk
Event:
GTC Silicon Valley
Year:
2014
Session ID:
S4517
Streaming:
Download:
Share: