GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Presentation
Media
Abstract:
The parallel algorithms that were introduced in C++17 were designed to support GPU parallel programming. We have implemented these parallel algorithms in the PGI C++ compiler for NVIDIA GPUs, making it possible in some cases to run standard C++ on GPUs with no directives, pragmas, or annotations, and with performance similar to other GPU programming models. We will share our experiences and performance results, and explain the capabilities of the PGI implementation.
The parallel algorithms that were introduced in C++17 were designed to support GPU parallel programming. We have implemented these parallel algorithms in the PGI C++ compiler for NVIDIA GPUs, making it possible in some cases to run standard C++ on GPUs with no directives, pragmas, or annotations, and with performance similar to other GPU programming models. We will share our experiences and performance results, and explain the capabilities of the PGI implementation.  Back
 
Topics:
HPC and Supercomputing
Type:
Talk
Event:
Supercomputing
Year:
2019
Session ID:
SC1936
Streaming:
Download:
Share:
 
Abstract:
We'll discuss the C++17 parallel algorithms, which were designed to support GPU parallel programming. They include parallel versions of many existing algorithms, and a few new algorithms designed for efficient parallel execution of scans and reductions. The PGI C++ compiler has implemented these parallel algorithms for NVIDIA GPUs, making it possible in some cases to run standard C++ on GPUs with no directives, pragmas, or annotations. We will share our experiences and performance results for several of the parallel algorithms. We'll also explain the capabilities of the PGI implementation relative to CUDA, Thrust, and OpenACC.
We'll discuss the C++17 parallel algorithms, which were designed to support GPU parallel programming. They include parallel versions of many existing algorithms, and a few new algorithms designed for efficient parallel execution of scans and reductions. The PGI C++ compiler has implemented these parallel algorithms for NVIDIA GPUs, making it possible in some cases to run standard C++ on GPUs with no directives, pragmas, or annotations. We will share our experiences and performance results for several of the parallel algorithms. We'll also explain the capabilities of the PGI implementation relative to CUDA, Thrust, and OpenACC.  Back
 
Topics:
Programming Languages, Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2019
Session ID:
S9770
Streaming:
Download:
Share:
 
 
Previous
  • Amazon Web Services
  • IBM
  • Cisco
  • Dell EMC
  • Hewlett Packard Enterprise
  • Inspur
  • Lenovo
  • SenseTime
  • Supermicro Computers
  • Synnex
  • Autodesk
  • HP
  • Linear Technology
  • MSI Computer Corp.
  • OPTIS
  • PNY
  • SK Hynix
  • vmware
  • Abaco Systems
  • Acceleware Ltd.
  • ASUSTeK COMPUTER INC
  • Cray Inc.
  • Exxact Corporation
  • Flanders - Belgium
  • Google Cloud
  • HTC VIVE
  • Liqid
  • MapD
  • Penguin Computing
  • SAP
  • Sugon
  • Twitter
Next