Abstract:
Learn how to use performance analysis tools to find the bottlenecks in your OpenACC applications. With the proper performance information, and the feedback from the compiler, you can tune your application and improve overall performance. Live demonstrations will use PGI's pgprof, NVIDIA's Visual Profiler and command-line nvprof, and additional tools available to the parallel computing community.