GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Algorithms & Numerical Techniques
Presentation
Media
CUDA Implementation of Modern Preconditioning Techniques for Iterative Solvers of Linear Systems
Abstract:
<div> Learn how to implement state-of-the-art preconditioners for iterative solvers of large-scale linear systems in CUDA. Previously most preconditioners were set up on CPUs because this task was not considered suitable for fine-grain parallelization. We&#39;ll show how it&#39;s possible to implement efficient CUDA kernels for techniques like the adaptive factorized sparse approximate inverse by adopting an approach that dramatically reduces the amount of memory required to run in parallel. We&#39;ll describe how our GPU-only preconditioners and solvers can be used to solve real-world problems in science and engineering. We&#39;ll provide single and multi-GPU implementations. Our method makes it possible to obtain about an order-of-magnitude speedup on high-end multi-core CPUs like the Intel Xeon Platinum 8176.</div> <div> &nbsp;</div>
 
Topics:
Algorithms & Numerical Techniques, Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2019
Session ID:
S9192
Streaming:
Download:
Share: