GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Presentation
Media
Abstract:
We will introduce a new FFT library that extends and complements cuFFT. This new library allows to go beyond performance limits imposed by the current cuFFT API. During our talk, we will walk through new API concepts, showcase performance, and discuss planned future developments.Specifically, the new library aims to provide performance and flexibility improvements by allowing the merging of FFT calculations with other code at convenient levels of abstraction. Examples are: device functions that can be inlined in user CUDA kernels; functions that are able to generate custom FFT kernels at a higher level; and functions that can optimize the sequence of FFT transforms at the top level.Software handling FFT calculations would be able to leverage compile-time information from the user, thus enabling more efficient code generation and better integration with the user code.While the initial release of this new library is planned to directly provide only a subset of functionality of the cuFFT
We will introduce a new FFT library that extends and complements cuFFT. This new library allows to go beyond performance limits imposed by the current cuFFT API. During our talk, we will walk through new API concepts, showcase performance, and discuss planned future developments.Specifically, the new library aims to provide performance and flexibility improvements by allowing the merging of FFT calculations with other code at convenient levels of abstraction. Examples are: device functions that can be inlined in user CUDA kernels; functions that are able to generate custom FFT kernels at a higher level; and functions that can optimize the sequence of FFT transforms at the top level.Software handling FFT calculations would be able to leverage compile-time information from the user, thus enabling more efficient code generation and better integration with the user code.While the initial release of this new library is planned to directly provide only a subset of functionality of the cuFFT  Back
 
Topics:
Tools & Libraries, Performance Optimization, Algorithms & Numerical Techniques
Type:
Talk
Event:
GTC Silicon Valley
Year:
2019
Session ID:
S9257
Streaming:
Share:
 
 
Previous
  • Amazon Web Services
  • IBM
  • Cisco
  • Dell EMC
  • Hewlett Packard Enterprise
  • Inspur
  • Lenovo
  • SenseTime
  • Supermicro Computers
  • Synnex
  • Autodesk
  • HP
  • Linear Technology
  • MSI Computer Corp.
  • OPTIS
  • PNY
  • SK Hynix
  • vmware
  • Abaco Systems
  • Acceleware Ltd.
  • ASUSTeK COMPUTER INC
  • Cray Inc.
  • Exxact Corporation
  • Flanders - Belgium
  • Google Cloud
  • HTC VIVE
  • Liqid
  • MapD
  • Penguin Computing
  • SAP
  • Sugon
  • Twitter
Next