GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Performance Optimization
Presentation
Media
Fast Convolutions Via the Overlap-and-Save Method Using Shared Memory FFT
Abstract:
We will present optimizations that increase performance of overlap-and-save calculations of linear convolution using shared memory FFT. The overlap-and-save method is used when convolution of a long signal with many filters is required. We'll explain how we implemented custom FFT, which uses shared memory, to eliminate most of the device memory transfers normally required when calculating convolution. We'll show how we achieved significant impact for certain problem sizes.
 
Topics:
Performance Optimization, Accelerated Data Science
Type:
Talk
Event:
GTC Silicon Valley
Year:
2019
Session ID:
S9352
Streaming:
Download:
Share: