SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC On-Demand

AI Application Deployment and Inference
Presentation
Media
Autoregressive Wavenet Inference on Volta GPUs
Autoregressive wavenets have demonstrated extremely high quality real-time speech synthesis results.  However, the compute requirements and tight latency bounds have made them impractical for deployment on traditional CPU-only systems.  In this talk we demonstrate that Volta GPUs provide excellent real-time inference performance on these networks, making practical deployments possible.  We discuss several alternative implementation techniques and demonstrate their achieved performance on a V100 GPU.
Autoregressive wavenets have demonstrated extremely high quality real-time speech synthesis results.  However, the compute requirements and tight latency bounds have made them impractical for deployment on traditional CPU-only systems.  In this talk we demonstrate that Volta GPUs provide excellent real-time inference performance on these networks, making practical deployments possible.  We discuss several alternative implementation techniques and demonstrate their achieved performance on a V100 GPU.  Back
 
Keywords:
AI Application Deployment and Inference, Speech and Language Processing, GTC Silicon Valley 2018 - ID S8968
Streaming:
Share: