GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Presentation
Media
Abstract:
You may already use NVIDIA's cuDNN library to accelerate your deep neural network inference, but are you getting the most out of it to truly unleash the tremendous performance of NVIDIA's newest GPU architectures, Volta and Turing? We'll discuss how to avoid the most common pitfalls in porting your CPU-based inference to the GPU and demonstrate best practices in a step-by-step optimization of an example network. Learn how to deploy your deep neural network inference in both the fastest and most memory-efficient way, using cuDNN and Tensor Cores, NVIDIA's revolutionary technology that delivers groundbreaking performance in FP16, INT8 and INT4 inference on Volta and Turing.
You may already use NVIDIA's cuDNN library to accelerate your deep neural network inference, but are you getting the most out of it to truly unleash the tremendous performance of NVIDIA's newest GPU architectures, Volta and Turing? We'll discuss how to avoid the most common pitfalls in porting your CPU-based inference to the GPU and demonstrate best practices in a step-by-step optimization of an example network. Learn how to deploy your deep neural network inference in both the fastest and most memory-efficient way, using cuDNN and Tensor Cores, NVIDIA's revolutionary technology that delivers groundbreaking performance in FP16, INT8 and INT4 inference on Volta and Turing.  Back
 
Topics:
AI Application, Deployment & Inference, Deep Learning & AI Frameworks
Type:
Talk
Event:
GTC Silicon Valley
Year:
2019
Session ID:
S9644
Streaming:
Download:
Share:
 
Abstract:
We'll show how a headset-mounted depth camera can be used to enable scalable real-time scene reconstruction for immersive mixed reality applications. We'll discuss and profile optimized CUDA kernels that leverage the tremendous performance of an NVIDIA Quadro GP100. Furthermore, we'll show how knowledge of the headset's position and orientation in space can be leveraged to improve and make more robust the reconstruction process.
We'll show how a headset-mounted depth camera can be used to enable scalable real-time scene reconstruction for immersive mixed reality applications. We'll discuss and profile optimized CUDA kernels that leverage the tremendous performance of an NVIDIA Quadro GP100. Furthermore, we'll show how knowledge of the headset's position and orientation in space can be leveraged to improve and make more robust the reconstruction process.  Back
 
Topics:
Performance Optimization, Virtual Reality & Augmented Reality
Type:
Talk
Event:
GTC Silicon Valley
Year:
2018
Session ID:
S8698
Streaming:
Download:
Share:
 
 
Previous
  • Amazon Web Services
  • IBM
  • Cisco
  • Dell EMC
  • Hewlett Packard Enterprise
  • Inspur
  • Lenovo
  • SenseTime
  • Supermicro Computers
  • Synnex
  • Autodesk
  • HP
  • Linear Technology
  • MSI Computer Corp.
  • OPTIS
  • PNY
  • SK Hynix
  • vmware
  • Abaco Systems
  • Acceleware Ltd.
  • ASUSTeK COMPUTER INC
  • Cray Inc.
  • Exxact Corporation
  • Flanders - Belgium
  • Google Cloud
  • HTC VIVE
  • Liqid
  • MapD
  • Penguin Computing
  • SAP
  • Sugon
  • Twitter
Next