GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Presentation
Media
Abstract:
This talk was designed to illustrate how having CUDA knowledge can help DL developers understand and tune their deep learning applications. We explain how to implement Tensorflow custom operations to utilize GPU more efficiently in running DL workloads, esp. BERT Inference for SQuAD. We also deliver the key insights on why the techniques introduced here can achieve better performance by discerning the profiling result.
This talk was designed to illustrate how having CUDA knowledge can help DL developers understand and tune their deep learning applications. We explain how to implement Tensorflow custom operations to utilize GPU more efficiently in running DL workloads, esp. BERT Inference for SQuAD. We also deliver the key insights on why the techniques introduced here can achieve better performance by discerning the profiling result.   Back
 
Topics:
HPC and AI
Type:
Talk
Event:
AI Conference Korea
Year:
2019
Session ID:
SKR9108
Download:
Share:
 
 
Previous
  • Amazon Web Services
  • IBM
  • Cisco
  • Dell EMC
  • Hewlett Packard Enterprise
  • Inspur
  • Lenovo
  • SenseTime
  • Supermicro Computers
  • Synnex
  • Autodesk
  • HP
  • Linear Technology
  • MSI Computer Corp.
  • OPTIS
  • PNY
  • SK Hynix
  • vmware
  • Abaco Systems
  • Acceleware Ltd.
  • ASUSTeK COMPUTER INC
  • Cray Inc.
  • Exxact Corporation
  • Flanders - Belgium
  • Google Cloud
  • HTC VIVE
  • Liqid
  • MapD
  • Penguin Computing
  • SAP
  • Sugon
  • Twitter
Next