GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Artificial Intelligence and Deep Learning
Presentation
Media
An Efficient CUDA-Accelerated Machine Learning Inference for 4G and 5G Radio Networks
Abstract:
We describe the design of scalable CUDA-based service framework for ML model inference tasks to efficiently distribute such workloads across a cluster of dedicated GPU-based servers. These servers can also be easily integrated with existing telecom cloud infrastructure. In telecom data centres, ML models are increasingly being deployed for use cases such as automation, analytics and anomaly detection. Handling diverse datatypes and request rates ranging between hours and milliseconds can become a challenge with a legacy CPU-dominated cloud environment.
 
Topics:
Artificial Intelligence and Deep Learning
Type:
Talk
Event:
GTC Europe
Year:
2018
Session ID:
E8421
Streaming:
Download:
Share: