SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC On-Demand

AI Application Deployment and Inference
Presentation
Media
Scalable, Responsive, and Cost-Effective Object Detection Service for Web-Scale Images
We''ll introduce how Bing built a scalable, responsive, and economical object detection API based on NVIDIA GPUs and Azure cloud platforms. Object detection is an important image understanding technique as the entry point or dispatcher to guide users to more specific scenarios. However, it is very challenging to provide object detection services on web-scale images because it is intrinsically a compute-intensive task and thus resource demanding. We''ll also introduce how to use NVIDIA''s CUDA profiling toolchain and cuDNN to make the system even more cost-effective. The system currently supports billion-level traffic, covering Bing''s entire index.
We''ll introduce how Bing built a scalable, responsive, and economical object detection API based on NVIDIA GPUs and Azure cloud platforms. Object detection is an important image understanding technique as the entry point or dispatcher to guide users to more specific scenarios. However, it is very challenging to provide object detection services on web-scale images because it is intrinsically a compute-intensive task and thus resource demanding. We''ll also introduce how to use NVIDIA''s CUDA profiling toolchain and cuDNN to make the system even more cost-effective. The system currently supports billion-level traffic, covering Bing''s entire index.  Back
 
Keywords:
AI Application Deployment and Inference, Performance Optimization, GTC Silicon Valley 2018 - ID S8620
Streaming:
Download:
Share: