SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC On-Demand

Presentation
Media
Abstract:
We'll share our experience building an audio cognition platform that extracts non-verbal information such as speech, music, and environmental sounds. Cochlear.ai's Sense platform leverages acoustic event detection, scene classification, human gender/age estimation, music analysis, and more with near real-time analysis from audio data. Everything is optimized for audio processing, including the cloud backend, API design and management, and deep learning architecture. We'll detail our learnings and challenges in developing our audio cognition service.
We'll share our experience building an audio cognition platform that extracts non-verbal information such as speech, music, and environmental sounds. Cochlear.ai's Sense platform leverages acoustic event detection, scene classification, human gender/age estimation, music analysis, and more with near real-time analysis from audio data. Everything is optimized for audio processing, including the cloud backend, API design and management, and deep learning architecture. We'll detail our learnings and challenges in developing our audio cognition service.  Back
 
Topics:
AI Application Deployment and Inference, AI and DL Research, Deep Learning and AI Frameworks
Type:
Talk
Event:
GTC Silicon Valley
Year:
2019
Session ID:
S9625
Streaming:
Download:
Share:
 
Abstract:
We'll explain the concept and the importance of audio recognition, which aims to understand literally all the information contained in the audio, not limiting its scope to speech recognition. It includes the introduction of various types of non-verbal information contained in the audio such as acoustic scenes/events, speech, and music. This session is helpful to the people who are not familiar with audio processing but are interested in the context-aware system. Also, it might be inspiring for someone who develops AI applications such as AI home assistant, a humanoid robot, and self-driving cars. It also covers the potential use-cases and creative applications, including a video demonstration of the audio context-aware system applied to media-art performance for real-time music generation.
We'll explain the concept and the importance of audio recognition, which aims to understand literally all the information contained in the audio, not limiting its scope to speech recognition. It includes the introduction of various types of non-verbal information contained in the audio such as acoustic scenes/events, speech, and music. This session is helpful to the people who are not familiar with audio processing but are interested in the context-aware system. Also, it might be inspiring for someone who develops AI applications such as AI home assistant, a humanoid robot, and self-driving cars. It also covers the potential use-cases and creative applications, including a video demonstration of the audio context-aware system applied to media-art performance for real-time music generation.  Back
 
Topics:
AI and DL Research, Speech and Language Processing, NVIDIA Inception Program, GIS
Type:
Talk
Event:
GTC Silicon Valley
Year:
2018
Session ID:
S8696
Streaming:
Download:
Share: