SEARCH SESSIONS
SEARCH SESSIONS

Search All
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

5G & Edge
Presentation
Media
Abstract:
Learn how NVIDIA's GPUs are used to accelerate unified communications (UC) analytics processing by mathematically classifying UC call flows. We'll discuss how we're leveraging NVIDIA GPU parallelization technology to support classification and bas ...Read More
Abstract:
Learn how NVIDIA's GPUs are used to accelerate unified communications (UC) analytics processing by mathematically classifying UC call flows. We'll discuss how we're leveraging NVIDIA GPU parallelization technology to support classification and baselining of UC call flows, protect UC against fraudulent attacks, and establish predictive UC forecasting models. We'll explain how this allows us to more accurately identify and forecast deviations that may represent malicious use of UC against a baseline of normal traffic.  Back
 
Topics:
5G & Edge
Type:
Talk
Event:
GTC Silicon Valley
Year:
2019
Session ID:
S9385
Streaming:
Download:
Share:
Accelerated Data Science
Presentation
Media
Abstract:
We'll discuss the Bayesian statistical paradigm and Markov Chain Monte Carlo (MCMC) algorithms - the cornerstone of modern Bayesian computation. Scalable MCMC for big datasets and complex models is currently an open research question. Using GPUs pro ...Read More
Abstract:
We'll discuss the Bayesian statistical paradigm and Markov Chain Monte Carlo (MCMC) algorithms - the cornerstone of modern Bayesian computation. Scalable MCMC for big datasets and complex models is currently an open research question. Using GPUs provides a promising and largely unexplored avenue for accelerating these algorithms, but is nontrivial, because MCMC is inherently sequential and has traditionally been considered difficult to parallelize. We'll show how Gibbs sampling, a widely used MCMC algorithm, can be effectively parallelized on GPUs for a large class of exchangeable hierarchical Bayesian models. Participants will learn the mathematical and hardware/software challenges in bringing GPUs to the Bayesian community. Background in Bayesian statistics or MCMC is not assumed.  Back
 
Topics:
Accelerated Data Science, Federal, Algorithms & Numerical Techniques
Type:
Talk
Event:
GTC Silicon Valley
Year:
2017
Session ID:
S7263
Download:
Share:
Algorithms & Numerical Techniques
Presentation
Media
Abstract:
Have you heard about the world's biggest eye ever built? Are you interested in scientific simulations running on NVIDIA DGX-1? Come and learn how combining these powerful computing devices dramatically leaps forward the computational astronomy commu ...Read More
Abstract:
Have you heard about the world's biggest eye ever built? Are you interested in scientific simulations running on NVIDIA DGX-1? Come and learn how combining these powerful computing devices dramatically leaps forward the computational astronomy community in designing major, multimillion-dollar optical instruments for the European Extremely Large Telescope. Starting from the mathematical model up to the high-performance implementation on DGX-1, we'll explain how the resulting matrix computations associated with an efficient task-based programming model help design the next generation of telescope instruments and, eventually, demonstrate a pathfinder for the discovery of new galaxies.  Back
 
Topics:
Algorithms & Numerical Techniques, Astronomy & Astrophysics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2018
Session ID:
S8231
Streaming:
Download:
Share:
Astronomy & Astrophysics
Presentation
Media
Abstract:
Have you heard about the largest ground-based telescope ever built? Are you interested in the newest NVIDIA DGX-1 hardware accelerator? Come and learn how the DGX-1 architecture dramatically leaps forward the computational astronomy community in ...Read More
Abstract:

Have you heard about the largest ground-based telescope ever built? Are you interested in the newest NVIDIA DGX-1 hardware accelerator? Come and learn how the DGX-1 architecture dramatically leaps forward the computational astronomy community in designing major, multimillion-dollar optical instruments for the European Extremely Large Telescope. Starting from the mathematical model up to the high-performance implementation on distributed-memory systems with hardware accelerators, we'll explain how the resulting matrix computations associated with an efficient task-based programming model help design the next generation of telescope instruments.

  Back
 
Topics:
Astronomy & Astrophysics, Tools & Libraries, Federal
Type:
Talk
Event:
GTC Silicon Valley
Year:
2017
Session ID:
S7153
Download:
Share:
Autonomous Vehicles
Presentation
Media
Abstract:
The greatest challenge regarding simulation is finding the matrix that compares it to real life. In this session, Danny will present a mathematical matrix that defines this relation and show how a proper simulation can be constructed based on de ...Read More
Abstract:

The greatest challenge regarding simulation is finding the matrix that compares it to real life. In this session, Danny will present a mathematical matrix that defines this relation and show how a proper simulation can be constructed based on deep learning techniques. He will also supply live examples of the Cognata simulation platform.

  Back
 
Topics:
Autonomous Vehicles, Artificial Intelligence and Deep Learning, Virtual Reality & Augmented Reality
Type:
Talk
Event:
GTC Israel
Year:
2018
Session ID:
SIL8122
Streaming:
Share:
Climate, Weather & Ocean Modeling
Presentation
Media
Abstract:
When used as predictive tools in natural disasters such as tsunamis, numerical models require extremely fast computations. Just a few years ago, real-time computing in Tsunami Early Warning Systems (TEWS) was unthinkable. Nevertheless, the EDANYA Gro ...Read More
Abstract:
When used as predictive tools in natural disasters such as tsunamis, numerical models require extremely fast computations. Just a few years ago, real-time computing in Tsunami Early Warning Systems (TEWS) was unthinkable. Nevertheless, the EDANYA Group has revolutionized tsunami science paradigms. With the goal of saving lives in the framework of TEWS, our group has developed Tsunami-HySEA, a GPU-based numerical model aimed at producing numerical simulations of tsunami events that are faster than ever. Based on highly efficient, robust mathematical algorithms, together with the computational power of NVIDIA GPUs, Tsunami-HySEA is able to simulate a tsunami event in only a few minutes. Nowadays, one of the main challenges in tsunami science is producing accurate assessments of tsunami wave impacts and just a few minutes after the generating earthquake is triggered. This timely prediction would save many lives in a tsunami scenario. When the response is needed only in a few minutes, the resulting scenario is challenging. The required characteristics are difficult to combine in a single numerical tool: robustness, low-dissipation, large domains, and an extremely fast response  Back
 
Topics:
Climate, Weather & Ocean Modeling
Type:
Talk
Event:
GTC Silicon Valley
Year:
2018
Session ID:
S81003
Streaming:
Download:
Share:
Computational Biology & Chemistry
Presentation
Media
Abstract:
We'll discuss cuTENSOR, a high-performance CUDA library for tensor operations that efficiently handles the ubiquitous presence of high-dimensional arrays (i.e., tensors) in today's HPC and DL workloads. This library supports highly efficient tensor ...Read More
Abstract:
We'll discuss cuTENSOR, a high-performance CUDA library for tensor operations that efficiently handles the ubiquitous presence of high-dimensional arrays (i.e., tensors) in today's HPC and DL workloads. This library supports highly efficient tensor operations such as tensor contractions (a generalization of matrix-matrix multiplications), point-wise tensor operations such as tensor permutations, and tensor decompositions (a generalization of matrix decompositions). While providing high performance, cuTENSOR also allows users to express their mathematical equations for tensors in a straightforward way that hides the complexity of dealing with these high-dimensional objects behind an easy-to-use API. CUDA 10.1 enables CUDA programmers to utilize Tensor Cores directly with the new mma.sync instruction. In this presentation, we describe the functionality of mma.sync and present strategies for implementing efficient matrix multiply computations in CUDA that maximize performance on NVIDIA Volta GPUs. We then describe how CUTLASS 1.3 provides reusable components embodying these strategies. CUTLASS 1.3 demonstrates a median 44% speedup of CUDA kernels executing layers from real-world Deep Learning workloads.  Back
 
Topics:
Computational Biology & Chemistry, Tools & Libraries, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2019
Session ID:
S9593
Streaming:
Download:
Share:
Finance - Quantitative Risk & Derivative Calculations
Presentation
Media
Abstract:
The goal of this lab is to give a guided tour through the essentials of CUDA parallelization in mathematical finance. We consider pricing a bullet option under LV (Local Volatility) model using either MC (Monte Carlo) or an implicit discretizati ...Read More
Abstract:

The goal of this lab is to give a guided tour through the essentials of CUDA parallelization in mathematical finance. We consider pricing a bullet option under LV (Local Volatility) model using either MC (Monte Carlo) or an implicit discretization scheme for PDE (Partial Differential Equation). The considered example is close to a real banking application with an LV model derived from the implicit SVI model of Gatheral & Jacquier using the Andersen & Brotherton-Ratcliffe expression based on Dupire equation. Several optimizations are studied like the judicious memory storage in shared and registers for two discretization scales in MC. Thanks to a simple trick proposed in Abbas-Turki & Graillat, we see also the use of PCR (Parallel Cyclic Reduction) to solve tridiagonal systems of any size.

  Back
 
Topics:
Finance - Quantitative Risk & Derivative Calculations
Type:
Instructor-Led Lab
Event:
GTC Europe
Year:
2017
Session ID:
53028
Download:
Share:
HPC and AI
Presentation
Media
Abstract:
We'll discuss our work using neural networks to fit the interatomic potential function and describe how we tested the network's potential function in atomic simulation software. This method has lower computational cost than traditional density func ...Read More
Abstract:
We'll discuss our work using neural networks to fit the interatomic potential function and describe how we tested the network's potential function in atomic simulation software. This method has lower computational cost than traditional density functional theory methods. We'll show how our work is applicable to different atom types and architectures and how it avoids relying on the physical model. Instead, it uses a purely mathematical representation, which reduces the need for human intervention.  Back
 
Topics:
HPC and AI, Computational Physics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2019
Session ID:
S9843
Streaming:
Download:
Share:
HPC and Supercomputing
Presentation
Media
Abstract:
We introduce cuTENSOR, a high-performance CUDA library for tensor operations that efficiently handles the ubiquitous presence of high-dimensional arrays (i.e., tensors) in today's HPC and DL workloads. This library supports highly efficient tensor o ...Read More
Abstract:
We introduce cuTENSOR, a high-performance CUDA library for tensor operations that efficiently handles the ubiquitous presence of high-dimensional arrays (i.e., tensors) in today's HPC and DL workloads. This library supports highly efficient tensor operations such as tensor contractions (a generalization of matrix-matrix multiplications), element-wise tensor operations such as tensor permutations, and tensor reductions. While providing high performance, cuTENSOR also allows users to express their mathematical equations for tensors in a straight-forward way that hides the complexity of dealing with these high-dimensional objects behind an easy-to-use API.  Back
 
Topics:
HPC and Supercomputing
Type:
Talk
Event:
Supercomputing
Year:
2019
Session ID:
SC1933
Download:
Share:
Intelligent Machines, IoT & Robotics
Presentation
Media
Abstract:
The talk presents the current developments of the NUMBERS project highly interdisciplinary research program in Cognitive Developmental Robotics to construct a novel artificial cognitive model of mathematical cognition that imitates human-like le ...Read More
Abstract:

The talk presents the current developments of the NUMBERS project highly interdisciplinary research program in Cognitive Developmental Robotics to construct a novel artificial cognitive model of mathematical cognition that imitates human-like learning approaches for developing number understanding.
The project aims to provide a proof-of-concept and the scientific and technological bases for novel robots capable of abstract and symbolic processing, which is required for improving their cognitive performance and their social interaction with human beings.
During the talk, the current experimental results will be reviewed to give evidence of improved performance thanks to the embodiment.

  Back
 
Topics:
Intelligent Machines, IoT & Robotics
Type:
Talk
Event:
GTC Europe
Year:
2018
Session ID:
E8502
Streaming:
Download:
Share:
 
Abstract:
What do we need to achieve artificial general intelligence? How do we distribute intelligence over the internet-of-things? We'll dive deep into the heart of the matter, which is machine reasoning. Following recent advances in mathematical foundation ...Read More
Abstract:
What do we need to achieve artificial general intelligence? How do we distribute intelligence over the internet-of-things? We'll dive deep into the heart of the matter, which is machine reasoning. Following recent advances in mathematical foundations and homotopy-type theory, we conclude that the crux is to formally separate intents from implementations. We can teach neural networks to understand these intents and to use a divide-and-conquer method for compiling these intents into implementations. Our goal is to outline a distributed strategy for accomplishing this moonshot.  Back
 
Topics:
Intelligent Machines, IoT & Robotics, Artificial Intelligence and Deep Learning
Type:
Talk
Event:
GTC Silicon Valley
Year:
2017
Session ID:
S7239
Download:
Share: