SEARCH SESSIONS

Search All
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Acoustics and Audio Processing
Presentation
Media
Speakers:
Nicolas Tsingos
Abstract:
Learn how to leverage GPUs for interactive audio rendering. This session will give a short overview of the architecture of current GPUs, emphasizing some key differences between GPU and CPUs programming models for audio processing. We will illus ...Read More
Abstract:

Learn how to leverage GPUs for interactive audio rendering. This session will give a short overview of the architecture of current GPUs, emphasizing some key differences between GPU and CPUs programming models for audio processing. We will illustrate the benefits of GPU-accelerated audio rendering with results from 3D audio processing and sound scattering simulations. Finally, we will discuss best practices for GPU implementations as well as future opportunities for audio rendering on massively parallel architectures.

  Back
 
Topics:
Acoustics and Audio Processing, Rendering & Ray Tracing, Signal and Audio Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2042
Streaming:
Download:
Share:
 
Speakers:
Giancarlo Del Sordo
Abstract:
Learn how to implement a commercial software library that exploits CUDA for audio applications. We focus on the overall threading architecture and the underlying math for implementing general purpose audio processing in CUDA devices. Covers the ...Read More
Abstract:

Learn how to implement a commercial software library that exploits CUDA for audio applications. We focus on the overall threading architecture and the underlying math for implementing general purpose audio processing in CUDA devices. Covers the use of inter-process communication to make a plug-in implementation loadable in 32 bit hosts installed in 64 bit systems, distributing the GPU load on remote servers, and creating a CUDA network for high-end purposes such as a big recording facility.

  Back
 
Topics:
Acoustics and Audio Processing, Signal and Audio Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102076
Streaming:
Download:
Share:
 
Abstract:
Learn how a synthesis of 3D sound scenes can be achieved using a peer-to-peer music streaming environment and GPU. We will discuss the technical and cost benefits to this approach, while noting that it frees the CPU for other tasks. ...Read More
Abstract:

Learn how a synthesis of 3D sound scenes can be achieved using a peer-to-peer music streaming environment and GPU. We will discuss the technical and cost benefits to this approach, while noting that it frees the CPU for other tasks.

  Back
 
Topics:
Acoustics and Audio Processing, Signal and Audio Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102116
Streaming:
Download:
Share:
 
Speakers:
Jike Chong
Abstract:
We explore two contending recognition network representations for speech inference engines: the linear lexical model (LLM) and the weighted finite state transducer (WFST) on NVIDIA GTX285 and GTX480 GPUs. We demonstrate that while an inference e ...Read More
Abstract:

We explore two contending recognition network representations for speech inference engines: the linear lexical model (LLM) and the weighted finite state transducer (WFST) on NVIDIA GTX285 and GTX480 GPUs. We demonstrate that while an inference engine using the simpler LLM representation evaluates 22x more transitions per second than the advanced WFST representation, the simple structure of the LLM representation allows 4.7-6.4x faster evaluation and 53-65x faster operands gathering for each state transition. We illustrate that the performance of a speech inference engine based on the LLM representation is competitive with the WFST representation on highly parallel GPUs.

  Back
 
Topics:
Acoustics and Audio Processing
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10C01
Download:
Share:
 
Speakers:
Jike Chong
Abstract:
Automatic speech recognition (ASR) technology is emerging as a critical component in data analytics for a wealth of media data being generated everyday. ASR-based applications contain fine-grained concurrency that has great potential to be explo ...Read More
Abstract:

Automatic speech recognition (ASR) technology is emerging as a critical component in data analytics for a wealth of media data being generated everyday. ASR-based applications contain fine-grained concurrency that has great potential to be exploited on the GPU. However, the state-of-art ASR algorithm involves a highly parallel graph traversal on an irregular graph with millions of states and arcs, making efficient parallel implementations highly challenging. We present four generalizable techniques including: dynamic data-gather buffer, find-unique, lock-free data structures using atomics, and hybrid global/local task queues. When used together, these techniques can effectively resolve ASR implementation challenges on an NVIDIA GPU.

  Back
 
Topics:
Acoustics and Audio Processing
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10C02
Download:
Share:
Artificial Intelligence and Deep Learning
Presentation
Media
Speakers:
Jike Chong
Abstract:
Learn about how the GPU is able to meet the challenges of implementing automatic speech recognition (ASR), gain insights into the data-parallel implementation techniques that can provide 10x faster performance compared to sequentially processing ...Read More
Abstract:

Learn about how the GPU is able to meet the challenges of implementing automatic speech recognition (ASR), gain insights into the data-parallel implementation techniques that can provide 10x faster performance compared to sequentially processing ASR on a CPU. The state-of-art algorithm for ASR performs a graph traversal on a large, irregular graph with millions of states and arcs, guided by speech input only known at runtime. We present four generalizable techniques including: dynamic data-gather buffer, find-unique, lock-free data structures using atomics, and hybrid global/local task queues. When used together, these techniques can effectively resolve ASR implementation challenges on a GPU.

  Back
 
Topics:
Artificial Intelligence and Deep Learning, Acoustics and Audio Processing, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10C02
Streaming:
Download:
Share:
 
Speakers:
Dr.Alan Peters
Abstract:
Universal Robotics is using GPUs for real-time visual sensing in the reactive control of industrial robots. For a robot to work in a complex dynamic environment to achieve a more loosely specified goal, such as moving arbitrary boxes from a pall ...Read More
Abstract:

Universal Robotics is using GPUs for real-time visual sensing in the reactive control of industrial robots. For a robot to work in a complex dynamic environment to achieve a more loosely specified goal, such as moving arbitrary boxes from a pallet to a conveyor, requires reactivity. Reactive control requires intensive, concurrent, low-latency computation for motion planning, exception handling, and sensing. We describe and demonstrate how GPU-based computation enables visual servoing and box moving. We also discuss the potential of the GPU to solve more difficult sensory problems such as multi-robot cooperation, multimodal sensor binding, attention, sensitization, and habituation.

  Back
 
Topics:
Artificial Intelligence and Deep Learning
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102091
Streaming:
Download:
Share:
 
Speakers:
Avi Bleiweiss
Abstract:
A Zero-Sum game is a match for which the gain of one results in loss of the other. Tic-Tac-Toe, Checkers and Chess are Zero-Sum board game examples. For realizing the best player move, the game is abstracted as a tree, often quite deep, consisti ...Read More
Abstract:

A Zero-Sum game is a match for which the gain of one results in loss of the other. Tic-Tac-Toe, Checkers and Chess are Zero-Sum board game examples. For realizing the best player move, the game is abstracted as a tree, often quite deep, consisting of all possible configurations. We present an efficient GPU implementation of the Mini-Max search algorithm, enhanced with Alpha-Beta pruning. We highlight challenges for deploying non-tail recursion of a highly irregular algorithm on GPUs, proposing a hybrid of compiler and user managed stack. We demonstrate superior performance for running many thousands of 3D Tic-Tac-Toe matches, simultaneously.

  Back
 
Topics:
Artificial Intelligence and Deep Learning
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102207
Streaming:
Download:
Share:
 
Speakers:
Andrew Hershberger
Abstract:
CUDA Creatures applies parallel algorithms to the iterated Prisoner''s Dilemma, a classic study of the evolution of cooperation. We bring interactivity to parameter space exploration by achieving 600x to 800x speedups on GTX 260. ...Read More
Abstract:

CUDA Creatures applies parallel algorithms to the iterated Prisoner''s Dilemma, a classic study of the evolution of cooperation. We bring interactivity to parameter space exploration by achieving 600x to 800x speedups on GTX 260.

  Back
 
Topics:
Artificial Intelligence and Deep Learning
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10L01
Download:
Share:
Astronomy & Astrophysics
Presentation
Media
Speakers:
Alessandra Mastrobuono, Roberto Capuzzo-Dolcetta
- Sapienza Univ. of Roma
Abstract:
Astrophysics is a field where super computing is a must to obtain new scientific results. in particular, the study of the interaction among massive black holes and surrounding stars is a hot topic, ...Read More
Abstract:
Astrophysics is a field where super computing is a must to obtain new scientific results. in particular, the study of the interaction among massive black holes and surrounding stars is a hot topic, which requires heavy computations to have good representation of what happens in the inner regions of galaxies. We present the results obtained with our high precisioned N-body code, NBSymple, which exploits the joint power of a multi core CPU system together with the high performance NVIDIA Tesla C1060 GPUs. The code is available at the website: astrowww.phys.uniroma1.it/dolcetta/nbsymple.html  Back
 
Topics:
Astronomy & Astrophysics, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102000
Streaming:
Download:
Share:
 
Speakers:
Matt Sinclair
Abstract:
Learn how to use the hidden computation capability of GPU texture units for general purpose computation. We describe GRASSY, a system for stellar spectral synthesis where the core problem is interpolation between pre-computed intensity value. We ...Read More
Abstract:

Learn how to use the hidden computation capability of GPU texture units for general purpose computation. We describe GRASSY, a system for stellar spectral synthesis where the core problem is interpolation between pre-computed intensity value. We map these pre-computed tables to the GPU''s texture memory. Interpolation then becomes a texture lookup where the hardware automatically performs the interpolation, albeit at very low precision. Our mathematical framework reasons about the impact of this precision and our performance results show 500X speedups. This work generalizes the GPU texture units as computation engines and opens up new problems for GPU acceleration.

  Back
 
Topics:
Astronomy & Astrophysics, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S10044
Download:
Share:
 
Speakers:
Richard Townsend
- University of Wisconsin-Madison
Abstract:
Standard FFT algorithms cannot be applied to spectral analysis of unevenly sampled data. Alternative approaches scale as O(N^2), making them an ideal target for harnessing the raw computing power of GPUs. ...Read More
Abstract:
Standard FFT algorithms cannot be applied to spectral analysis of unevenly sampled data. Alternative approaches scale as O(N^2), making them an ideal target for harnessing the raw computing power of GPUs. To this end, I have developed CU-LSP, a CUDA spectral analysis code based on the Lomb-Scargle periodogram. Preliminary benchmarking indicates impressive speed-ups, on the order of 400 relative to a single core of a modern CPU. An initial application of CU-LSP will be the analysis of time-series data from planet-search and asteroseismology satellites.   Back
 
Topics:
Astronomy & Astrophysics, Developer - Algorithms, Signal and Audio Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102082
Streaming:
Download:
Share:
 
Speakers:
Dominique Aubert
- Strasbourg University
Abstract:
Cosmological simulations aim at reproducing the physical processes which occur on the largest scales of the Universe since the Big-Bang by means of numerical calculations on supercomputers. ...Read More
Abstract:
Cosmological simulations aim at reproducing the physical processes which occur on the largest scales of the Universe since the Big-Bang by means of numerical calculations on supercomputers. Using CUDA, I have implemented standard cosmological techniques on GPU architecture (PM N-Body solver, Hydrodynamics & moment-based radiative transfer) and designed them to run on supercomputing facilities by means of MPI+CUDA mixed programming. These applications are able to run on 100 or more graphics devices with typical scalar x50 accelerations and with a communication overhead limited to 15%. It allow to explore physical regimes which were out of reach of current simulations.   Back
 
Topics:
Astronomy & Astrophysics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102099
Streaming:
Download:
Share:
 
Speakers:
Abdul Mroue
- CITA, Univ. Of Toronto
Abstract:
Get the latest information on how to evolve binary black holes simulations on GPUs.
 
Topics:
Astronomy & Astrophysics, Developer - Algorithms, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102108
Streaming:
Download:
Share:
 
Speakers:
Mark Cheung
- Lockheed Martin Solar & Astrophysics Laboratory
Abstract:
Learn how GPU computing is enabling astrophysicists to study our closest star. NASA''s recently launched Solar Dynamics Observatory is continuously streaming full-disk images of the Sun at visible, UV and EUV wavelengths. ...Read More
Abstract:
Learn how GPU computing is enabling astrophysicists to study our closest star. NASA''s recently launched Solar Dynamics Observatory is continuously streaming full-disk images of the Sun at visible, UV and EUV wavelengths. This presentation will discuss ways that GPU computing is helping scientists cope with the analysis of the immense data volumes as well as in numerical modeling of the Sun.  Back
 
Topics:
Astronomy & Astrophysics, Computational Fluid Dynamics, Computer Vision, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102178
Streaming:
Download:
Share:
 
Speakers:
Rainer Spurzem
- National Astronomical Obersvatories, Chinese Academy of Sciences
Abstract:
Many, if not all galaxies harbour supermassive black holes. If galaxies merge, which is quite common in the process of hierarchical structure formation in the universe, ...Read More
Abstract:
Many, if not all galaxies harbour supermassive black holes. If galaxies merge, which is quite common in the process of hierarchical structure formation in the universe, their black holes sink to the centre of the merger remnant and form a tight binary. Depending on initial conditions and time supermassive black hole binaries are prominent gravitational wave sources, if they ultimately come close together and coalesce. We model such systems as gravitating N-body systems (stars) with two or more massive bodies (black holes), including if necessary relativistic corrections to the classical Newtonian gravitational forces (Kupi et al. 2006, Berentzen et al.2009).  Back
 
Topics:
Astronomy & Astrophysics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10B01
Download:
Share:
Autonomous Vehicles
Presentation
Media
Speakers:
Kshitij Gupta
Abstract:
In this presentation, we will talk about our experiences of implementing an end-to-end automatic speech recognition system that runs in faster than real-time on embedded GPUs, targeted towards small form-factor consumer devices. Focusing specifi ...Read More
Abstract:

In this presentation, we will talk about our experiences of implementing an end-to-end automatic speech recognition system that runs in faster than real-time on embedded GPUs, targeted towards small form-factor consumer devices. Focusing specifically on some of the challenges encountered during the design process, a major portion of our talk will focus on giving insights into modifications we made to well-established speech algorithms to fit well within the GPU programming model. We will show how these changes helped us in realizing a highly optimized system on platforms with limited memory bandwidth and compute resources.

  Back
 
Topics:
Autonomous Vehicles, Acoustics and Audio Processing, Mobile Applications, Signal and Audio Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102175
Streaming:
Download:
Share:
 
Speakers:
Theo Valich
Abstract:
Explore how advanced SoC technologies are transforming the world of automotive industry. Learn on how using nVidia Tegra increased the available range while pushing the envelope on next-gen driver experience. Sharing the lessons learned in the w ...Read More
Abstract:

Explore how advanced SoC technologies are transforming the world of automotive industry. Learn on how using nVidia Tegra increased the available range while pushing the envelope on next-gen driver experience. Sharing the lessons learned in the world of electric cars and challenges in constructing a mass production electric vehicle.

  Back
 
Topics:
Autonomous Vehicles, Graphics and AI, Computer Vision, Video & Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102303
Streaming:
Download:
Share:
 
Speakers:
Theo Valich
Abstract:
Learn how GPU technologies broke speed limits in automotive development. By using GPU-accelerated tools, small team of engineers created a complete certifiable vehicle in only two years, using fraction of the budget used in conventional industry ...Read More
Abstract:

Learn how GPU technologies broke speed limits in automotive development. By using GPU-accelerated tools, small team of engineers created a complete certifiable vehicle in only two years, using fraction of the budget used in conventional industry. Talk will cover tools and techniques used in creation of XD concept, as well as how to overcome challenges moving a product from concept to mass production stage.

  Back
 
Topics:
Autonomous Vehicles, Graphics and AI, Computer Vision
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102304
Streaming:
Download:
Share:
 
Speakers:
Vladimir Glavtchev
Abstract:
We investigate the use of differentGPU-based implementations for performing real-time speed limit sign recognition on a resource-constrainedembedded system. The system recognized US and European Union speed-limits at over 88% accuracy while runn ...Read More
Abstract:

We investigate the use of differentGPU-based implementations for performing real-time speed limit sign recognition on a resource-constrainedembedded system. The system recognized US and European Union speed-limits at over 88% accuracy while running in real-time. The system is hardware-accelerated using CUDA and OpenGL. It introduces a novel technique for detecting speed-limit signs which is only possible with the aid of GPU processing.

  Back
 
Topics:
Autonomous Vehicles
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10H01
Download:
Share:
 
Speakers:
Marius Vasiliu
Abstract:
NVIDIA GPU architecture becomes a very interesting hardware target for complex automotive application. We implemented the same automotive application on several different hardware targets and analyzed the maximum frame rate and the effective CPU ...Read More
Abstract:

NVIDIA GPU architecture becomes a very interesting hardware target for complex automotive application. We implemented the same automotive application on several different hardware targets and analyzed the maximum frame rate and the effective CPU charge. This paper shows how real-time applications like pedestrian detection and driving assistance take benefits from a massively parallel "central" architecture like GPU/CUDA. Real-time performance and zero-delay transfers can be achieved using a full asynchronous implementation. The same approach can really multiply the application performance by the number of GPU devices present on the embedded system, at a reasonable power consumption.

  Back
 
Topics:
Autonomous Vehicles
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10H02
Download:
Share:
Cloud Visualization
Presentation
Media
Speakers:
Abstract:
We present MatCloud (www.mat-cloud.com), a cloud infrastructure and service for scientific computing using state-of-the-art GPU clusters. MatCloud is a service infrastructure exposed by a simple web terminal interface to run Matlab-like commands ...Read More
Abstract:

We present MatCloud (www.mat-cloud.com), a cloud infrastructure and service for scientific computing using state-of-the-art GPU clusters. MatCloud is a service infrastructure exposed by a simple web terminal interface to run Matlab-like commands/scripts. Join us to see how GPU technology can not only be applied to cloud computing community, but also boost the adoption of cloud computing for its dramatic performance gains over traditional cloud infrastructures.MatCloud is an in-progress academic project and is under active development.

  Back
 
Topics:
Cloud Visualization, Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S1020260
Streaming:
Download:
Share:
 
Speakers:
Tad Brockway
- Microsoft
Abstract:
Learn about Microsoft''s upcoming GPU Virtualization feature, RemoteFX, which will ship in Windows Server 2008 R2 SP1. ...Read More
Abstract:
Learn about Microsoft''s upcoming GPU Virtualization feature, RemoteFX, which will ship in Windows Server 2008 R2 SP1. Microsoft RemoteFX enables GPUs to be hosted in the datacenter as a service that can be shared by multiple users for streaming the real-time and complete Windows 7 desktop experience to ultra-lightweight client devices anywhere on the corporate network. With Microsoft RemoteFX, users will be able to work remotely in a Windows Aero desktop environment, watch full-motion video, enjoy Silverlight animations, and run 3D applications - all with the fidelity of local-like performance.  Back
 
Topics:
Cloud Visualization, Graphics and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102243
Streaming:
Download:
Share:
Computational Fluid Dynamics
Presentation
Media
Speakers:
Jonathan Cohen
- NVIDIA Research
Abstract:
OpenCurrent is an open source library with support for structured 3D grids and various PDE solvers that operate on them, including a multigrid Poisson solver and an incompressible Navier-Stokes solver. ...Read More
Abstract:
OpenCurrent is an open source library with support for structured 3D grids and various PDE solvers that operate on them, including a multigrid Poisson solver and an incompressible Navier-Stokes solver. It also includes extensions for splitting grids across multiple GPUs. This talk will provide a basic introduction to the code base and its design principles.  Back
 
Topics:
Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2022
Download:
Share:
 
Speakers:
Vivien Clauzon
Abstract:
Hear why and how Numtech, a french SME working in the field of atmospheric dispersion and expertise of meteorological events, is benchmarking GPGPU for its futures applications. ...Read More
Abstract:
Hear why and how Numtech, a french SME working in the field of atmospheric dispersion and expertise of meteorological events, is benchmarking GPGPU for its futures applications. A compressible and an incompressible interactive flow solvers are described.  Back
 
Topics:
Computational Fluid Dynamics, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2037
Streaming:
Download:
Share:
 
Speakers:
Matthieu Lefebvre
- ONERA
Abstract:
Hear how we are improving our elsA and CEDRE computational fluid dynamics software by working on solving the Euler equations set on the GPU. ...Read More
Abstract:
Hear how we are improving our elsA and CEDRE computational fluid dynamics software by working on solving the Euler equations set on the GPU. We discuss how our implementation considers the associated Riemann problem and the Roe-Pike differencing scheme at several orders in space while also introducing immerse boundary conditions. Covers the significant speedup obtained through algorithmic and computational optimizations.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2045
Streaming:
Download:
Share:
 
Speakers:
Kees Vuik, Rohit Gupta
- Delft University Of Technology
Abstract:
Explore how to use deflation as a second level preconditioning technique to speed up Block Incomplete Cholesky Preconditioned Conjugate Gradient Method. ...Read More
Abstract:
Explore how to use deflation as a second level preconditioning technique to speed up Block Incomplete Cholesky Preconditioned Conjugate Gradient Method. We use it to solve the Pressure correction equation involved in the solution of the Two-Phase Fluid Flow problem. Our implementation reaches speedup factors between 25-30, for more than 260,000 unknowns, when compared to the CPU.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2049
Streaming:
Download:
Share:
 
Speakers:
Con Caris, John Taylor, Tomasz Bednarz
- CSIRO
Abstract:
Learn step-by-step procedures to write an explicit CFD solver based on final difference methods with staggered grid allocations and boundary fitted coordinates. ...Read More
Abstract:
Learn step-by-step procedures to write an explicit CFD solver based on final difference methods with staggered grid allocations and boundary fitted coordinates. We will discuss the derivation of the mathematical model, discretization of the model equations, development of the algorithms, and parallelization and visualization of the computed data using OpenCL and OpenGL. Compares case studies of natural convection, driven cavity, scaling analysis, and magneto-thermal convection computed using CSIRO''s CPU/GPU supercomputer cluster to known analytical and experimental solutions.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms, HPC and AI, HPC and AI, Physics Simulation, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2058
Streaming:
Download:
Share:
 
Speakers:
Timothy Warburton
- Rice University
Abstract:
In the last three years we have demonstrated how GPU accelerated discontinuous Galerkin methods have enabled simulation of time-dependent, electromagnetic scattering from airplanes and helicopters. ...Read More
Abstract:
In the last three years we have demonstrated how GPU accelerated discontinuous Galerkin methods have enabled simulation of time-dependent, electromagnetic scattering from airplanes and helicopters. In this talk we will discuss how we have extended these techniques to enable GPU accelerated simulation of supersonic airflow as well.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102078
Download:
Share:
 
Speakers:
David M., Patrice Castonguay
- Stanford University
Abstract:
We will describe a scalable and efficient high-order unstructured compressible flow solver for GPUs. ...Read More
Abstract:
We will describe a scalable and efficient high-order unstructured compressible flow solver for GPUs. The solver allows the achievement of arbitrary order of accuracy for flows over complex geometries. High-order solvers require more operations per degree of freedom, thus making them highly suitable for massively parallel processors. Preliminary results indicate speed-ups up to 70x with the Tesla C1060 compared to the Intel i7 CPU. Memory access was optimized using shared and texture memory.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102079
Streaming:
Download:
Share:
 
Speakers:
Peter Zaspel
- University of Bonn
Abstract:
This demonstrates the potential of GPUs for solving complex free surface flow problems using level set methods. ...Read More
Abstract:
This demonstrates the potential of GPUs for solving complex free surface flow problems using level set methods. These methods are capable of producing complex surface deformations, and therefore are used widely in computer graphics, as well as engineering applications. This work demonstrates that GPUs can be used to accelerate the most computationally expensive part of free surface flow calculations, and therefore allows much larger problems to be solved on workstation machines than was previously possible. These techniques will be exemplified by our current project to port our in-house fluid solver NaSt3DGPF to the GPU.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms, HPC and AI, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102083
Download:
Share:
 
Speakers:
Allan Peter
- Technical University of Denmark
Abstract:
This work is concerned with the development of an efficient high-throughput scalable model for simulation of fully nonlinear water waves (OceanWave3D) applicable to solve and analyze large-scale problems in coastal engineering. ...Read More
Abstract:
This work is concerned with the development of an efficient high-throughput scalable model for simulation of fully nonlinear water waves (OceanWave3D) applicable to solve and analyze large-scale problems in coastal engineering. The goal can be achieved through algorithm redesign and parallelization of an optimized sequential single-CPU algorithm based on a flexible-order Finite Difference Method. High performance is pursued by utilizing many-core processing in the model focusing on GPUs for acceleration of code execution. This involves combining analytical methods with an algorithm redesign of the current numerical model.   Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102103
Download:
Share:
 
Speakers:
Issei Masaie
- Prometech Software, Inc.
Abstract:
Prometech Software, Inc. is an university launched technology venture in Japan and has been working in the field of particle-based computational fluid dynamics for several years. ...Read More
Abstract:
Prometech Software, Inc. is an university launched technology venture in Japan and has been working in the field of particle-based computational fluid dynamics for several years. Through collaboratinos with major automotive and material companies in Japan, Prometech has implemented our Particle technology on Multi-GPU and delivered as a CAE software, "Particleworks". In this session, we will discuss the theoretical background of our simulation (MPS; Moving Particle Simulation method), Multi GPU programming techniques of sparse matrix solver, performance results of Particleworks and the analysis examples of the Auto and Material.  Back
 
Topics:
Computational Fluid Dynamics, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102106
Streaming:
Download:
Share:
 
Speakers:
Christopher Stone
- Intelligent Light
Abstract:
Dive deep as we present the details of a new CUDA-based algorithm for accurate rotorcraft wake simulations. ...Read More
Abstract:
Dive deep as we present the details of a new CUDA-based algorithm for accurate rotorcraft wake simulations. We use a vortex particle method, accelerated with a multipole tree algorithm, combined with a traditional grid-based CFD code. This CUDA algorithm can evaluate the velocity and velocity-gradient with an effective throughput approaching 300 billion interactions per second on a C1060. This gives 10x speed-up and 2.5x better accuracy compared to the parallel CPU version.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102110
Streaming:
Download:
Share:
 
Speakers:
Tobias Brandvik
- University of Cambridge
Abstract:
This talk describes a strategy for implementing structured grid PDE solvers on GPUs. Techniques covered include the use of source-to-source compilation and the use of sparse matrix vector multiplications for complicated boundary conditions. ...Read More
Abstract:
This talk describes a strategy for implementing structured grid PDE solvers on GPUs. Techniques covered include the use of source-to-source compilation and the use of sparse matrix vector multiplications for complicated boundary conditions. A new production-quality solver for flows in turbomachines called Turbostream that uses these techniques is presented. The impact of the use of GPUs on the turbomachinery design process is demonstrated by two 64-GPU simulations that have recently been performed on the University of Cambridge''s GPU cluster.  Back
 
Topics:
Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102118
Streaming:
Download:
Share:
 
Speakers:
Jonas Toelke
- Ingrain
Abstract:
Learn how a very efficient implementation of multiphase lattice Boltzmann methods (LBM) based on CUDA delivers significant benefits for predictions of properties in rocks. ...Read More
Abstract:
Learn how a very efficient implementation of multiphase lattice Boltzmann methods (LBM) based on CUDA delivers significant benefits for predictions of properties in rocks. This simulator on NVIDIA hardware enables us to perform pore scale multi-phase (oil-water-matrix) simulations in natural porous media and to predict important rock properties like absolute permeability, relative permeabilites, and capillary pressure. We will show videos of these simulations in complex real world porous media and rocks.  Back
 
Topics:
Computational Fluid Dynamics, Seismic & Geosciences
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102170
Streaming:
Download:
Share:
 
Speakers:
Daniel Gaudlitz
- FluiDyna
 
Topics:
Computational Fluid Dynamics, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102206
Streaming:
Download:
Share:
 
Speakers:
Andrew Corrigan, Keith Obenschain
- Naval Research Lab Code 6440
Abstract:
Explore how a code written to run in parallel using OpenMP and on a single GPU was modified to run across multiple GPUs and nodes on a multi-CPU, multi-GPU cluster installed at the Naval Research Laboratory. ...Read More
Abstract:
Explore how a code written to run in parallel using OpenMP and on a single GPU was modified to run across multiple GPUs and nodes on a multi-CPU, multi-GPU cluster installed at the Naval Research Laboratory. We will discuss the performance of this code running in parallel using MPI/OpenMP and MPI/CUDA.  Back
 
Topics:
Computational Fluid Dynamics, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102234
Streaming:
Download:
Share:
 
Speakers:
Dan Bailey
Abstract:
Explore how a less efficient, but highly parallel algorithm can still be a superior alternative to a sequential CPU method. This talk will present a simple CUDA-based Poisson solver to the conjugate gradient method designed for solving well-cond ...Read More
Abstract:

Explore how a less efficient, but highly parallel algorithm can still be a superior alternative to a sequential CPU method. This talk will present a simple CUDA-based Poisson solver to the conjugate gradient method designed for solving well-conditioned matrices such as those that arise from the pressure projection stage of a Navier-Stokes fluid solver. In contrast to other active areas of research in this field, we show that a more brute force approach can still significantly out-perform the best CPU alternatives by sacrificing a high convergence rate in place of achieving much faster iterations.

  Back
 
Topics:
Computational Fluid Dynamics, Graphics and AI, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102239
Download:
Share:
 
Speakers:
Arun Somani, Lizandro Solano, Z.J. Wang
- Iowa State University
Abstract:
A discontinuous high-order formulation named the Correction Procedure via Reconstruction (CPR) is recently implemented on Nvidia GPUs. ...Read More
Abstract:
A discontinuous high-order formulation named the Correction Procedure via Reconstruction (CPR) is recently implemented on Nvidia GPUs. The CPR formulation is related to the discontinuous Galerkin (DG) method, and unifies several methods such as the DG, spectral volume and spectral difference into a single framework efficient for hybrid meshes. In preliminary 2D inviscid flow computations, a single GPU has been able to deliver a speedup of 44 over a CPU of the same generation. Extension is being made for viscous flow computation, and results will be presented at the final presentation.   Back
 
Topics:
Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102292
Streaming:
Download:
Share:
 
Speakers:
Takayuki Aoki
- Tokyo Institute of Technology
Abstract:
Many CFD applications have been successfully accelerated on GPUs, but for large-scale simulations that require memory beyond a single GPU, communication is required between GPUs over cluster nodes through PCI-Express and interconnects. ...Read More
Abstract:
Many CFD applications have been successfully accelerated on GPUs, but for large-scale simulations that require memory beyond a single GPU, communication is required between GPUs over cluster nodes through PCI-Express and interconnects. To overcome performance bottlenecks and preserve parallel scalability, an overlapping technique between computation and communication is essential. This work presents results of an LBM for incompressible flow, and a Tsunami simulation solving the shallow water equation for simulations on the NVIDIA Tesla-based TSUBAME supercomputer of Tokyo Tech. In addition results will be presented on a complete GPU implementation of a production-level weather prediction code developed by the JMA that achieves 15 TFLOPS for an 80-fold speedup.  Back
 
Topics:
Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102295
Streaming:
Download:
Share:
 
Speakers:
Patrice Castonguay
- Stanford University
Abstract:
The objective of this project is to develop a scalable and efficient high-order unstructured compressible flow solver for GPUs. ...Read More
Abstract:
The objective of this project is to develop a scalable and efficient high-order unstructured compressible flow solver for GPUs. The solver allows the achievement of arbitrary order of accuracy for flows over complex geometries. High-order solvers require more operations per degree of freedom, thus making them highly suitable for massively parallel processors. Preliminary results indicate speed-ups up to 70x with the Tesla C1060 compared to the Intel i7 CPU. Memory access was optimized using shared and texture memory.  Back
 
Topics:
Computational Fluid Dynamics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10D01
Download:
Share:
 
Speakers:
Dana Jacobsen
- Boise State University
Abstract:
An investigation of the performance and scalability of a multigrid pressure Poisson equation solver running on a GPU cluster.
 
Topics:
Computational Fluid Dynamics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10D02
Download:
Share:
 
Speakers:
Gilles Civario
- Irish Centre for High-End Computing
Abstract:
In this work, the acceleration of a mesh-free Computational Fluid Dynamics (CFD) code is performed using CUDA.
 
Topics:
Computational Fluid Dynamics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10D03
Download:
Share:
 
Speakers:
Sean Lovett
- University of Cambridge
Abstract:
We used NVIDIA Tesla GPUs to accelerate the solution of hyperbolic partial differential equations, with application to modelling airblast generated by industrial bench mining operations. ...Read More
Abstract:
We used NVIDIA Tesla GPUs to accelerate the solution of hyperbolic partial differential equations, with application to modelling airblast generated by industrial bench mining operations. Parallelisation over multiple GPUs was achieved using MPI.  Back
 
Topics:
Computational Fluid Dynamics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10D04
Download:
Share:
 
Speakers:
Z.J. Wang
- Iowa State University
Abstract:
This poster describes our implementation of adaptive high-order CFD methods on GPUs. A speedup factor of up to 44 has been achieved for 2D flow problems.
 
Topics:
Computational Fluid Dynamics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10D05
Download:
Share:
 
Speakers:
Long Wang
- Supercomputing Center, Chinese Academy of Sciences
Abstract:
Computational Fluid Dynamics, an important branch in HPC field, has a history of seeking and requiring higher computational performance. ...Read More
Abstract:
Computational Fluid Dynamics, an important branch in HPC field, has a history of seeking and requiring higher computational performance. The traditional way to satisfy this quest is to use faster machines or supercomputers. Yet these approaches seem inconvenient and costly to many individual researchers. We investigated the use of GPU to accelerate CFD codes and tested the performances on CUDA and OpenCL platform. We have ported 2D cave flow, 2D Riemann, and 2D flow over a RAE2882 airfoil to the GPU and explored some GPU-specific optimization strategies. In most cases, approximately 16 to 63 x speed up can be achieved.  Back
 
Topics:
Computational Fluid Dynamics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10D06
Download:
Share:
Computer Vision
Presentation
Media
Speakers:
Tom Dean
Abstract:
Join us for a discussion on applying commodity-server-based clusters and GPU-based clusters to simulating computer vision algorithms at a scale that approaches that of biological vision. We consider the limitations of each technology, survey app ...Read More
Abstract:

Join us for a discussion on applying commodity-server-based clusters and GPU-based clusters to simulating computer vision algorithms at a scale that approaches that of biological vision. We consider the limitations of each technology, survey approaches taken thus far, and suggest new hybrid models and programming frameworks to overcome current limitations and substantially improve performance.

  Back
 
Topics:
Computer Vision, Artificial Intelligence and Deep Learning
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102132
Download:
Share:
 
Speakers:
Abstract:
Learn how to use CUDA and GPGPU to perform large scale face search for both forensics as well as CCTV face recognition.
 
Topics:
Computer Vision, Video & Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102173
Streaming:
Download:
Share:
 
Abstract:
Biologically-inspired computer vision algorithms - those that aim to mirror the computations performed by the brain''s visual system - have emerged as exceptionally promising candidates in object and face recognition research, achieving ...Read More
Abstract:

Biologically-inspired computer vision algorithms - those that aim to mirror the computations performed by the brain''s visual system - have emerged as exceptionally promising candidates in object and face recognition research, achieving performance on a range of object and face recognition tasks. Recently, we have begun harnessing the newly-available power of NVIDIA GPUs to tackle the problem of biologically-inspired model selection within a largescale model search framework, drawing inspiration from high-throughput screening approaches in molecular biology and genetics where a large number of organisms are screened in parallel for a given property of interest. As the available computational power provided by massively paralleltechnology from NVIDIA continues to expand, w e hope that this research will hold great potential for new social networking applications in addition to rapidly accelerating progress in artificial vision, and for generating new, experimentally testable hypotheses for the study of biological vision.

  Back
 
Topics:
Computer Vision, Artificial Intelligence and Deep Learning, HPC and AI, Neuroscience
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102204
Download:
Share:
 
Speakers:
James Fung
- NVIDIA
Abstract:
GPUS have evolved from fixed function to general purpose, and continue to evolve with new features being added in every generation. ...Read More
Abstract:
GPUS have evolved from fixed function to general purpose, and continue to evolve with new features being added in every generation. This talk will discuss how to exploit the new features introduced by the Fermi architecture (such as concurrent kernel execution, writes to texture) to accelerate computer vision algorithms.  Back
 
Topics:
Computer Vision, Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102209
Streaming:
Download:
Share:
 
Speakers:
Joe Stam
Abstract:
OpenCV is a widely popular computer vision library, with millions of downloads and hundreds of thousands of users. Applications span many industries including robotics, industrial machine vision, automotive, film & broadcast, medical, and co ...Read More
Abstract:

OpenCV is a widely popular computer vision library, with millions of downloads and hundreds of thousands of users. Applications span many industries including robotics, industrial machine vision, automotive, film & broadcast, medical, and consumer applications. NVIDIA and the OpenCV development team are collaborating to provide CUDA implementations of the most demanding algorithms, thus enabling a new level of real-time capability and higher quality results. This talk with introduce OpenCV, and summarize the new CUDA enabled capabilities, and provide an overview of future plans.

  Back
 
Topics:
Computer Vision, Stereoscopic 3D, Video & Image Processing, Video & Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102215
Streaming:
Download:
Share:
 
Speakers:
Peter Lu
- Harvard University
Abstract:
Ever since the 1827 discovery of Brownian motion by observing pollen grains, quantifying motion under the microscope has led to breakthroughs in physics, biology and engineering. ...Read More
Abstract:
Ever since the 1827 discovery of Brownian motion by observing pollen grains, quantifying motion under the microscope has led to breakthroughs in physics, biology and engineering. Here, I present methods we have developed using confocal microscopy to deduce 3D structure and dynamics from 2D image sequences. We analyze the motion of diffusing colloidal particles and swarms of bacteria free to swim in 3D, which we observe at the single-organism level. We rely heavily on GPU computing to process our large data sets, making extensive use of NPP, CuFFT and optical-flow CUDA algorithms originally developed for machine vision in automobiles.  Back
 
Topics:
Computer Vision, Life & Material Science, Video & Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102242
Streaming:
Download:
Share:
 
Speakers:
Mahesh Khadtare
- CRL India
Abstract:
Explores the GPU porting and performance analysis of the image quality assessment algorithm based on structural similarity index(SSI). ...Read More
Abstract:
Explores the GPU porting and performance analysis of the image quality assessment algorithm based on structural similarity index(SSI). This index is a powerful tool for image quality assessment and the algorithm is highly suitable for GPU architecture, offering a rapid image quality assessment in many image restoration applications.  Back
 
Topics:
Computer Vision, Video & Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102298
Streaming:
Download:
Share:
 
Speakers:
Hannes Fassold
Abstract:
Discover how post-production tasks can be accelerated by taking advantage of GPU-based algorithms. In this talk we present computer vision algorithms for corner detection, feature point tracking, image warping and image inpainting, and their eff ...Read More
Abstract:

Discover how post-production tasks can be accelerated by taking advantage of GPU-based algorithms. In this talk we present computer vision algorithms for corner detection, feature point tracking, image warping and image inpainting, and their efficient implementation on GPUs using CUDA. We also show how to use these algorithms to do real-time stabilization and temporal re-sampling (re-timing) of high definition video sequences, both common tasks in post-production. Benchmarking of the GPU implementations against optimized CPU algorithms demonstrates a speedup of approximately an order of magnitude.

  Back
 
Topics:
Computer Vision, Video & Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2029
Streaming:
Download:
Share:
 
Speakers:
Abstract:
To measure three-dimensional shape data of objects, we build up a measurement system that assigns three-dimensional coordinates to the position of projected measurement labels in a camera image. ...Read More
Abstract:
To measure three-dimensional shape data of objects, we build up a measurement system that assigns three-dimensional coordinates to the position of projected measurement labels in a camera image. To achieve high measurement accuracy across high amounts of measurement points, we need a very quick routine to localize measurement labels with high precision. To speed up the computation, we evaluate the fits using the CUDA architecture. The final implementation speeds up the fitting of 104 two-dimensional Gauss functions by a factor of 90.  Back
 
Topics:
Computer Vision, Stereoscopic 3D
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102065
Streaming:
Download:
Share:
 
Speakers:
Kento Tarui
Abstract:
We propose a real time HOG based object detector implemented on GPU. To accelerate the detection process, the proposed method uses two serially-cascaded HOG detectors. The first low dimensional HOG detector discards detection windows obviously n ...Read More
Abstract:

We propose a real time HOG based object detector implemented on GPU. To accelerate the detection process, the proposed method uses two serially-cascaded HOG detectors. The first low dimensional HOG detector discards detection windows obviously not showing target objects. It reduces the computational cost of the second high dimensional HOG detector. This method tested on 640x480 color image and the same size movie. The computation time decreases to 70ms per image. That is 4 times faster than a case of single detector. This method provides real time performance even on middle end GPUs such as GeForce GTS 250.

  Back
 
Topics:
Computer Vision, Artificial Intelligence and Deep Learning
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102114
Streaming:
Download:
Share:
 
Speakers:
Ryan Ismert
- Sportvision, Inc.
Abstract:
This talk will take a detailed look at Sportvision''s "First and 10" system, perhaps the most widely experienced example of AR ever, with 106 million viewers during the 2010 Superbowl alone. ...Read More
Abstract:
This talk will take a detailed look at Sportvision''s "First and 10" system, perhaps the most widely experienced example of AR ever, with 106 million viewers during the 2010 Superbowl alone. We''ll examine the current implementation and the GPU features that enable low latency, video-rate performance.  Back
 
Topics:
Computer Vision
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102123
Streaming:
Download:
Share:
 
Speakers:
Bor-Yiing Su
- University of California, Berkeley
Abstract:
We have developed a parallel object recognition system using CUDA, achieving 70x-80x speedup against the original serial implementation. ...Read More
Abstract:
We have developed a parallel object recognition system using CUDA, achieving 70x-80x speedup against the original serial implementation. In order to optimize our implementation, we evaluated the performance of different parallelization strategies on some key computations in the object recognition system. Finally we concluded that the parallel implementation performance is sensitive to input data properties. Therefore, we should dynamically adjust the parallelization strategy at runtime to optimize key computations.  Back
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F01
Download:
Share:
 
Speakers:
Narayanan Sundaram
- University of California, Berkeley
Abstract:
In this poster we discuss a method for computing point trajectories based on a fast parallel implementation of a recent optical flow algorithm that tolerates fast motion. ...Read More
Abstract:
In this poster we discuss a method for computing point trajectories based on a fast parallel implementation of a recent optical flow algorithm that tolerates fast motion. The parallel implementation of large displacement optical flow runs about 78x faster than the serial C++ version. We use this implementation is a point tracking application. Our resulting technique tracks up to three orders of magnitude more points and is 46% more accurate than the Kanade-Lucas-Tomasi tracker. Compared to the Particle Video tracker, we achieve 66% better accuracy while retaining the ability to handle large displacements while running an order of magnitude faster.  Back
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F02
Download:
Share:
 
Speakers:
Steven Brumby
- Los Alamos National Laboratory
Abstract:
Los Alamos National Laboratory''s Petascale Synthetic Visual Cognition project is exploring full-scale, real-time functional models of human visual cortex to understand how human vision achieves its accuracy, robustness and speed. ...Read More
Abstract:
Los Alamos National Laboratory''s Petascale Synthetic Visual Cognition project is exploring full-scale, real-time functional models of human visual cortex to understand how human vision achieves its accuracy, robustness and speed. Commercial-off-the-shelf hardware to support this modeling is rapidly improving, e.g., a teraflop GPGPU card costs ~$500 and is ~size of mouse cortex. We present results demonstrating image classification on UAV aerial video with a visual cortex model running on a 240-core NVIDIA GeForce GTX285, and see >x10 speed-up. As this technology continues to improve, cortical modeling on GPGPU devices has the potential to revolutionize computer vision.  Back
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F03
Download:
Share:
 
Speakers:
Melvin Wong
- Institute for Infocomm Research
Abstract:
Background subtraction is one of the important image processing steps for video surveillance and many computer vision problems such as tracking & recognition. ...Read More
Abstract:
Background subtraction is one of the important image processing steps for video surveillance and many computer vision problems such as tracking & recognition. However, robust background subtraction that adapts well to variable environment changes is highly computational and consumed large amount of memory. Thus, its practical application is often limited. Here, we aimed to expand its usage and tackle vision problems that requires high frame rate camera such as real-time sports analysis, real-time object detection and recognition. Using recent advances in accelerator hardware - NVIDIA Fermi Architecture and taking advantage of heterogeneous computing , we are able to gain good performance that allows to use in these practical applications.  Back
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F04
Download:
Share:
 
Speakers:
Nicolas Pinto
- Massachusetts Institute of Technology
Abstract:
The construction of artificial vision systems and the study of biological vision are naturally intertwined as they represent simultaneous efforts to forward- and reverse-engineer systems with similar goals. ...Read More
Abstract:
The construction of artificial vision systems and the study of biological vision are naturally intertwined as they represent simultaneous efforts to forward- and reverse-engineer systems with similar goals. Here, we present a high-throughput approach to more expansively explore biologically-inspired models by leveraging GPUs. We show that this approach can yield significant gains in performance on object and face recognition (including "Labeled Faces in the Wild" challenge and faces from Facebook), consistently outperforming the state-of-the-art. We highlight how the application of flexible programming tools, such as high-level scripting, template metaprogramming/auto-tuning, can enable large performance gains, while managing complexity for the developer.  Back
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F05
Download:
Share:
 
Speakers:
Salman Ul Haq
- TunaCode
Abstract:
CUVI Lib (CUDA for Vision and Imaging Library) is a software library that provides a set of GPU accelerated computer vision and image processing functions. ...Read More
Abstract:
CUVI Lib (CUDA for Vision and Imaging Library) is a software library that provides a set of GPU accelerated computer vision and image processing functions. CUVI can both be utilized as an add-on library for the NVIDIA''s NPP (NVIDIA Performance Primitives) as it compliments the functionality present in NPP as well as it can be used as a standalone library ready to be plugged into end-user C/C++ applications.  Back
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F06
Download:
Share:
 
Speakers:
In Kyu Park
- Inha University
Abstract:
We present a new surfel (surface element) based multi-view stereo algorithm which runs entirely on GPU. ...Read More
Abstract:
We present a new surfel (surface element) based multi-view stereo algorithm which runs entirely on GPU. We utilize flexibility of surfel-based 3D shape representation and global optimization by graph cuts in a same framework.The orientation of the constructed surfel candidates imposes an effective constraint that reduces the effect of the minimal surface bias. The entire processing pipeline is implemented on the latest GPU to speed up the processing significantly. Experimental results show that the proposed approach reconstructs the 3D shape of an object accurately and efficiently, which runs more than 100 times faster than on CPU.  Back
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F07
Download:
Share:
 
Speakers:
Jayadeep Vijayan
- NeST Software
Abstract:
A GPU based implementation of a face recognition solution using PCA with Eigenfaces algorithm.
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F08
Download:
Share:
 
Speakers:
Jan-Michael Frahm
- University of North Carolina, Chapel Hill
Abstract:
We present a system to reconstruct dense 3D models from community photo collections. First images are described using GIST and are clustered using hamming distances. ...Read More
Abstract:
We present a system to reconstruct dense 3D models from community photo collections. First images are described using GIST and are clustered using hamming distances. Each of these clusters is geometrically verified and connected using Geotags. Connected clusters are bundle adjusted and the obtained registration is used to estimate depthmaps that are finally fused to obtain dense 3D models. Each of the above steps, except Bundle Adjustment, is implemented in CUDA and runs on multiple GPUs . The performance of our pipeline is two order of magnitude faster on one order more images compared to state of the art method.  Back
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F09
Download:
Share:
 
Speakers:
Chloe Vaniet
- Imperial College London
Abstract:
Vision enhancement systems is an alternative visual aid device to enhance the remaining vision for visual impairment subjects. ...Read More
Abstract:
Vision enhancement systems is an alternative visual aid device to enhance the remaining vision for visual impairment subjects. Our aim is to develop a mobile central vision enhancement system for macular degeneration patients. Three different types of enhancement algorithms have been developed and their efficiency was tested on low vision patients. These three algorithms have been implemented on a portable low power devic. The Nvidia system-on-a-chip Tegra has been chosen for this implementation.  Back
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F10
Download:
Share:
 
Speakers:
Esubalew Bekele
- Universal Robotics Inc.
Abstract:
A dense stereo vision for a material handling dual-arm industrial robot have been implemented with the Rectification, Stereo Correspondence and 3D Pose from depth are ported out to GPU using CUDA.
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F11
Download:
Share:
 
Speakers:
Hendrik Dahlkamp
- Stanford University
Abstract:
We present a flexible, parallelized method for fusing information from optical and range sensors based on an accelerated high-dimensional filtering approach. ...Read More
Abstract:
We present a flexible, parallelized method for fusing information from optical and range sensors based on an accelerated high-dimensional filtering approach. Our system takes as input a sequence of monocular camera images as well as a stream of sparse range measurements as obtained from a laser or other sensor system. Our method produces a dense, high-resolution depth map of the scene, automatically generating confidence values for every interpolated depth point. We describe how to integrate priors on object shape, motion and appearance and how to achieve an efficient implementation using parallel processing hardware such as GPUs.  Back
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F12
Download:
Share:
 
Speakers:
Varun Ganapathi
- Stanford University
Abstract:
In this work, we derive an efficient filtering algorithm for tracking human pose at 4-10 frames per second using a stream of monocular depth images. ...Read More
Abstract:
In this work, we derive an efficient filtering algorithm for tracking human pose at 4-10 frames per second using a stream of monocular depth images. The key idea is to combine an accurate generative model-which is achievable in this setting using state of the art GPU hardware-with a discriminative model that feeds data-driven evidence about body part locations. We describe a novel algorithm for propagating the noisy evidence about body part locations up the kinematic chain using the unscented transform.We provide extensive experimental results on 28 real-world sequences using automatic ground-truth annotations from a commercial motion capture system.  Back
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F13
Download:
Share:
 
Speakers:
Tim Llewellynn
- nViso / EPFL
Abstract:
Active Appearance Models (AAM) is a powerful tool for modeling and matching objects under shape deformations and texture variations. ...Read More
Abstract:
Active Appearance Models (AAM) is a powerful tool for modeling and matching objects under shape deformations and texture variations. It learns characteristics of objects by building a compact statistical model from applying Principal Component Analysis (PCA) to a set of labeled data. Although AAM has been widely applied in the fields of computer vision, due to its flexible framework, it still cannot satisfy the requirement of real-time situations. To alleviate this problem, we address the computational complexity of the fitting procedure by running the AAM optimization algorithm on a GPU using a hybrid CPU / GPU block processing architecture.  Back
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F14
Download:
Share:
 
Speakers:
Anatoly Baksheev
- ITEEZ
Abstract:
OpenCV is a free open source library of computer vision algorithms. Recently a new module consisting of functions implemented on GPU was introduced in OpenCV. ...Read More
Abstract:
OpenCV is a free open source library of computer vision algorithms. Recently a new module consisting of functions implemented on GPU was introduced in OpenCV. It consists of several methods for calculating stereo correspondence between two images that is used to reconstruct a 3D scene. A simple block-matching algorithm works up to 10x faster compared to a CPU implementation in OpenCV providing real-time processing of HD stereo pairs on Tesla cards. Belief propagation-based algorithms show 20-50x speedup compared to a CPU implementation.  Back
 
Topics:
Computer Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F15
Download:
Share:
Databases, Data Mining, Business Intelligence
Presentation
Media
Speakers:
Ren Wu
Abstract:
Join us and learn why GPU computing is a game changer for business intelligence (BI). We will discuss how GPUs can be used to accelerate BI analytics at much lower cost, higher performance, and better power efficiency than other alternatives.

Read More
Abstract:

Join us and learn why GPU computing is a game changer for business intelligence (BI). We will discuss how GPUs can be used to accelerate BI analytics at much lower cost, higher performance, and better power efficiency than other alternatives.

  Back
 
Topics:
Databases, Data Mining, Business Intelligence, Finance, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102069
Streaming:
Download:
Share:
 
Speakers:
Abstract:
In a large-scale database installation where data tables are distributed across multiple servers, computational throughput can be optimized by using GPUs on each server and integrating database management with GPU resources. In the Department of ...Read More
Abstract:

In a large-scale database installation where data tables are distributed across multiple servers, computational throughput can be optimized by using GPUs on each server and integrating database management with GPU resources. In the Department of Physics and Astronomy at The Johns Hopkins University, we are experimenting with a set of software tools that closely couple SQL statements with GPU functionality. While still under development, the new framework is now routinely used in our research projects, e.g., to study the spatial clustering of galaxies as well as genomics.

  Back
 
Topics:
Databases, Data Mining, Business Intelligence, Tools & Libraries, Astronomy & Astrophysics, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102092
Streaming:
Download:
Share:
 
Speakers:
Abstract:
In this research session, we present an approach using NVIDIA GPUs as massively parallel coprocessors for in-memory OLAP computations. Early tests have shown speedup factors of more than 40x compared to optimized sequential algorithms on a CPU. ...Read More
Abstract:

In this research session, we present an approach using NVIDIA GPUs as massively parallel coprocessors for in-memory OLAP computations. Early tests have shown speedup factors of more than 40x compared to optimized sequential algorithms on a CPU. In addition to the data structures and algorithms involved, we describe a method to extend the approach to systems with more than one GPU in order to scale it to larger data sets.

  Back
 
Topics:
Databases, Data Mining, Business Intelligence
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102237
Streaming:
Download:
Share:
 
Speakers:
Peter Volk
Abstract:
With an increasing amount of data and user demands for fast query processing, the optimization of database operations continues to be a challenging task. A common optimization method is to leverage parallel hardware architectures. With the intro ...Read More
Abstract:

With an increasing amount of data and user demands for fast query processing, the optimization of database operations continues to be a challenging task. A common optimization method is to leverage parallel hardware architectures. With the introduction of general-purpose GPU computing, massively parallel hardware has become available within commodity hardware. To efficiently exploit this technology, we introduce the method of speculative query processing. This speculative query processing works on index structures to efficiently support heavily used database operations. To show the benefits and opportunities of our approach, we present a fine and coarse grain implementation for multidimensional queries.

  Back
 
Topics:
Databases, Data Mining, Business Intelligence
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10G02
Download:
Share:
 
Speakers:
Henry Cook
Abstract:
We propose a mechanism to provide the benefits of a software-managed memory hierarchy on top of a hierarchy of hardware-managed caches. A virtual local store (VLS) is mapped into the virtual address space of a process and backed by physical main ...Read More
Abstract:

We propose a mechanism to provide the benefits of a software-managed memory hierarchy on top of a hierarchy of hardware-managed caches. A virtual local store (VLS) is mapped into the virtual address space of a process and backed by physical main memory, but is stored in a partition of the hardware-managed cache when active. This reduces context switch cost, and allows VLSs to migrate with their process thread. The partition allocated to the VLS can be rapidly reconfigured without flushing the cache, allowing programmers to selectively use VLS in a library routine with low overhead.

  Back
 
Topics:
Databases, Data Mining, Business Intelligence
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10G03
Download:
Share:
Developer - Algorithms
Presentation
Media
Speakers:
Nikolai Sakharnykh
Abstract:
Learn about new techniques to efficiently implement the Alternating Direction Implicit method on GPU for large 2D and 3D domains with complex boundaries. A novel tridiagonal solver for systems with variable sizes and a new hybrid approach will b ...Read More
Abstract:

Learn about new techniques to efficiently implement the Alternating Direction Implicit method on GPU for large 2D and 3D domains with complex boundaries. A novel tridiagonal solver for systems with variable sizes and a new hybrid approach will be covered in detail. Comprehensive performance analysis and key Fermi optimizations will be explored. Various applications of tridiagonal solvers such as 3D direct numerical fluid simulation and a 2D depth-of-field effect for games will be briefly discussed.

  Back
 
Topics:
Developer - Algorithms, Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S10015
Streaming:
Download:
Share:
 
Abstract:
Learn how to accelerate marching cubes on the GPU by taking advantage of the GPU's high memory bandwidth and fast on-chip shared memory in a data expansion algorithm that can extract the complete iso-surface mesh from (dynamic) volume data w ...Read More
Abstract:

Learn how to accelerate marching cubes on the GPU by taking advantage of the GPU's high memory bandwidth and fast on-chip shared memory in a data expansion algorithm that can extract the complete iso-surface mesh from (dynamic) volume data without requiring any data transfers back to the CPU.

  Back
 
Topics:
Developer - Algorithms, Medical Imaging & Radiology, Video & Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2020
Download:
Share:
 
Speakers:
Allan Rasmusson, Gernot Ziegler
- University of Aarhus, NVIDIA
Abstract:
Explore a new technique in the detection of common regions in a 2D/3D data array. Connected components along the axes are linked before actual label propagation starts. ...Read More
Abstract:
Explore a new technique in the detection of common regions in a 2D/3D data array. Connected components along the axes are linked before actual label propagation starts. The algorithm is completely gather-based, which allows for several optimizations in the CUDA C implementation. It enables real-time frame rates for the analysis of typical 2D images and interactive frame rates for the analysis of typical volume data.  Back
 
Topics:
Developer - Algorithms, Computer Vision, Medical Imaging & Radiology, Video & Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2021
Streaming:
Download:
Share:
 
Speakers:
Robert Strzodka
- Max Planck Institut Informatik
Abstract:
Learn how to switch between array of structs (AoS) and struct of arrays (SoA) storage without having to change the data access syntax. ...Read More
Abstract:
Learn how to switch between array of structs (AoS) and struct of arrays (SoA) storage without having to change the data access syntax. A few changes to the struct and container definitions will enable you to evaluate the performance of AoS vs. SoA on your existing AoS code. We present a simple abstraction that retains the more intuitive AoS syntax array[index]component, yet allows you to switch between AoS and SoA storage with a single template parameter at class definition.  Back
 
Topics:
Developer - Algorithms, Tools & Libraries, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2038
Streaming:
Download:
Share:
 
Speakers:
Nachiket Gokhale
- Weidlinger Associates Inc
Abstract:
Explicit finite element codes are widely used to simulate the response of structures and mechanical equipment subjected to shock, blast and wave propagation phenomena. ...Read More
Abstract:
Explicit finite element codes are widely used to simulate the response of structures and mechanical equipment subjected to shock, blast and wave propagation phenomena. High resolution models require run times ranging from a few seconds to a few months are common and hence the payoff from GPU acceleration is tremendous. We describe the acceleration of our commercial finite element code NLFLEX using CUDA. We developed GPU kernels in CUDA based on our production code NLFLEX, for linear elasticity, explosives, elasto-plasticity and large deformation elasticity. We attained order of magnitude (10X) acceleration in single precision and approximately (5X) in double precision mode.   Back
 
Topics:
Developer - Algorithms, Computational Fluid Dynamics, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102061
Streaming:
Download:
Share:
 
Speakers:
Doris Chen
- University of Toronto
Abstract:
FPGA technology mapping is an algorithm that is heavily data parallel, but contains many features that make it unattractive for GPU implementation. ...Read More
Abstract:
FPGA technology mapping is an algorithm that is heavily data parallel, but contains many features that make it unattractive for GPU implementation. The algorithm uses data in irregular ways since it is a graph-based algorithm. It also makes heavy use of constructs like recursion which is not supported by GPU hardware. In this paper, we take a state-of-the-art FPGA technology mapping algorithm within Berkeley's ABC package and attempt to parallelize it on a GPU. We show that runtime gains of 3.1x are achievable while maintaining identical quality as demonstrated by running these netlists through Altera's Quartus II place-and-route tool.  Back
 
Topics:
Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102068
Streaming:
Download:
Share:
 
Speakers:
Mark Harris
- NVIDIA
Abstract:
Learn about the importance of optimized data-parallel algorithm primitives as building blocks for efficient real-world applications. ...Read More
Abstract:
Learn about the importance of optimized data-parallel algorithm primitives as building blocks for efficient real-world applications. Fundamental parallel algorithms like sorting, parallel reduction, and parallel scan are key components in a wide range of applications from video games to serious science. This session will cover the state of the art in data-parallel primitive algorithms for GPUs. Starting with an explanation of the purpose and applications of the algorithms, we will discuss key algorithm design principles, demonstrate current open source algorithm libraries for GPUs (CUDPP and Thrust), describe optimizations using new features in the Fermi architecture, and explore future directions.  Back
 
Topics:
Developer - Algorithms, Tools & Libraries, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102084
Streaming:
Download:
Share:
 
Speakers:
Andrew Davidson, Yao Zhang
- University of California, Davis
Abstract:
In this presentation, we will discuss and analyze the performance of three optimization techniques for tridiagonal solvers. ...Read More
Abstract:
In this presentation, we will discuss and analyze the performance of three optimization techniques for tridiagonal solvers. We first present a hybrid Parallel Cyclic Reduction(PCR)-Gaussian Elimination(GE) tridiagonal solver, which combines work-efficient and step-efficient algorithms for high performance. We further discuss an auto-tuned variant of this technique which selects the optimal switching point between algorithms on a per-machine basis. Next, we present a technique to handle large systems, where shared memory constraints prohibit previous work to solve these systems directly. Finally, we will discuss optimizations on a cyclic reduction technique that avoid bank conflicts on current hardware.  Back
 
Topics:
Developer - Algorithms, Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102085
Streaming:
Download:
Share:
 
Speakers:
Holger Dammertz
- Ulm University
Abstract:
Learn how to select the best and fastest pseudo random number generator for your massively parallel Monte Carlo simulation. ...Read More
Abstract:
Learn how to select the best and fastest pseudo random number generator for your massively parallel Monte Carlo simulation.Pseudo random numbers generators (PRNG) are a fundamental building block of these simulations and it is thus required to select suitable PRNGs with regard to the specific problem at hand while considering the parallel hardware architecture. Recent developments in random number generations provide a wide variety of choices, each with different properties and trade-offs. We provide a comprehensive survey of the current state of the art for massively parallel PRNG and show a broad range of applications.  Back
 
Topics:
Developer - Algorithms, Finance
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102136
Streaming:
Download:
Share:
 
Speakers:
Shawn Brown
Abstract:
Learn how to adapt a kd-tree spatial data structure for efficient nearest neighbor (NN) searches on a GPU. Although the kd-tree is not a natural fit for GPU implementation, it can still be effective with the right engineering decisions. By bound ...Read More
Abstract:

Learn how to adapt a kd-tree spatial data structure for efficient nearest neighbor (NN) searches on a GPU. Although the kd-tree is not a natural fit for GPU implementation, it can still be effective with the right engineering decisions. By bounding the maximum height of the kd-tree, minimizing the memory footprint of data structures, and optimizing the GPU kernel code, multi-core GPU NN searches with tens of thousands to tens of millions of points run 10-40 times faster than the equivalent single-core CPU NN searches.

  Back
 
Topics:
Developer - Algorithms, Artificial Intelligence and Deep Learning, Databases, Data Mining, Business Intelligence
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102140
Download:
Share:
 
Speakers:
Amanda Peters
- Harvard University
Abstract:
Learn how GPUs are being used to accelerate the study of the emergence of cooperative behavior in biology, from the interactions of humans to viruses to bacteria. ...Read More
Abstract:
Learn how GPUs are being used to accelerate the study of the emergence of cooperative behavior in biology, from the interactions of humans to viruses to bacteria. The work presented here achieves a speedup of 209x on a cluster of 4 Tesla GPUs.  Back
 
Topics:
Developer - Algorithms, Life & Material Science
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102163
Streaming:
Download:
Share:
 
Speakers:
Lorena Barba
- Boston University
Abstract:
The first wave of successful GPU accelerations has been crowded with highly-parallel methods that adapted well to the hardware. ...Read More
Abstract:
The first wave of successful GPU accelerations has been crowded with highly-parallel methods that adapted well to the hardware. But the easy-pickings are now running out. The truly challenging applications require "going back to the algorithmic drawing board." To develop new versions of the most effective fast algorithms, such that our science can most benefit, an ideal environment is created by the open software model, where efforts can be shared. We will describe one area of application --electrostatics of biomolecules in solution-- where we see at work the triad of extreme computing: fast algorithms, open software, and heterogeneous computing.  Back
 
Topics:
Developer - Algorithms, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102166
Streaming:
Download:
Share:
 
Speakers:
Adarsh Krishnamurthy, Sara McMains
- University of California Berkeley
Abstract:
The broad objective of our research is to develop mechanical Computer-Aided Design tools that provide interactive feedback to the designer. ...Read More
Abstract:
The broad objective of our research is to develop mechanical Computer-Aided Design tools that provide interactive feedback to the designer. We have developed GPU algorithms for fundamental CAD operations (NURBS evaluation, surface-surface intersection, separation distance computation, moment computation, etc.) that are one to two orders of magnitude faster, and often more accurate, than current commercial CPU implementations. We will touch on strategies we have employed to meet GPU programming challenges, such as the separation of CPU/GPU operations, imposing artificial structure on computations, and transforming problem definitions to suit GPU-computation models.  Back
 
Topics:
Developer - Algorithms, Tools & Libraries, Graphics and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102171
Streaming:
Download:
Share:
 
Speakers:
Pavel Emeliyanenko
- Max-Planck Institute for Informatics
Abstract:
We present the first implementation of a complete modular resultant algorithm on the graphics hardware. ...Read More
Abstract:
We present the first implementation of a complete modular resultant algorithm on the graphics hardware. Our recent developments taking advantage of new NVidia Fermi GPU architecture and instruction set allowed us to achieve about 150x speed-up over a modular resultant algorithm from Maple 13.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A02
Download:
Share:
 
Speakers:
Hartmut Ruhl
- Ludwig-Maximilians-University
Abstract:
Particle-In-Cell simulations represent an important technique in the field of kinetic plasma simulations. ...Read More
Abstract:
Particle-In-Cell simulations represent an important technique in the field of kinetic plasma simulations. 2D particle pushing and conserved current aggregation has been implemented in CUDA. On a TESLA C1060 the CUDA code is 4 times faster than SSE2 optimized code on a quad core INTEL XEON processor.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A03
Download:
Share:
 
Speakers:
Octavian Nitica
- University of Delaware
Abstract:
The Ant Colony Optimization (ACO) Algorithm is a metaheuristic that is used to find shortest paths in graphs. ...Read More
Abstract:
The Ant Colony Optimization (ACO) Algorithm is a metaheuristic that is used to find shortest paths in graphs. By using CUDA to implement an ACO algorithm, we achieved significant improvement in performance over a highly-tuned sequential CPU implementation. The construction step of the ACO algorithm consists of each ant creating an independent solution, and this step is where most of the computation is spent. Since the construction step is the same for most ACO variations, parallelizing this step will also allow for easy adaptation to different pheromone updating functions. Currently, our research tests this hypothesis on the travelling salesmen problem.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A04
Download:
Share:
 
Speakers:
Duane Merrill
- University of Virginia
Abstract:
The need to rank and order data is pervasive, and sorting operations are fundamental to many algorithms. ...Read More
Abstract:
The need to rank and order data is pervasive, and sorting operations are fundamental to many algorithms. This poster presents a very efficient method for sorting large sequences of fixed-length keys (and values) using GPU stream processors. Compared to the state-of-the-art, our implementation demonstrates multiple factors of speedup (up to 3.8x) for all NVIDIA GPGPUs. For this domain of sorting problems, we believe our sorting primitive to be the fastest available for any fully-programmable microarchitecture: our stock NVIDIA GTX480 sorting results exceed the 1G keys/sec average sorting rate (i.e., one billion 32-bit keys sorted per second).  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A05
Download:
Share:
 
Speakers:
Stanley Tzeng
- University of California, Davis
Abstract:
We explore software mechanisms for managing irregular tasks on graphics processing units. Traditional GPU programming guidelines teaches us how to efficiently program the GPU for data parallel pipelines with regular input and output. ...Read More
Abstract:
We explore software mechanisms for managing irregular tasks on graphics processing units. Traditional GPU programming guidelines teaches us how to efficiently program the GPU for data parallel pipelines with regular input and output. We present a strategy for solving task parallel pipelines which can handle irregular workloads on the GPU. We demonstrate that dynamic scheduling and efficient memory management are critical problems in achieving high efficiency on irregular workloads. We showcase our results on a real time Reyes rendering pipeline.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A06
Download:
Share:
 
Speakers:
Yao Zhang
- University of California, Davis
Abstract:
Tridiagonal linear systems are of importance to many problems in numerical analysis and computational fluid dynamics, as well as to computer graphics applications in video games and computer-animated films. ...Read More
Abstract:
Tridiagonal linear systems are of importance to many problems in numerical analysis and computational fluid dynamics, as well as to computer graphics applications in video games and computer-animated films. This poster presents our study on the performance of multiple tridiagonal algorithms on a GPU. We design a novel hybrid algorithm that combines a work-efficient algorithm with a step-efficient algorithm in a way well-suited for a GPU architecture. Our hybrid solver achieves 8x and 2x speedup respectively in single precision and double precision over a multi-threaded highly-optimized CPU solver and a 2x speedup over a basic GPU solver.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A07
Download:
Share:
 
Speakers:
Hans Henrik B. Soerensen
- Technical University of Denmark
Abstract:
A GPU competence center and laboratory for research and collaboration within academia and partners in industry has been established in 2008 at section for Scientific Computing, DTU informatics, Technical University of Denmark. ...Read More
Abstract:
A GPU competence center and laboratory for research and collaboration within academia and partners in industry has been established in 2008 at section for Scientific Computing, DTU informatics, Technical University of Denmark. In GPULab we focus on the utilization of GPUs for high-performance computing applications and software tools in science and engineering, inverse problems, visualization, imaging, dynamic optimization. This poster illustrates the latest and most interesting projects that have been developed at our center.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A08
Download:
Share:
 
Speakers:
David Whittaker
- University of Alabama at Birmingham
Abstract:
This poster describes a new technique for performing binary prefix sums using Fermi''s new __ballot() and __popc() functions. ...Read More
Abstract:
This poster describes a new technique for performing binary prefix sums using Fermi''s new __ballot() and __popc() functions. These instructions greatly increase intra-warp communication, allowing for an 80% speedup over standard GPU methods in applications like Radix Sort. It also points to future research that will enable suffix array construction, Burrows-Wheeler Transform, and the BZIP algorithm to take advantage of these instructions for efficient GPU compression.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A09
Download:
Share:
 
Speakers:
James Malcolm
- Accelereyes
Abstract:
This poster presents an approach to derive parallelism in algorithms that involve building sparse matrix that represents relationships between inter-dependent data fields and enhancing its performance on the GPU. ...Read More
Abstract:
This poster presents an approach to derive parallelism in algorithms that involve building sparse matrix that represents relationships between inter-dependent data fields and enhancing its performance on the GPU. This work compares the algorithm performance on the GPU to its CPU variant that employs the traditional sparse matrix-vector multiplication (SpMV) approach. We have also compared our algorithm performance with CUSP SpMV on GPU. The softwares used in this work are MATLAB and Jacket - GPU engine for MATLAB  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A10
Download:
Share:
 
Speakers:
Wen Zheng
- Stanford University
Abstract:
The particle level set is widely used as an accurate interface tracking tool in simulation, computer vision and other related fields. ...Read More
Abstract:
The particle level set is widely used as an accurate interface tracking tool in simulation, computer vision and other related fields. However, high computation cost prevents applying this method to real-time and interactive scenarios. This work intensively used parallel design patterns that are implemented in the thrust library, like compaction, reduction and scattering, to parallelize the particle level set method in order to attain real-time performance.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A11
Download:
Share:
 
Speakers:
Sungpack Hong
- Stanford University
Abstract:
Graphs are powerful data representations favored in many computational domains. GPUs have showed promising results in this domain, but their performance when the graph is highly irregular. ...Read More
Abstract:
Graphs are powerful data representations favored in many computational domains. GPUs have showed promising results in this domain, but their performance when the graph is highly irregular. In this study, we propose three general schemes to accelerate graph algorithms on a modern GPU architecture: (i) deferred processing of outliers, (ii) efficient dynamic workload balancing and (iii) warp-based execution exploiting threads in a SIMD-like manner. Our evaluation reveals that our schemes exhibit up to 9x speedup over previous GPU algorithms and 23x over single CPU execution on irregular graphs.They also yield up to 30% improvement,even for regular graphs  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A12
Download:
Share:
 
Speakers:
Daniel Faircloth
- Georgia Tech Research Institute
Abstract:
The Method of Moments is a popular computational method for solving integral equations in electromagnetics. ...Read More
Abstract:
The Method of Moments is a popular computational method for solving integral equations in electromagnetics. However, it suffers from high computational and memory costs since it requires the solution of a dense linear system. The Adaptive Cross Approximation (ACA) is an effective technique for compressing the system matrix thereby reducing the necessary storage as well as the number of operations required to solve the system. Acceleration of the ACA MoM with NVIDIA GPUs can finally enable the solution of "real world" scattering problems on a personal workstation in a practical timeframe.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A13
Download:
Share:
 
Speakers:
Zhaosong Ma
- Institute of Mechanics, Chinese Academy of Sciences
Abstract:
The Continuum-based Distinct Element Method (CDEM) is the combination of Finite Element Method (FEM) and Discrete Element Method (DEM), which is mainly used in general structural analyses, as well as landslide stability evaluations, ...Read More
Abstract:
The Continuum-based Distinct Element Method (CDEM) is the combination of Finite Element Method (FEM) and Discrete Element Method (DEM), which is mainly used in general structural analyses, as well as landslide stability evaluations, coal and gas outburst analyses. By means of CUDA and a GTX-285 VGA card, the GPU version achieves hundreds times speedup ratio.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A14
Download:
Share:
 
Speakers:
Adarsh Krishnamurthy
- University of California, Berkeley
Abstract:
We present GPU algorithms and strategies for accelerating distance queries and clearance computations on models made of trimmed NURBS surfaces. ...Read More
Abstract:
We present GPU algorithms and strategies for accelerating distance queries and clearance computations on models made of trimmed NURBS surfaces. We provide a generalized framework for using GPUs as co-processors in accelerating CAD operations. The accuracy of our algorithm is based on the model space precision, unlike earlier graphics algorithms that were based only on image space precision. Our algorithms are at least an order of magnitude faster and about two orders of magnitude more accurate than the commercial solid modeling kernel ACIS.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A15
Download:
Share:
 
Speakers:
Debapriya Chatterjee
- University of Michigan
Abstract:
This poster describes my research work on how to leverage the GP-GPU execution parallelism to achieve high performance in the time consuming problem of gate-level simulation of digital hardware designs.
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A16
Download:
Share:
 
Speakers:
Vincent Natoli
- Stone Ridge Technology
Abstract:
Impressive speedups up to 100x using GPUs compared to CPUs are achieved by taking advantage data parallelism, increased bandwidth and the ability to hide latency. ...Read More
Abstract:
Impressive speedups up to 100x using GPUs compared to CPUs are achieved by taking advantage data parallelism, increased bandwidth and the ability to hide latency. We have implemented a Monte Carlo valuation of a barrier option modeled by a standard diffusion process with a jump diffusion term obeying an underlying Poisson process to account for rare events. In addition, a Brownian Bridge is incorporated to account for barrier crossings in between diffusion trajectories and to reduce bias. This option is representative of exotic options which lack a closed-form solution and are amenable to Monte Carlo type methods for valuation.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A17
Download:
Share:
Emerging Companies Summit
Presentation
Media
Speakers:
Jensen Huang
Abstract:
Jensen Huang was joined in a fireside chat by Quentin Hardy, National Editor at Forbes Magazine. They discussed the rise of GPUs, current trends in visual and parallel computing, and the transformational changes ahead for the industry. ...Read More
Abstract:

Jensen Huang was joined in a fireside chat by Quentin Hardy, National Editor at Forbes Magazine. They discussed the rise of GPUs, current trends in visual and parallel computing, and the transformational changes ahead for the industry.

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S104006
Download:
Share:
 
Speakers:
Jeff Herbst
Abstract:
The Emerging Companies Summit is a unique forum for startup companies to showcase innovative applications that leverage the GPU to solve visual and compute-intensive problems. The Opening Address includes an overview of NVIDIA''s GPU eco ...Read More
Abstract:

The Emerging Companies Summit is a unique forum for startup companies to showcase innovative applications that leverage the GPU to solve visual and compute-intensive problems. The Opening Address includes an overview of NVIDIA''s GPU ecosystem development activities and an interaction on stage with selected companies building groundbreaking applications on top of the GPU platform. The ECS is a great opportunity to discover new players in the GPU ecosystem, find great investments, explore partnership opportunities, network/ build relationships, and discuss the future of an industry that is reshaping computing.

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
ECS0900
Download:
Share:
 
Speakers:
Sam Blackman
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features Elemental Technologies - covering the field of video processing. Find this session at 5 minutes into the video.Panelists for this session include Dan''l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4001A
Download:
Share:
 
Speakers:
Rob Balgley
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Bunkspeed - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists for this session include Dan''l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4001B
Download:
Share:
 
Speakers:
Chris Doran
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Bunkspeed - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists for this session include Dan''l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4001C
Download:
Share:
 
Speakers:
Chris Blewitt
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features miGenius - covering the field of cloud computing. Find this session at 5 minutes into the video.Panelists for this session include Dan'l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4002A
Download:
Share:
 
Speakers:
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Bunkspeed - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists for this session include Dan''l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4002B
Download:
Share:
 
Speakers:
Philip Lunn
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Bunkspeed - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists for this session include Dan''l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4002C
Download:
Share:
 
Speakers:
Jon Peddie
Abstract:
Moderated by Jon Peddie (President, Jon Peddie Research) The GPU (graphics processing unit) runs advanced applications which are transforming existing industries and creating new ones. Join our panel of leading industry experts as they discuss t ...Read More
Abstract:

Moderated by Jon Peddie (President, Jon Peddie Research) The GPU (graphics processing unit) runs advanced applications which are transforming existing industries and creating new ones. Join our panel of leading industry experts as they discuss the latest technology advances in the usage of GPU for Computer Vision, they will cover facial, gesture, human motion, and biometrics recognition, augmented reality, robotic computing and more. Panelists: Joe Stam (Sr. Applications Engineer, NVIDIA) Yoram Yaacovi (CTO & General Manager, Technologies at Microsoft Israel, R&D Center) Sam Cox (CEO, Milabra) Janko Mrsic-Flogel (CTO, Mirriad) Tom Dean (Research Scientist, Google)

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S104003
Download:
Share:
 
Speakers:
Michael Hummel
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features empulse GmbH - covering the field of databases & data mining. Find this session at 5 minutes into the video.Panelists include Flip Gianos (Partner, Interwest), Charles Carmel (VP of Corporate Business Development, Cisco), Nathan Brookwood (Principal Analyst, Insight64) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4004A
Download:
Share:
 
Speakers:
Natan Peterfreund
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Playcast Media Systems - covering the field of video processing. Find this session at 20 minutes into the video.Panelists will include Flip Gianos (Partner, Interwest), Charles Carmel (VP of Corporate Business Development, Cisco), Nathan Brookwood (Principal Analyst, Insight64) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4004B
Download:
Share:
 
Speakers:
Austin Shoemaker
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Cooliris - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists include Flip Gianos (Partner, Interwest), Charles Carmel (VP of Corporate Business Development, Cisco), Nathan Brookwood (Principal Analyst, Insight64) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4004C
Download:
Share:
 
Speakers:
Michel Tombroff
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Softkinetic - covering the field of computer vision. Find this session at 5 minutes into the video.Panelists include Flip Gianos (Partner, Interwest), Charles Carmel (VP of Corporate Business Development, Cisco), Nathan Brookwood (Principal Analyst, Insight64) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4005A
Download:
Share:
 
Speakers:
Uri Tal
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features Rocketick - covering the field of high performance computing. Find this session at 20 minutes into the video.Panelists include Flip Gianos (Partner, Interwest), Charles Carmel (VP of Corporate Business Development, Cisco), Nathan Brookwood (Principal Analyst, Insight64) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4005B
Download:
Share:
 
Speakers:
Kristian Raue
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features Jedox AG - covering the field of databases & data mining. Find this session at 35 minutes into the video.Panelists include Flip Gianos (Partner, Interwest), Charles Carmel (VP of Corporate Business Development, Cisco), Nathan Brookwood (Principal Analyst, Insight64) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4005C
Download:
Share:
 
Speakers:
Andrew Jamison
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Scalable Display Technologies - covering the field of imaging. Find this session at 5 minutes into the video.Panelists include Norman Winarsky (VP of Ventures, Licensing & Strategic Programs, SRI), Savitha Srinivasan (Corporate Venture Partner, IBM), Rob Enderle (Analyst, Enderle Group) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4007A
Download:
Share:
 
Speakers:
Jeroen Snepvangers
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features RTT - covering the field of computer graphics. Find this session at 20 minutes into the video.Panelists include Norman Winarsky (VP of Ventures, Licensing & Strategic Programs, SRI), Savitha Srinivasan (Corporate Venture Partner, IBM), Rob Enderle (Analyst, Enderle Group) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4007B
Download:
Share:
 
Speakers:
Michael Zeitlin
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Bunkspeed - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists for this session include Dan''l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4007C
Download:
Share:
 
Speakers:
Jules Urbach
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features OTOY - covering the field of cloud computing. Find this session at 5 minutes into the video.Panelists include Norman Winarsky (VP of Ventures, Licensing & Strategic Programs, SRI), Savitha Srinivasan (Corporate Venture Partner, IBM), Rob Enderle (Analyst, Enderle Group) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4008A
Download:
Share:
 
Speakers:
David Peters
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features Universal Robotics - covering the field of machine learning & artificial intelligence. Find this session at 20 minutes into the video.Panelists include Norman Winarsky (VP of Ventures, Licensing & Strategic Programs, SRI), Savitha Srinivasan (Corporate Venture Partner, IBM), Rob Enderle (Analyst, Enderle Group) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4008B
Download:
Share:
 
Speakers:
David Hayes
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features ICD - covering the field of mobile devices. Find this session at 35 minutes into the video.Panelists include Norman Winarsky (VP of Ventures, Licensing & Strategic Programs, SRI), Savitha Srinivasan (Corporate Venture Partner, IBM), Rob Enderle (Analyst, Enderle Group) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4008C
Download:
Share:
 
Speakers:
Yoram Burg
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features OptiTex - covering the field of physics simulation. Find this session at 5 minutes into the video.Panelists include Bill Tai (General Partner, Charles River Ventures), Paul Weiskopf (Sr. VP of Corporate Development, Adobe), Tim Bajarin (President, Creative Strategies) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4010A
Download:
Share:
 
Speakers:
Sylvain Ordureau
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features RTT - covering the field of computer graphics. Find this session at 20 minutes into the video.Panelists include Norman Winarsky (VP of Ventures, Licensing & Strategic Programs, SRI), Savitha Srinivasan (Corporate Venture Partner, IBM), Rob Enderle (Analyst, Enderle Group) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4010B
Download:
Share:
 
Speakers:
Torsten Reil
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features NaturalMotion Limited - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists include Bill Tai (General Partner, Charles River Ventures), Paul Weiskopf (Sr. VP of Corporate Development, Adobe), Tim Bajarin (President, Creative Strategies) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4010C
Download:
Share:
 
Speakers:
Jeff Han
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features Perceptive Pixel - covering the field of imaging. Find this session at 5 minutes into the video.Panelists include Bill Tai (General Partner, Charles River Ventures), Paul Weiskopf (Sr. VP of Corporate Development, Adobe), Tim Bajarin (President, Creative Strategies) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4011A
Download:
Share:
 
Speakers:
Lance Maurer
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features Cinnafilm - covering the field of film. Find this session at 20 minutes into the video.Panelists include Bill Tai (General Partner, Charles River Ventures), Paul Weiskopf (Sr. VP of Corporate Development, Adobe), Tim Bajarin (President, Creative Strategies) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4011B
Download:
Share:
Finance
Presentation
Media
Speakers:
Pierre Spatz
- Murex SAS
Abstract:
Murex will share its practical experience using GPUs to accelerate high-performance analytics based on GPU-enabled Monte Carlo and PDE methods. ...Read More
Abstract:
Murex will share its practical experience using GPUs to accelerate high-performance analytics based on GPU-enabled Monte Carlo and PDE methods. We will also briefly describe Murex's experience developing a high-level payoff scripting language that allows user-definable payoffs for single and cross-asset instruments.  Back
 
Topics:
Finance, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2032
Streaming:
Download:
Share:
 
Speakers:
Scott Donovan
- Citadel Investment Group
Abstract:
Join Citadel to explore our three year undertaking on the feasibility of GPGPU computing for option pricing. ...Read More
Abstract:
Join Citadel to explore our three year undertaking on the feasibility of GPGPU computing for option pricing. We will discuss our 140X performance boost and the hurdles we had to overcome to integrate GPUs into our existing infrastructure. Please note that our talk will not get into the details of the model (that's proprietary information), but we will share our innovative solution to drive a grid of virtual GPUs.  Back
 
Topics:
Finance, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2033
Streaming:
Download:
Share:
 
Speakers:
Peter Decrem
Abstract:
Learn how to compute traditional end of day computations in real time through the use of a hybrid GPU/CPU computing environment. We will detail how computing intensive tasks are delegated to the GPU while interface issues are dealt with by the C ...Read More
Abstract:

Learn how to compute traditional end of day computations in real time through the use of a hybrid GPU/CPU computing environment. We will detail how computing intensive tasks are delegated to the GPU while interface issues are dealt with by the CPU. We will discuss our methodology consisting of the following three components: (1) valuations; (2) by tenor risk measures; and (3) full distributions allowing for more complex analytics such as exotic options products valuation and counterparty value adjustments calculation.

  Back
 
Topics:
Finance, Developer - Algorithms, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S10040
Streaming:
Download:
Share:
 
Speakers:
Ian Reid
- NAG
Abstract:
Last year NAG presented spectacular results for Monte Carlo techniques on GPUs using NAG's GPU library. ...Read More
Abstract:
Last year NAG presented spectacular results for Monte Carlo techniques on GPUs using NAG's GPU library. This year we will talk about new projects in the areas of Monte Carlo and PDE techniques, delivering additional benefits to the finance industry for real-world problems, including credit modeling.  Back
 
Topics:
Finance
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102063
Download:
Share:
 
Speakers:
Thomas Bradley
- NVIDIA
Abstract:
Learn how the GPU can be deployed to generated correlated paths for Monte Carlo simulation. Using Asian Basket options as an example, ...Read More
Abstract:
Learn how the GPU can be deployed to generated correlated paths for Monte Carlo simulation. Using Asian Basket options as an example, the session shows the generation of correlated paths with a local volatility model for each of the underlying assets. Once the paths have been computed, the payoff in each scenario is computed and reduced to determine the expected value, all on the GPU.  Back
 
Topics:
Finance
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102064
Streaming:
Download:
Share:
 
Speakers:
Philippe Stephan
- RMS
Abstract:
RMS will describe our experience leveraging GPUs and simple software architectural principles to deliver both spectacular performance gains and enhanced flexibility in next generation portfolio risk management applications.
 
Topics:
Finance
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102077
Streaming:
Download:
Share:
 
Speakers:
Jike Chong, Matthew Dixon
- UC Davis, Parasians, LLC
Abstract:
Learn how financial market risk managers can increase their ability to preempt exposure limit breaching and tighten risk control to increase investor confidence. ...Read More
Abstract:
Learn how financial market risk managers can increase their ability to preempt exposure limit breaching and tighten risk control to increase investor confidence. Gain insight into the techniques for obtaining high performance Monte-Carlo based market value-at-risk (VaR) estimates over a hierarchy of risk aggregation levels. This session will focus on how the new Fermi platform can be used by financial institutions to enable on-demand estimates of the market VaR, and discuss important software architecture decisions, the benefits of the new GigaThread Engine and Parallel DataCache, as well as the guiding principles for constructing efficient algorithms on GPUs.  Back
 
Topics:
Finance, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102098
Streaming:
Download:
Share:
 
Speakers:
Lokman A.
- Paris-Est University
Abstract:
This presentation focuses on the challenging problem of Pricing High-Dimensional American Options (PHAO) and how GPUs can be involved in this task. ...Read More
Abstract:
This presentation focuses on the challenging problem of Pricing High-Dimensional American Options (PHAO) and how GPUs can be involved in this task. On the one hand, we present a method based on Malliavin calculus which is effective for parallel architecture. On the other hand, we compare this method with Longstaff & Schwartz method which is more dedicated to sequential architecture. We will conclude with some ideas about the parallelization of the former method on a cluster of machines and finally we will discuss this method considering it as a reformulation of a non-linear parabolic problem using BSDEs.  Back
 
Topics:
Finance, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102101
Streaming:
Download:
Share:
GPU Accelerated Internet
Presentation
Media
Speakers:
Chris Pedersen
Abstract:
Join us for a whirlwind demo-punctuated tour of up-and-coming technologies that promise to bring GPU acceleration to the Worldwide Web. We''ll cover 2D graphics, 3D graphics and video. In addition to summarizing the emerging standards an ...Read More
Abstract:

Join us for a whirlwind demo-punctuated tour of up-and-coming technologies that promise to bring GPU acceleration to the Worldwide Web. We''ll cover 2D graphics, 3D graphics and video. In addition to summarizing the emerging standards and technologies, performance test results showing how they scale on various GPUs will be presented, along with recommendations for how to design for best performance. Finally, adoption trends and ecosystem dynamics will be summarized. Attendees should leave with a richer understanding of the possibilities enabled by the GPU-Accelerated Web, and new insights into when and how it will matter.

  Back
 
Topics:
GPU Accelerated Internet, Stereoscopic 3D, Video & Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S10019
Streaming:
Download:
Share:
 
Speakers:
Lee Thomason
Abstract:
Explore the Flash rendering architecture including the challenges of mapping from an animated software vector rendering model to a GPU. We will also discuss how the landscape of mobile, desktop, devices, drivers, and APIs impacts the design and ...Read More
Abstract:

Explore the Flash rendering architecture including the challenges of mapping from an animated software vector rendering model to a GPU. We will also discuss how the landscape of mobile, desktop, devices, drivers, and APIs impacts the design and deployment of a GPU based Flash Player.

  Back
 
Topics:
GPU Accelerated Internet
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S10060
Download:
Share:
 
Speakers:
Vladimir Vukicevic
- Mozilla Corporation
Abstract:
WebGL is a newly-emerging standard for 3D graphics and visual computing on the web. Supported and developed by major web browser vendors, WebGL enables rich interactive 3D graphics delivered through a web browser, ...Read More
Abstract:
WebGL is a newly-emerging standard for 3D graphics and visual computing on the web. Supported and developed by major web browser vendors, WebGL enables rich interactive 3D graphics delivered through a web browser, on both desktop and mobile platforms. This session will contain an introduction to WebGL, and will focus application development issues unique to the web platform, optimization concerns, and how web technologies such as offline app support, HTML5 video and audio, File and WebSockets integrate with WebGL. Experienced OpenGL developers will learn how to transition their knowledge to WebGL development.  Back
 
Topics:
GPU Accelerated Internet, Tools & Libraries, Graphics and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102113
Streaming:
Download:
Share:
 
Speakers:
Jason Weber
- Microsoft
Abstract:
Internet Explorer 9 is bringing the power of modern GPUs to Web. Thanks to hardware accelerated graphics, the websites that you use every day become faster and developers can create new classes of web applications which were previously not possible. ...Read More
Abstract:
Internet Explorer 9 is bringing the power of modern GPUs to Web. Thanks to hardware accelerated graphics, the websites that you use every day become faster and developers can create new classes of web applications which were previously not possible. This session will provide an inside look into how Internet Explorer was redesigned to leverage the GPU. We'll show detailed performance results, discuss our architectural approach, and look at the impact of the GPU on HTML5. A session by engineers for engineers with lots of fun demos.  Back
 
Topics:
GPU Accelerated Internet
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102274
Streaming:
Download:
Share:
General Interest
Presentation
Media
Speakers:
Jensen Huang
Abstract:
The opening keynote, features Jensen Huang, CEO and Co-Founder of NVIDIA and special guests. Hear about what''s next in computing and graphics, and preview disruptive technologies and exciting demonstrations from across industries. ...Read More
Abstract:

The opening keynote, features Jensen Huang, CEO and Co-Founder of NVIDIA and special guests. Hear about what''s next in computing and graphics, and preview disruptive technologies and exciting demonstrations from across industries.

  Back
 
Topics:
General Interest
Type:
Keynote
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S091001
Download:
Share:
 
Speakers:
Klaus Schluten
Abstract:
How does the H1N1 "Swine Flu" virus avoid drugs while attacking our cells? What can we learn about solar energy by studying biological photosynthesis? How do our cells read the genetic code? What comes next in computational biology? Co ...Read More
Abstract:

How does the H1N1 "Swine Flu" virus avoid drugs while attacking our cells? What can we learn about solar energy by studying biological photosynthesis? How do our cells read the genetic code? What comes next in computational biology? Computational biology is approaching a new and exciting frontier: the ability to simulate structures and processes in living cells. Come learn about the "computational microscope," a new research instrument that scientists can use to simulate biomolecules at nearly infinite resolution. The computational microscope complements the most advanced physical microscopes to guide today's biomedical research. In this keynote address, computational biology pioneer Dr. Klaus Schulten of the University of Illinois, Urbana-Champaign, will introduce the computational microscope, showcase the widely used software underlying it, and highlight major discoveries made with the aid of the computational microscope ranging from viewing protein folding, translating the genetic code in cells, and harvesting solar energy in photosynthesis. He will also look towards a future when cell tomography and computing will establish atom-by-atom views of entire life forms.

  Back
 
Topics:
General Interest, Life & Material Science
Type:
Keynote
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S10002
Download:
Share:
 
Speakers:
Sebastien Thrun
Abstract:
What really causes accidents and congestion on our roadways? How close are we to fully autonomous cars? In his keynote address, Stanford Professor and Google Distinguished Engineer, Dr. Sebastian Thrun, will show how his two autonomous vehicles, ...Read More
Abstract:

What really causes accidents and congestion on our roadways? How close are we to fully autonomous cars? In his keynote address, Stanford Professor and Google Distinguished Engineer, Dr. Sebastian Thrun, will show how his two autonomous vehicles, Stanley (DARPA Grand Challenge winner), and Junior (2nd Place in the DARPA Urban Challenge) demonstrate how close yet how far away we are to fully autonomous cars. Using computer vision combined with lasers, radars, GPS sensors, gyros, accelerometers, and wheel velocity, the vehicle control systems are able to perceive and plan the routes to safely navigate Stanley and Junior through the courses. However, these closed courses are a far cry from everyday driving. Find out what the team will do next to get one step closer to the "holy grail" of computer vision, and a huge leap forward toward the concept of fully autonomous vehicles.

  Back
 
Topics:
General Interest, Artificial Intelligence and Deep Learning, Computer Vision
Type:
Keynote
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S10003
Download:
Share:
 
Speakers:
Will Ramey
Abstract:
Get a head start on the conference with this first-day introduction to key technologies for GPU Computing. This 90-minute tutorial session will cover the key features and differences between the major programming languages, APIs and development ...Read More
Abstract:

Get a head start on the conference with this first-day introduction to key technologies for GPU Computing. This 90-minute tutorial session will cover the key features and differences between the major programming languages, APIs and development tools available today. Attendees will also learn several high level design patterns for consumer, professional and HPC applications, with practical programming considerations for each.

  Back
 
Topics:
General Interest, Programming Languages, Tools & Libraries
Type:
Tutorial
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S09412
Streaming:
Download:
Share:
 
Abstract:
Starting with a background in C or C++, learn everything you need to know in order to start programming in CUDA C. Beginning with a "Hello, World" CUDA C program, explore parallel programming with CUDA through a number of hands-on code ...Read More
Abstract:

Starting with a background in C or C++, learn everything you need to know in order to start programming in CUDA C. Beginning with a "Hello, World" CUDA C program, explore parallel programming with CUDA through a number of hands-on code examples. Examine more deeply the various APIs available to CUDA applications and learn the best (and worst) ways in which to employ them in applications. Master the first half of the book "CUDA by Example" as taught by the author, pointing you on a trajectory to complete the second half on your own after course completion.

  Back
 
Topics:
General Interest, Programming Languages
Type:
Tutorial
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102131
Streaming:
Download:
Share:
 
Abstract:
OpenCL is Khronos' new open standard for parallel programming of heterogeneous systems. This tutorial session will introduce the main concepts behind the standard and illustrate them with some simple code walkthrough. Attendees will also lea ...Read More
Abstract:

OpenCL is Khronos' new open standard for parallel programming of heterogeneous systems. This tutorial session will introduce the main concepts behind the standard and illustrate them with some simple code walkthrough. Attendees will also learn how to make efficient use of the API to achieve good performance on the GPU.

  Back
 
Topics:
General Interest, Tools & Libraries
Type:
Tutorial
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S09409
Streaming:
Download:
Share:
 
Speakers:
Cem Cebenoyan
Abstract:
This presentation gives an overview of the DirectX 11 pipeline and how it extends previous DirectX versions to enable stunning visual effects in real-time graphics applications.
 
Topics:
General Interest, Graphics and AI, General Interest
Type:
Tutorial
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S09408
Streaming:
Download:
Share:
 
Speakers:
Abstract:
Learn how to to use the DirectCompute API to solve GPU computing problems. This tutorial will introduce the DirectCompute API, cover the recommended best practices for GPU programming, and go over examples of how to use this API efficiently and ...Read More
Abstract:

Learn how to to use the DirectCompute API to solve GPU computing problems. This tutorial will introduce the DirectCompute API, cover the recommended best practices for GPU programming, and go over examples of how to use this API efficiently and effectively to solve compute-intensive problems.

  Back
 
Topics:
General Interest, Programming Languages
Type:
Tutorial
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S09411
Streaming:
Download:
Share:
 
Speakers:
Mark Kilgard
Abstract:
This session will discuss the latest OpenGL features offered by NVIDIA for both Quadro and Geforce line of products. Learn more about OpenGL 4 as well as NVIDIA specific OpenGL extensions.
 
Topics:
General Interest, Programming Languages
Type:
Tutorial
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102127
Streaming:
Download:
Share:
 
Speakers:
Kumar Iyer
Abstract:
NVIDIA Parallel Nsight provides access to the power of the GPU from within the familiar environment of Microsoft Visual Studio. In this session, you will learn how to use Parallel Nsight to develop GPU computing and graphics applications. Learn ...Read More
Abstract:

NVIDIA Parallel Nsight provides access to the power of the GPU from within the familiar environment of Microsoft Visual Studio. In this session, you will learn how to use Parallel Nsight to develop GPU computing and graphics applications. Learn how to use the powerful Parallel Nsight debugger to identify errors in CUDA C/C++ kernels and HLSL shaders using GPU breakpoints and direct memory and variable inspection. See how Parallel Nsight displays system-wide performance characteristics, allowing you to create efficient GPU algorithms.

  Back
 
Topics:
General Interest, Tools & Libraries
Type:
Tutorial
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102245
Streaming:
Download:
Share:
 
Speakers:
Abstract:
Come learn of the software engines NVIDIA freely provides to application developers to rapidly leverage new GPU capabilities and dramatically reduce the time it takes to bring compelling features to end users.
 
Topics:
General Interest, Programming Languages, Computer Vision, Rendering & Ray Tracing
Type:
Tutorial
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2024
Streaming:
Download:
Share:
 
Speakers:
Abstract:
Learn how to use NVIDIA OptiX to quickly develop high performance ray tracing applications for interactive rendering, offline rendering, or scientific visualization. This session will explore the latest available OptiX version.
 
Topics:
General Interest, Graphics and AI, HPC and AI, Rendering & Ray Tracing
Type:
Tutorial
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102261
Streaming:
Download:
Share:
 
Abstract:
This tutorial session teaches attendees how to program the NVIDIA Quadro Digital Video Pipeline with OpenGL. It will go in-depth into the techniques and recommended practices.
 
Topics:
General Interest, Programming Languages, Graphics and AI, Video & Image Processing
Type:
Tutorial
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102158
Streaming:
Download:
Share:
 
Abstract:
Learn how to program the NVIDIA Quadro Digital Video pipeline using Direct3D. This session will provide an overview of the SDK, discuss device control, data transfers, performance measuring and tuning, ancillary data and application design consi ...Read More
Abstract:

Learn how to program the NVIDIA Quadro Digital Video pipeline using Direct3D. This session will provide an overview of the SDK, discuss device control, data transfers, performance measuring and tuning, ancillary data and application design considerations.

  Back
 
Topics:
General Interest, Programming Languages, Graphics and AI, Video & Image Processing
Type:
Tutorial
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102159
Streaming:
Download:
Share:
 
Speakers:
Abstract:
Let''s dive into the 3rd dimension. This talk presents a comprehensive technical overview of NVIDIA's stereo technology and tools. After a complete introduction to NVIDIA's stereo technology, we will then explore in more detail p ...Read More
Abstract:

Let''s dive into the 3rd dimension. This talk presents a comprehensive technical overview of NVIDIA's stereo technology and tools. After a complete introduction to NVIDIA's stereo technology, we will then explore in more detail production techniques for the new artistic space of effects and creativity offered by 3D stereo. The take away of this session will be a solid understanding of NVIDIA's stereo technology and how to take best advantage of it.

  Back
 
Topics:
General Interest, Programming Languages, Stereoscopic 3D
Type:
Tutorial
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2010
Streaming:
Download:
Share:
 
Speakers:
Yangdong Deng
- Tsinghua University
Abstract:
Modern Internet routers must meet two conflicting objectives, high performance and good programmability, to satisfy the ever-increasing bandwidth requirements under fast changing network protocols. ...Read More
Abstract:
Modern Internet routers must meet two conflicting objectives, high performance and good programmability, to satisfy the ever-increasing bandwidth requirements under fast changing network protocols. A few recent works prove that GPUs have great potential to serve as the packet processing engine for software routers. However, current GPU's batched execution model cannot guarantee quality-of-service (QoS) requirement. In this work, we show how to convert a GPU into an effective packet processor through minimal changes in both hardware architecture and scheduling mechanism. Experimental results proved that the new GPU architecture could meet stringent QoS requirements, but maintain a high processing throughput.  Back
 
Topics:
General Interest, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102081
Streaming:
Download:
Share:
 
Speakers:
Joseph Rios
- NASA
Abstract:
Learn about twenty-four hour, fast-time simulations of traffic in the National Airspace System, which use GPU technology to help perform key steps in the trajectory prediction of flights. ...Read More
Abstract:
Learn about twenty-four hour, fast-time simulations of traffic in the National Airspace System, which use GPU technology to help perform key steps in the trajectory prediction of flights. GPUs enabled us to improve the runtime by up to two orders of magnitude versus the previously required tens of minutes per execution. We will present a brief overview of the problem domain and a description of how the GPU has opened doors to uncharted research areas.  Back
 
Topics:
General Interest, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102214
Streaming:
Download:
Share:
 
Speakers:
Hanspeter Pfister, Jeffrey Vetter
- Harvard University, Georgia Tech and ORNL
Abstract:
Come hear about the groundbreaking research taking place at the CUDA Centers of Excellence, an elite group of world-renown research universities that are pushing the frontier of massively parallel computing using CUDA. ...Read More
Abstract:
Come hear about the groundbreaking research taking place at the CUDA Centers of Excellence, an elite group of world-renown research universities that are pushing the frontier of massively parallel computing using CUDA. Researchers from these top institutions will survey cutting-edge research that is advancing the state of the art in GPU computing and dozens of application fields across science and engineering. In this session we will hear from Professor Hanspeter Pfister of Harvard University and Professor Jeff Vetter of Georgia Tech and Oak Ridge National Laboratory.   Back
 
Topics:
General Interest
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102262
Streaming:
Download:
Share:
 
Speakers:
Amitabh Varshney, Stan Tomov, Wei Ge
- University of Tennessee, University of Maryland, Institute of Process Engineering, Chinese Academy of Sciences
Abstract:
Come hear about the groundbreaking research taking place at the CUDA Centers of Excellence, an elite group of world-renown research universities that are pushing the frontier of massively parallel computing using CUDA. ...Read More
Abstract:
Come hear about the groundbreaking research taking place at the CUDA Centers of Excellence, an elite group of world-renown research universities that are pushing the frontier of massively parallel computing using CUDA. Researchers from these top institutions will survey cutting-edge research that is advancing the state of the art in GPU computing and dozens of application fields across science and engineering. In this session we will hear from Dr. Wei Ge at the Chinese Academy of Science, Professor Amitabh Varshney at the University of Maryland, and Adjunct Assistant Professor Stan Tomov at the University of Tennessee - Knoxville.   Back
 
Topics:
General Interest
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102263
Streaming:
Download:
Share:
 
Speakers:
Charles Hansen, Wen-mei Hwu, Yangdong Deng
- Tsinghua University, University of Utah, University of Illinois
Abstract:
Come hear about the groundbreaking research taking place at the CUDA Centers of Excellence, an elite group of world-renown research universities that are pushing the frontier of massively parallel computing using CUDA. ...Read More
Abstract:
Come hear about the groundbreaking research taking place at the CUDA Centers of Excellence, an elite group of world-renown research universities that are pushing the frontier of massively parallel computing using CUDA. Researchers from these top institutions will survey cutting-edge research that is advancing the state of the art in GPU computing and dozens of application fields across science and engineering. In this session we will hear from Dr. Wen-mei Hwu at the University of Illinois at Urbana - Champaign, Professor Yangdong Deng at Tsinghua University and Dr. Charles D. Hansen at the University of Utah.   Back
 
Topics:
General Interest
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102264
Streaming:
Download:
Share:
 
Speakers:
Paul Calleja, Satoshi Matsuoka, Ting-Wai Chiu
- University of Cambridge, National Taiwan University, Tokyo Institute of Technology
Abstract:
Come hear about the groundbreaking research taking place at the CUDA Centers of Excellence, an elite group of world-renown research universities that are pushing the frontier of massively parallel computing using CUDA. ...Read More
Abstract:
Come hear about the groundbreaking research taking place at the CUDA Centers of Excellence, an elite group of world-renown research universities that are pushing the frontier of massively parallel computing using CUDA. Researchers from these top institutions will survey cutting-edge research that is advancing the state of the art in GPU computing and dozens of application fields across science and engineering. In this session we will hear from Professor Ting-wai Chiu at National Taiwan University, Dr. Satoshi Matsuoka at Tokyo Tech and Dr. Paul Calleja at the University of Cambridge.   Back
 
Topics:
General Interest
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102265
Streaming:
Download:
Share:
 
Speakers:
Gallagher Pryor
- AccelerEyes
Abstract:
Discover and leverage parallelism inherent in pre-existing codes. Often times, parallelism is hidden in seemingly serial programs. ...Read More
Abstract:
Discover and leverage parallelism inherent in pre-existing codes. Often times, parallelism is hidden in seemingly serial programs. This is due obfuscation via indexing or looping wherein the parallelism is seemingly non-existent. Several real-world examples of seemingly serial code demonstrate simple, yet surprisingly effective rules for detecting potential parallelism. For each example, learn how to express the code at a higher, more concise level in M by vectorizing computations. We give several canned techniques of vectorization for many common, and sometimes very difficult, use cases. Learn how such vectorization concisely brings the parallelism of code to the forefront and transforms programs that might have been originally difficult to run on a SIMT device very suitable for execution on the GPU. GPU speedups will be shown utilizing Jacket.   Back
 
Topics:
General Interest
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102268
Streaming:
Download:
Share:
 
Speakers:
Ian Buck
- NVIDIA
Abstract:
Learn how the GPU evolved from its humble beginning as a "VGA Accelerator" to become a massively parallel general purpose accelerator for heterogeneous computing systems. ...Read More
Abstract:
Learn how the GPU evolved from its humble beginning as a "VGA Accelerator" to become a massively parallel general purpose accelerator for heterogeneous computing systems. This talk will focus on significant milestones in GPU hardware architecture and software programming models, covering several key concepts that demonstrate why advances in GPU parallel processing performance and power efficiency will continue to outpace CPUs.  Back
 
Topics:
General Interest
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102275
Streaming:
Download:
Share:
 
Speakers:
Mark Govett
- NOAA Earth System Research Laboratory
Abstract:
We are using GPUs to run a new weather model being developed at NOAA's Earth System Research Laboratory (ESRL) called the Non-hydrostatic Icosahedral Model (NIM). ...Read More
Abstract:
We are using GPUs to run a new weather model being developed at NOAA's Earth System Research Laboratory (ESRL) called the Non-hydrostatic Icosahedral Model (NIM). NIM is slated to run at high resolution (4km global scale) within two years. This presentation will highlight work required to parallelize and run the NIM. We will describe progress running on multiple GPUs, report on our evaluation of two FORTRAN GPU compilers, and give performance updates of NIM using Fermi. We will also discuss special challenges developing and running operational weather models on GPUs.  Back
 
Topics:
General Interest
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
SC1024
Streaming:
Download:
Share:
 
Speakers:
Debapriya Chatterjee
- University of Michigan
Abstract:
Logic simulation is a critical component of the digital design tool flow. It is used from high-level descriptions down to gate-level to validate several aspects of the design, particularly functional correctness. ...Read More
Abstract:
Logic simulation is a critical component of the digital design tool flow. It is used from high-level descriptions down to gate-level to validate several aspects of the design, particularly functional correctness. Despite development houses investing vast resources in the simulation task it is still far from achieving the performance demands of validating complex modern designs at gate-level. We developed a GP-GPU accelerated gate-level simulator using NVIDIA CUDA. We leverage novel algorithms for circuit netlist partitioning and found that our experimental prototype could handle large, industrial scale designs comprised of millions of gates while delivering 13x speedup on average over a typical commercial simulator.   Back
 
Topics:
General Interest
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A16
Streaming:
Download:
Share:
 
Speakers:
Jimmy Chung
- Samsung Semiconductor Inc.
Abstract:
High-end graphics memory has been an essential ingredient in designing PC cards for many years, just as mobile DRAM has been a part of virtually all mobile devices since they were first developed. ...Read More
Abstract:
High-end graphics memory has been an essential ingredient in designing PC cards for many years, just as mobile DRAM has been a part of virtually all mobile devices since they were first developed. In the face of increasing upward pressures on power consumption, Green GDDR5 and Low Power mobile DDR2 (or LPDDR2) provide outstanding performance at exceptionally low power levels, for a greater return on investment in designing desktop and mobile devices, respectively. This Samsung presentation will provide an overview of Green GDDR5's and Green LPDDR2's power savings compared to other much less energy efficient alternatives. The presenter also will take a close look at how GDDR5 and LPDDR2 work to improve performance and extend battery life, while helping to substantially reduce electricity usage worldwide.   Back
 
Topics:
General Interest
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102309
Download:
Share:
Graphics and AI
Presentation
Media
Speakers:
Abstract:
Hear about the ongoing revolution in the production of photorealistic imagery being powered by GPUs. We will explore the algorithms and concepts behind iray - a CUDA accelerated software library from mental images/NVIDIA that provides an interac ...Read More
Abstract:

Hear about the ongoing revolution in the production of photorealistic imagery being powered by GPUs. We will explore the algorithms and concepts behind iray - a CUDA accelerated software library from mental images/NVIDIA that provides an interactive, push-button, fast synthetic digital camera in software to a variety of OEM applications and platforms. We will demonstrate iray embedded in commercial CAD and Digital Content Creation applications as well as in 3D cloud computing platforms.

  Back
 
Topics:
Graphics and AI, Cloud Visualization, Rendering & Ray Tracing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S10013
Streaming:
Download:
Share:
 
Speakers:
Abstract:
Video editing is currently at two simultaneous inflections points: use of GPUs for video processing and the beginning of wide spread adoption of 3D. At this time however, identifying and navigating through the necessary tools and equipment to cr ...Read More
Abstract:

Video editing is currently at two simultaneous inflections points: use of GPUs for video processing and the beginning of wide spread adoption of 3D. At this time however, identifying and navigating through the necessary tools and equipment to create compelling 3D video content is challenging. This session is intended to provide a pragmatic guide to creating prosumer 3D video content and how the GPU greatly assists and speeds up this process. The intended audience is anyone interested in how to create compelling 3D movies at a prosumer level.

  Back
 
Topics:
Graphics and AI, Stereoscopic 3D
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102222
Streaming:
Download:
Share:
 
Speakers:
Abstract:
Video editing is currently at two simultaneous inflections points: use of GPUs for video processing and the beginning of wide spread adoption of 3D. At this time however, identifying and navigating through the necessary tools and equipment to cr ...Read More
Abstract:

Video editing is currently at two simultaneous inflections points: use of GPUs for video processing and the beginning of wide spread adoption of 3D. At this time however, identifying and navigating through the necessary tools and equipment to create compelling 3D video content is challenging. This session is intended to provide a pragmatic guide to creating prosumer 3D video content and how the GPU greatly assists and speeds up this process. The intended audience is anyone interested in how to create compelling 3D movies at a prosumer level.

  Back
 
Topics:
Graphics and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102279
Download:
Share:
 
Abstract:
The arrival of fully programable GPUs is now changing the visual effects industry, which traditionally relied on CPU computation to create their spectacular imagery. Implementing the complex image processing algorithms used by VFX is a challenge ...Read More
Abstract:

The arrival of fully programable GPUs is now changing the visual effects industry, which traditionally relied on CPU computation to create their spectacular imagery. Implementing the complex image processing algorithms used by VFX is a challenge, but the payoffs in terms of interactivity and throughput can be enormous. Hear how The Foundry''s novel image processing architecture simplifies the implementation of GPU-enabled VFX software and eases the transition from a CPU based infrastructure to a GPU based one.

  Back
 
Topics:
Graphics and AI, Tools & Libraries, Video & Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102125
Download:
Share:
 
Speakers:
Tristan Lorach
- NVIDIA
Abstract:
Dive into the details of using CgFX - Cg's effect framework - to combine ray-tracing with real-time rendering and enable the next generation of complex high-quality rendering. ...Read More
Abstract:
Dive into the details of using CgFX - Cg's effect framework - to combine ray-tracing with real-time rendering and enable the next generation of complex high-quality rendering. You will learn how to use CgFX to create complex rendering effects in a concise and elegant fashion by: Blending material-level and scene-level effects in a consistent way,- Seamlessly integrating CUDA-based data processing within the CgFX rendering pipeline,Mixing OptiX-based rendering with CgFX and OpenGL.   Back
 
Topics:
Graphics and AI, Rendering & Ray Tracing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2056
Streaming:
Download:
Share:
 
Speakers:
Steve Nash
- NVIDIA
Abstract:
The unprecedented realism that is possible today allows for visualization at an ever larger scale. This talk will walk through several case studies from high resolution single displays to completely immersive environments. ...Read More
Abstract:
The unprecedented realism that is possible today allows for visualization at an ever larger scale. This talk will walk through several case studies from high resolution single displays to completely immersive environments. Details will be shared on how to architect and implement these installations, with attention to the typical issues encountered. It will cover how to implement stereo 3D in OpenGL, Direct3D, as well as how that relates to the different display technologies (projectors, multi-display, CAVEs, etc.)  Back
 
Topics:
Graphics and AI, Stereoscopic 3D
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102071
Streaming:
Download:
Share:
 
Speakers:
Charles Loop
- Microsoft Research
Abstract:
See how the new DirectX 11 Hardware Tessellation and Compute Shader can be used to implement an adaptive Catmull-Clark subdivision surface renderer. ...Read More
Abstract:
See how the new DirectX 11 Hardware Tessellation and Compute Shader can be used to implement an adaptive Catmull-Clark subdivision surface renderer. We use a table driven approach to performing Catmull-Clark subdivision in parallel utilizing one thread per output mesh vertex.  Back
 
Topics:
Graphics and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102129
Streaming:
Download:
Share:
 
Speakers:
Rajeev Surati
- Scalable Display Technologies
Abstract:
We''ll go over the challenges we have overcome in building 100 million pixel seamless displays. One customer requirement involves interactive changes of the eyepoint as a person moves, relative to the screen, ...Read More
Abstract:
We''ll go over the challenges we have overcome in building 100 million pixel seamless displays. One customer requirement involves interactive changes of the eyepoint as a person moves, relative to the screen, yet the distortions computed are quite non-linear. We discuss our use of a gpu to implement this procedure.  Back
 
Topics:
Graphics and AI, HPC and AI, Medical Imaging & Radiology
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102134
Streaming:
Download:
Share:
 
Speakers:
Evan Hart, Johannes van Waveren
- NVIDIA, id Software
Abstract:
A virtual texture implementation allows applications the ability to manage gigantic amounts of texture data for rendering complex data sets. ...Read More
Abstract:
A virtual texture implementation allows applications the ability to manage gigantic amounts of texture data for rendering complex data sets. However, practical utilization involves feeding it adequate data. The GPU offers a powerful engine capable of accelerating the transcoding of efficient storage formats into formats useful for rendering. This session will demonstrate a virtual texturing implementation and the steps needed to GPU accelerate the non-rendering portions of managing and loading the virtual texture data.  Back
 
Topics:
Graphics and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102152
Streaming:
Download:
Share:
 
Abstract:
We present a discussion of ideas and techniques behind programmable graphics pipelines on modern GPUs, specifically the example design of a real-time Reyes renderer. Walking through this example, we address the philosophy beneath programmable GP ...Read More
Abstract:

We present a discussion of ideas and techniques behind programmable graphics pipelines on modern GPUs, specifically the example design of a real-time Reyes renderer. Walking through this example, we address the philosophy beneath programmable GPU graphics, the broad strategy for the specific pipeline, and algorithmic and implementation-level details for key rendering stages. We cover several issues concerning GPU efficiency, including those involving work scheduling, parallelization of traditional stages, and balancing of rendering workloads. We expect the audience to gain an in-depth exposure of the state of research in programmable graphics, and an insight into efficient pipeline design for irregular workloads.

  Back
 
Topics:
Graphics and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102162
Streaming:
Download:
Share:
 
Speakers:
Ken Pimentel
Abstract:
Learn how GPU technologies are transforming the making of pixels. This talk will cover GPU-centric rendering techniques that leverage both the raw computational capabilities of NVIDIA's GPUs and advanced pixel-shading techniques for interact ...Read More
Abstract:

Learn how GPU technologies are transforming the making of pixels. This talk will cover GPU-centric rendering techniques that leverage both the raw computational capabilities of NVIDIA's GPUs and advanced pixel-shading techniques for interactive visualization and rendering.

  Back
 
Topics:
Graphics and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102165
Streaming:
Download:
Share:
 
Speakers:
Philippe Rollin
- NVIDIA
Abstract:
The new generation of accelerated graphics is elevating visual computing to new heights. Tessellation, one of its most anticipated features, is already used in many scenarios to bring 3D graphics to an unprecedented level of realism. ...Read More
Abstract:
The new generation of accelerated graphics is elevating visual computing to new heights. Tessellation, one of its most anticipated features, is already used in many scenarios to bring 3D graphics to an unprecedented level of realism. This talk will introduce tessellation using OpenGL 4.0. We will also describe how an existing application can be adapted to efficiently take advantage of this new feature and also how to overcome some of the challenges.   Back
 
Topics:
Graphics and AI, Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102227
Download:
Share:
 
Speakers:
Brian Harrison, Michael Morrison
- NVIDIA
Abstract:
Learn how NVIDIA SceniX is a rapid start to building state of the art, realtime 3D applications, and how raytracing can be combined with raster graphics for new levels of interactive realism.
 
Topics:
Graphics and AI, Computer Vision, Rendering & Ray Tracing, Stereoscopic 3D
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102308
Streaming:
Download:
Share:
 
Speakers:
Nathan Andrysco
- Purdue University
Abstract:
We propose a new way to represent trees that allows for faster algorithms, that are simple to implement (especially on the GPU), and with a lower memory overhead than previous approaches. ...Read More
Abstract:
We propose a new way to represent trees that allows for faster algorithms, that are simple to implement (especially on the GPU), and with a lower memory overhead than previous approaches. Using our data structure, we have seen significant improvements in both volume ray casting and ray tracing applications over previous state-of-the-art methods.  Back
 
Topics:
Graphics and AI
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10E01
Download:
Share:
 
Speakers:
Anjul Patney
- University of California, Davis
Abstract:
In this poster, we describe our recent work in the area of programmable graphics pipelines by presenting a fragment-parallel formulation of an A-buffer-style composite and filter equation, and describe its implementation on a modern GPU.
 
Topics:
Graphics and AI
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10E02
Download:
Share:
HPC and AI
Presentation
Media
Speakers:
Long Wang, Wei Ge, Xiaowei Wang, Yunquan Zhang
- Institute of Process Engineering, Chinese Academy of Sciences, Institute of Process Engineering, Institute of Software, CAS, Super Computing Center, Institute of Computer Network Information of CAS
Abstract:
China now holds three spots in the June 2010 Top500 list of GPU-based supercomputers, and two of them, using NVIDIA GPUs, are related to CAS. ...Read More
Abstract:
China now holds three spots in the June 2010 Top500 list of GPU-based supercomputers, and two of them, using NVIDIA GPUs, are related to CAS. Efficient use of these systems is more important than peak or Linpack performance. This session will cover some of the large-scale multi-GPU applications in CAS, ranging from molecular dynamics below nano-scale to complex flows on meter-scale and porous media on geological scales, as well as fundamental linear algebra and data/image analysis. The idea of keeping high-efficiency and generality of the computation platform by maintaining a consistency among the target physical system, the computational model and algorithm, and the computer hardware will be explained in detail and demonstrated through a number of super-computing applications in the chemical, oil, mining, metallurgical and biological industries.  Back
 
Topics:
HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102286
Streaming:
Download:
Share:
 
Speakers:
Mark Fernandez
- Dell
Abstract:
We have benchmarked the real performance impact on a series of GPU accelerated applications to understand the benefits and drawbacks of different system level configurations. ...Read More
Abstract:
We have benchmarked the real performance impact on a series of GPU accelerated applications to understand the benefits and drawbacks of different system level configurations. Come hear about the effects on performance of GPUs in shared slots and of GPUs that are externally connected.  Back
 
Topics:
HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102287
Streaming:
Download:
Share:
 
Speakers:
Don Clegg
- Super Micro Computer, Inc.
Abstract:
Find out how Supermicro scales up and scales out GPU performance by using Twin architecture. In this session, we outline Supermicro''s Twin architecture advantages across 1U/2U GPU servers and the design of personal supercomputer, ...Read More
Abstract:
Find out how Supermicro scales up and scales out GPU performance by using Twin architecture. In this session, we outline Supermicro''s Twin architecture advantages across 1U/2U GPU servers and the design of personal supercomputer, and how we are able to scale and optimize GPU technology for datacenter environment and for professional workstation.  Back
 
Topics:
HPC and AI, Computer Vision
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102293
Streaming:
Download:
Share:
 
Speakers:
John Taylor
- CSIRO
Abstract:
We propose holding a research roundtable focussed on using GPU clusters to support scientific research. ...Read More
Abstract:
We propose holding a research roundtable focussed on using GPU clusters to support scientific research. The roundtable will bring together researchers that have recently deployed or are interested in deploying GPU clusters to enable scientific research. At the research roundtable they will be able to share their experiences in deploying this new technology and discuss the future of this technology in supporting research to tackle the world's most challenging scientific problems. To open discussion we will provide a brief presentation about deployment of the CSIRO''s latest supercomputer cluster, which is among the world''s first to combine traditional CPUs with more powerful NVIDIA GPUs, that is providing a world class computational and simulation science facility to advance priority CSIRO science.   Back
 
Topics:
HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102301
Streaming:
Download:
Share:
 
Speakers:
Xiaohui Cui
- Oak Ridge National Laboratory
Abstract:
Power consumption has become the leading design constraint for large scale computing systems. In order to achieve exascale computing, system energy efficiency must be improved significantly. ...Read More
Abstract:
Power consumption has become the leading design constraint for large scale computing systems. In order to achieve exascale computing, system energy efficiency must be improved significantly. Our approach will focus on investigating software methodologies to achieve energy efficient computing on heterogeneous systems accelerated with GPUs.   Back
 
Topics:
HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2052
Streaming:
Download:
Share:
 
Speakers:
Everett Phillips, Massimiliano Fatica
- NVIDIA
Abstract:
This talk will illustrate the use of GPUs to accelerate the LINPACK benchmark on clusters with GPUs, where both the CPUs and the GPUs are used in synergy. ...Read More
Abstract:
This talk will illustrate the use of GPUs to accelerate the LINPACK benchmark on clusters with GPUs, where both the CPUs and the GPUs are used in synergy. The acceleration is obtained executing DGEMM (matrix multiply) and DTRSM (for the solution of triangular systems) calls simultaneously on both GPU and CPU cores. Details of the implementation will be presented together with results that shows how effective the solution is, both for performance and power efficiency.  Back
 
Topics:
HPC and AI, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2057
Streaming:
Download:
Share:
 
Speakers:
Guido Juckeland, Jeremy Meredith
- TU Dresden - ZIH, Oak Ridge National Laboratory
Abstract:
Learn how applications can be executed over multiple GPUs located in multiple hosts, what the challenges are to scale one application to a 20 PFLOP/s machine and why tool support is a necessity. ...Read More
Abstract:
Learn how applications can be executed over multiple GPUs located in multiple hosts, what the challenges are to scale one application to a 20 PFLOP/s machine and why tool support is a necessity. Receive an overview on the available performance analysis tools that support CUDA developers in generating applications with outstanding speedups.  Back
 
Topics:
HPC and AI, Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102089
Streaming:
Download:
Share:
 
Speakers:
Nolan Davis
- SAIC
Abstract:
Large linear algebra problems may be solved using recursive block decomposition in which GPUs efficiently compute the sub-blocks and multicore CPUs put the sub-blocks back together within a large shared memory space. ...Read More
Abstract:
Large linear algebra problems may be solved using recursive block decomposition in which GPUs efficiently compute the sub-blocks and multicore CPUs put the sub-blocks back together within a large shared memory space. This talk will present benchmark results for such a hybrid approach, implemented in Matlab® and using Jacket® to access the GPU compute power.  Back
 
Topics:
HPC and AI, Developer - Algorithms, Signal and Audio Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102100
Streaming:
Download:
Share:
 
Speakers:
Guillaume Saupin
- Atomic and Alternative Energies Commission (CEA)
Abstract:
Radiation poisoning is an everpresent danger for intervention teams that must visit nuclear sites. Virtual reality can help teams prepare for intervention, but efficient computation of radiation dosage is critical to study complex scenarios. ...Read More
Abstract:
Radiation poisoning is an everpresent danger for intervention teams that must visit nuclear sites. Virtual reality can help teams prepare for intervention, but efficient computation of radiation dosage is critical to study complex scenarios. Radiation protection research often uses codes based on the straight line attenuation method. As with other approaches, geometrical computations (finding all the interactions radiation rays/objects intersection) remain the simulation bottleneck. This talk will describe how we have used the Thrust high-level library for CUDA C/C++ to quickly prototype innovative algorithms and achieve a significant speed up.  Back
 
Topics:
HPC and AI, Developer - Algorithms, Physics Simulation, Rendering & Ray Tracing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102104
Streaming:
Download:
Share:
 
Speakers:
Francois Bodin
- CAPS entreprise
Abstract:
GPGPU is a tremendous opportunity to many application fields. Migrating legacy software to GPGPU is a complex process that requires mastering the technological risks (e. ...Read More
Abstract:
GPGPU is a tremendous opportunity to many application fields. Migrating legacy software to GPGPU is a complex process that requires mastering the technological risks (e.g. loss of code portability, extensive code restructuration, debugging complexity) as well as costs. In this talk, we present a methodology based on HMPP (Heterogeneous Multicore Parallel Programming), allowing incremental processes that reduce the cost and risks of porting codes to GPGPU.  Back
 
Topics:
HPC and AI, Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102117
Streaming:
Download:
Share:
 
Speakers:
Robert Farber
Abstract:
Hear the latest on how supercomputing for the masses is changing the world. We will look at some of the one- to three-orders of magnitude faster killer apps and see how they do it. We will discuss specific mapping to GPGPU hardware and technique ...Read More
Abstract:

Hear the latest on how supercomputing for the masses is changing the world. We will look at some of the one- to three-orders of magnitude faster killer apps and see how they do it. We will discuss specific mapping to GPGPU hardware and techniques for high performance and near-linear scalability both within and across multiple GPGPUs. We will also consider software investment and the decades long longevity of some successful massively parallel Investments in multithreaded software, scalability, balance metrics, lack of consensus on programming models, and lifecycle considerations.

  Back
 
Topics:
HPC and AI, Artificial Intelligence and Deep Learning, Developer - Algorithms, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
SC1022
Streaming:
Download:
Share:
 
Speakers:
Fabrizio Zanella
- CST of America
Abstract:
3D Full Wave Electromagnetic simulations of RF components, antennas, printed circuit boards, can be quite time consuming. ...Read More
Abstract:
3D Full Wave Electromagnetic simulations of RF components, antennas, printed circuit boards, can be quite time consuming. Computer Simulation Technology (CST) toolsuite includes the capability to activate GPU Computing. Examples will be shown of using Tesla C1060 and S1070 configurations to provide significant performance improvement of complex simulations.  Back
 
Topics:
HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102133
Streaming:
Download:
Share:
 
Speakers:
Andrew Washbrook, Philip Clark
- University of Edinburgh
Abstract:
Learn how GPUs could be adopted by the ATLAS detector at the Large Hadron Collider (LHC) at CERN. The detector, located at one of the collision points, must trigger on unprecedented data acquisition rates (PB/s), ...Read More
Abstract:
Learn how GPUs could be adopted by the ATLAS detector at the Large Hadron Collider (LHC) at CERN. The detector, located at one of the collision points, must trigger on unprecedented data acquisition rates (PB/s), to decide whether to record the event, or lose it forever. In the beginning, we introduce the ATLAS experiment and the computational challenges it faces. The second part will focus on how GPUs can be used for algorithm acceleration - using two critical algorithms as exemplars. Finally, we will outline how GPGPU acceleration could be exploited and incorporated into the future ATLAS computing framework.   Back
 
Topics:
HPC and AI, Developer - Algorithms, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102135
Download:
Share:
 
Speakers:
Hatem Ltaief, Stan Tomov
- University of Tennessee
Abstract:
Learn how to develop faster, cheaper and better linear algebra software for GPUs through a hybridization methodology that is built on (1) Representing linear algebra algorithms as directed acyclic graphs where nodes correspond ...Read More
Abstract:
Learn how to develop faster, cheaper and better linear algebra software for GPUs through a hybridization methodology that is built on (1) Representing linear algebra algorithms as directed acyclic graphs where nodes correspond to tasks and edges to dependencies among them, and (2) Scheduling the execution of the tasks over hybrid architectures of GPUs and multicore. Examples will be given using MAGMA, a new generation of linear algebra libraries that extends the sequential LAPACK-style algorithms to the highly parallel GPU and multicore heterogeneous architectures.  Back
 
Topics:
HPC and AI, Tools & Libraries, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102138
Streaming:
Download:
Share:
 
Speakers:
John Humphrey
- EM Photonics, Inc
Abstract:
Get the latest information on CULA, an library of hybrid GPU/CPU linear algebra routines optimized for NVIDIA GPUs. ...Read More
Abstract:
Get the latest information on CULA, an library of hybrid GPU/CPU linear algebra routines optimized for NVIDIA GPUs. CULA launched at GTC2009 and has since received large speedups and many new features. We will cover all the features, performance, inner workings, and how users can integrate CULA into their applications. New features for 2010 and 2011 will be in the spotlight, with exciting new developments for sparse matrices including general direct sparse solvers, iterative sparse solvers, and specialized block tridiagonal solvers. Learn how your existing linear algebra applications can benefit from a high quality library. Much more information is available at www.culatools.com and at our presentation and booth.  Back
 
Topics:
HPC and AI, Tools & Libraries, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102153
Streaming:
Download:
Share:
 
Speakers:
NULL
- EM Photonics, Inc.
Abstract:
Discover how different branches of the U.S. military are utilizing GPU accelerated solutions in mission-critical operations. ...Read More
Abstract:
Discover how different branches of the U.S. military are utilizing GPU accelerated solutions in mission-critical operations. This session will detail GPU-related projects that the engineers at EM Photonics have developed specifically for military applications. An image processing example will discuss how GPUs are being used to accelerate long-range battlefield surveillance to protect soldiers. Other military examples include low-power embedded GPU solutions utilized by UAVs and CFD simulations used to model complex interactions between vehicles at sea.  Back
 
Topics:
HPC and AI, Tools & Libraries, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102154
Streaming:
Download:
Share:
 
Speakers:
Matthew Curry
- Sandia National Laboratories and the University of Alabama at Birmingham
Abstract:
While RAID is the prevailing method of creating reliable secondary storage infrastructure, many users desire more flexibility than offered by current implementations. ...Read More
Abstract:
While RAID is the prevailing method of creating reliable secondary storage infrastructure, many users desire more flexibility than offered by current implementations. To attain needed performance, customers have often sought after hardware-based RAID solutions. This talk describes a RAID system that offloads erasure correction coding calculations to GPUs, allowing increased reliability by supporting new RAID levels while maintaining high performance.  Back
 
Topics:
HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102205
Download:
Share:
 
Speakers:
Chris Mason
- Acceleware
Abstract:
Learn about Acceleware''s and Dassault Systemes'' integrated solution that performs an LDL^T factorization on GPUs within the Abaqus software package. ...Read More
Abstract:
Learn about Acceleware''s and Dassault Systemes'' integrated solution that performs an LDL^T factorization on GPUs within the Abaqus software package. We will discuss efficient GPU parallelization of the factorization algorithm and enabling the CPU and GPU to overlap their computations and data transfers. Includes an end user simulation case study and GPU performance measurements including 300 GFlops in single precision and 145 GFlops in double precision on NVIDIA Tesla C2050.  Back
 
Topics:
HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102208
Download: