SEARCH SESSIONS

Search All
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Acoustics and Audio Processing
Presentation
Media
Speakers:
Nicolas Tsingos
Abstract:
Learn how to leverage GPUs for interactive audio rendering. This session will give a short overview of the architecture of current GPUs, emphasizing some key differences between GPU and CPUs programming models for audio processing. We will illus ...Read More
Abstract:

Learn how to leverage GPUs for interactive audio rendering. This session will give a short overview of the architecture of current GPUs, emphasizing some key differences between GPU and CPUs programming models for audio processing. We will illustrate the benefits of GPU-accelerated audio rendering with results from 3D audio processing and sound scattering simulations. Finally, we will discuss best practices for GPU implementations as well as future opportunities for audio rendering on massively parallel architectures.

  Back
 
Topics:
Acoustics and Audio Processing, Rendering and Ray Tracing, Signal and Audio Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2042
Streaming:
Download:
Share:
 
Speakers:
Giancarlo Del Sordo
Abstract:
Learn how to implement a commercial software library that exploits CUDA for audio applications. We focus on the overall threading architecture and the underlying math for implementing general purpose audio processing in CUDA devices. Covers the ...Read More
Abstract:

Learn how to implement a commercial software library that exploits CUDA for audio applications. We focus on the overall threading architecture and the underlying math for implementing general purpose audio processing in CUDA devices. Covers the use of inter-process communication to make a plug-in implementation loadable in 32 bit hosts installed in 64 bit systems, distributing the GPU load on remote servers, and creating a CUDA network for high-end purposes such as a big recording facility.

  Back
 
Topics:
Acoustics and Audio Processing, Signal and Audio Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102076
Streaming:
Download:
Share:
 
Abstract:
Learn how a synthesis of 3D sound scenes can be achieved using a peer-to-peer music streaming environment and GPU. We will discuss the technical and cost benefits to this approach, while noting that it frees the CPU for other tasks. ...Read More
Abstract:

Learn how a synthesis of 3D sound scenes can be achieved using a peer-to-peer music streaming environment and GPU. We will discuss the technical and cost benefits to this approach, while noting that it frees the CPU for other tasks.

  Back
 
Topics:
Acoustics and Audio Processing, Signal and Audio Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102116
Streaming:
Download:
Share:
 
Speakers:
Jike Chong
Abstract:
We explore two contending recognition network representations for speech inference engines: the linear lexical model (LLM) and the weighted finite state transducer (WFST) on NVIDIA GTX285 and GTX480 GPUs. We demonstrate that while an inference e ...Read More
Abstract:

We explore two contending recognition network representations for speech inference engines: the linear lexical model (LLM) and the weighted finite state transducer (WFST) on NVIDIA GTX285 and GTX480 GPUs. We demonstrate that while an inference engine using the simpler LLM representation evaluates 22x more transitions per second than the advanced WFST representation, the simple structure of the LLM representation allows 4.7-6.4x faster evaluation and 53-65x faster operands gathering for each state transition. We illustrate that the performance of a speech inference engine based on the LLM representation is competitive with the WFST representation on highly parallel GPUs.

  Back
 
Topics:
Acoustics and Audio Processing
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10C01
Download:
Share:
 
Speakers:
Jike Chong
Abstract:
Automatic speech recognition (ASR) technology is emerging as a critical component in data analytics for a wealth of media data being generated everyday. ASR-based applications contain fine-grained concurrency that has great potential to be explo ...Read More
Abstract:

Automatic speech recognition (ASR) technology is emerging as a critical component in data analytics for a wealth of media data being generated everyday. ASR-based applications contain fine-grained concurrency that has great potential to be exploited on the GPU. However, the state-of-art ASR algorithm involves a highly parallel graph traversal on an irregular graph with millions of states and arcs, making efficient parallel implementations highly challenging. We present four generalizable techniques including: dynamic data-gather buffer, find-unique, lock-free data structures using atomics, and hybrid global/local task queues. When used together, these techniques can effectively resolve ASR implementation challenges on an NVIDIA GPU.

  Back
 
Topics:
Acoustics and Audio Processing
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10C02
Download:
Share:
Astronomy and Astrophysics
Presentation
Media
Speakers:
Alessandra Mastrobuono, Roberto Capuzzo-Dolcetta
- Sapienza Univ. of Roma
Abstract:
Astrophysics is a field where super computing is a must to obtain new scientific results. in particular, the study of the interaction among massive black holes and surrounding stars is a hot topic, ...Read More
Abstract:
Astrophysics is a field where super computing is a must to obtain new scientific results. in particular, the study of the interaction among massive black holes and surrounding stars is a hot topic, which requires heavy computations to have good representation of what happens in the inner regions of galaxies. We present the results obtained with our high precisioned N-body code, NBSymple, which exploits the joint power of a multi core CPU system together with the high performance NVIDIA Tesla C1060 GPUs. The code is available at the website: astrowww.phys.uniroma1.it/dolcetta/nbsymple.html  Back
 
Topics:
Astronomy and Astrophysics, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102000
Streaming:
Download:
Share:
 
Speakers:
Matt Sinclair
Abstract:
Learn how to use the hidden computation capability of GPU texture units for general purpose computation. We describe GRASSY, a system for stellar spectral synthesis where the core problem is interpolation between pre-computed intensity value. We ...Read More
Abstract:

Learn how to use the hidden computation capability of GPU texture units for general purpose computation. We describe GRASSY, a system for stellar spectral synthesis where the core problem is interpolation between pre-computed intensity value. We map these pre-computed tables to the GPU''s texture memory. Interpolation then becomes a texture lookup where the hardware automatically performs the interpolation, albeit at very low precision. Our mathematical framework reasons about the impact of this precision and our performance results show 500X speedups. This work generalizes the GPU texture units as computation engines and opens up new problems for GPU acceleration.

  Back
 
Topics:
Astronomy and Astrophysics, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S10044
Download:
Share:
 
Speakers:
Richard Townsend
- University of Wisconsin-Madison
Abstract:
Standard FFT algorithms cannot be applied to spectral analysis of unevenly sampled data. Alternative approaches scale as O(N^2), making them an ideal target for harnessing the raw computing power of GPUs. ...Read More
Abstract:
Standard FFT algorithms cannot be applied to spectral analysis of unevenly sampled data. Alternative approaches scale as O(N^2), making them an ideal target for harnessing the raw computing power of GPUs. To this end, I have developed CU-LSP, a CUDA spectral analysis code based on the Lomb-Scargle periodogram. Preliminary benchmarking indicates impressive speed-ups, on the order of 400 relative to a single core of a modern CPU. An initial application of CU-LSP will be the analysis of time-series data from planet-search and asteroseismology satellites.   Back
 
Topics:
Astronomy and Astrophysics, Developer - Algorithms, Signal and Audio Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102082
Streaming:
Download:
Share:
 
Speakers:
Dominique Aubert
- Strasbourg University
Abstract:
Cosmological simulations aim at reproducing the physical processes which occur on the largest scales of the Universe since the Big-Bang by means of numerical calculations on supercomputers. ...Read More
Abstract:
Cosmological simulations aim at reproducing the physical processes which occur on the largest scales of the Universe since the Big-Bang by means of numerical calculations on supercomputers. Using CUDA, I have implemented standard cosmological techniques on GPU architecture (PM N-Body solver, Hydrodynamics & moment-based radiative transfer) and designed them to run on supercomputing facilities by means of MPI+CUDA mixed programming. These applications are able to run on 100 or more graphics devices with typical scalar x50 accelerations and with a communication overhead limited to 15%. It allow to explore physical regimes which were out of reach of current simulations.   Back
 
Topics:
Astronomy and Astrophysics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102099
Streaming:
Download:
Share:
 
Speakers:
Abdul Mroue
- CITA, Univ. Of Toronto
Abstract:
Get the latest information on how to evolve binary black holes simulations on GPUs.
 
Topics:
Astronomy and Astrophysics, Developer - Algorithms, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102108
Streaming:
Download:
Share:
 
Speakers:
Mark Cheung
- Lockheed Martin Solar & Astrophysics Laboratory
Abstract:
Learn how GPU computing is enabling astrophysicists to study our closest star. NASA''s recently launched Solar Dynamics Observatory is continuously streaming full-disk images of the Sun at visible, UV and EUV wavelengths. ...Read More
Abstract:
Learn how GPU computing is enabling astrophysicists to study our closest star. NASA''s recently launched Solar Dynamics Observatory is continuously streaming full-disk images of the Sun at visible, UV and EUV wavelengths. This presentation will discuss ways that GPU computing is helping scientists cope with the analysis of the immense data volumes as well as in numerical modeling of the Sun.  Back
 
Topics:
Astronomy and Astrophysics, Computational Fluid Dynamics, Computer Vision and Machine Vision, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102178
Streaming:
Download:
Share:
 
Speakers:
Rainer Spurzem
- National Astronomical Obersvatories, Chinese Academy of Sciences
Abstract:
Many, if not all galaxies harbour supermassive black holes. If galaxies merge, which is quite common in the process of hierarchical structure formation in the universe, ...Read More
Abstract:
Many, if not all galaxies harbour supermassive black holes. If galaxies merge, which is quite common in the process of hierarchical structure formation in the universe, their black holes sink to the centre of the merger remnant and form a tight binary. Depending on initial conditions and time supermassive black hole binaries are prominent gravitational wave sources, if they ultimately come close together and coalesce. We model such systems as gravitating N-body systems (stars) with two or more massive bodies (black holes), including if necessary relativistic corrections to the classical Newtonian gravitational forces (Kupi et al. 2006, Berentzen et al.2009).  Back
 
Topics:
Astronomy and Astrophysics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10B01
Download:
Share:
Climate, Weather, Ocean Modeling
Presentation
Media
Speakers:
Takayuki Aoki
- Global Scientific Information and Computing Center (GSIC) of Tokyo Institute of Technology (Tokyo Tech)
 
Topics:
Climate, Weather, Ocean Modeling
Type:
Talk
Event:
Supercomputing
Year:
2010
Session ID:
SC1014
Download:
Share:
 
Speakers:
Thomas Schulthess
- Swiss National Supercomputing Centre
 
Topics:
Climate, Weather, Ocean Modeling
Type:
Talk
Event:
Supercomputing
Year:
2010
Session ID:
SC1010
Download:
Share:
Cloud Visualization
Presentation
Media
Speakers:
Tamrat Belayneh, Paul Arden
- mental images
 
Topics:
Cloud Visualization
Type:
Talk
Event:
Supercomputing
Year:
2010
Session ID:
SC1008
Download:
Share:
 
Speakers:
Dale Southard
- NVIDIA
 
Topics:
Cloud Visualization
Type:
Talk
Event:
Supercomputing
Year:
2010
Session ID:
SC1007
Download:
Share:
 
Speakers:
Abstract:
We present MatCloud (www.mat-cloud.com), a cloud infrastructure and service for scientific computing using state-of-the-art GPU clusters. MatCloud is a service infrastructure exposed by a simple web terminal interface to run Matlab-like commands ...Read More
Abstract:

We present MatCloud (www.mat-cloud.com), a cloud infrastructure and service for scientific computing using state-of-the-art GPU clusters. MatCloud is a service infrastructure exposed by a simple web terminal interface to run Matlab-like commands/scripts. Join us to see how GPU technology can not only be applied to cloud computing community, but also boost the adoption of cloud computing for its dramatic performance gains over traditional cloud infrastructures.MatCloud is an in-progress academic project and is under active development.

  Back
 
Topics:
Cloud Visualization, Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S1020260
Streaming:
Download:
Share:
 
Speakers:
Tad Brockway
- Microsoft
Abstract:
Learn about Microsoft''s upcoming GPU Virtualization feature, RemoteFX, which will ship in Windows Server 2008 R2 SP1. ...Read More
Abstract:
Learn about Microsoft''s upcoming GPU Virtualization feature, RemoteFX, which will ship in Windows Server 2008 R2 SP1. Microsoft RemoteFX enables GPUs to be hosted in the datacenter as a service that can be shared by multiple users for streaming the real-time and complete Windows 7 desktop experience to ultra-lightweight client devices anywhere on the corporate network. With Microsoft RemoteFX, users will be able to work remotely in a Windows Aero desktop environment, watch full-motion video, enjoy Silverlight animations, and run 3D applications - all with the fidelity of local-like performance.  Back
 
Topics:
Cloud Visualization, Computer Graphics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102243
Streaming:
Download:
Share:
Computational Fluid Dynamics
Presentation
Media
Speakers:
Jonathan Cohen
- NVIDIA Research
Abstract:
OpenCurrent is an open source library with support for structured 3D grids and various PDE solvers that operate on them, including a multigrid Poisson solver and an incompressible Navier-Stokes solver. ...Read More
Abstract:
OpenCurrent is an open source library with support for structured 3D grids and various PDE solvers that operate on them, including a multigrid Poisson solver and an incompressible Navier-Stokes solver. It also includes extensions for splitting grids across multiple GPUs. This talk will provide a basic introduction to the code base and its design principles.  Back
 
Topics:
Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2022
Download:
Share:
 
Speakers:
Vivien Clauzon
Abstract:
Hear why and how Numtech, a french SME working in the field of atmospheric dispersion and expertise of meteorological events, is benchmarking GPGPU for its futures applications. ...Read More
Abstract:
Hear why and how Numtech, a french SME working in the field of atmospheric dispersion and expertise of meteorological events, is benchmarking GPGPU for its futures applications. A compressible and an incompressible interactive flow solvers are described.  Back
 
Topics:
Computational Fluid Dynamics, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2037
Streaming:
Download:
Share:
 
Speakers:
Matthieu Lefebvre
- ONERA
Abstract:
Hear how we are improving our elsA and CEDRE computational fluid dynamics software by working on solving the Euler equations set on the GPU. ...Read More
Abstract:
Hear how we are improving our elsA and CEDRE computational fluid dynamics software by working on solving the Euler equations set on the GPU. We discuss how our implementation considers the associated Riemann problem and the Roe-Pike differencing scheme at several orders in space while also introducing immerse boundary conditions. Covers the significant speedup obtained through algorithmic and computational optimizations.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2045
Streaming:
Download:
Share:
 
Speakers:
Kees Vuik, Rohit Gupta
- Delft University Of Technology
Abstract:
Explore how to use deflation as a second level preconditioning technique to speed up Block Incomplete Cholesky Preconditioned Conjugate Gradient Method. ...Read More
Abstract:
Explore how to use deflation as a second level preconditioning technique to speed up Block Incomplete Cholesky Preconditioned Conjugate Gradient Method. We use it to solve the Pressure correction equation involved in the solution of the Two-Phase Fluid Flow problem. Our implementation reaches speedup factors between 25-30, for more than 260,000 unknowns, when compared to the CPU.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2049
Streaming:
Download:
Share:
 
Speakers:
Con Caris, John Taylor, Tomasz Bednarz
- CSIRO
Abstract:
Learn step-by-step procedures to write an explicit CFD solver based on final difference methods with staggered grid allocations and boundary fitted coordinates. ...Read More
Abstract:
Learn step-by-step procedures to write an explicit CFD solver based on final difference methods with staggered grid allocations and boundary fitted coordinates. We will discuss the derivation of the mathematical model, discretization of the model equations, development of the algorithms, and parallelization and visualization of the computed data using OpenCL and OpenGL. Compares case studies of natural convection, driven cavity, scaling analysis, and magneto-thermal convection computed using CSIRO''s CPU/GPU supercomputer cluster to known analytical and experimental solutions.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms, HPC and AI, HPC and AI, Physics Simulation, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2058
Streaming:
Download:
Share:
 
Speakers:
Timothy Warburton
- Rice University
Abstract:
In the last three years we have demonstrated how GPU accelerated discontinuous Galerkin methods have enabled simulation of time-dependent, electromagnetic scattering from airplanes and helicopters. ...Read More
Abstract:
In the last three years we have demonstrated how GPU accelerated discontinuous Galerkin methods have enabled simulation of time-dependent, electromagnetic scattering from airplanes and helicopters. In this talk we will discuss how we have extended these techniques to enable GPU accelerated simulation of supersonic airflow as well.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102078
Download:
Share:
 
Speakers:
David M., Patrice Castonguay
- Stanford University
Abstract:
We will describe a scalable and efficient high-order unstructured compressible flow solver for GPUs. ...Read More
Abstract:
We will describe a scalable and efficient high-order unstructured compressible flow solver for GPUs. The solver allows the achievement of arbitrary order of accuracy for flows over complex geometries. High-order solvers require more operations per degree of freedom, thus making them highly suitable for massively parallel processors. Preliminary results indicate speed-ups up to 70x with the Tesla C1060 compared to the Intel i7 CPU. Memory access was optimized using shared and texture memory.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102079
Streaming:
Download:
Share:
 
Speakers:
Peter Zaspel
- University of Bonn
Abstract:
This demonstrates the potential of GPUs for solving complex free surface flow problems using level set methods. ...Read More
Abstract:
This demonstrates the potential of GPUs for solving complex free surface flow problems using level set methods. These methods are capable of producing complex surface deformations, and therefore are used widely in computer graphics, as well as engineering applications. This work demonstrates that GPUs can be used to accelerate the most computationally expensive part of free surface flow calculations, and therefore allows much larger problems to be solved on workstation machines than was previously possible. These techniques will be exemplified by our current project to port our in-house fluid solver NaSt3DGPF to the GPU.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms, HPC and AI, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102083
Download:
Share:
 
Speakers:
Allan Peter
- Technical University of Denmark
Abstract:
This work is concerned with the development of an efficient high-throughput scalable model for simulation of fully nonlinear water waves (OceanWave3D) applicable to solve and analyze large-scale problems in coastal engineering. ...Read More
Abstract:
This work is concerned with the development of an efficient high-throughput scalable model for simulation of fully nonlinear water waves (OceanWave3D) applicable to solve and analyze large-scale problems in coastal engineering. The goal can be achieved through algorithm redesign and parallelization of an optimized sequential single-CPU algorithm based on a flexible-order Finite Difference Method. High performance is pursued by utilizing many-core processing in the model focusing on GPUs for acceleration of code execution. This involves combining analytical methods with an algorithm redesign of the current numerical model.   Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102103
Download:
Share:
 
Speakers:
Issei Masaie
- Prometech Software, Inc.
Abstract:
Prometech Software, Inc. is an university launched technology venture in Japan and has been working in the field of particle-based computational fluid dynamics for several years. ...Read More
Abstract:
Prometech Software, Inc. is an university launched technology venture in Japan and has been working in the field of particle-based computational fluid dynamics for several years. Through collaboratinos with major automotive and material companies in Japan, Prometech has implemented our Particle technology on Multi-GPU and delivered as a CAE software, "Particleworks". In this session, we will discuss the theoretical background of our simulation (MPS; Moving Particle Simulation method), Multi GPU programming techniques of sparse matrix solver, performance results of Particleworks and the analysis examples of the Auto and Material.  Back
 
Topics:
Computational Fluid Dynamics, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102106
Streaming:
Download:
Share:
 
Speakers:
Christopher Stone
- Intelligent Light
Abstract:
Dive deep as we present the details of a new CUDA-based algorithm for accurate rotorcraft wake simulations. ...Read More
Abstract:
Dive deep as we present the details of a new CUDA-based algorithm for accurate rotorcraft wake simulations. We use a vortex particle method, accelerated with a multipole tree algorithm, combined with a traditional grid-based CFD code. This CUDA algorithm can evaluate the velocity and velocity-gradient with an effective throughput approaching 300 billion interactions per second on a C1060. This gives 10x speed-up and 2.5x better accuracy compared to the parallel CPU version.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102110
Streaming:
Download:
Share:
 
Speakers:
Tobias Brandvik
- University of Cambridge
Abstract:
This talk describes a strategy for implementing structured grid PDE solvers on GPUs. Techniques covered include the use of source-to-source compilation and the use of sparse matrix vector multiplications for complicated boundary conditions. ...Read More
Abstract:
This talk describes a strategy for implementing structured grid PDE solvers on GPUs. Techniques covered include the use of source-to-source compilation and the use of sparse matrix vector multiplications for complicated boundary conditions. A new production-quality solver for flows in turbomachines called Turbostream that uses these techniques is presented. The impact of the use of GPUs on the turbomachinery design process is demonstrated by two 64-GPU simulations that have recently been performed on the University of Cambridge''s GPU cluster.  Back
 
Topics:
Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102118
Streaming:
Download:
Share:
 
Speakers:
Jonas Toelke
- Ingrain
Abstract:
Learn how a very efficient implementation of multiphase lattice Boltzmann methods (LBM) based on CUDA delivers significant benefits for predictions of properties in rocks. ...Read More
Abstract:
Learn how a very efficient implementation of multiphase lattice Boltzmann methods (LBM) based on CUDA delivers significant benefits for predictions of properties in rocks. This simulator on NVIDIA hardware enables us to perform pore scale multi-phase (oil-water-matrix) simulations in natural porous media and to predict important rock properties like absolute permeability, relative permeabilites, and capillary pressure. We will show videos of these simulations in complex real world porous media and rocks.  Back
 
Topics:
Computational Fluid Dynamics, Energy Exploration
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102170
Streaming:
Download:
Share:
 
Speakers:
Daniel Gaudlitz
- FluiDyna
 
Topics:
Computational Fluid Dynamics, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102206
Streaming:
Download:
Share:
 
Speakers:
Andrew Corrigan, Keith Obenschain
- Naval Research Lab Code 6440
Abstract:
Explore how a code written to run in parallel using OpenMP and on a single GPU was modified to run across multiple GPUs and nodes on a multi-CPU, multi-GPU cluster installed at the Naval Research Laboratory. ...Read More
Abstract:
Explore how a code written to run in parallel using OpenMP and on a single GPU was modified to run across multiple GPUs and nodes on a multi-CPU, multi-GPU cluster installed at the Naval Research Laboratory. We will discuss the performance of this code running in parallel using MPI/OpenMP and MPI/CUDA.  Back
 
Topics:
Computational Fluid Dynamics, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102234
Streaming:
Download:
Share:
 
Speakers:
Dan Bailey
- Double Negative
Abstract:
Explore how a less efficient, but highly parallel algorithm can still be a superior alternative to a sequential CPU method. ...Read More
Abstract:
Explore how a less efficient, but highly parallel algorithm can still be a superior alternative to a sequential CPU method. This talk will present a simple CUDA-based Poisson solver to the conjugate gradient method designed for solving well-conditioned matrices such as those that arise from the pressure projection stage of a Navier-Stokes fluid solver. In contrast to other active areas of research in this field, we show that a more brute force approach can still significantly out-perform the best CPU alternatives by sacrificing a high convergence rate in place of achieving much faster iterations.  Back
 
Topics:
Computational Fluid Dynamics, Developer - Algorithms, Film
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102239
Download:
Share:
 
Speakers:
Arun Somani, Lizandro Solano, Z.J. Wang
- Iowa State University
Abstract:
A discontinuous high-order formulation named the Correction Procedure via Reconstruction (CPR) is recently implemented on Nvidia GPUs. ...Read More
Abstract:
A discontinuous high-order formulation named the Correction Procedure via Reconstruction (CPR) is recently implemented on Nvidia GPUs. The CPR formulation is related to the discontinuous Galerkin (DG) method, and unifies several methods such as the DG, spectral volume and spectral difference into a single framework efficient for hybrid meshes. In preliminary 2D inviscid flow computations, a single GPU has been able to deliver a speedup of 44 over a CPU of the same generation. Extension is being made for viscous flow computation, and results will be presented at the final presentation.   Back
 
Topics:
Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102292
Streaming:
Download:
Share:
 
Speakers:
Takayuki Aoki
- Tokyo Institute of Technology
Abstract:
Many CFD applications have been successfully accelerated on GPUs, but for large-scale simulations that require memory beyond a single GPU, communication is required between GPUs over cluster nodes through PCI-Express and interconnects. ...Read More
Abstract:
Many CFD applications have been successfully accelerated on GPUs, but for large-scale simulations that require memory beyond a single GPU, communication is required between GPUs over cluster nodes through PCI-Express and interconnects. To overcome performance bottlenecks and preserve parallel scalability, an overlapping technique between computation and communication is essential. This work presents results of an LBM for incompressible flow, and a Tsunami simulation solving the shallow water equation for simulations on the NVIDIA Tesla-based TSUBAME supercomputer of Tokyo Tech. In addition results will be presented on a complete GPU implementation of a production-level weather prediction code developed by the JMA that achieves 15 TFLOPS for an 80-fold speedup.  Back
 
Topics:
Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102295
Streaming:
Download:
Share:
 
Speakers:
Patrice Castonguay
- Stanford University
Abstract:
The objective of this project is to develop a scalable and efficient high-order unstructured compressible flow solver for GPUs. ...Read More
Abstract:
The objective of this project is to develop a scalable and efficient high-order unstructured compressible flow solver for GPUs. The solver allows the achievement of arbitrary order of accuracy for flows over complex geometries. High-order solvers require more operations per degree of freedom, thus making them highly suitable for massively parallel processors. Preliminary results indicate speed-ups up to 70x with the Tesla C1060 compared to the Intel i7 CPU. Memory access was optimized using shared and texture memory.  Back
 
Topics:
Computational Fluid Dynamics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10D01
Download:
Share:
 
Speakers:
Dana Jacobsen
- Boise State University
Abstract:
An investigation of the performance and scalability of a multigrid pressure Poisson equation solver running on a GPU cluster.
 
Topics:
Computational Fluid Dynamics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10D02
Download:
Share:
 
Speakers:
Gilles Civario
- Irish Centre for High-End Computing
Abstract:
In this work, the acceleration of a mesh-free Computational Fluid Dynamics (CFD) code is performed using CUDA.
 
Topics:
Computational Fluid Dynamics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10D03
Download:
Share:
 
Speakers:
Sean Lovett
- University of Cambridge
Abstract:
We used NVIDIA Tesla GPUs to accelerate the solution of hyperbolic partial differential equations, with application to modelling airblast generated by industrial bench mining operations. ...Read More
Abstract:
We used NVIDIA Tesla GPUs to accelerate the solution of hyperbolic partial differential equations, with application to modelling airblast generated by industrial bench mining operations. Parallelisation over multiple GPUs was achieved using MPI.  Back
 
Topics:
Computational Fluid Dynamics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10D04
Download:
Share:
 
Speakers:
Z.J. Wang
- Iowa State University
Abstract:
This poster describes our implementation of adaptive high-order CFD methods on GPUs. A speedup factor of up to 44 has been achieved for 2D flow problems.
 
Topics:
Computational Fluid Dynamics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10D05
Download:
Share:
 
Speakers:
Long Wang
- Supercomputing Center, Chinese Academy of Sciences
Abstract:
Computational Fluid Dynamics, an important branch in HPC field, has a history of seeking and requiring higher computational performance. ...Read More
Abstract:
Computational Fluid Dynamics, an important branch in HPC field, has a history of seeking and requiring higher computational performance. The traditional way to satisfy this quest is to use faster machines or supercomputers. Yet these approaches seem inconvenient and costly to many individual researchers. We investigated the use of GPU to accelerate CFD codes and tested the performances on CUDA and OpenCL platform. We have ported 2D cave flow, 2D Riemann, and 2D flow over a RAE2882 airfoil to the GPU and explored some GPU-specific optimization strategies. In most cases, approximately 16 to 63 x speed up can be achieved.  Back
 
Topics:
Computational Fluid Dynamics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10D06
Download:
Share:
Computer Graphics
Presentation
Media
Speakers:
Tristan Lorach
- NVIDIA
Abstract:
Dive into the details of using CgFX - Cg's effect framework - to combine ray-tracing with real-time rendering and enable the next generation of complex high-quality rendering. ...Read More
Abstract:
Dive into the details of using CgFX - Cg's effect framework - to combine ray-tracing with real-time rendering and enable the next generation of complex high-quality rendering. You will learn how to use CgFX to create complex rendering effects in a concise and elegant fashion by: Blending material-level and scene-level effects in a consistent way,- Seamlessly integrating CUDA-based data processing within the CgFX rendering pipeline,Mixing OptiX-based rendering with CgFX and OpenGL.   Back
 
Topics:
Computer Graphics, Rendering and Ray Tracing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2056
Streaming:
Download:
Share:
 
Speakers:
Steve Nash
- NVIDIA
Abstract:
The unprecedented realism that is possible today allows for visualization at an ever larger scale. This talk will walk through several case studies from high resolution single displays to completely immersive environments. ...Read More
Abstract:
The unprecedented realism that is possible today allows for visualization at an ever larger scale. This talk will walk through several case studies from high resolution single displays to completely immersive environments. Details will be shared on how to architect and implement these installations, with attention to the typical issues encountered. It will cover how to implement stereo 3D in OpenGL, Direct3D, as well as how that relates to the different display technologies (projectors, multi-display, CAVEs, etc.)  Back
 
Topics:
Computer Graphics, Stereoscopic 3D
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102071
Streaming:
Download:
Share:
 
Speakers:
Charles Loop
- Microsoft Research
Abstract:
See how the new DirectX 11 Hardware Tessellation and Compute Shader can be used to implement an adaptive Catmull-Clark subdivision surface renderer. ...Read More
Abstract:
See how the new DirectX 11 Hardware Tessellation and Compute Shader can be used to implement an adaptive Catmull-Clark subdivision surface renderer. We use a table driven approach to performing Catmull-Clark subdivision in parallel utilizing one thread per output mesh vertex.  Back
 
Topics:
Computer Graphics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102129
Streaming:
Download:
Share:
 
Speakers:
Rajeev Surati
- Scalable Display Technologies
Abstract:
We''ll go over the challenges we have overcome in building 100 million pixel seamless displays. One customer requirement involves interactive changes of the eyepoint as a person moves, relative to the screen, ...Read More
Abstract:
We''ll go over the challenges we have overcome in building 100 million pixel seamless displays. One customer requirement involves interactive changes of the eyepoint as a person moves, relative to the screen, yet the distortions computed are quite non-linear. We discuss our use of a gpu to implement this procedure.  Back
 
Topics:
Computer Graphics, HPC and AI, Medical Imaging and Radiology
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102134
Streaming:
Download:
Share:
 
Speakers:
Evan Hart, Johannes van Waveren
- NVIDIA, id Software
Abstract:
A virtual texture implementation allows applications the ability to manage gigantic amounts of texture data for rendering complex data sets. ...Read More
Abstract:
A virtual texture implementation allows applications the ability to manage gigantic amounts of texture data for rendering complex data sets. However, practical utilization involves feeding it adequate data. The GPU offers a powerful engine capable of accelerating the transcoding of efficient storage formats into formats useful for rendering. This session will demonstrate a virtual texturing implementation and the steps needed to GPU accelerate the non-rendering portions of managing and loading the virtual texture data.  Back
 
Topics:
Computer Graphics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102152
Streaming:
Download:
Share:
 
Speakers:
Anjul Patney, Stanley Tzeng
- University of California, Davis
Abstract:
We present a discussion of ideas and techniques behind programmable graphics pipelines on modern GPUs, specifically the example design of a real-time Reyes renderer. ...Read More
Abstract:
We present a discussion of ideas and techniques behind programmable graphics pipelines on modern GPUs, specifically the example design of a real-time Reyes renderer. Walking through this example, we address the philosophy beneath programmable GPU graphics, the broad strategy for the specific pipeline, and algorithmic and implementation-level details for key rendering stages. We cover several issues concerning GPU efficiency, including those involving work scheduling, parallelization of traditional stages, and balancing of rendering workloads. We expect the audience to gain an in-depth exposure of the state of research in programmable graphics, and an insight into efficient pipeline design for irregular workloads.  Back
 
Topics:
Computer Graphics, Film
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102162
Streaming:
Download:
Share:
 
Speakers:
Ken Pimentel
- Autodesk
Abstract:
Learn how GPU technologies are transforming the making of pixels. This talk will cover GPU-centric rendering techniques that leverage both the raw computational capabilities of NVIDIA's GPUs ...Read More
Abstract:
Learn how GPU technologies are transforming the making of pixels. This talk will cover GPU-centric rendering techniques that leverage both the raw computational capabilities of NVIDIA's GPUs and advanced pixel-shading techniques for interactive visualization and rendering.  Back
 
Topics:
Computer Graphics, Film
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102165
Streaming:
Download:
Share:
 
Speakers:
Philippe Rollin
- NVIDIA
Abstract:
The new generation of accelerated graphics is elevating visual computing to new heights. Tessellation, one of its most anticipated features, is already used in many scenarios to bring 3D graphics to an unprecedented level of realism. ...Read More
Abstract:
The new generation of accelerated graphics is elevating visual computing to new heights. Tessellation, one of its most anticipated features, is already used in many scenarios to bring 3D graphics to an unprecedented level of realism. This talk will introduce tessellation using OpenGL 4.0. We will also describe how an existing application can be adapted to efficiently take advantage of this new feature and also how to overcome some of the challenges.   Back
 
Topics:
Computer Graphics, Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102227
Download:
Share:
 
Speakers:
Brian Harrison, Michael Morrison
- NVIDIA
Abstract:
Learn how NVIDIA SceniX is a rapid start to building state of the art, realtime 3D applications, and how raytracing can be combined with raster graphics for new levels of interactive realism.
 
Topics:
Computer Graphics, Computer Vision and Machine Vision, Rendering and Ray Tracing, Stereoscopic 3D
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102308
Streaming:
Download:
Share:
 
Speakers:
Nathan Andrysco
- Purdue University
Abstract:
We propose a new way to represent trees that allows for faster algorithms, that are simple to implement (especially on the GPU), and with a lower memory overhead than previous approaches. ...Read More
Abstract:
We propose a new way to represent trees that allows for faster algorithms, that are simple to implement (especially on the GPU), and with a lower memory overhead than previous approaches. Using our data structure, we have seen significant improvements in both volume ray casting and ray tracing applications over previous state-of-the-art methods.  Back
 
Topics:
Computer Graphics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10E01
Download:
Share:
 
Speakers:
Anjul Patney
- University of California, Davis
Abstract:
In this poster, we describe our recent work in the area of programmable graphics pipelines by presenting a fragment-parallel formulation of an A-buffer-style composite and filter equation, and describe its implementation on a modern GPU.
 
Topics:
Computer Graphics
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10E02
Download:
Share:
Computer Vision and Machine Vision
Presentation
Media
Speakers:
Tom Dean
- Google Inc.
Abstract:
Join us for a discussion on applying commodity-server-based clusters and GPU-based clusters to simulating computer vision algorithms at a scale that approaches that of biological vision. ...Read More
Abstract:
Join us for a discussion on applying commodity-server-based clusters and GPU-based clusters to simulating computer vision algorithms at a scale that approaches that of biological vision. We consider the limitations of each technology, survey approaches taken thus far, and suggest new hybrid models and programming frameworks to overcome current limitations and substantially improve performance.  Back
 
Topics:
Computer Vision and Machine Vision, Machine Learning & Deep Learning
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102132
Download:
Share:
 
Speakers:
Abstract:
Learn how to use CUDA and GPGPU to perform large scale face search for both forensics as well as CCTV face recognition.
 
Topics:
Computer Vision and Machine Vision, Video and Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102173
Streaming:
Download:
Share:
 
Speakers:
Abbas Bigdeli, Ben Lever
- MIT, Harvard University
Abstract:
Biologically-inspired computer vision algorithms - those that aim to mirror the computations performed by the brain''s visual system - have emerged as exceptionally promising candidates in object and face recognition research, ...Read More
Abstract:
Biologically-inspired computer vision algorithms - those that aim to mirror the computations performed by the brain''s visual system - have emerged as exceptionally promising candidates in object and face recognition research, achieving performance on a range of object and face recognition tasks. Recently, we have begun harnessing the newly-available power of NVIDIA GPUs to tackle the problem of biologically-inspired model selection within a largescale model search framework, drawing inspiration from high-throughput screening approaches in molecular biology and genetics where a large number of organisms are screened in parallel for a given property of interest. As the available computational power provided by massively paralleltechnology from NVIDIA continues to expand, w e hope that this research will hold great potential for new social networking applications in addition to rapidly accelerating progress in artificial vision, and for generating new, experimentally testable hypotheses for the study of biological vision.   Back
 
Topics:
Computer Vision and Machine Vision, HPC and AI, Machine Learning & Deep Learning, Neuroscience
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102204
Download:
Share:
 
Speakers:
James Fung
- NVIDIA
Abstract:
GPUS have evolved from fixed function to general purpose, and continue to evolve with new features being added in every generation. ...Read More
Abstract:
GPUS have evolved from fixed function to general purpose, and continue to evolve with new features being added in every generation. This talk will discuss how to exploit the new features introduced by the Fermi architecture (such as concurrent kernel execution, writes to texture) to accelerate computer vision algorithms.  Back
 
Topics:
Computer Vision and Machine Vision, Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102209
Streaming:
Download:
Share:
 
Speakers:
Joe Stam
Abstract:
OpenCV is a widely popular computer vision library, with millions of downloads and hundreds of thousands of users. Applications span many industries including robotics, industrial machine vision, automotive, film & broadcast, medical, and co ...Read More
Abstract:

OpenCV is a widely popular computer vision library, with millions of downloads and hundreds of thousands of users. Applications span many industries including robotics, industrial machine vision, automotive, film & broadcast, medical, and consumer applications. NVIDIA and the OpenCV development team are collaborating to provide CUDA implementations of the most demanding algorithms, thus enabling a new level of real-time capability and higher quality results. This talk with introduce OpenCV, and summarize the new CUDA enabled capabilities, and provide an overview of future plans.

  Back
 
Topics:
Computer Vision and Machine Vision, Stereoscopic 3D, Video and Image Processing, Video and Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102215
Streaming:
Download:
Share:
 
Speakers:
Peter Lu
- Harvard University
Abstract:
Ever since the 1827 discovery of Brownian motion by observing pollen grains, quantifying motion under the microscope has led to breakthroughs in physics, biology and engineering. ...Read More
Abstract:
Ever since the 1827 discovery of Brownian motion by observing pollen grains, quantifying motion under the microscope has led to breakthroughs in physics, biology and engineering. Here, I present methods we have developed using confocal microscopy to deduce 3D structure and dynamics from 2D image sequences. We analyze the motion of diffusing colloidal particles and swarms of bacteria free to swim in 3D, which we observe at the single-organism level. We rely heavily on GPU computing to process our large data sets, making extensive use of NPP, CuFFT and optical-flow CUDA algorithms originally developed for machine vision in automobiles.  Back
 
Topics:
Computer Vision and Machine Vision, Life & Material Science, Video and Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102242
Streaming:
Download:
Share:
 
Speakers:
Mahesh Khadtare
- CRL India
Abstract:
Explores the GPU porting and performance analysis of the image quality assessment algorithm based on structural similarity index(SSI). ...Read More
Abstract:
Explores the GPU porting and performance analysis of the image quality assessment algorithm based on structural similarity index(SSI). This index is a powerful tool for image quality assessment and the algorithm is highly suitable for GPU architecture, offering a rapid image quality assessment in many image restoration applications.  Back
 
Topics:
Computer Vision and Machine Vision, Video and Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102298
Streaming:
Download:
Share:
 
Speakers:
Hannes Fassold
Abstract:
Discover how post-production tasks can be accelerated by taking advantage of GPU-based algorithms. In this talk we present computer vision algorithms for corner detection, feature point tracking, image warping and image inpainting, and their eff ...Read More
Abstract:

Discover how post-production tasks can be accelerated by taking advantage of GPU-based algorithms. In this talk we present computer vision algorithms for corner detection, feature point tracking, image warping and image inpainting, and their efficient implementation on GPUs using CUDA. We also show how to use these algorithms to do real-time stabilization and temporal re-sampling (re-timing) of high definition video sequences, both common tasks in post-production. Benchmarking of the GPU implementations against optimized CPU algorithms demonstrates a speedup of approximately an order of magnitude.

  Back
 
Topics:
Computer Vision and Machine Vision, Video and Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2029
Streaming:
Download:
Share:
 
Speakers:
Abstract:
To measure three-dimensional shape data of objects, we build up a measurement system that assigns three-dimensional coordinates to the position of projected measurement labels in a camera image. ...Read More
Abstract:
To measure three-dimensional shape data of objects, we build up a measurement system that assigns three-dimensional coordinates to the position of projected measurement labels in a camera image. To achieve high measurement accuracy across high amounts of measurement points, we need a very quick routine to localize measurement labels with high precision. To speed up the computation, we evaluate the fits using the CUDA architecture. The final implementation speeds up the fitting of 104 two-dimensional Gauss functions by a factor of 90.  Back
 
Topics:
Computer Vision and Machine Vision, Stereoscopic 3D
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102065
Streaming:
Download:
Share:
 
Speakers:
Kento Tarui
- AquaCast Corporation
Abstract:
We propose a real time HOG based object detector implemented on GPU. To accelerate the detection process, the proposed method uses two serially-cascaded HOG detectors. ...Read More
Abstract:
We propose a real time HOG based object detector implemented on GPU. To accelerate the detection process, the proposed method uses two serially-cascaded HOG detectors. The first low dimensional HOG detector discards detection windows obviously not showing target objects. It reduces the computational cost of the second high dimensional HOG detector. This method tested on 640x480 color image and the same size movie. The computation time decreases to 70ms per image. That is 4 times faster than a case of single detector. This method provides real time performance even on middle end GPUs such as GeForce GTS 250.  Back
 
Topics:
Computer Vision and Machine Vision, Machine Learning & Deep Learning
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102114
Streaming:
Download:
Share:
 
Speakers:
Ryan Ismert
- Sportvision, Inc.
Abstract:
This talk will take a detailed look at Sportvision''s "First and 10" system, perhaps the most widely experienced example of AR ever, with 106 million viewers during the 2010 Superbowl alone. ...Read More
Abstract:
This talk will take a detailed look at Sportvision''s "First and 10" system, perhaps the most widely experienced example of AR ever, with 106 million viewers during the 2010 Superbowl alone. We''ll examine the current implementation and the GPU features that enable low latency, video-rate performance.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102123
Streaming:
Download:
Share:
 
Speakers:
Bor-Yiing Su
- University of California, Berkeley
Abstract:
We have developed a parallel object recognition system using CUDA, achieving 70x-80x speedup against the original serial implementation. ...Read More
Abstract:
We have developed a parallel object recognition system using CUDA, achieving 70x-80x speedup against the original serial implementation. In order to optimize our implementation, we evaluated the performance of different parallelization strategies on some key computations in the object recognition system. Finally we concluded that the parallel implementation performance is sensitive to input data properties. Therefore, we should dynamically adjust the parallelization strategy at runtime to optimize key computations.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F01
Download:
Share:
 
Speakers:
Narayanan Sundaram
- University of California, Berkeley
Abstract:
In this poster we discuss a method for computing point trajectories based on a fast parallel implementation of a recent optical flow algorithm that tolerates fast motion. ...Read More
Abstract:
In this poster we discuss a method for computing point trajectories based on a fast parallel implementation of a recent optical flow algorithm that tolerates fast motion. The parallel implementation of large displacement optical flow runs about 78x faster than the serial C++ version. We use this implementation is a point tracking application. Our resulting technique tracks up to three orders of magnitude more points and is 46% more accurate than the Kanade-Lucas-Tomasi tracker. Compared to the Particle Video tracker, we achieve 66% better accuracy while retaining the ability to handle large displacements while running an order of magnitude faster.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F02
Download:
Share:
 
Speakers:
Steven Brumby
- Los Alamos National Laboratory
Abstract:
Los Alamos National Laboratory''s Petascale Synthetic Visual Cognition project is exploring full-scale, real-time functional models of human visual cortex to understand how human vision achieves its accuracy, robustness and speed. ...Read More
Abstract:
Los Alamos National Laboratory''s Petascale Synthetic Visual Cognition project is exploring full-scale, real-time functional models of human visual cortex to understand how human vision achieves its accuracy, robustness and speed. Commercial-off-the-shelf hardware to support this modeling is rapidly improving, e.g., a teraflop GPGPU card costs ~$500 and is ~size of mouse cortex. We present results demonstrating image classification on UAV aerial video with a visual cortex model running on a 240-core NVIDIA GeForce GTX285, and see >x10 speed-up. As this technology continues to improve, cortical modeling on GPGPU devices has the potential to revolutionize computer vision.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F03
Download:
Share:
 
Speakers:
Melvin Wong
- Institute for Infocomm Research
Abstract:
Background subtraction is one of the important image processing steps for video surveillance and many computer vision problems such as tracking & recognition. ...Read More
Abstract:
Background subtraction is one of the important image processing steps for video surveillance and many computer vision problems such as tracking & recognition. However, robust background subtraction that adapts well to variable environment changes is highly computational and consumed large amount of memory. Thus, its practical application is often limited. Here, we aimed to expand its usage and tackle vision problems that requires high frame rate camera such as real-time sports analysis, real-time object detection and recognition. Using recent advances in accelerator hardware - NVIDIA Fermi Architecture and taking advantage of heterogeneous computing , we are able to gain good performance that allows to use in these practical applications.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F04
Download:
Share:
 
Speakers:
Nicolas Pinto
- Massachusetts Institute of Technology
Abstract:
The construction of artificial vision systems and the study of biological vision are naturally intertwined as they represent simultaneous efforts to forward- and reverse-engineer systems with similar goals. ...Read More
Abstract:
The construction of artificial vision systems and the study of biological vision are naturally intertwined as they represent simultaneous efforts to forward- and reverse-engineer systems with similar goals. Here, we present a high-throughput approach to more expansively explore biologically-inspired models by leveraging GPUs. We show that this approach can yield significant gains in performance on object and face recognition (including "Labeled Faces in the Wild" challenge and faces from Facebook), consistently outperforming the state-of-the-art. We highlight how the application of flexible programming tools, such as high-level scripting, template metaprogramming/auto-tuning, can enable large performance gains, while managing complexity for the developer.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F05
Download:
Share:
 
Speakers:
Salman Ul Haq
- TunaCode
Abstract:
CUVI Lib (CUDA for Vision and Imaging Library) is a software library that provides a set of GPU accelerated computer vision and image processing functions. ...Read More
Abstract:
CUVI Lib (CUDA for Vision and Imaging Library) is a software library that provides a set of GPU accelerated computer vision and image processing functions. CUVI can both be utilized as an add-on library for the NVIDIA''s NPP (NVIDIA Performance Primitives) as it compliments the functionality present in NPP as well as it can be used as a standalone library ready to be plugged into end-user C/C++ applications.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F06
Download:
Share:
 
Speakers:
In Kyu Park
- Inha University
Abstract:
We present a new surfel (surface element) based multi-view stereo algorithm which runs entirely on GPU. ...Read More
Abstract:
We present a new surfel (surface element) based multi-view stereo algorithm which runs entirely on GPU. We utilize flexibility of surfel-based 3D shape representation and global optimization by graph cuts in a same framework.The orientation of the constructed surfel candidates imposes an effective constraint that reduces the effect of the minimal surface bias. The entire processing pipeline is implemented on the latest GPU to speed up the processing significantly. Experimental results show that the proposed approach reconstructs the 3D shape of an object accurately and efficiently, which runs more than 100 times faster than on CPU.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F07
Download:
Share:
 
Speakers:
Jayadeep Vijayan
- NeST Software
Abstract:
A GPU based implementation of a face recognition solution using PCA with Eigenfaces algorithm.
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F08
Download:
Share:
 
Speakers:
Jan-Michael Frahm
- University of North Carolina, Chapel Hill
Abstract:
We present a system to reconstruct dense 3D models from community photo collections. First images are described using GIST and are clustered using hamming distances. ...Read More
Abstract:
We present a system to reconstruct dense 3D models from community photo collections. First images are described using GIST and are clustered using hamming distances. Each of these clusters is geometrically verified and connected using Geotags. Connected clusters are bundle adjusted and the obtained registration is used to estimate depthmaps that are finally fused to obtain dense 3D models. Each of the above steps, except Bundle Adjustment, is implemented in CUDA and runs on multiple GPUs . The performance of our pipeline is two order of magnitude faster on one order more images compared to state of the art method.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F09
Download:
Share:
 
Speakers:
Chloe Vaniet
- Imperial College London
Abstract:
Vision enhancement systems is an alternative visual aid device to enhance the remaining vision for visual impairment subjects. ...Read More
Abstract:
Vision enhancement systems is an alternative visual aid device to enhance the remaining vision for visual impairment subjects. Our aim is to develop a mobile central vision enhancement system for macular degeneration patients. Three different types of enhancement algorithms have been developed and their efficiency was tested on low vision patients. These three algorithms have been implemented on a portable low power devic. The Nvidia system-on-a-chip Tegra has been chosen for this implementation.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F10
Download:
Share:
 
Speakers:
Esubalew Bekele
- Universal Robotics Inc.
Abstract:
A dense stereo vision for a material handling dual-arm industrial robot have been implemented with the Rectification, Stereo Correspondence and 3D Pose from depth are ported out to GPU using CUDA.
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F11
Download:
Share:
 
Speakers:
Hendrik Dahlkamp
- Stanford University
Abstract:
We present a flexible, parallelized method for fusing information from optical and range sensors based on an accelerated high-dimensional filtering approach. ...Read More
Abstract:
We present a flexible, parallelized method for fusing information from optical and range sensors based on an accelerated high-dimensional filtering approach. Our system takes as input a sequence of monocular camera images as well as a stream of sparse range measurements as obtained from a laser or other sensor system. Our method produces a dense, high-resolution depth map of the scene, automatically generating confidence values for every interpolated depth point. We describe how to integrate priors on object shape, motion and appearance and how to achieve an efficient implementation using parallel processing hardware such as GPUs.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F12
Download:
Share:
 
Speakers:
Varun Ganapathi
- Stanford University
Abstract:
In this work, we derive an efficient filtering algorithm for tracking human pose at 4-10 frames per second using a stream of monocular depth images. ...Read More
Abstract:
In this work, we derive an efficient filtering algorithm for tracking human pose at 4-10 frames per second using a stream of monocular depth images. The key idea is to combine an accurate generative model-which is achievable in this setting using state of the art GPU hardware-with a discriminative model that feeds data-driven evidence about body part locations. We describe a novel algorithm for propagating the noisy evidence about body part locations up the kinematic chain using the unscented transform.We provide extensive experimental results on 28 real-world sequences using automatic ground-truth annotations from a commercial motion capture system.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F13
Download:
Share:
 
Speakers:
Tim Llewellynn
- nViso / EPFL
Abstract:
Active Appearance Models (AAM) is a powerful tool for modeling and matching objects under shape deformations and texture variations. ...Read More
Abstract:
Active Appearance Models (AAM) is a powerful tool for modeling and matching objects under shape deformations and texture variations. It learns characteristics of objects by building a compact statistical model from applying Principal Component Analysis (PCA) to a set of labeled data. Although AAM has been widely applied in the fields of computer vision, due to its flexible framework, it still cannot satisfy the requirement of real-time situations. To alleviate this problem, we address the computational complexity of the fitting procedure by running the AAM optimization algorithm on a GPU using a hybrid CPU / GPU block processing architecture.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F14
Download:
Share:
 
Speakers:
Anatoly Baksheev
- ITEEZ
Abstract:
OpenCV is a free open source library of computer vision algorithms. Recently a new module consisting of functions implemented on GPU was introduced in OpenCV. ...Read More
Abstract:
OpenCV is a free open source library of computer vision algorithms. Recently a new module consisting of functions implemented on GPU was introduced in OpenCV. It consists of several methods for calculating stereo correspondence between two images that is used to reconstruct a 3D scene. A simple block-matching algorithm works up to 10x faster compared to a CPU implementation in OpenCV providing real-time processing of HD stereo pairs on Tesla cards. Belief propagation-based algorithms show 20-50x speedup compared to a CPU implementation.  Back
 
Topics:
Computer Vision and Machine Vision
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10F15
Download:
Share:
Databases, Data Mining, Business Intelligence
Presentation
Media
Speakers:
Ren Wu
Abstract:
Join us and learn why GPU computing is a game changer for business intelligence (BI). We will discuss how GPUs can be used to accelerate BI analytics at much lower cost, higher performance, and better power efficiency than other alternatives.

Read More
Abstract:

Join us and learn why GPU computing is a game changer for business intelligence (BI). We will discuss how GPUs can be used to accelerate BI analytics at much lower cost, higher performance, and better power efficiency than other alternatives.

  Back
 
Topics:
Databases, Data Mining, Business Intelligence, Finance, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102069
Streaming:
Download:
Share:
 
Speakers:
Abstract:
In a large-scale database installation where data tables are distributed across multiple servers, computational throughput can be optimized by using GPUs on each server and integrating database management with GPU resources. In the Department of ...Read More
Abstract:

In a large-scale database installation where data tables are distributed across multiple servers, computational throughput can be optimized by using GPUs on each server and integrating database management with GPU resources. In the Department of Physics and Astronomy at The Johns Hopkins University, we are experimenting with a set of software tools that closely couple SQL statements with GPU functionality. While still under development, the new framework is now routinely used in our research projects, e.g., to study the spatial clustering of galaxies as well as genomics.

  Back
 
Topics:
Databases, Data Mining, Business Intelligence, Astronomy and Astrophysics, Developer - Tools & Libraries, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102092
Streaming:
Download:
Share:
 
Speakers:
Abstract:
In this research session, we present an approach using NVIDIA GPUs as massively parallel coprocessors for in-memory OLAP computations. Early tests have shown speedup factors of more than 40x compared to optimized sequential algorithms on a CPU. ...Read More
Abstract:

In this research session, we present an approach using NVIDIA GPUs as massively parallel coprocessors for in-memory OLAP computations. Early tests have shown speedup factors of more than 40x compared to optimized sequential algorithms on a CPU. In addition to the data structures and algorithms involved, we describe a method to extend the approach to systems with more than one GPU in order to scale it to larger data sets.

  Back
 
Topics:
Databases, Data Mining, Business Intelligence
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102237
Streaming:
Download:
Share:
 
Speakers:
Peter Volk
Abstract:
With an increasing amount of data and user demands for fast query processing, the optimization of database operations continues to be a challenging task. A common optimization method is to leverage parallel hardware architectures. With the intro ...Read More
Abstract:

With an increasing amount of data and user demands for fast query processing, the optimization of database operations continues to be a challenging task. A common optimization method is to leverage parallel hardware architectures. With the introduction of general-purpose GPU computing, massively parallel hardware has become available within commodity hardware. To efficiently exploit this technology, we introduce the method of speculative query processing. This speculative query processing works on index structures to efficiently support heavily used database operations. To show the benefits and opportunities of our approach, we present a fine and coarse grain implementation for multidimensional queries.

  Back
 
Topics:
Databases, Data Mining, Business Intelligence
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10G02
Download:
Share:
 
Speakers:
Henry Cook
Abstract:
We propose a mechanism to provide the benefits of a software-managed memory hierarchy on top of a hierarchy of hardware-managed caches. A virtual local store (VLS) is mapped into the virtual address space of a process and backed by physical main ...Read More
Abstract:

We propose a mechanism to provide the benefits of a software-managed memory hierarchy on top of a hierarchy of hardware-managed caches. A virtual local store (VLS) is mapped into the virtual address space of a process and backed by physical main memory, but is stored in a partition of the hardware-managed cache when active. This reduces context switch cost, and allows VLSs to migrate with their process thread. The partition allocated to the VLS can be rapidly reconfigured without flushing the cache, allowing programmers to selectively use VLS in a library routine with low overhead.

  Back
 
Topics:
Databases, Data Mining, Business Intelligence
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10G03
Download:
Share:
Developer - Algorithms
Presentation
Media
Speakers:
Paul Navratil
- Texas Advanced Computing Center
 
Topics:
Developer - Algorithms
Type:
Talk
Event:
Supercomputing
Year:
2010
Session ID:
SC1013
Download:
Share:
 
Speakers:
Nikolai Sakharnykh
Abstract:
Learn about new techniques to efficiently implement the Alternating Direction Implicit method on GPU for large 2D and 3D domains with complex boundaries. A novel tridiagonal solver for systems with variable sizes and a new hybrid approach will b ...Read More
Abstract:

Learn about new techniques to efficiently implement the Alternating Direction Implicit method on GPU for large 2D and 3D domains with complex boundaries. A novel tridiagonal solver for systems with variable sizes and a new hybrid approach will be covered in detail. Comprehensive performance analysis and key Fermi optimizations will be explored. Various applications of tridiagonal solvers such as 3D direct numerical fluid simulation and a 2D depth-of-field effect for games will be briefly discussed.

  Back
 
Topics:
Developer - Algorithms, Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S10015
Streaming:
Download:
Share:
 
Abstract:
Learn how to accelerate marching cubes on the GPU by taking advantage of the GPU's high memory bandwidth and fast on-chip shared memory in a data expansion algorithm that can extract the complete iso-surface mesh from (dynamic) volume data w ...Read More
Abstract:

Learn how to accelerate marching cubes on the GPU by taking advantage of the GPU's high memory bandwidth and fast on-chip shared memory in a data expansion algorithm that can extract the complete iso-surface mesh from (dynamic) volume data without requiring any data transfers back to the CPU.

  Back
 
Topics:
Developer - Algorithms, Medical Imaging and Radiology, Video and Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2020
Download:
Share:
 
Speakers:
Allan Rasmusson, Gernot Ziegler
- University of Aarhus, NVIDIA
Abstract:
Explore a new technique in the detection of common regions in a 2D/3D data array. Connected components along the axes are linked before actual label propagation starts. ...Read More
Abstract:
Explore a new technique in the detection of common regions in a 2D/3D data array. Connected components along the axes are linked before actual label propagation starts. The algorithm is completely gather-based, which allows for several optimizations in the CUDA C implementation. It enables real-time frame rates for the analysis of typical 2D images and interactive frame rates for the analysis of typical volume data.  Back
 
Topics:
Developer - Algorithms, Computer Vision and Machine Vision, Medical Imaging and Radiology, Video and Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2021
Streaming:
Download:
Share:
 
Speakers:
Robert Strzodka
- Max Planck Institut Informatik
Abstract:
Learn how to switch between array of structs (AoS) and struct of arrays (SoA) storage without having to change the data access syntax. ...Read More
Abstract:
Learn how to switch between array of structs (AoS) and struct of arrays (SoA) storage without having to change the data access syntax. A few changes to the struct and container definitions will enable you to evaluate the performance of AoS vs. SoA on your existing AoS code. We present a simple abstraction that retains the more intuitive AoS syntax array[index]component, yet allows you to switch between AoS and SoA storage with a single template parameter at class definition.  Back
 
Topics:
Developer - Algorithms, Developer - Algorithms, Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2038
Streaming:
Download:
Share:
 
Speakers:
Nachiket Gokhale
- Weidlinger Associates Inc
Abstract:
Explicit finite element codes are widely used to simulate the response of structures and mechanical equipment subjected to shock, blast and wave propagation phenomena. ...Read More
Abstract:
Explicit finite element codes are widely used to simulate the response of structures and mechanical equipment subjected to shock, blast and wave propagation phenomena. High resolution models require run times ranging from a few seconds to a few months are common and hence the payoff from GPU acceleration is tremendous. We describe the acceleration of our commercial finite element code NLFLEX using CUDA. We developed GPU kernels in CUDA based on our production code NLFLEX, for linear elasticity, explosives, elasto-plasticity and large deformation elasticity. We attained order of magnitude (10X) acceleration in single precision and approximately (5X) in double precision mode.   Back
 
Topics:
Developer - Algorithms, Computational Fluid Dynamics, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102061
Streaming:
Download:
Share:
 
Speakers:
Doris Chen
- University of Toronto
Abstract:
FPGA technology mapping is an algorithm that is heavily data parallel, but contains many features that make it unattractive for GPU implementation. ...Read More
Abstract:
FPGA technology mapping is an algorithm that is heavily data parallel, but contains many features that make it unattractive for GPU implementation. The algorithm uses data in irregular ways since it is a graph-based algorithm. It also makes heavy use of constructs like recursion which is not supported by GPU hardware. In this paper, we take a state-of-the-art FPGA technology mapping algorithm within Berkeley's ABC package and attempt to parallelize it on a GPU. We show that runtime gains of 3.1x are achievable while maintaining identical quality as demonstrated by running these netlists through Altera's Quartus II place-and-route tool.  Back
 
Topics:
Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102068
Streaming:
Download:
Share:
 
Speakers:
Mark Harris
- NVIDIA
Abstract:
Learn about the importance of optimized data-parallel algorithm primitives as building blocks for efficient real-world applications. ...Read More
Abstract:
Learn about the importance of optimized data-parallel algorithm primitives as building blocks for efficient real-world applications. Fundamental parallel algorithms like sorting, parallel reduction, and parallel scan are key components in a wide range of applications from video games to serious science. This session will cover the state of the art in data-parallel primitive algorithms for GPUs. Starting with an explanation of the purpose and applications of the algorithms, we will discuss key algorithm design principles, demonstrate current open source algorithm libraries for GPUs (CUDPP and Thrust), describe optimizations using new features in the Fermi architecture, and explore future directions.  Back
 
Topics:
Developer - Algorithms, Developer - Tools & Libraries, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102084
Streaming:
Download:
Share:
 
Speakers:
Andrew Davidson, Yao Zhang
- University of California, Davis
Abstract:
In this presentation, we will discuss and analyze the performance of three optimization techniques for tridiagonal solvers. ...Read More
Abstract:
In this presentation, we will discuss and analyze the performance of three optimization techniques for tridiagonal solvers. We first present a hybrid Parallel Cyclic Reduction(PCR)-Gaussian Elimination(GE) tridiagonal solver, which combines work-efficient and step-efficient algorithms for high performance. We further discuss an auto-tuned variant of this technique which selects the optimal switching point between algorithms on a per-machine basis. Next, we present a technique to handle large systems, where shared memory constraints prohibit previous work to solve these systems directly. Finally, we will discuss optimizations on a cyclic reduction technique that avoid bank conflicts on current hardware.  Back
 
Topics:
Developer - Algorithms, Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102085
Streaming:
Download:
Share:
 
Speakers:
Holger Dammertz
- Ulm University
Abstract:
Learn how to select the best and fastest pseudo random number generator for your massively parallel Monte Carlo simulation. ...Read More
Abstract:
Learn how to select the best and fastest pseudo random number generator for your massively parallel Monte Carlo simulation.Pseudo random numbers generators (PRNG) are a fundamental building block of these simulations and it is thus required to select suitable PRNGs with regard to the specific problem at hand while considering the parallel hardware architecture. Recent developments in random number generations provide a wide variety of choices, each with different properties and trade-offs. We provide a comprehensive survey of the current state of the art for massively parallel PRNG and show a broad range of applications.  Back
 
Topics:
Developer - Algorithms, Finance
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102136
Streaming:
Download:
Share:
 
Speakers:
Shawn Brown
Abstract:
Learn how to adapt a kd-tree spatial data structure for efficient nearest neighbor (NN) searches on a GPU. Although the kd-tree is not a natural fit for GPU implementation, it can still be effective with the right engineering decisions. By bound ...Read More
Abstract:

Learn how to adapt a kd-tree spatial data structure for efficient nearest neighbor (NN) searches on a GPU. Although the kd-tree is not a natural fit for GPU implementation, it can still be effective with the right engineering decisions. By bounding the maximum height of the kd-tree, minimizing the memory footprint of data structures, and optimizing the GPU kernel code, multi-core GPU NN searches with tens of thousands to tens of millions of points run 10-40 times faster than the equivalent single-core CPU NN searches.

  Back
 
Topics:
Developer - Algorithms, Databases, Data Mining, Business Intelligence, Machine Learning & Deep Learning
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102140
Download:
Share:
 
Speakers:
Amanda Peters
- Harvard University
Abstract:
Learn how GPUs are being used to accelerate the study of the emergence of cooperative behavior in biology, from the interactions of humans to viruses to bacteria. ...Read More
Abstract:
Learn how GPUs are being used to accelerate the study of the emergence of cooperative behavior in biology, from the interactions of humans to viruses to bacteria. The work presented here achieves a speedup of 209x on a cluster of 4 Tesla GPUs.  Back
 
Topics:
Developer - Algorithms, Life & Material Science
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102163
Streaming:
Download:
Share:
 
Speakers:
Lorena Barba
- Boston University
Abstract:
The first wave of successful GPU accelerations has been crowded with highly-parallel methods that adapted well to the hardware. ...Read More
Abstract:
The first wave of successful GPU accelerations has been crowded with highly-parallel methods that adapted well to the hardware. But the easy-pickings are now running out. The truly challenging applications require "going back to the algorithmic drawing board." To develop new versions of the most effective fast algorithms, such that our science can most benefit, an ideal environment is created by the open software model, where efforts can be shared. We will describe one area of application --electrostatics of biomolecules in solution-- where we see at work the triad of extreme computing: fast algorithms, open software, and heterogeneous computing.  Back
 
Topics:
Developer - Algorithms, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102166
Streaming:
Download:
Share:
 
Speakers:
Adarsh Krishnamurthy, Sara McMains
- University of California Berkeley
Abstract:
The broad objective of our research is to develop mechanical Computer-Aided Design tools that provide interactive feedback to the designer. ...Read More
Abstract:
The broad objective of our research is to develop mechanical Computer-Aided Design tools that provide interactive feedback to the designer. We have developed GPU algorithms for fundamental CAD operations (NURBS evaluation, surface-surface intersection, separation distance computation, moment computation, etc.) that are one to two orders of magnitude faster, and often more accurate, than current commercial CPU implementations. We will touch on strategies we have employed to meet GPU programming challenges, such as the separation of CPU/GPU operations, imposing artificial structure on computations, and transforming problem definitions to suit GPU-computation models.  Back
 
Topics:
Developer - Algorithms, Computer Graphics, Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102171
Streaming:
Download:
Share:
 
Speakers:
Pavel Emeliyanenko
- Max-Planck Institute for Informatics
Abstract:
We present the first implementation of a complete modular resultant algorithm on the graphics hardware. ...Read More
Abstract:
We present the first implementation of a complete modular resultant algorithm on the graphics hardware. Our recent developments taking advantage of new NVidia Fermi GPU architecture and instruction set allowed us to achieve about 150x speed-up over a modular resultant algorithm from Maple 13.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A02
Download:
Share:
 
Speakers:
Hartmut Ruhl
- Ludwig-Maximilians-University
Abstract:
Particle-In-Cell simulations represent an important technique in the field of kinetic plasma simulations. ...Read More
Abstract:
Particle-In-Cell simulations represent an important technique in the field of kinetic plasma simulations. 2D particle pushing and conserved current aggregation has been implemented in CUDA. On a TESLA C1060 the CUDA code is 4 times faster than SSE2 optimized code on a quad core INTEL XEON processor.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A03
Download:
Share:
 
Speakers:
Octavian Nitica
- University of Delaware
Abstract:
The Ant Colony Optimization (ACO) Algorithm is a metaheuristic that is used to find shortest paths in graphs. ...Read More
Abstract:
The Ant Colony Optimization (ACO) Algorithm is a metaheuristic that is used to find shortest paths in graphs. By using CUDA to implement an ACO algorithm, we achieved significant improvement in performance over a highly-tuned sequential CPU implementation. The construction step of the ACO algorithm consists of each ant creating an independent solution, and this step is where most of the computation is spent. Since the construction step is the same for most ACO variations, parallelizing this step will also allow for easy adaptation to different pheromone updating functions. Currently, our research tests this hypothesis on the travelling salesmen problem.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A04
Download:
Share:
 
Speakers:
Duane Merrill
- University of Virginia
Abstract:
The need to rank and order data is pervasive, and sorting operations are fundamental to many algorithms. ...Read More
Abstract:
The need to rank and order data is pervasive, and sorting operations are fundamental to many algorithms. This poster presents a very efficient method for sorting large sequences of fixed-length keys (and values) using GPU stream processors. Compared to the state-of-the-art, our implementation demonstrates multiple factors of speedup (up to 3.8x) for all NVIDIA GPGPUs. For this domain of sorting problems, we believe our sorting primitive to be the fastest available for any fully-programmable microarchitecture: our stock NVIDIA GTX480 sorting results exceed the 1G keys/sec average sorting rate (i.e., one billion 32-bit keys sorted per second).  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A05
Download:
Share:
 
Speakers:
Stanley Tzeng
- University of California, Davis
Abstract:
We explore software mechanisms for managing irregular tasks on graphics processing units. Traditional GPU programming guidelines teaches us how to efficiently program the GPU for data parallel pipelines with regular input and output. ...Read More
Abstract:
We explore software mechanisms for managing irregular tasks on graphics processing units. Traditional GPU programming guidelines teaches us how to efficiently program the GPU for data parallel pipelines with regular input and output. We present a strategy for solving task parallel pipelines which can handle irregular workloads on the GPU. We demonstrate that dynamic scheduling and efficient memory management are critical problems in achieving high efficiency on irregular workloads. We showcase our results on a real time Reyes rendering pipeline.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A06
Download:
Share:
 
Speakers:
Yao Zhang
- University of California, Davis
Abstract:
Tridiagonal linear systems are of importance to many problems in numerical analysis and computational fluid dynamics, as well as to computer graphics applications in video games and computer-animated films. ...Read More
Abstract:
Tridiagonal linear systems are of importance to many problems in numerical analysis and computational fluid dynamics, as well as to computer graphics applications in video games and computer-animated films. This poster presents our study on the performance of multiple tridiagonal algorithms on a GPU. We design a novel hybrid algorithm that combines a work-efficient algorithm with a step-efficient algorithm in a way well-suited for a GPU architecture. Our hybrid solver achieves 8x and 2x speedup respectively in single precision and double precision over a multi-threaded highly-optimized CPU solver and a 2x speedup over a basic GPU solver.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A07
Download:
Share:
 
Speakers:
Hans Henrik B. Soerensen
- Technical University of Denmark
Abstract:
A GPU competence center and laboratory for research and collaboration within academia and partners in industry has been established in 2008 at section for Scientific Computing, DTU informatics, Technical University of Denmark. ...Read More
Abstract:
A GPU competence center and laboratory for research and collaboration within academia and partners in industry has been established in 2008 at section for Scientific Computing, DTU informatics, Technical University of Denmark. In GPULab we focus on the utilization of GPUs for high-performance computing applications and software tools in science and engineering, inverse problems, visualization, imaging, dynamic optimization. This poster illustrates the latest and most interesting projects that have been developed at our center.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A08
Download:
Share:
 
Speakers:
David Whittaker
- University of Alabama at Birmingham
Abstract:
This poster describes a new technique for performing binary prefix sums using Fermi''s new __ballot() and __popc() functions. ...Read More
Abstract:
This poster describes a new technique for performing binary prefix sums using Fermi''s new __ballot() and __popc() functions. These instructions greatly increase intra-warp communication, allowing for an 80% speedup over standard GPU methods in applications like Radix Sort. It also points to future research that will enable suffix array construction, Burrows-Wheeler Transform, and the BZIP algorithm to take advantage of these instructions for efficient GPU compression.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A09
Download:
Share:
 
Speakers:
James Malcolm
- Accelereyes
Abstract:
This poster presents an approach to derive parallelism in algorithms that involve building sparse matrix that represents relationships between inter-dependent data fields and enhancing its performance on the GPU. ...Read More
Abstract:
This poster presents an approach to derive parallelism in algorithms that involve building sparse matrix that represents relationships between inter-dependent data fields and enhancing its performance on the GPU. This work compares the algorithm performance on the GPU to its CPU variant that employs the traditional sparse matrix-vector multiplication (SpMV) approach. We have also compared our algorithm performance with CUSP SpMV on GPU. The softwares used in this work are MATLAB and Jacket - GPU engine for MATLAB  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A10
Download:
Share:
 
Speakers:
Wen Zheng
- Stanford University
Abstract:
The particle level set is widely used as an accurate interface tracking tool in simulation, computer vision and other related fields. ...Read More
Abstract:
The particle level set is widely used as an accurate interface tracking tool in simulation, computer vision and other related fields. However, high computation cost prevents applying this method to real-time and interactive scenarios. This work intensively used parallel design patterns that are implemented in the thrust library, like compaction, reduction and scattering, to parallelize the particle level set method in order to attain real-time performance.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A11
Download:
Share:
 
Speakers:
Sungpack Hong
- Stanford University
Abstract:
Graphs are powerful data representations favored in many computational domains. GPUs have showed promising results in this domain, but their performance when the graph is highly irregular. ...Read More
Abstract:
Graphs are powerful data representations favored in many computational domains. GPUs have showed promising results in this domain, but their performance when the graph is highly irregular. In this study, we propose three general schemes to accelerate graph algorithms on a modern GPU architecture: (i) deferred processing of outliers, (ii) efficient dynamic workload balancing and (iii) warp-based execution exploiting threads in a SIMD-like manner. Our evaluation reveals that our schemes exhibit up to 9x speedup over previous GPU algorithms and 23x over single CPU execution on irregular graphs.They also yield up to 30% improvement,even for regular graphs  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A12
Download:
Share:
 
Speakers:
Daniel Faircloth
- Georgia Tech Research Institute
Abstract:
The Method of Moments is a popular computational method for solving integral equations in electromagnetics. ...Read More
Abstract:
The Method of Moments is a popular computational method for solving integral equations in electromagnetics. However, it suffers from high computational and memory costs since it requires the solution of a dense linear system. The Adaptive Cross Approximation (ACA) is an effective technique for compressing the system matrix thereby reducing the necessary storage as well as the number of operations required to solve the system. Acceleration of the ACA MoM with NVIDIA GPUs can finally enable the solution of "real world" scattering problems on a personal workstation in a practical timeframe.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A13
Download:
Share:
 
Speakers:
Zhaosong Ma
- Institute of Mechanics, Chinese Academy of Sciences
Abstract:
The Continuum-based Distinct Element Method (CDEM) is the combination of Finite Element Method (FEM) and Discrete Element Method (DEM), which is mainly used in general structural analyses, as well as landslide stability evaluations, ...Read More
Abstract:
The Continuum-based Distinct Element Method (CDEM) is the combination of Finite Element Method (FEM) and Discrete Element Method (DEM), which is mainly used in general structural analyses, as well as landslide stability evaluations, coal and gas outburst analyses. By means of CUDA and a GTX-285 VGA card, the GPU version achieves hundreds times speedup ratio.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A14
Download:
Share:
 
Speakers:
Adarsh Krishnamurthy
- University of California, Berkeley
Abstract:
We present GPU algorithms and strategies for accelerating distance queries and clearance computations on models made of trimmed NURBS surfaces. ...Read More
Abstract:
We present GPU algorithms and strategies for accelerating distance queries and clearance computations on models made of trimmed NURBS surfaces. We provide a generalized framework for using GPUs as co-processors in accelerating CAD operations. The accuracy of our algorithm is based on the model space precision, unlike earlier graphics algorithms that were based only on image space precision. Our algorithms are at least an order of magnitude faster and about two orders of magnitude more accurate than the commercial solid modeling kernel ACIS.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A15
Download:
Share:
 
Speakers:
Debapriya Chatterjee
- University of Michigan
Abstract:
This poster describes my research work on how to leverage the GP-GPU execution parallelism to achieve high performance in the time consuming problem of gate-level simulation of digital hardware designs.
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A16
Download:
Share:
 
Speakers:
Vincent Natoli
- Stone Ridge Technology
Abstract:
Impressive speedups up to 100x using GPUs compared to CPUs are achieved by taking advantage data parallelism, increased bandwidth and the ability to hide latency. ...Read More
Abstract:
Impressive speedups up to 100x using GPUs compared to CPUs are achieved by taking advantage data parallelism, increased bandwidth and the ability to hide latency. We have implemented a Monte Carlo valuation of a barrier option modeled by a standard diffusion process with a jump diffusion term obeying an underlying Poisson process to account for rare events. In addition, a Brownian Bridge is incorporated to account for barrier crossings in between diffusion trajectories and to reduce bias. This option is representative of exotic options which lack a closed-form solution and are amenable to Monte Carlo type methods for valuation.  Back
 
Topics:
Developer - Algorithms
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10A17
Download:
Share:
Developer - Programming Languages
Presentation
Media
Speakers:
Geoff Gerfin
- NVIDIA
 
Topics:
Developer - Programming Languages
Type:
Talk
Event:
Supercomputing
Year:
2010
Session ID:
SC1019
Download:
Share:
 
Speakers:
Ben Bergen
- Los Alamos National Laboratory
 
Topics:
Developer - Programming Languages
Type:
Talk
Event:
Supercomputing
Year:
2010
Session ID:
SC1001
Download:
Share:
 
Speakers:
, ,
Abstract:
Explore a new automatic Fortran translator which has been developed and used to port the numerical subroutines of FEFLO , a general-purpose legacy Computational Fluid Dynamics code operating on unstructured grids, to run on the GPU. ...Read More
Abstract:
Explore a new automatic Fortran translator which has been developed and used to port the numerical subroutines of FEFLO , a general-purpose legacy Computational Fluid Dynamics code operating on unstructured grids, to run on the GPU. Data transfer to the CPU is minimized throughout the course of a CFD run. Benchmarks of large-scale production runs will be presented.  Back
 
Topics:
Developer - Programming Languages, Developer - Algorithms, Computational Fluid Dynamics, Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2005
Streaming:
Download:
Share:
 
Speakers:
Paulius Micikevicius
- NVIDIA
Abstract:
This presentation covers the major CUDA optimizations. Topics will include: maximizing memory throughput, kernel launch configuration, using shared memory, and improving GPU/CPU interaction. ...Read More
Abstract:
This presentation covers the major CUDA optimizations. Topics will include: maximizing memory throughput, kernel launch configuration, using shared memory, and improving GPU/CPU interaction. While C for CUDA is used for illustration, the concepts covered will apply equally to programs written with OpenCL and DirectCompute APIs.  Back
 
Topics:
Developer - Programming Languages, Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2011
Streaming:
Download:
Share:
 
Speakers:
Jonathan Cohen
Abstract:
I will describe tricks for building APIs using C++ metaprogramming that generate custom kernels for complex manipulation of device-side arrays in CUDA. Using a variation of Expression Templates, multiple operations can be fused into a single ker ...Read More
Abstract:

I will describe tricks for building APIs using C++ metaprogramming that generate custom kernels for complex manipulation of device-side arrays in CUDA. Using a variation of Expression Templates, multiple operations can be fused into a single kernel that executes with reasonable efficiency.

  Back
 
Topics:
Developer - Programming Languages, Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S1020230
Streaming:
Download:
Share:
 
Speakers:
Ulises Cervantes-Pimentel
- Wolfram Research
Abstract:
Mathematica is widely used in scientific, engineering, mathematical fields and education. In this session, new tools for general GPU programming in the next release of Mathematica are presented. ...Read More
Abstract:
Mathematica is widely used in scientific, engineering, mathematical fields and education. In this session, new tools for general GPU programming in the next release of Mathematica are presented. These tools build on top of Mathematica's technology which provides a simple, yet powerful, interface to the large base of compiling tools. Applications of CUDA and OpenCL from within Mathematica will be presented. These examples will provide a general overview of the powerful development environment for GPU programming that Mathematica can offer not just for researchers but for anybody with basic knowledge of Mathematica and GPU programming.  Back
 
Topics:
Developer - Programming Languages, Developer - Algorithms, Developer - Tools & Libraries, Video and Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2028
Download:
Share:
 
Speakers:
Huiyang Zhou, Yi Yang
- North Carolina State University
Abstract:
Attend this session to learn and share code optimizations to achieve high performance GPU computing. ...Read More
Abstract:
Attend this session to learn and share code optimizations to achieve high performance GPU computing. We will cover code transformations for memory coalesing, workload management at both thread and thread-block levels, and different ways to handle memory partition conflicts. We will also discuss Integration of code optimizations into a compiler.  Back
 
Topics:
Developer - Programming Languages
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102067
Streaming:
Download:
Share:
 
Speakers:
Christopher Rossbach, Emmett Witchel
- Microsoft Research, University of Texas at Austin
Abstract:
GPGPU frameworks such as CUDA improve programmability, but GPU parallelism remains inaccessible in many application domains. ...Read More
Abstract:
GPGPU frameworks such as CUDA improve programmability, but GPU parallelism remains inaccessible in many application domains. This session argues that poor OS support causes this problem. OSes do not provide the kind of high-level abstractions for GPUs that applications expect for other resources like CPUs and file systems. We advocate reorganizing kernel abstractions to support GPUs as first-class computing resources, with traditional guarantees such as fairness and isolation. We demonstrate shortcomings in Windows 7 GPU support, and show that better OS abstractions can accelerate interactive workloads like gesture recognition by a factor of 10X over a CUDA implementation.   Back
 
Topics:
Developer - Programming Languages, Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102124
Streaming:
Download:
Share:
 
Speakers:
Simon Barrett
- NVIDIA
Abstract:
Parallel Nsight is NVIDIA''s new development environment for graphics and GPU computing. In this advanced session, you will learn how Parallel Nsight can accelerate debugging and profiling of Direct3D 11 applications. ...Read More
Abstract:
Parallel Nsight is NVIDIA''s new development environment for graphics and GPU computing. In this advanced session, you will learn how Parallel Nsight can accelerate debugging and profiling of Direct3D 11 applications. Attendees will learn how to debug Direct3D frames and HLSL shaders using Parallel Nsight''s powerful Graphics Inspector and Debugger which allows developers to inspect Direct3D resources and state, set breakpoints in HLSL shaders, examine shader variables, and see which graphics primitives are live on the GPU. Attendees will also learn how to use the Frame Profiler to capture and mine performance information, and easily pinpoint bottlenecked GPU units.  Back
 
Topics:
Developer - Programming Languages, Computer Graphics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102212
Streaming:
Download:
Share:
 
Speakers:
Jack Pappas
- TidePowerd
Abstract:
Join TidePowerd for a demonstration of GPU.NET, our innovative new product which dramatically cuts the time needed to develop and maintain a GPU-based application by extending Microsoft''s . ...Read More
Abstract:
Join TidePowerd for a demonstration of GPU.NET, our innovative new product which dramatically cuts the time needed to develop and maintain a GPU-based application by extending Microsoft''s .NET Framework onto GPUs. With GPU.NET, your device-accelerated code can be written in any .NET-supported language (e.g., C#, F#, IronPython) and called like any other method - so it''s easy to create new GPU-based applications without having to retrain your developers. You''ll learn how to use GPU.NET to quickly develop a financial calculator in C#, use the built-in Visual Studio unit-testing tools to ensure the correctness of the code, and seamlessly deploy the application into a mixed Windows / Linux environment. We''ll also discuss how GPU.NET expands the frontiers of GPU computing into lucrative new markets such business intelligence, database processing, and data visualization.  Back
 
Topics:
Developer - Programming Languages
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102294
Download:
Share:
 
Speakers:
Jeff Stuart
- University of California, Davis
Abstract:
Our poster outlines GPU-to-CPU callbacks, a method for the GPU to request work from the CPU. We give some motivation, demonstrate the code architecture, and give samples of CPU and GPU code that show callbacks being executed.
 
Topics:
Developer - Programming Languages
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10P01
Download:
Share:
 
Speakers:
Jike Chong
- Parasians, LLC
Abstract:
Data layout, data placement, and synchronization processes are not usually part of a speech application expert''s daily concerns. ...Read More
Abstract:
Data layout, data placement, and synchronization processes are not usually part of a speech application expert''s daily concerns. Yet failure to carefully take these concerns into account in a highly parallel implementation on the graphics processing units (GPU) could mean an order of magnitude of loss in application performance. We present an application framework for parallel programming of automatic speech recognition (ASR) applications that allows a speech application expert to effectively implement speech applications on the GPU, and demonstrate how the ASR application framework has enabled a Matlab/Java programmer to achieve a 20x speedup in application performance on a GPU.  Back
 
Topics:
Developer - Programming Languages
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10R01
Download:
Share:
 
Speakers:
Rami Mukhtar
- NICTA
Abstract:
We are developing a domain specific language for computer vision algorithms that facilitates rapid implementation of algorithms that are scalable and portable across CPU-GPU architectures. ...Read More
Abstract:
We are developing a domain specific language for computer vision algorithms that facilitates rapid implementation of algorithms that are scalable and portable across CPU-GPU architectures. The presented approach significantly lowers the barrier of implementation of computer vision algorithms for heterogeneous CPU-GPU architectures, and enables a single implementation to automatically scale to use additional hardware as it becomes available.  Back
 
Topics:
Developer - Programming Languages
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10R02
Download:
Share:
 
Speakers:
Albert Sidelnik
- University of Illinois at Urbana-Champaign
Abstract:
GPGPU architectures offer large performance gains over their traditional CPU counterparts for many applications. ...Read More
Abstract:
GPGPU architectures offer large performance gains over their traditional CPU counterparts for many applications. However, current GPU programming models present numerous challenges to the programmer: lower-level languages, explicit data movement, loss of portability, and performance optimization challenges. In this paper, we present novel methods and compiler transformations that increase productivity by enabling users to easily program GPUs using the high productivity programming language Chapel.  Back
 
Topics:
Developer - Programming Languages
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10R03
Download:
Share:
Developer - Tools & Libraries
Presentation
Media
Speakers:
Nathan Bell
- NVIDIA
 
Topics:
Developer - Tools & Libraries, HPC and AI
Type:
Talk
Event:
Supercomputing
Year:
2010
Session ID:
SC1017
Download:
Share:
 
Speakers:
Michael Wolfe
- The Portland Group
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
Supercomputing
Year:
2010
Session ID:
SC1021
Download:
Share:
 
Speakers:
Jack Dongarra
- University of Tennessee
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
Supercomputing
Year:
2010
Session ID:
SC1002
Download:
Share:
 
Speakers:
Rafael Campana
- NVIDIA
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
Supercomputing
Year:
2010
Session ID:
SC1018
Download:
Share:
 
Speakers:
Mark Govett
- National Oceanic and Atmospheric Administration
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
Supercomputing
Year:
2010
Session ID:
SC1024
Download:
Share:
 
Speakers:
David Lecomber
- Allinea Software
Abstract:
Discover how a debugger can help you fix those hard to find bugs in your GPU software, with this introduction to the special CUDA features in Allinea DDT.
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2039
Streaming:
Download:
Share:
 
Speakers:
Andreas Kloeckner
Abstract:
Explore PyCUDA, a robust, open-source toolkit that lets you control your GPU from the comfort of Python, a Matlab-like scripting language. Learn about Fermi tuning with PyCUDA, the new interfaces for CUBLAS and CUFFT, the ecosystem of third-part ...Read More
Abstract:

Explore PyCUDA, a robust, open-source toolkit that lets you control your GPU from the comfort of Python, a Matlab-like scripting language. Learn about Fermi tuning with PyCUDA, the new interfaces for CUBLAS and CUFFT, the ecosystem of third-party libraries built on PyCUDA, and examples illustrating PyCUDA''s benefits to large-scale applications.

  Back
 
Topics:
Developer - Tools & Libraries, Computational Fluid Dynamics, Physics Simulation
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S10041
Streaming:
Download:
Share:
 
Speakers:
Bryan Catanzaro
- University of California, Berkeley
Abstract:
Learn how to write Python programs that execute highly efficiently on GPUs using Copperhead, a data-parallel Python runtime. ...Read More
Abstract:
Learn how to write Python programs that execute highly efficiently on GPUs using Copperhead, a data-parallel Python runtime. Using standard Python constructs like map and reduce, we will see how to construct data-parallel computations and embed them in Python programs that interoperate with numerical and visualization libraries such as NumPy, SciPy and Matplotlib. We will examine how to express computations using Copperhead, explore the performance of Copperhead programs running on GPUs, and discuss Copperhead''s runtime model, which enables data-parallel execution from within Python.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2050
Streaming:
Download:
Share:
 
Speakers:
Bob Archer
- Adobe Systems Inc
Abstract:
Examine the challenges and advantages of building the Pixel Bender domain specific language for image processing for the GPU. ...Read More
Abstract:
Examine the challenges and advantages of building the Pixel Bender domain specific language for image processing for the GPU. We will examine how Pixel Bender was made to work within several Adobe applications across a wide range of hardware systems and platforms.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2053
Streaming:
Download:
Share:
 
Speakers:
Maxim Naumov
- NVIDIA
Abstract:
The CUSPARSE library can impact and enable software solutions for computational science and engineering problems in the fields of energy exploration, physical simulations and life sciences among many others. ...Read More
Abstract:
The CUSPARSE library can impact and enable software solutions for computational science and engineering problems in the fields of energy exploration, physical simulations and life sciences among many others. It provides sparse linear algebra primitives that can be used to implement iterative linear system and eigenvalue solvers and can also serve as a building block for the state-of-the-art sparse direct solvers. CUSPARSE library is implemented using CUDA parallel programming model and provides sparse analogs to BLAS level-1,2,3 operations, such as matrix-vector multiplication, triangular solve and format conversion routines.   Back
 
Topics:
Developer - Tools & Libraries, Developer - Algorithms, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102070
Streaming:
Download:
Share:
 
Speakers:
Huynh Phung
- A*STAR Institute of High Performance Computing
Abstract:
We describe our journey of migrating a legacy direct solver library for Poisson equations written in Fortran77 to CUDA in order to harness the computational power provided by the Tesla device ("Fermi"). ...Read More
Abstract:
We describe our journey of migrating a legacy direct solver library for Poisson equations written in Fortran77 to CUDA in order to harness the computational power provided by the Tesla device ("Fermi"). This legacy library is still widely used today as it is the most complete library that can deal with three different boundary conditions (Dirchlet, Neumann and Cyclic) and two grid configurations (staggered and centered) independently in any of the three dimensions (x, y, z); giving a total of over 200 configurations.  Back
 
Topics:
Developer - Tools & Libraries, Computational Fluid Dynamics
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102109
Streaming:
Download:
Share:
 
Speakers:
Domokos Vermes
Abstract:
Data analysis is the art and the science of getting the correct quantitative models and their numerical parameters from the observed data. In this talk, we report on a project to integrate CUDA into the open source data analysis environment R. T ...Read More
Abstract:

Data analysis is the art and the science of getting the correct quantitative models and their numerical parameters from the observed data. In this talk, we report on a project to integrate CUDA into the open source data analysis environment R. The combined use of the CPU and GPU resources can efficiently exploit the significant amount of data parallelism inherent in most data analysis problems and methods. This makes interactive analysis possible even for large, compute-intensive problems. The implementation and the achievable performance gains will be demonstrated on a concrete example from quantitative finance.

  Back
 
Topics:
Developer - Tools & Libraries, Databases, Data Mining, Business Intelligence, Finance, Life & Material Science
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102111
Streaming:
Download:
Share:
 
Speakers:
Kumar Iyer
- NVIDIA
Abstract:
NVIDIA Parallel Nsight provides access to the power of the GPU from within the familiar environment of Microsoft Visual Studio. ...Read More
Abstract:
NVIDIA Parallel Nsight provides access to the power of the GPU from within the familiar environment of Microsoft Visual Studio. This session is an entry level overview of the GPU computing and graphics development features of Parallel Nsight as well as a glimpse into the future of this powerful tool.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102149
Streaming:
Download:
Share:
 
Speakers:
Sebastien Domine
- NVIDIA
Abstract:
Data parallel algorithms that provide real-time financial options pricing or identification of hidden oil reserves are utilizing the massively parallel nature of the GPU for industry changing performance gains. ...Read More
Abstract:
Data parallel algorithms that provide real-time financial options pricing or identification of hidden oil reserves are utilizing the massively parallel nature of the GPU for industry changing performance gains. Developers require industry standard development tools to create the software that accomplishes these parallel tasks. NVIDIA Parallel Nsight delivers the power of the GPU within the familiar environment of Microsoft Visual Studio. In this session, you will learn advanced techniques for debugging CUDA C/C++ and DirectCompute code using Parallel Nsight, including conditional and data breakpoints as well as out of bound GPU memory access detection.   Back
 
Topics:
Developer - Tools & Libraries, Developer - Programming Languages
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102150
Streaming:
Download:
Share:
 
Speakers:
Sebastien Domine
- NVIDIA
Abstract:
Life altering products that provide early detection of breast cancer or simulate molecular behavior, accelerating drug discovery, are becoming reality thanks to the power of the GPU. ...Read More
Abstract:
Life altering products that provide early detection of breast cancer or simulate molecular behavior, accelerating drug discovery, are becoming reality thanks to the power of the GPU. As these technologies become mainstream, mainstream tools are required to support these development efforts. NVIDIA Parallel Nsight delivers the power of the GPU within the familiar environment of Microsoft Visual Studio. In this session, you will learn advanced techniques for visualizing your application''s workloads and performance characteristics across the CPU, GPU, and operating system, and explore the depths of Parallel Nsight profilers, including GPU performance counters and how to use them.   Back
 
Topics:
Developer - Tools & Libraries, Developer - Programming Languages
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102151
Streaming:
Download:
Share:
 
Speakers:
Isaac Gelado
- Universitat Politecnica de Catalunya
Abstract:
Learn how to use GMAC, a novel run-time for CUDA GPUs. GMAC unifies the host and device memories into a unified virtual address space, enabling the host code to directly access the device memory, ...Read More
Abstract:
Learn how to use GMAC, a novel run-time for CUDA GPUs. GMAC unifies the host and device memories into a unified virtual address space, enabling the host code to directly access the device memory, and removing the need for data transfers between host and device memories. Moreover, GMAC also allows pointers to be used by both, the host and device code indistinctly. This session will present the GMAC run-time and show how to use it in current applications. This session will cover from the basics of GMAC to multi-threaded applications using POSIX threads, OpenMP and MPI.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102156
Streaming:
Download:
Share:
 
Speakers:
Cedric Augonnet
- INRIA
Abstract:
See how StarPU provides task scheduling facilities for a hybrid platform and a powerful data management library that transparently takes care of data across the entire machine. ...Read More
Abstract:
See how StarPU provides task scheduling facilities for a hybrid platform and a powerful data management library that transparently takes care of data across the entire machine. We will discuss the significant performance improvements resulting from its flexible scheduler as well as its ability to mix parallel CPU kernels (eg. written in OpenMP or TBB) with CUDA/OpenCL and MPI.  Back
 
Topics:
Developer - Tools & Libraries, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102160
Download:
Share:
 
Speakers:
David Cox, Nicolas Pinto
- MIT, Harvard University
Abstract:
Learn how to let the computer optimize your CUDA and OpenCL code for you with easy GPU Meta-programming and Scripting (e. ...Read More
Abstract:
Learn how to let the computer optimize your CUDA and OpenCL code for you with easy GPU Meta-programming and Scripting (e.g. PyCUDA). We will present a case study in which we consider the step-wise optimization of a 3D filter bank convolution, using a suite of open-source tools.   Back
 
Topics:
Developer - Tools & Libraries, Computer Vision and Machine Vision, HPC and AI, Neuroscience
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102176
Streaming:
Download:
Share:
 
Speakers:
Hassan Chafi, HyoukJoong Lee
- Stanford University
Abstract:
Explore a new approach in parallel programming which leverages Domain Specific Languages (DSLs) to simplify programming heterogeneous systems (multi-core processors and GPUs). ...Read More
Abstract:
Explore a new approach in parallel programming which leverages Domain Specific Languages (DSLs) to simplify programming heterogeneous systems (multi-core processors and GPUs). This approach allows DSL users to take advantage of the power of GPUs without having working knowledge of lower level programming models such as CUDA. Topics will cover the advantages of the DSL approach in parallel programming, and the runtime implementation details with optimizations to have the performance benefits of using GPUs.  Back
 
Topics:
Developer - Tools & Libraries, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102177
Download:
Share:
 
Speakers:
Christopher Brown
- Decision Patterns
Abstract:
Come learn about the GPU R package. R is the widely popular open source statistical programming language. ...Read More
Abstract:
Come learn about the GPU R package. R is the widely popular open source statistical programming language. The GPU package extends R by providing GPU-based types, classes and methods implementing GPU versions of R vectors, matrices, lists and data frames. Subsequent operations with these are executed on the GPU. Users are not required to create special bindings or implement special syntax, nor do they need copy objects between CPU and GPU. The GPU packages allows programmers access to the computational power of GPUs with little modification to existing code.   Back
 
Topics:
Developer - Tools & Libraries, Developer - Algorithms, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102179
Streaming:
Download:
Share:
 
Speakers:
Benoit Meister
- Reservoir Labs
Abstract:
Discover our automatic C-to-CUDA mapper prototype, and how it optimizes execution and data movement for a broad class of loop codes. ...Read More
Abstract:
Discover our automatic C-to-CUDA mapper prototype, and how it optimizes execution and data movement for a broad class of loop codes. Coupled with our powerful mapper, C as an input language does not only offer portability but also performance and performance portability. Learn about our optimizations and some of the performance obtained through different uses of the mapper.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102202
Streaming:
Download:
Share:
 
Speakers:
Andrew Kerr, Gregory Diamos, Sudhakar Yalamanchili
- Georgia Institute of Technology
Abstract:
Learn how to debug and profile CUDA applications using GPU-Ocelot. Ocelot is a compilation and emulation framework for CUDA that includes debugging and profiling tools as well as backend compilers for NVIDIA GPUs and x86 CPUs. ...Read More
Abstract:
Learn how to debug and profile CUDA applications using GPU-Ocelot. Ocelot is a compilation and emulation framework for CUDA that includes debugging and profiling tools as well as backend compilers for NVIDIA GPUs and x86 CPUs. We will present examples of applications developed on x86 CPUs and deployed on NVIDIA GPUs. We will also discuss memory checking, race detection, and deadlock detection tools available within Ocelot.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102210
Streaming:
Download:
Share:
 
Speakers:
Danl Pierce
- Access Analytics Int'l, LLC
Abstract:
Hear product architects and developers describe the algorithmic depths and high level breath of the use of GPUs that have been employed to create BCSLIB-GPU, the GPU enablement of the industry standard sparse matrix software suite, BCSLIB-EXT. ...Read More
Abstract:
Hear product architects and developers describe the algorithmic depths and high level breath of the use of GPUs that have been employed to create BCSLIB-GPU, the GPU enablement of the industry standard sparse matrix software suite, BCSLIB-EXT. We provide a range of comparison data with Tesla and Fermi compared with multi-core CPU only systems and for a wide range of realisitic demanding real world test problems.  Back
 
Topics:
Developer - Tools & Libraries, Developer - Algorithms, Embedded & Automotive, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102213
Download:
Share:
 
Speakers:
Elif Albuz, Frank Jargstorff, Nathan Whitehead, Philippe Vandermersch, Ujval Kapasi
- NVIDIA
Abstract:
Learn about NVIDIA's CUDA libraries and meet the engineers that develop them. Lead developers will cover the capabilities, performance and future directions for NVIDIA's CUFFT, CUBLAS, CURAND, ...Read More
Abstract:
Learn about NVIDIA's CUDA libraries and meet the engineers that develop them. Lead developers will cover the capabilities, performance and future directions for NVIDIA's CUFFT, CUBLAS, CURAND, and NPP libraries (other libraries such as CUSPARSE and open source Thrust are covered in other talks). After the presentation, NVIDIA developers will remain in the room to chat and answer questions during the lunch break.   Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102216
Download:
Share:
 
Speakers:
Nathan Bell
- NVIDIA Research
Abstract:
Thrust is a parallel template library for developing CUDA applications. Modeled after the C++ Standard Template Library (STL), Thrust brings a familiar abstraction layer to the realm of GPU computing. ...Read More
Abstract:
Thrust is a parallel template library for developing CUDA applications. Modeled after the C++ Standard Template Library (STL), Thrust brings a familiar abstraction layer to the realm of GPU computing. Thrust provides host and device variants of the STL vector container to simplify memory management and facilitate data transfers. These containers are complemented with a large collection of generic data-parallel algorithms and a suite of useful iterator adaptors. Together, these features form a flexible high-level interface for GPU programming that greatly enhances developer productivity. In this session we''ll discuss Thrust''s features and explain the basic design philosophy of the library.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102219
Streaming:
Download:
Share:
 
Speakers:
Jared Hoberock
- NVIDIA
Abstract:
Thrust is a parallel template library for developing CUDA applications which is modeled after the C++ Standard Template Library (STL). ...Read More
Abstract:
Thrust is a parallel template library for developing CUDA applications which is modeled after the C++ Standard Template Library (STL). In this session we''ll show how to implement decompose problems into the algorithms provided by Thrust. We''ll also discuss the performance implications of "kernel fusion" and "array of structs" vs. "structure of arrays" memory layouts and how they relate to Thrust. Lastly, we''ll present evidence that Thrust implementations are fast, while remaining concise and readable.   Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102220
Download:
Share:
 
Speakers:
Andrew Iles, Peter Buckingham
- NVIDIA
Abstract:
Learn about the suite of tools NVIDIA provides to manage large installations of GPUs from the NVIDIA Tesla Series. ...Read More
Abstract:
Learn about the suite of tools NVIDIA provides to manage large installations of GPUs from the NVIDIA Tesla Series. The presentation will cover cluster management - tool and library -, as well as the GPUDirect technology that enables GPUs to communicate faster across the network.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102225
Download:
Share:
 
Speakers:
Satish Salian
- NVIDIA
Abstract:
Boost your development speed by mastering the CUDA debugging tools NVIDIA provides. In this session you will learn the basics of cuda-gdb and cuda-memcheck, as well as their more advanced features with live demonstrations on Linux and MacOS. ...Read More
Abstract:
Boost your development speed by mastering the CUDA debugging tools NVIDIA provides. In this session you will learn the basics of cuda-gdb and cuda-memcheck, as well as their more advanced features with live demonstrations on Linux and MacOS.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2002
Streaming:
Download:
Share:
 
Speakers:
Peng Wang
- NVIDIA
Abstract:
Learn how to optimize your OpenCL application to achieve maximum performance on NVIDIA GPUs. We will first briefly discuss how the OpenCL programming model maps onto NVIDIA GPU's architecture. ...Read More
Abstract:
Learn how to optimize your OpenCL application to achieve maximum performance on NVIDIA GPUs. We will first briefly discuss how the OpenCL programming model maps onto NVIDIA GPU's architecture. We will then talk about memory, instruction, and NDRange optimization techniques, illustrating each with small code samples.  Back
 
Topics:
Developer - Tools & Libraries, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S09068
Download:
Share:
 
Speakers:
Paulius Micikevicius
- NVIDIA
Abstract:
The goal of this session is to demystify performance optimization by transforming it into an analysis-driven process. ...Read More
Abstract:
The goal of this session is to demystify performance optimization by transforming it into an analysis-driven process. There are three fundamental limiters to kernel performance: instruction throughput, memory throughput, and latency. In this session we will describe: how to use profiling tools and source code instrumentation to assess the significance of each limiter; what optimizations to apply for each limiter; how to determine when hardware limits are reached. Concepts will be illustrated with some examples and are equally applicable to both CUDA and OpenCL development. It is assumed that attendees are already familiar with the fundamental optimization techniques.   Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2012
Download:
Share:
 
Speakers:
Loren Dean
- MathWorks
Abstract:
MATLAB is a widely used tool for scientific, engineering and financial applications. As the popularity of GPUs has grown, there is strong interest from engineers and scientists who solve computationally intensive problems to be able to leverage GPU ...Read More
Abstract:
MATLAB is a widely used tool for scientific, engineering and financial applications. As the popularity of GPUs has grown, there is strong interest from engineers and scientists who solve computationally intensive problems to be able to leverage GPUs within MATLAB and other products from MathWorks. This talk will discuss how MathWorks tools can help engineers and scientist to take advantage of GPU resources while continuing to work in the familiar MATLAB environment. A range of capabilities will be discussed and demonstrated.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102267
Streaming:
Download:
Share:
 
Speakers:
Peter Decrem
- Quantifi
Abstract:
Quantifi will demo its xLDevelopment environment, which provide developers with an easy to use development environment which allows cuda functionality to be in Microsoft Excel. ...Read More
Abstract:
Quantifi will demo its xLDevelopment environment, which provide developers with an easy to use development environment which allows cuda functionality to be in Microsoft Excel. With as little as four lines, one will also select the position of the function in the menu bar, xml markup language will display in the excel help functionality, and objects can be easily added to the object cache. These objects can then be inspected by the end user or developer. Performance information can also be displayed in the object cache. The environment provides the developer an environment where he can focus on developing high performance functionality, and all intermediate layers of interface are taking care of by the environment.  Back
 
Topics:
Developer - Tools & Libraries, Finance
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102297
Streaming:
Download:
Share:
 
Speakers:
Chris Gottbrath
- TotalView Technologies, Inc., a Rogue Wave Software company
Abstract:
As GPU hardware becomes more prevalent in both research and commercial institutions, software that takes advantage of this specialized hardware is growing in demand. ...Read More
Abstract:
As GPU hardware becomes more prevalent in both research and commercial institutions, software that takes advantage of this specialized hardware is growing in demand. In many cases, it is infeasible or impossible to rewrite an existing program to run entirely on the GPU, so the goal is often to offload as much work as possible. As the IMSL Library team at Rogue Wave Software considers how best to tackle the GPU realm with a general mathematical library, the IMSL Fortran Library takes an initial step where the CUDA BLAS library is utilized to offload CPU work to GPU hardware. This presentation will discuss the approach and architecture of the solution. Benchmark results will show where success has been found. Plans for future products will also be covered.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102299
Streaming:
Download:
Share:
 
Speakers:
Didem Unat
- University of California, San Diego
Abstract:
We aim to facilitate GPU programming for finite difference applications. We have developed Mint, a source to source compiler to generate CUDA code from OpenMP code. ...Read More
Abstract:
We aim to facilitate GPU programming for finite difference applications. We have developed Mint, a source to source compiler to generate CUDA code from OpenMP code. Mint transforms omp parallel for loops into CUDA kernels and applies domain specific optimizations such as shared memory, register and kernel fuse optimizations. Since our translator targets structured grid problems, it optimizes the code better than the general purpose compilers. In this poster, we present translation and optimization steps along with our initial performance results.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10U01
Download:
Share:
 
Speakers:
Ian Johnson
- Florida State University
Abstract:
The goal of this project is to produce interactive scientific visualizations that can be used in educational games. ...Read More
Abstract:
The goal of this project is to produce interactive scientific visualizations that can be used in educational games. We use the computational power of OpenCL to enable features in the Blender Game Engine that would otherwise not be possible in real-time. By adding an interactive particle system to the game engine, we set the stage to demonstrate many interesting scientific phenomena (molecular dynamics, fluid dynamics, statistics) with the added benefit of real-time special effects for games in general.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10U02
Download:
Share:
 
Speakers:
Yongpeng Zhang
- North Carolina State University
Abstract:
In this poster, we propose GStream, a general-purpose, scalable data streaming framework on GPUs. The contributions of GStream are as follows: (1) We provide powerful, yet concise language abstractions suitable to describe conventional algorithms ...Read More
Abstract:
In this poster, we propose GStream, a general-purpose, scalable data streaming framework on GPUs. The contributions of GStream are as follows: (1) We provide powerful, yet concise language abstractions suitable to describe conventional algorithms as streaming problems. (2) We project these abstraction onto GPUs to fully exploit their inherent massive data- parallelism. (3) We demonstrate the viability of streaming on accelerators. Experiments show that the proposed framework provides flexibility, programmability and performance gains for various benchmarks from a variety of domains, including but not limited to data streaming, data parallel problems, numerical codes and text search.  Back
 
Topics:
Developer - Tools & Libraries
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10U03
Download:
Share:
 
Speakers:
Akira Nukada
- Tokyo Institute of Technology
Abstract:
We have released our FFT library for CUDA GPUs. Most of algorithms and auto-tuning technologies of FFT for CUDA are already published.
 
Topics:
Developer - Tools & Libraries
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10U04
Download:
Share:
Digital Content Creation & Film
Presentation
Media
Speakers:
Abstract:
Hear about the ongoing revolution in the production of photorealistic imagery being powered by GPUs. We will explore the algorithms and concepts behind iray - a CUDA accelerated software library from mental images/NVIDIA that provides an interac ...Read More
Abstract:

Hear about the ongoing revolution in the production of photorealistic imagery being powered by GPUs. We will explore the algorithms and concepts behind iray - a CUDA accelerated software library from mental images/NVIDIA that provides an interactive, push-button, fast synthetic digital camera in software to a variety of OEM applications and platforms. We will demonstrate iray embedded in commercial CAD and Digital Content Creation applications as well as in 3D cloud computing platforms.

  Back
 
Topics:
Digital Content Creation & Film, Cloud Visualization, Rendering and Ray Tracing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S10013
Streaming:
Download:
Share:
 
Speakers:
Abstract:
Video editing is currently at two simultaneous inflections points: use of GPUs for video processing and the beginning of wide spread adoption of 3D. At this time however, identifying and navigating through the necessary tools and equipment to cr ...Read More
Abstract:

Video editing is currently at two simultaneous inflections points: use of GPUs for video processing and the beginning of wide spread adoption of 3D. At this time however, identifying and navigating through the necessary tools and equipment to create compelling 3D video content is challenging. This session is intended to provide a pragmatic guide to creating prosumer 3D video content and how the GPU greatly assists and speeds up this process. The intended audience is anyone interested in how to create compelling 3D movies at a prosumer level.

  Back
 
Topics:
Digital Content Creation & Film, Stereoscopic 3D
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102222
Streaming:
Download:
Share:
 
Speakers:
Abstract:
Video editing is currently at two simultaneous inflections points: use of GPUs for video processing and the beginning of wide spread adoption of 3D. At this time however, identifying and navigating through the necessary tools and equipment to cr ...Read More
Abstract:

Video editing is currently at two simultaneous inflections points: use of GPUs for video processing and the beginning of wide spread adoption of 3D. At this time however, identifying and navigating through the necessary tools and equipment to create compelling 3D video content is challenging. This session is intended to provide a pragmatic guide to creating prosumer 3D video content and how the GPU greatly assists and speeds up this process. The intended audience is anyone interested in how to create compelling 3D movies at a prosumer level.

  Back
 
Topics:
Digital Content Creation & Film
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102279
Download:
Share:
Embedded & Automotive
Presentation
Media
Speakers:
Kshitij Gupta
Abstract:
In this presentation, we will talk about our experiences of implementing an end-to-end automatic speech recognition system that runs in faster than real-time on embedded GPUs, targeted towards small form-factor consumer devices. Focusing specifi ...Read More
Abstract:

In this presentation, we will talk about our experiences of implementing an end-to-end automatic speech recognition system that runs in faster than real-time on embedded GPUs, targeted towards small form-factor consumer devices. Focusing specifically on some of the challenges encountered during the design process, a major portion of our talk will focus on giving insights into modifications we made to well-established speech algorithms to fit well within the GPU programming model. We will show how these changes helped us in realizing a highly optimized system on platforms with limited memory bandwidth and compute resources.

  Back
 
Topics:
Embedded & Automotive, Acoustics and Audio Processing, Mobile Applications, Signal and Audio Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102175
Streaming:
Download:
Share:
 
Speakers:
Theo Valich
Abstract:
Explore how advanced SoC technologies are transforming the world of automotive industry. Learn on how using nVidia Tegra increased the available range while pushing the envelope on next-gen driver experience. Sharing the lessons learned in the w ...Read More
Abstract:

Explore how advanced SoC technologies are transforming the world of automotive industry. Learn on how using nVidia Tegra increased the available range while pushing the envelope on next-gen driver experience. Sharing the lessons learned in the world of electric cars and challenges in constructing a mass production electric vehicle.

  Back
 
Topics:
Embedded & Automotive, Computer Graphics, Computer Vision and Machine Vision, Video and Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102303
Streaming:
Download:
Share:
 
Speakers:
Theo Valich
- Bright Side Network Inc
Abstract:
Learn how GPU technologies broke speed limits in automotive development. By using GPU-accelerated tools, small team of engineers created a complete certifiable vehicle in only two years, using fraction of the budget used in conventional industry. ...Read More
Abstract:
Learn how GPU technologies broke speed limits in automotive development. By using GPU-accelerated tools, small team of engineers created a complete certifiable vehicle in only two years, using fraction of the budget used in conventional industry. Talk will cover tools and techniques used in creation of XD concept, as well as how to overcome challenges moving a product from concept to mass production stage.   Back
 
Topics:
Embedded & Automotive, Computer Graphics, Computer Vision and Machine Vision
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102304
Streaming:
Download:
Share:
 
Speakers:
Vladimir Glavtchev
- BMW
Abstract:
We investigate the use of differentGPU-based implementations for performing real-time speed limit sign recognition on a resource-constrainedembedded system. ...Read More
Abstract:
We investigate the use of differentGPU-based implementations for performing real-time speed limit sign recognition on a resource-constrainedembedded system. The system recognized US and European Union speed-limits at over 88% accuracy while running in real-time. The system is hardware-accelerated using CUDA and OpenGL. It introduces a novel technique for detecting speed-limit signs which is only possible with the aid of GPU processing.  Back
 
Topics:
Embedded & Automotive
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10H01
Download:
Share:
 
Speakers:
Marius Vasiliu
- University of Paris Sud
Abstract:
NVIDIA GPU architecture becomes a very interesting hardware target for complex automotive application. ...Read More
Abstract:
NVIDIA GPU architecture becomes a very interesting hardware target for complex automotive application. We implemented the same automotive application on several different hardware targets and analyzed the maximum frame rate and the effective CPU charge. This paper shows how real-time applications like pedestrian detection and driving assistance take benefits from a massively parallel "central" architecture like GPU/CUDA. Real-time performance and zero-delay transfers can be achieved using a full asynchronous implementation. The same approach can really multiply the application performance by the number of GPU devices present on the embedded system, at a reasonable power consumption.  Back
 
Topics:
Embedded & Automotive
Type:
Poster
Event:
GTC Silicon Valley
Year:
2010
Session ID:
P10H02
Download:
Share:
Emerging Companies Summit
Presentation
Media
Speakers:
Jensen Huang
Abstract:
Jensen Huang was joined in a fireside chat by Quentin Hardy, National Editor at Forbes Magazine. They discussed the rise of GPUs, current trends in visual and parallel computing, and the transformational changes ahead for the industry. ...Read More
Abstract:

Jensen Huang was joined in a fireside chat by Quentin Hardy, National Editor at Forbes Magazine. They discussed the rise of GPUs, current trends in visual and parallel computing, and the transformational changes ahead for the industry.

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S104006
Download:
Share:
 
Speakers:
Jeff Herbst
Abstract:
The Emerging Companies Summit is a unique forum for startup companies to showcase innovative applications that leverage the GPU to solve visual and compute-intensive problems. The Opening Address includes an overview of NVIDIA''s GPU eco ...Read More
Abstract:

The Emerging Companies Summit is a unique forum for startup companies to showcase innovative applications that leverage the GPU to solve visual and compute-intensive problems. The Opening Address includes an overview of NVIDIA''s GPU ecosystem development activities and an interaction on stage with selected companies building groundbreaking applications on top of the GPU platform. The ECS is a great opportunity to discover new players in the GPU ecosystem, find great investments, explore partnership opportunities, network/ build relationships, and discuss the future of an industry that is reshaping computing.

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
ECS0900
Download:
Share:
 
Speakers:
Sam Blackman
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features Elemental Technologies - covering the field of video processing. Find this session at 5 minutes into the video.Panelists for this session include Dan''l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4001A
Download:
Share:
 
Speakers:
Rob Balgley
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Bunkspeed - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists for this session include Dan''l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4001B
Download:
Share:
 
Speakers:
Chris Doran
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Bunkspeed - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists for this session include Dan''l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4001C
Download:
Share:
 
Speakers:
Chris Blewitt
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features miGenius - covering the field of cloud computing. Find this session at 5 minutes into the video.Panelists for this session include Dan'l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4002A
Download:
Share:
 
Speakers:
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Bunkspeed - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists for this session include Dan''l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4002B
Download:
Share:
 
Speakers:
Philip Lunn
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Bunkspeed - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists for this session include Dan''l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4002C
Download:
Share:
 
Speakers:
Jon Peddie
Abstract:
Moderated by Jon Peddie (President, Jon Peddie Research) The GPU (graphics processing unit) runs advanced applications which are transforming existing industries and creating new ones. Join our panel of leading industry experts as they discuss t ...Read More
Abstract:

Moderated by Jon Peddie (President, Jon Peddie Research) The GPU (graphics processing unit) runs advanced applications which are transforming existing industries and creating new ones. Join our panel of leading industry experts as they discuss the latest technology advances in the usage of GPU for Computer Vision, they will cover facial, gesture, human motion, and biometrics recognition, augmented reality, robotic computing and more. Panelists: Joe Stam (Sr. Applications Engineer, NVIDIA) Yoram Yaacovi (CTO & General Manager, Technologies at Microsoft Israel, R&D Center) Sam Cox (CEO, Milabra) Janko Mrsic-Flogel (CTO, Mirriad) Tom Dean (Research Scientist, Google)

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S104003
Download:
Share:
 
Speakers:
Michael Hummel
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features empulse GmbH - covering the field of databases & data mining. Find this session at 5 minutes into the video.Panelists include Flip Gianos (Partner, Interwest), Charles Carmel (VP of Corporate Business Development, Cisco), Nathan Brookwood (Principal Analyst, Insight64) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4004A
Download:
Share:
 
Speakers:
Natan Peterfreund
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Playcast Media Systems - covering the field of video processing. Find this session at 20 minutes into the video.Panelists will include Flip Gianos (Partner, Interwest), Charles Carmel (VP of Corporate Business Development, Cisco), Nathan Brookwood (Principal Analyst, Insight64) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4004B
Download:
Share:
 
Speakers:
Austin Shoemaker
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Cooliris - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists include Flip Gianos (Partner, Interwest), Charles Carmel (VP of Corporate Business Development, Cisco), Nathan Brookwood (Principal Analyst, Insight64) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4004C
Download:
Share:
 
Speakers:
Michel Tombroff
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Softkinetic - covering the field of computer vision. Find this session at 5 minutes into the video.Panelists include Flip Gianos (Partner, Interwest), Charles Carmel (VP of Corporate Business Development, Cisco), Nathan Brookwood (Principal Analyst, Insight64) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4005A
Download:
Share:
 
Speakers:
Uri Tal
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features Rocketick - covering the field of high performance computing. Find this session at 20 minutes into the video.Panelists include Flip Gianos (Partner, Interwest), Charles Carmel (VP of Corporate Business Development, Cisco), Nathan Brookwood (Principal Analyst, Insight64) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4005B
Download:
Share:
 
Speakers:
Kristian Raue
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features Jedox AG - covering the field of databases & data mining. Find this session at 35 minutes into the video.Panelists include Flip Gianos (Partner, Interwest), Charles Carmel (VP of Corporate Business Development, Cisco), Nathan Brookwood (Principal Analyst, Insight64) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4005C
Download:
Share:
 
Speakers:
Andrew Jamison
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Scalable Display Technologies - covering the field of imaging. Find this session at 5 minutes into the video.Panelists include Norman Winarsky (VP of Ventures, Licensing & Strategic Programs, SRI), Savitha Srinivasan (Corporate Venture Partner, IBM), Rob Enderle (Analyst, Enderle Group) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4007A
Download:
Share:
 
Speakers:
Jeroen Snepvangers
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features RTT - covering the field of computer graphics. Find this session at 20 minutes into the video.Panelists include Norman Winarsky (VP of Ventures, Licensing & Strategic Programs, SRI), Savitha Srinivasan (Corporate Venture Partner, IBM), Rob Enderle (Analyst, Enderle Group) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4007B
Download:
Share:
 
Speakers:
Michael Zeitlin
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders.This CEO on Stage session features Bunkspeed - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists for this session include Dan''l Lewin (Corporate VP, Microsoft), Drew Lanza (Partner, Morgenthaler), and Jon Peddie (President, JPR) & Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4007C
Download:
Share:
 
Speakers:
Jules Urbach
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features OTOY - covering the field of cloud computing. Find this session at 5 minutes into the video.Panelists include Norman Winarsky (VP of Ventures, Licensing & Strategic Programs, SRI), Savitha Srinivasan (Corporate Venture Partner, IBM), Rob Enderle (Analyst, Enderle Group) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4008A
Download:
Share:
 
Speakers:
David Peters
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features Universal Robotics - covering the field of machine learning & artificial intelligence. Find this session at 20 minutes into the video.Panelists include Norman Winarsky (VP of Ventures, Licensing & Strategic Programs, SRI), Savitha Srinivasan (Corporate Venture Partner, IBM), Rob Enderle (Analyst, Enderle Group) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4008B
Download:
Share:
 
Speakers:
David Hayes
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features ICD - covering the field of mobile devices. Find this session at 35 minutes into the video.Panelists include Norman Winarsky (VP of Ventures, Licensing & Strategic Programs, SRI), Savitha Srinivasan (Corporate Venture Partner, IBM), Rob Enderle (Analyst, Enderle Group) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4008C
Download:
Share:
 
Speakers:
Yoram Burg
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features OptiTex - covering the field of physics simulation. Find this session at 5 minutes into the video.Panelists include Bill Tai (General Partner, Charles River Ventures), Paul Weiskopf (Sr. VP of Corporate Development, Adobe), Tim Bajarin (President, Creative Strategies) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4010A
Download:
Share:
 
Speakers:
Sylvain Ordureau
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features RTT - covering the field of computer graphics. Find this session at 20 minutes into the video.Panelists include Norman Winarsky (VP of Ventures, Licensing & Strategic Programs, SRI), Savitha Srinivasan (Corporate Venture Partner, IBM), Rob Enderle (Analyst, Enderle Group) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4010B
Download:
Share:
 
Speakers:
Torsten Reil
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features NaturalMotion Limited - covering the field of computer graphics. Find this session at 35 minutes into the video.Panelists include Bill Tai (General Partner, Charles River Ventures), Paul Weiskopf (Sr. VP of Corporate Development, Adobe), Tim Bajarin (President, Creative Strategies) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4010C
Download:
Share:
 
Speakers:
Jeff Han
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features Perceptive Pixel - covering the field of imaging. Find this session at 5 minutes into the video.Panelists include Bill Tai (General Partner, Charles River Ventures), Paul Weiskopf (Sr. VP of Corporate Development, Adobe), Tim Bajarin (President, Creative Strategies) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4011A
Download:
Share:
 
Speakers:
Lance Maurer
Abstract:
See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seco ...Read More
Abstract:

See the hottest new technologies from startups that could transform computing. In a lively and fast-paced exchange, the "Emerging Companies Summit - CEO on Stage" sessions feature CEOs from three startups who have 7 minutes and 30 seconds to introduce their companies and 7 minutes and 30 seconds to interact with a panel of industry analysts, investors and technology leaders. This CEO on Stage session features Cinnafilm - covering the field of film. Find this session at 20 minutes into the video.Panelists include Bill Tai (General Partner, Charles River Ventures), Paul Weiskopf (Sr. VP of Corporate Development, Adobe), Tim Bajarin (President, Creative Strategies) and Jeff Herbst (VP of Business Development, NVIDIA).

  Back
 
Topics:
Emerging Companies Summit
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
4011B
Download:
Share:
Energy Exploration
Presentation
Media
Speakers:
Abstract:
Mental Images' DiCE-based geospatial library is a CUDA and cluster-based visualization framework that enables scalable processing and rendering of huge amounts of subsurface data for interactive seismic interpretation. Geospatial exploration ...Read More
Abstract:

Mental Images' DiCE-based geospatial library is a CUDA and cluster-based visualization framework that enables scalable processing and rendering of huge amounts of subsurface data for interactive seismic interpretation. Geospatial exploration in the oil and gas industries is concerned with scanning the earth's subsurface structure for detecting oil and for cost-effective drilling of detected oil reservoirs. Efficient seismic interpretation requires the interpreters to be able to interactively explore huge amounts of volumetric seismic information with embedded stacked horizons to gain visual insight into the subsurface structure and to determine where oil recovery facilities and drilling infrastructure shall be built.

  Back
 
Topics:
Energy Exploration, Databases, Data Mining, Business Intelligence, Developer - Tools & Libraries, Video and Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2014
Streaming:
Download:
Share:
 
Speakers:
Scott Morton
- Hess Corporation
Abstract:
At Hess Corporation, we have moved the most computationally intensive parts of our seismic imaging codes from CPUs to GPUs over the past few years. ...Read More
Abstract:
At Hess Corporation, we have moved the most computationally intensive parts of our seismic imaging codes from CPUs to GPUs over the past few years. In this talk I will give an overview of seismic imaging, highlighting the physical and computational algorithms of these codes. I will discuss our software approach and the programming effort to port them to GPUs, concluding with a summary of our progress in adopting GPUs in production.  Back
 
Topics:
Energy Exploration, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2059
Streaming:
Download:
Share:
 
Speakers:
Javier Cabezas, Mauricio Araya
- Barcelona Supercomputing Center
Abstract:
Get a close look at implementing Reverse Time Migration (RTM) applications across multiple GPUs. We will focus on how RTM applications can be scaled using the GMAC asymmetric distributed shared memory (ADSM) library to break the problem ...Read More
Abstract:
Get a close look at implementing Reverse Time Migration (RTM) applications across multiple GPUs. We will focus on how RTM applications can be scaled using the GMAC asymmetric distributed shared memory (ADSM) library to break the problem into manageable chunks. We will provide an introduction to GMAC and discuss handling boundary conditions and using separate kernels to improve efficiency.  Back
 
Topics:
Energy Exploration, Developer - Algorithms, HPC and AI
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102226
Streaming:
Download:
Share:
Film
Presentation
Media
Abstract:
The arrival of fully programable GPUs is now changing the visual effects industry, which traditionally relied on CPU computation to create their spectacular imagery. Implementing the complex image processing algorithms used by VFX is a challenge ...Read More
Abstract:

The arrival of fully programable GPUs is now changing the visual effects industry, which traditionally relied on CPU computation to create their spectacular imagery. Implementing the complex image processing algorithms used by VFX is a challenge, but the payoffs in terms of interactivity and throughput can be enormous. Hear how The Foundry''s novel image processing architecture simplifies the implementation of GPU-enabled VFX software and eases the transition from a CPU based infrastructure to a GPU based one.

  Back
 
Topics:
Film, Developer - Tools & Libraries, Video and Image Processing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
S102125
Download:
Share:
Finance
Presentation
Media
Speakers:
Pierre Spatz
- Murex SAS
Abstract:
Murex will share its practical experience using GPUs to accelerate high-performance analytics based on GPU-enabled Monte Carlo and PDE methods. ...Read More
Abstract:
Murex will share its practical experience using GPUs to accelerate high-performance analytics based on GPU-enabled Monte Carlo and PDE methods. We will also briefly describe Murex's experience developing a high-level payoff scripting language that allows user-definable payoffs for single and cross-asset instruments.  Back
 
Topics:
Finance, Developer - Algorithms
Type:
Talk
Event:
GTC Silicon Valley
Year:
2010
Session ID:
2032
Streaming:
Download:
Share:
 
Speakers:
Scott Donovan
- Citadel Investment Group
Abstract:
Join Citadel to explore our three year undertaking on the feasibility of GPGPU computing for option pricing. ...Read More
Abstract:
Join Citadel to explore our three year undertaking on the feasibility of GPGPU computing for option pricing. We will discuss our 140X performance boost and the hurdles we had to overcome to integrate GPUs into our existing infrastructure. Please note that our talk will not get into the details of the model (that's proprietary information), but we will share our innovative solution to drive a grid of virtu