GTC ON-DEMAND

 
SEARCH SESSIONS
SEARCH SESSIONS

Search All
 
Refine Results:
 
Year(s)

SOCIAL MEDIA

EMAIL SUBSCRIPTION

 
 

GTC ON-DEMAND

Presentation
Media
Abstract:
Cognitive applications are reshaping the IT landscape with entire data centers designed and built solely for that purpose. Though computationally challenging, deep learning networks have become a critical building block to boost accuracy of cognitive offerings like Watson. We'll present a detailed performance study of deep learning workloads and how sharing accelerator resources can improve throughput by a factor of three, effectively turning a four GPU commodity cloud system into a high-end, 12-GPU supercomputer. Using Watson workloads from three major areas that incorporate deep learning technology (language classification, visual recognition, and speech recognition), we document effectiveness and scalability of this approach.
Cognitive applications are reshaping the IT landscape with entire data centers designed and built solely for that purpose. Though computationally challenging, deep learning networks have become a critical building block to boost accuracy of cognitive offerings like Watson. We'll present a detailed performance study of deep learning workloads and how sharing accelerator resources can improve throughput by a factor of three, effectively turning a four GPU commodity cloud system into a high-end, 12-GPU supercomputer. Using Watson workloads from three major areas that incorporate deep learning technology (language classification, visual recognition, and speech recognition), we document effectiveness and scalability of this approach.  Back
 
Topics:
Artificial Intelligence and Deep Learning, Performance Optimization
Type:
Talk
Event:
GTC Silicon Valley
Year:
2017
Session ID:
S7320
Download:
Share:
 
Abstract:
We'll introduce PowerAI and the S822LC for HPC. PowerAI is an optimized software stack for AI designed to take advantage of Power processor performance features, and in particular of the new NVLink interface between Power and the NVIDIA Tesla P100 GPU accelerator, first introduced with S822LC for HPC. We'll introduce performance enhancements of the PowerAI, including IBM Caffe with its performance optimization centered at enhance communications and other enhancements to frameworks, libraries, and the deep learning ecosystem for Power. With its high-performance NVLink connection, the new generation S822LC for HPC server is the first that offers a sweet spot of scalability, performance, and efficiency for deep learning applications. Together, these hardware and software enhancements enabled the first release of PowerAI to achieve best in industry training for Alexnet and VGGnet.
We'll introduce PowerAI and the S822LC for HPC. PowerAI is an optimized software stack for AI designed to take advantage of Power processor performance features, and in particular of the new NVLink interface between Power and the NVIDIA Tesla P100 GPU accelerator, first introduced with S822LC for HPC. We'll introduce performance enhancements of the PowerAI, including IBM Caffe with its performance optimization centered at enhance communications and other enhancements to frameworks, libraries, and the deep learning ecosystem for Power. With its high-performance NVLink connection, the new generation S822LC for HPC server is the first that offers a sweet spot of scalability, performance, and efficiency for deep learning applications. Together, these hardware and software enhancements enabled the first release of PowerAI to achieve best in industry training for Alexnet and VGGnet.  Back
 
Topics:
Artificial Intelligence and Deep Learning, Tools & Libraries, Performance Optimization
Type:
Talk
Event:
GTC Silicon Valley
Year:
2017
Session ID:
S7368
Download:
Share:
 
Abstract:

With its high-performance NVlink connection, the new generation S822LC for HPC server offers a sweet spot of scalability, performance and efficiency for Deep Learning applications. The next generation S822LC systems include the P100 GPUs which were optimized for Deep Learning workloads, NVlink for enhanced peer-to-peer GPU multiprocessing, and CPU-GPU NVlink for enhanced performance and programmability. At the same time, they remain upwardly compatible with earlier systems, accelerating existing Deep Learning frameworks building on the the familiar CUDA and cuDNN libraries. As part of the focus on cognitive application enablement at IBM, the new server will be accompanied with a rich pre optimized and pre-built deep learning software distribution to simplify and accelerate deployment.

With its high-performance NVlink connection, the new generation S822LC for HPC server offers a sweet spot of scalability, performance and efficiency for Deep Learning applications. The next generation S822LC systems include the P100 GPUs which were optimized for Deep Learning workloads, NVlink for enhanced peer-to-peer GPU multiprocessing, and CPU-GPU NVlink for enhanced performance and programmability. At the same time, they remain upwardly compatible with earlier systems, accelerating existing Deep Learning frameworks building on the the familiar CUDA and cuDNN libraries. As part of the focus on cognitive application enablement at IBM, the new server will be accompanied with a rich pre optimized and pre-built deep learning software distribution to simplify and accelerate deployment.

  Back
 
Topics:
HPC and AI
Type:
Talk
Event:
GTC Washington D.C.
Year:
2016
Session ID:
DCS16189
Download:
Share:
 
Abstract:

Over the past three decades, the Power Architecture has been an important asset in IBM's systems strategy. During the time, Power-based systems powered desktops, technical workstations, embedded devices, game consoles, supercomputers and commercial UNIX servers.

Over the past three decades, the Power Architecture has been an important asset in IBM's systems strategy. During the time, Power-based systems powered desktops, technical workstations, embedded devices, game consoles, supercomputers and commercial UNIX servers.

  Back
 
Topics:
OpenPOWER, HPC and Supercomputing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2015
Session ID:
S5682
Streaming:
Download:
Share:
 
Abstract:

The POWER8 processor is the latest RISC (Reduced Instruction Set Computer) microprocessor from IBM and the first processor supporting the new OpenPOWER software environment. Power8 was designed to deliver unprecedented performance for emerging workloads, such as Business Analytics and Big Data applications, Cloud computing and Scale out Datacenter workloads. It is fabricated using IBM's 22-nm Silicon on Insulator (SOI) technology with layers of metal, and it has been designed to significantly improve both single-thread performance and single-core throughput over its predecessor, the POWER7i processor.

The POWER8 processor is the latest RISC (Reduced Instruction Set Computer) microprocessor from IBM and the first processor supporting the new OpenPOWER software environment. Power8 was designed to deliver unprecedented performance for emerging workloads, such as Business Analytics and Big Data applications, Cloud computing and Scale out Datacenter workloads. It is fabricated using IBM's 22-nm Silicon on Insulator (SOI) technology with layers of metal, and it has been designed to significantly improve both single-thread performance and single-core throughput over its predecessor, the POWER7i processor.

  Back
 
Topics:
OpenPOWER, HPC and Supercomputing
Type:
Talk
Event:
GTC Silicon Valley
Year:
2015
Session ID:
S5696
Streaming:
Download:
Share:
 
 
Previous
  • Amazon Web Services
  • IBM
  • Cisco
  • Dell EMC
  • Hewlett Packard Enterprise
  • Inspur
  • Lenovo
  • SenseTime
  • Supermicro Computers
  • Synnex
  • Autodesk
  • HP
  • Linear Technology
  • MSI Computer Corp.
  • OPTIS
  • PNY
  • SK Hynix
  • vmware
  • Abaco Systems
  • Acceleware Ltd.
  • ASUSTeK COMPUTER INC
  • Cray Inc.
  • Exxact Corporation
  • Flanders - Belgium
  • Google Cloud
  • HTC VIVE
  • Liqid
  • MapD
  • Penguin Computing
  • SAP
  • Sugon
  • Twitter
Next