Aug
16
Wed
Using the Roofline Model and Intel Advisor
Aug 16 @ 1:00 pm – 2:00 pm

The IDEAS Productivity project, in partnership with the DOE Computing Facilities of the ALCF, OLCF, and NERSC and the DOE Exascale Computing Project (ECP), is resuming the webinar series on Best Practices for HPC Software Developers, which we began last year.

As part of this series, we will offer one-hour webinars on topics in scientific software development and high-performance computing, approximately once a month. Participation is free and open to the public, but registration will be required for each event.

The next webinar in the series was “Using the Roofline Model and Intel Advisor” with speaker Sam Williams of Lawrence Berkeley National Laboratory.  This webinar took place on August 16, 2017 and you can find a copy of the presentation and the video from the webinar in the Presentation Materials section below.

This webinar began by introducing the Roofline Model and its “Cache-Aware” variant.   Then, they moved to general guidelines and historical approaches to Roofline-based program analysis.  Next, they provided a short discussion of how changes in data locality and arithmetic intensity of two canonical benchmarks visually manifest in the context of these two Roofline formulations.  The speakers then conducted demonstrations of using Intel Advisor and the Roofline model within Intel Advisor.  The first demo was primarily instructive on how to compile, benchmark, and use Advisor. The second demo focused on using variants of a simple benchmark to highlight changes in the Roofline model as well as providing correlation to Advisor’s other capabilities.
Sep
20
Wed
Scalable Node Programming with OpenACC
Sep 20 @ 1:00 pm – 2:15 pm

The DOE Exascale Computing Project (ECP) is pleased to sponsor a webinar presented by Michael Wolfe of NVIDIA titled “Scalable Node Programming with OpenACC” that took place on Wednesday, September 20, at 1:00 pm ET.

Supercomputer nodes are becoming more parallel with each generation.  HPC applications must now utilize parallelism on-node as well as across many nodes for performance.  OpenACC is designed to expose and exploit parallelism, whether that be for a multicore, a manycore, or a GPU-accelerated node.

This webinar discussed:

  • what problems OpenACC addresses;
  • the important elements of OpenACC;
  • how and why OpenACC is being adopted and used in production science and engineering applications, including Gaussian, Fluent, VASP, and several weather and climate codes;
  • current status of OpenACC tools and of the specification;
  • building and using applications with OpenACC for different targets;
  • the OpenACC community and user events;
  • new features requested by users coming with OpenACC 2.6 and beyond.