Sep
24
Thu
The Second Extreme-scale Scientific Software Stack Forum (E4S Forum)
Sep 24 @ 8:00 am – 5:00 pm

The Second Extreme-scale Scientific Software Stack Forum (E4S Forum)

The DOE Exascale Computing Project (ECP) Software Technology focus area is developing an HPC software ecosystem that will enable the efficient and performant execution of exascale applications. ECP’s Extreme-scale Scientific Software Stack (E4S) is developing a comprehensive and coherent software stack that will enable application developers to productively write highly parallel applications that can portably target diverse exascale architectures. E4S exists to accelerate the development, deployment, and use of HPC software, lowering the barriers for HPC users. This forum will bring together researchers to engage with the E4S community and build research collaborations with the broader HPC community. The forum presents a unique opportunity to engage with E4S projects and learn more about plans for deployment of this software stack.

More information (including registration) is available at the E4S Forum website.

Oct
2
Fri
Webinar: HDF5 Application Tuning (part 1)
Oct 2 @ 11:00 am – 12:00 pm

HDF5 Application Tuning: There is more than one way to skin a cat(fish)

This webinar shows a simple but practically relevant example of drastic performance differences — a factor of four between the best and worst performers — when using HDF5. The webinar offers a systematic step-by-step analysis and diagnosis of why that is the case. The webinar continues a series dedicated to troubleshooting performance problems in, the effective use of diagnostic tools, and how to make the most of HDF5 and its unique capabilities.

More information about the webinar as well as presentation materials can be found here.

Oct
13
Tue
HDF5 User Group Meeting (HUG 2020)
Oct 13 – Oct 16 all-day

HDF5 User Group meeting (HUG 2020)

The HDF Group and NERSC will host a virtual HDF5 User Group Meeting from October 13th to October 16th, 2020. There will be no charge for this virtual meeting but space is limited.

More information (including registration) is available at https://www.hdfgroup.org/hug/2020-hug.

Oct
14
Wed
Scalable Precision Tuning of Numerical Software
Oct 14 @ 1:00 pm – 2:00 pm

The IDEAS Productivity project, in partnership with the DOE Computing Facilities of the ALCF, OLCF, and NERSC and the DOE Exascale Computing Project (ECP) has resumed the webinar series on Best Practices for HPC Software Developers, which we began in 2016.

As part of this series, we offer one-hour webinars on topics in scientific software development and high-performance computing, approximately once a month. The October webinar is titled Scalable Precision Tuning of Numerical Software, and will be presented by Cindy Rubio González (UC Davis). The webinar will take place on Wednesday, October 14, 2020 at 1:00 pm ET.

Abstract:

The use of numerical software has grown rapidly over the past few years, providing the foundation for a large variety of applications including scientific software and machine learning. Given the variety of numerical errors that can occur, floating-point programs are difficult to write, test and debug. One common practice among developers is to use the highest available precision when allocating variables. While more robust, this can degrade program performance significantly. This webinar will describe research related to the development of tools to assist programmers in tuning the precision of their floating-point programs. These tools conduct a data-driven approach to search over the types of floating-point variables to lower their precision subject to accuracy constraints and performance goals. In the last part of the webinar, the presenter will discuss challenges and opportunities for scalable precision tuning of large HPC applications.

Oct
29
Thu
Strategies for Working Remotely Panel Series – Sustainable Hybrid Approaches
Oct 29 @ 3:00 pm – 4:15 pm

In response to the COVID-19 pandemic and need for many to transition to unplanned remote work, the IDEAS-ECP Productivity project launched the panel series Strategies for Working Remotely, which explores important topics in this area.

Abstract:

  • In Spring 2020 many workers abruptly transitioned from a primarily on-site to a primarily remote work experience.  Many people will be (or already are) transitioning to a hybrid experience, spending some days on site and some remote.  Working in this hybrid setting is likely to last longer for many people than the primarily remote setting.  For some, the hybrid setting may go on indefinitely. In this panel discussion, we learn from colleagues who have worked in a hybrid setting during their careers, as well as some who are new to remote work.  We discuss some of the challenges we have faced in primarily remote settings and how these challenges might be addressed in hybrid settings. Themes include making effective use of time on site, best practices and principles for teams who are blended remote and on site, and how to be effective and efficient in long-term hybrid settings.

Panelists:

  • Katie Antypas, LBL
  • David Bernholdt, ORNL
  • Mark C. Miller, LLNL
  • Hai Ah Nam, LANL
  • Valerie Taylor, ANL

Moderators:

  • Ashley Barker, ORNL
  • Mike Heroux, SNL
  • Elaine Raybourn, SNL
Nov
4
Wed
Reducing Technical Debt with Reproducible Containers
Nov 4 @ 1:00 pm – 2:00 pm

The IDEAS Productivity project, in partnership with the DOE Computing Facilities of the ALCF, OLCF, and NERSC and the DOE Exascale Computing Project (ECP) has resumed the webinar series on Best Practices for HPC Software Developers, which we began in 2016.

As part of this series, we offer one-hour webinars on topics in scientific software development and high-performance computing, approximately once a month. The November webinar is titled Reducing Technical Debt with Reproducible Containers, and will be presented by Tanu Malik (DePaul University). The webinar will take place on Wednesday, November 4, 2020 at 1:00 pm ET.

Abstract:

Computational experiments can be challenging to reproduce; researchers have to choose between pursuing a fast-paced research agenda and developing well-organized, sufficiently documented, and easily reproducible software. Like incurring fiscal debt, there are often tactical reasons to take on technical debt in scientific software—such as deferring documentation, organization, refactoring, and unit tests when pursuing a new idea or meeting a conference deadline. However, more often than not, researchers do not repay this technical debt, leading to irreproducible experiments.

The webinar will describe different levels of technical debt and quantify the cost of not repaying the technical debt. The presenter will introduce isolation in containers as a powerful mechanism for reducing portability debt and describe limitations of current container tools. The presenter will introduce a vision of a reproducible container that aims to automate repayment of different types of technical debt, and will describe the current state of this vision with three tools that use isolation, encapsulation, and monitoring to include necessary and sufficient content in the container—both in terms of software and data, and describe the contents of the container. Finally, the presenter will show results of using reproducible containers on domain science and HPC use cases, and provide guidance.

Nov
10
Tue
UPC++: An Asynchronous RMA/RPC Library for Distributed C++ Applications
Nov 10 @ 2:30 pm – 6:30 pm

UPC++: An Asynchronous RMA/RPC Library for Distributed C++ Applications

The UPC++ Team will offer a tutorial at SC20 introducing the basic concepts and advanced optimization techniques of UPC++, a C++ library supporting Partitioned Global Address Space (PGAS) programming. The tutorial will introduce the UPC++ memory and execution models and examine basic algorithm implementations. Participants will gain hands-on experience incorporating UPC++ features into several application examples. The presenters will also examine two irregular applications (metagenomic assembler and multifrontal sparse solver) and describe how they leverage UPC++ features to optimize communication performance.

Presenters:

  • Katherine A. Yelick (Lawrence Berkeley National Laboratory, University of California, Berkeley)
  • Amir Kamil (Lawrence Berkeley National Laboratory, University of Michigan)
  • Dan Bonachea (Lawrence Berkeley National Laboratory)
  • Paul H. Hargrove (Lawrence Berkeley National Laboratory)

More information about the tutorial (including registration) is available at the SC20 website.

Dec
9
Wed
Software Design for Longevity with Performance Portability
Dec 9 @ 1:00 pm – 2:00 pm

The IDEAS Productivity project, in partnership with the DOE Computing Facilities of the ALCF, OLCF, and NERSC and the DOE Exascale Computing Project (ECP) has resumed the webinar series on Best Practices for HPC Software Developers, which we began in 2016.

As part of this series, we offer one-hour webinars on topics in scientific software development and high-performance computing, approximately once a month. The December webinar is titled Software Design for Longevity with Performance Portability, and will be presented by Anshu Dubey (Argonne National Laboratory and University of Chicago). The webinar will take place on Wednesday, December 9, 2020 at 1:00 pm ET.

Abstract:

In the era of simultaneously increasing heterogeneity in hardware and application software, the topics of performance portability and longevity may seem at cross purposes. Key to achieving either objective individually is software design. Achieving both simultaneously is a much harder challenge, yet, in today’s scientific computing landscape neither objective can be ignored. Questions that science is posing to computation are more complex, which imply greater investment in building science capabilities in the software, and therefore longevity is important. Those questions need more capable hardware, which can only be obtained only through evermore heterogeneous platforms. This webinar will present a few basic principles of scientific software design that have been instrumental in mitigating some of the challenges that applications developers are facing. These principles represent a combination of experience from the presenter’s own project and from the Exascale Computing Project Performance Portability Panel Series that took place during summer of 2020.

Dec
17
Thu
Tutorial: In Situ Scientific Visualization and Analysis using ALPINE Ascent
Dec 17 @ 1:00 pm – 2:00 pm

Scientific visualization is a key component of simulation workflows. Visualization tools transform mesh data into pictures that help simulation users digest and understand complex science results. Visualization is often done post-hoc using files written to disk, however at extreme scales that model is strained by gaps between I/O and compute speeds. In memory processing, a type of in situ processing, can help avoid these I/O bottlenecks and allow users to analyze more data.

We are developing Ascent to support production in memory visualization for HPC simulations. It leverages both distributed-memory (MPI) and many-core (CUDA or OpenMP) parallelism to provide ray-tracing, contouring, etc using the same HPC resources as a host simulation application. Ascent is one of the infrastructure thrusts of the ECP ALPINE Software Technology Project and will be used to deploy new automated data reduction algorithms developed by ALPINE. The ALPINE is team actively working to help integrate Ascent into ECP application codes and support visualization needs.

This tutorial will cover how to present data to Ascent, transform data, render pictures, and export results. Attendees can follow along and execute tutorial examples using cloud hosted Jupyter Notebooks or a public Ascent install.

Please join us to learn about Ascent, 10am – 11am PT, 1pm – 2pm ET!

Presenters:

  • Cyrus Harrison (Lawrence Livermore National Laboratory)
  • Matthew Larsen (Lawrence Livermore National Laboratory)
  • Hank Childs (University of Oregon)

Useful Links:

The tutorial has been recorded and it is available at the ECP Youtube Channel. The slides are available here.

Strategies for Working Remotely Panel Series – Year in Review: What have we learned so far?
Dec 17 @ 3:00 pm – 4:00 pm

In response to the COVID-19 pandemic and need for many to transition to unplanned remote work, the IDEAS-ECP Productivity project launched the panel series Strategies for Working Remotely, which explores important topics in this area.

Abstract:

  • Earlier this year many workers abruptly transitioned from a primarily on-site to a primarily remote work experience due to a global pandemic. As we bring 2020 to a close, what have we learned so far, and what do we have yet to learn about working remotely, and working effectively in hybrid configurations? In this fireside chat, we look at key highlights from each of the Strategies for Working Remotely panel discussions in the series and dig deeper. What has worked, why, and where can we improve? What do we have yet to learn, or unlearn? “Ask me anything” questions can be submitted by the audience in advance to [email protected].

Panelists:

  • Lori Diachin, Lawrence Livermore National Laboratory
  • Tom Evans, Oak Ridge National Laboratory
  • Elaine Raybourn, Sandia National Laboratories

Moderator:

  • Ashley Barker, Oak Ridge National Laboratory