1st Variorum Lecture Series August 2021
The Variorum team will provide its first Variorum Lecture Series, where attendees will learn everything necessary to start using Variorum on various platforms to write portable power management code. The team will provide support through GitHub and Variorum mailing list during and after the lecture series. This Variorum Lecture Series will consist of two modules, each of 1.5 hours each. We will hold two sessions to accommodate different time zones as well as attendee schedules.
Session 1: August 6 and 13, 8:30am-10:00am PT (11:30am-1:00pm ET), targeting US/European attendees, and
Session 2: August 20 and 27, 4:00pm-5:30pm PT (7:00pm-8:30pm ET), targeting US/Asian attendees.
What is Variorum?
Variorum is a production-grade, open-source, vendor-neutral software infrastructure for exposing low-level control and monitoring of a system’s underlying hardware features. It can easily be ported to different hardware devices, as well as different generations within a particular device. This allows users to manage power, performance and thermal information seamlessly across hardware from different vendors. More specifically, Variorum’s flexible design supports a set of features that may exist on one generation of hardware, but not on another. Variorum can also be included as part of the system software stack for power management: such as runtime systems, resource managers, and other profiling tools. At present, Variorum supports 5 platforms (IBM, Intel, AMD, ARM and NVIDIA) and a total of ten microarchitectures across these platforms.
Contents of the Lectures
Module 1: Introduction to Variorum
- Challenges in Power Management and The HPC Power Stack
- Understanding Power Management Knobs on Intel, IBM, NVIDIA, ARM, and AMD platforms
- Variorum Library
- Build, dependencies, and setup
- Monitoring user applications non-intrusively
- Vendor-neutral Variorum API across diverse architectures
- Using Variorum for finer-grained monitoring, power capping, and management
Module 2: Integrating Variorum with System Software and Tools
- The HPC Power Stack revisited: need for power management at various levels
- GEOPM: job-level power management
- Kokkos and Caliper: application and workflow power management
- SLURM (Research Extensions): system-level power management
- Upcoming Features in Variorum
- The HPC Power Stack Roadmap
How to Attend
- The lecture series is available to everyone, and participants are welcome to attend any/all sessions.
- No-cost registration is necessary, meeting link and password will be sent to registrants. See “Tickets” above.
- Presenters will show in-depth demos during the lecture series. Presenters can provide support during and after the lecture series with setup and usage on supported architectures.
- Stephanie Brink, Tapasya Patki, Aniruddha Marathe, Barry Rountree (Lawrence Livermore National Laboratory)
Slides, recording and Q&A will be available after the lectures.