This online workshop organised by LRZ and VI-HPS as a PRACE training event will:
- give an overview of the VI-HPS programming tools suite,
- explain the functionality of individual tools, and how to use them effectively,
- offer hands-on experience and expert coaching assistance using the tools with your own application.
On completion participants should be familiar with common performance analysis and diagnosis techniques and how they can be employed in practice (on a range of HPC systems). Those who prepared their own application test cases will have been coached in the tuning of their measurement and analysis, and provided optimisation suggestions.
Presentations and hands-on sessions are planned on the following topics (tbc):
- Setting up, welcome and introduction
- APR performance reports
- MUST runtime error detection for MPI
- Caliper event annotation, logging & profiling
- Nsight system-wide and CUDA kernel profiling
- PAPI hardware performance counters
- Score-P instrumentation and measurement
- CUBE profile processing and exploration
- TAU performance system
- Scalasca automated trace analysis
- Vampir interactive trace analysis
- Paraver/Extrae/Dimemas trace analysis and performance prediction
- JUBE script-based workflow batch execution environment
- ... and potentially others to be added
A brief overview of the capabilities of these and associated tools is provided in the VI-HPS Tools Guide.
The workshop is organised in 5 days, with lectures and demos in the morning and hands-on sessions in the afternoon. The workshop will be held in English and run from 09:00 to not later than 17:30 each day, with breaks.
The number of participants is limited, therefore priority may be given to applicants with parallel codes already running on the workshop computer system, and those bringing codes from similar cluster systems to work on.
- Zoom app (recommended)
- SSH client (to connect to the HPC system)
- X Server (enabling remote visual tools)
Participants are encouraged to prepare their own MPI, OpenMP and hybrid MPI+OpenMP parallel application codes for analysis.
Programme in Detail
Hardware and Software Platforms
- 8-way Ivy Bridge-based Linux Cluster
- FDR14 Infiniband interconnect
- 31 nodes, 16 cores per node
- 64 GB DDR Memory per node
- Operating System: SUSE Linux Enterprise Server (SLES) 12 SP5
- Compilers: Intel icc, icpc, ifort 19.0
- Intel MPI
The LRZ IvyMUC cluster is the primary platform for the workshop and will be used for the hands-on exercises. Course accounts will be provided before the workshop. Other systems where up-to-date versions of the tools are installed can also be used when preferred, though support may be limited and participants are expected to already possess user accounts on non-local systems. Regardless of whichever systems they intend to use, participants should be familiar with the relevant procedures for compiling and running their parallel applications (via batch queues where appropriate).
Tuning Workshop SeriesCédric Valensi
Université de Versailles Saint-Quentin-en-Yvelines
Phone: +33 1 77 57 59 36
|This workshop is a PRACE training event, organised by LRZ and VI-HPS for the Gauss Centre for Supercomputing.|