The increase in computational power goes hand in hand with an increase in the size of the data to be managed, both on the input and on the output sides. IO can easily become a bottleneck for large scale architectures. The understanding of parallel file system mechanisms and parallel IO concepts enables users to efficiently use existing high level libraires like Netcdf or HDF5.
Topics:
• HDF5 High level IO libraries (3h)
• Parallel HDF5 and focus on MPI-IO hints (3h)
• Parallel file systems: Lustre (1h30)
• PDI: Parallel Data Interface (4h30)
Instructors: M. Haefele (Maison de la Simulation, CNRS), Thomas Leibovici (TGCC, CEA), Julien Bigot (maison de la Simulation)
Learning outcomes: After this course, participants should understand the tradeoffs implied by using a parallel file-system, and know how to efficiently use parallel IO libraries.
Prerequisites: Knowledge of C or Fortran programming language, parallel programing with MPI