[ONLINE] Data Management with iRODS and Compute @SURF

CET
ONLINE

ONLINE

Description

Would you want to practice reproducible research in HPC and preserve your data for the future?

Research Data Management is essential in effective and reproducible science. Due to ever increasing data volumes and complexity, researchers need modern tools to manage all stages of the data life cycle. Here we will use iRODS, a data management framework, to incorporate data management principles in compute pipelines. 

What?

In this course you will:

- Learn about the the iRODS data management framework and icommands
- Understand how to incorporate provenance in a compute workflow
- Know about FAIR in data processing workflows

Who?

- Everyone interested in learning advanced data management tools for compute and data processing workflows

Requirements

- Basic knowledge of Linux and shell commands

You should have

- Your own laptop with an up-to-date browser and a terminal emulator. The use of the operating systems Linux and macOS is preferred, but not mandatory. For Windows users we recommend to download MobaXterm (portable version) as terminal emulator.

 

IMPORTANT INFORMATION: WAITING LIST

If the course gets fully booked, no more registrations are accepted through this website. However, you can be included in the waiting list: for that, please send an email to training@surfsara.nl and you'll be informed when a place becomes available.

    • 1
      Welcome and introduction
    • 2
      Data Management, FAIR and iRODS
    • 3
      Hands-on: Data Handling with iRODS clients: icommands, python api, webdav
    • 11:00 AM
      Coffee break
    • 4
      The HPC System, Compute Workflows and iRODS
    • 12:00 PM
      Lunch
    • 5
      Hands-on: A Compute Workflow part 1
    • 6
      Hands-on: A Compute Workflow part 2
    • 3:00 PM
      Coffee break
    • 7
      Hands-on: Automating data handling with iRODS
    • 8
      Wrap up and Evaluation