Datamanagement with iRODS and Compute @SURFsara

VK1/VK2 (SURFsara)



Science Park 140, 1098 XG Amsterdam
With the advance of new technologies, data volumes and number of files are constantly increasing. Addtionally, new regulations (e.g. GDPR) sets strict requirements on the storage and use of privacy sensitive data. Data management has therefore become an essential part of data-driven research. In this course we will introduce how to efficiently manage data with the data management framework iRODS and to build computational pipelines on HPC infrastructure employing this data. Topics in this course will include: - Data Life Cycle and FAIR principles - iRODS concepts - iRODS graphical user interface - labeling data and searching for data in iRODS - building a computational pipeline that draws on data managed in iRODS.
    • 09:00 09:15
      Welcome and introduction 15m
    • 09:15 10:00
      Data Management, FAIR and iRODS 45m
    • 10:00 10:30
      Hands-on: iRODS Data Handling with Python 30m
    • 10:30 11:00
      Coffee break 30m
    • 11:00 12:00
      Hands-on: Metadata handling with Python 1h
    • 12:00 13:00
      Lunch 1h
    • 13:00 13:30
      Compute workflows with iRODS 30m
    • 13:30 14:30
      Hands-on: Compute workflows with iRODS - part I 1h
    • 14:30 15:00
      Coffee break 30m
    • 15:00 16:00
      Hands-on: Compute workflow with iRODS - part II 1h