The R part of course (first day) will be focused on presenting the basics of data analysis in R and visualization of data. The course will cover the introduction to the R statistical language introducing the basic data types and workflow. Afterwards, packages from the “tidyverse” collection will be presented. These includes packages for the loading of data, preprocessing data, basic data exploration, and visualization.
The Python oriented part (second day) will introduce essential data-scientific packages and will be complemented with hands-on exercises that will demonstrate their usage with real world data analytic problems, and showing how to tackle such problems.
The course will be up to 50% hands-on exercises covering all topics to practice the techniques, and patterns gained.
Purpose of the course (benefits for the attendees)
Target audience: Users that want to use Python and/or R for data analysis and prototyping. The participants will learn basic and intermediate skills for exploratory data analysis and visualization in the programming languages of R and Python.
About the tutor(s)
Tomáš Martinovič obtained his PhD in computational sciences at IT4Innovations, VSB - Technical University of Ostrava in 2018. From 2015 to 2018 he worked in a team focused on analysis of complex dynamical systems, where he worked on scalable implementations of algorithms from the field of nonlinear time series analysis. Since the start of 2019 he has been working in a team focused on high performance data analysis with the defined objective of research and transfer of knowledge in cooperation with industry.
Stanislav Böhm has a PhD in computer science, and is a researcher at IT4Innovations. He is interested in distributed systems, verification, and scheduling.