About the course
Content
Data Organization in Spreadsheets, OpenRefine for Data Cleaning, Data Management with SQL, Introduction to R
Learning outcomes
Spreadsheets:
- Good data entry practices - formatting data tables in spreadsheets
- How to avoid common formatting mistakes
- Approaches for handling dates in spreadsheets
- Basic quality control and data manipulation in spreadsheets
- Exporting data from spreadsheets
OpenRefine:
- Effectively clean and format data and automatically track any changes that you make
SQL:
- What relational databases are, how you can load data into them and how you can query databases to extract just the information that you need
R:
- Information about R syntax, the RStudio interface, on how to import CSV files
- The structure of data frames
- How to deal with factors, how to add/remove rows and columns, how to calculate summary statistics from a data frame
- Brief introduction to plotting
Programme
The schedule is available on https://subugoe.github.io/2021-12-06-dc-enlight-online/
Lecturers
Anne Hobert, Britta Timmermann, Julika Mimkes, Hanna Varachkina, Péter Király, Raisa Barthauer, Timo Gnadt