Introduction to data wrangling with OpenRefine (June 27th, 2024)

241_INF23

In our workshop “Data wrangling with OpenRefine” you will learn the basics of preparing and transforming tabular data with the open source software OpenRefine.

OpenRefine provides functions to identify and correct inconsistencies in large amounts of data under a graphical user interface that outwardly resembles spreadsheet software.

For example, it is possible to combine slightly different spellings of a name in different entries (e.g. TU Darmstadt and TU_Darmstadt) by clustering and then label them uniformly. Such data preparation often makes later analysis of the data much easier.

  • How do I create a project and import data?
  • How do I use facet, filter and cluster functions?
  • How do I transform data? (e.g. splitting cell contents)
  • How do I export data?

You will get answers to these questions in the workshop and can apply your new knowledge directly to a sample dataset.

Please set up OpenRefine on your computer before the workshop starts.

Instructions can be found here (current documents available 7 days before the workshop).

Target group: Doctoral candidates and postdocs without OpenRefine or other special prior knowledge (TU Darmstadt and RMU)

Trainer: Jens Freund | University and State Library Darmstadt

Language: English

Date/time: Thursday, June 27th, 2024 | 03:20 – 04:50 pm

Location: Online (Zoom)

Registration: Please fill in the Ingenium registration form.

You would like to participate in Ingenium events but do not have child care during that time? Here you can find more information about short-term child care at TU Darmstadt.