How to make your messy data usable? / OpenRefine

The practical workshop on cleaning your messy data with OpenRefine software.

First, we will cover spreadsheet best practices. Then, we will put that knowledge into practice with OpenRefine. This course will explore the depths of OpenRefine software and see what it can offer. This will include cleaning the data in bigger batches and unifying the data in one sweep (transforms and expressions). Additionally, we will introduce the possibility of downloading additional data from other databases and different extensions OpenRefine software has.

DOI: https://doi.org/10.5281/zenodo.10224704

Licence: Creative Commons Attribution 4.0 International

Keywords: Data exploration, data quality, Data exploration, Data validation, data cleaning

Target audience: Students, Researchers, PI, Postdocs and Staff members

Version: 1.0

Status: Active

Learning objectives:

Learning outcomes for the participants:
* Describe spreadsheet best practices
* Compare Excel and OpenRefine
* Apply transforms (cell editing, column editing, transposing) in OpenRefine
* Write simple expressions in OpenRefine
* Match your dataset with that of an external source

Authors: Diana Pilvar

Scientific topics: Data management, Data quality management


Activity log