e-learning

Cleaning GBIF data using OpenRefine

Abstract

In this tutorial we will use OpenRefine tool to clean occurrence records retrieved from GBIF (Global Biodiversity Information Facility).

About This Material

This is a Hands-on Tutorial from the GTN which is usable either for individual self-study, or as a teaching material in a classroom.

Questions this will address

  • How can I use OpenRefine to clean data?
  • How do I check and clean biodiversity data using OpenRefine?

Learning Objectives

  • Use OpenRefine faceting functionalities to apply mass editing and manage duplicates
  • Use OpenRefine clustering and filtering functionalities to edit, transform data notably using regular expression
  • Use OpenRefine to apply API services on your data

Licence: Creative Commons Attribution 4.0 International

Keywords: Ecology, biodiversity

Target audience: Students

Resource type: e-learning

Version: 5

Status: Active

Prerequisites:

Introduction to Galaxy Analyses

Learning objectives:

  • Use OpenRefine faceting functionalities to apply mass editing and manage duplicates
  • Use OpenRefine clustering and filtering functionalities to edit, transform data notably using regular expression
  • Use OpenRefine to apply API services on your data

Date modified: 2025-06-03

Date published: 2025-01-21

Authors: Laura Russell, Sophie Pamerlon, Yvan Le Bras

Contributors: Björn Grüning, Daniela Schneider, Saskia Hiltemann, Yvan Le Bras

Scientific topics: Ecology

External resources:

Activity log