e-learning

Remove contamination and host reads

Abstract

Metagenomic sequencing captures all DNA present in a sample, including the microbial community, host DNA, and potential environmental or external contaminants (such as human DNA introduced during sample handling or sequencing).

About This Material

This is a Hands-on Tutorial from the GTN which is usable either for individual self-study, or as a teaching material in a classroom.

Questions this will address

  • What preprocessing steps are required to obtain cleaned reads for downstream analysis?
  • How can we identify and remove contaminant or host-derived reads from raw sequencing data?

Learning Objectives

  • Identify reads originating from contaminants or host genomes.
  • Remove those reads to produce high-quality, clean metagenomic data suitable for downstream analyses.

Licence: Creative Commons Attribution 4.0 International

Keywords: Microbiome

Target audience: Students

Resource type: e-learning

Version: 1

Status: Active

Prerequisites:

Introduction to Galaxy Analyses

Learning objectives:

  • Identify reads originating from contaminants or host genomes.
  • Remove those reads to produce high-quality, clean metagenomic data suitable for downstream analyses.

Date modified: 2025-12-18

Date published: 2025-12-18

Authors: Bérénice Batut, Mina Hojat Ansari

Scientific topics: Metagenomics, Microbial ecology


Activity log