Register training material
14 materials found

Target audience: Beginners 


A Critical Guide to the PDB

This Critical Guide in the Introduction to Bioinformatics series provides a brief outline of the Protein Data Bank – the PDB – the world’s primary repository of biological macromolecular structures. The rationale for creating the resource and the kinds of information it provides are discussed,...

Scientific topics: Database management

Keywords: Introduction bioinformatics, Introduction pdb, Protein structure analysis, Protein structure databases, Protein structures

A Critical Guide to the PDB https://tess.elixir-europe.org/materials/a-critical-guide-to-the-pdb This Critical Guide in the Introduction to Bioinformatics series provides a brief outline of the Protein Data Bank – the PDB – the world’s primary repository of biological macromolecular structures. The rationale for creating the resource and the kinds of information it provides are discussed, and issues relating to its evolution and growth are explored. Specifically, this Guide introduces the principal features of the PDB, the nature (and quality) of its contents and how these may be interrogated. On reading this Guide, users will be able to: i) explain some of the ways in which knowledge of protein structures is useful; ii) identify the constituent databases of the wwPDB; iii) explain key features of the RCSB PDB in terms of its data distribution, growth and redundancy statistics; iv) search the PDB using simple and advanced keywords and full sequences, and analyse differences between them; and v) explain various structural quality criteria, and infer the quality of individual PDB entries. Database management Introduction bioinformatics, Introduction pdb, Protein structure analysis, Protein structure databases, Protein structures Beginners 2018-09-08
A Critical Guide to InterPro

This Critical Guide in the Introduction to Bioinformatics series provides an introduction to the InterPro database, the largest, most comprehensive, integrated protein family database in the world. The rationale for creating the resource, the nature of its contributing databases and the kinds of...

Scientific topics: Database management

Keywords: Introduction bioinformatics, Introduction interpro, Protein family classification, Protein family databases, Protein family hierarchies, Protein function annotation, Protein sequence analysis

A Critical Guide to InterPro https://tess.elixir-europe.org/materials/a-critical-guide-to-interpro This Critical Guide in the Introduction to Bioinformatics series provides an introduction to the InterPro database, the largest, most comprehensive, integrated protein family database in the world. The rationale for creating the resource, the nature of its contributing databases and the kinds of information they provide are discussed, and the role of InterPro in protein classification and function-annotation projects is outlined. Specifically, this Guide introduces the principal components of the InterPro database, the differences between them, and how their integration creates a resource whose diagnostic power is greater than the sum of its parts. On reading this Guide, users will be able to: i) explain how protein family databases are used to help annotate uncharacterised protein sequences; ii) identify InterPro’s constituent data resources and explain the main methods that underpin them; iii) search InterPro using keywords and full sequences; iv) analyse and interpret search results in terms of protein family hierarchies, their structural domains and functional features; and v) track the provenance of InterPro’s annotations. Database management Introduction bioinformatics, Introduction interpro, Protein family classification, Protein family databases, Protein family hierarchies, Protein function annotation, Protein sequence analysis Beginners 2018-09-08
A Critical Guide to the UniProtKB Flat-file Format

This Critical Guide briefly presents the need for biological databases and for a standard format for storing and organising biological data. Web-based interfaces have made databases more user-friendly, but knowledge of the underlying file format offers a deeper understanding of how to navigate...

Scientific topics: Database management

Keywords: Flat file databases, Flat files, Introduction bioinformatics, Uniprotkb flat file format

A Critical Guide to the UniProtKB Flat-file Format https://tess.elixir-europe.org/materials/a-critical-guide-to-the-uniprotkb-flat-file-format This Critical Guide briefly presents the need for biological databases and for a standard format for storing and organising biological data. Web-based interfaces have made databases more user-friendly, but knowledge of the underlying file format offers a deeper understanding of how to navigate and mine the information they contain, so that humans and machines can get the most out of them. This Guide explores the file format that underpins one of today’s most popular protein sequence databases – UniProtKB. Specifically, this Guide introduces the concept of database ‘flat-files’, and examines features of the UniProtKB flat-file format. On reading this Guide, users will be able to: i) identify key fields within UniProtKB/Swiss-Prot and UniProtKB/TrEMBL flat-files; ii) explain what these fields mean, what information they contain and what the information is used for; iii) analyse the information in different fields and infer structural and functional features of a sequence; iv) examine and investigate the provenance of annotations; and v) compare annotations at different time-points and evaluate the likely impact of annotation changes. Database management Flat file databases, Flat files, Introduction bioinformatics, Uniprotkb flat file format Beginners 2018-09-08
A Critical Guide to UniProtKB

This Critical Guide in the Introduction to Bioinformatics series provides a brief outline of the UniProt protein sequence database, with a particular focus on the UniProt Knowledgebase – UniProtKB. The rationale for creating the resource, its contributing databases and the kinds of information...

Scientific topics: Database management

Keywords: Introduction bioinformatics, Introduction uniprot, Protein sequence databases, Uniprot knowledgebase

A Critical Guide to UniProtKB https://tess.elixir-europe.org/materials/a-critical-guide-to-uniprotkb This Critical Guide in the Introduction to Bioinformatics series provides a brief outline of the UniProt protein sequence database, with a particular focus on the UniProt Knowledgebase – UniProtKB. The rationale for creating the resource, its contributing databases and the kinds of information they provide are discussed, and issues behind the quality of their annotations are explored. Specifically, this Guide introduces the principal components of the UniProt Knowledgebase, and the differences between them. On reading this Guide, users will be able to: i) identify and explain the characteristic features of UniProtKB/Swiss-Prot and UniProtKB/TrEMBL entries; ii) distinguish annotations that are computed, and hence not experimentally validated; iii) search UniProtKB using keywords, full sequences and peptides, and interpret the results; iv) analyse and track the provenance of annotations; and v) infer which annotations are likely to be accurate and which erroneous. Database management Introduction bioinformatics, Introduction uniprot, Protein sequence databases, Uniprot knowledgebase Beginners 2018-09-08
A Critical Guide to BLAST

This Critical Guide in the Introduction to Bioinformatics series provides an overview of the BLAST similarity search tool, briefly examining the underlying algorithm and its rise to popularity. Several Web-based and stand-alone implementations are reviewed, and key features of typical search...

Keywords: Introduction bioinformatics, Introduction blast, Sequence database searching, Sequence similarity searching

A Critical Guide to BLAST https://tess.elixir-europe.org/materials/a-critical-guide-to-blast This Critical Guide in the Introduction to Bioinformatics series provides an overview of the BLAST similarity search tool, briefly examining the underlying algorithm and its rise to popularity. Several Web-based and stand-alone implementations are reviewed, and key features of typical search results are discussed. Specifically, this Guide introduces concepts and theories that underpin the BLAST search tool, and examines features of search outputs important for understanding and interpreting BLAST results. On reading this Guide, users will be able to: i) search a variety of Web-based sequence databases with different query sequences, and alter search parameters; ii) explain a range of typical search parameters, and the likely impacts on search outputs of changing them; iii) analyse the information conveyed in search outputs and infer the significance of reported matches; iv) examine and investigate the annotations of reported matches, and their provenance; and v) compare the outputs of different BLAST implementations and evaluate the implications of any differences. Introduction bioinformatics, Introduction blast, Sequence database searching, Sequence similarity searching Beginners 2018-09-08
A Critical Guide to Unix

This Critical Guide in the Introduction to Bioinformatics series briefly introduces the Unix Operating System, and provides a subset of some of the most helpful and commonly used commands, including those that allow various types of search, navigation and file manipulation. Several keystroke...

Keywords: Command line, Introduction bioinformatics, Introduction unix, Unix commands, Unix operating system

A Critical Guide to Unix https://tess.elixir-europe.org/materials/a-critical-guide-to-unix This Critical Guide in the Introduction to Bioinformatics series briefly introduces the Unix Operating System, and provides a subset of some of the most helpful and commonly used commands, including those that allow various types of search, navigation and file manipulation. Several keystroke short-cuts are also explained, which help to make the routine use of Unix commands more efficient. Specifically, this Guide showcases some of the simplest, most frequently used commands to help new users to understand and gain confidence in using the Unix Operating System. On reading the Guide, users will be able: i) to exploit a range of commands: to manipulate files, directories and processes; to navigate directory structures and explore their contents; to search for files, and search and compare file contents; and to direct command outputs into files or into other commands; and ii) to explain what many simple commands mean and how they’re used. Command line, Introduction bioinformatics, Introduction unix, Unix commands, Unix operating system Beginners 2018-09-08
PhenoMeNal: accessing metabolomics workflows in Galaxy

A description of PhenoMeNal and when to use it, the creation of a cloud research environment and how to access Galaxy workflows via PhenoMeNal.

Scientific topics: Chemical biology, Metabolomics

Keywords: Galaxy

PhenoMeNal: accessing metabolomics workflows in Galaxy https://tess.elixir-europe.org/materials/phenomenal-accessing-metabolomics-workflows-in-galaxy A description of PhenoMeNal and when to use it, the creation of a cloud research environment and how to access Galaxy workflows via PhenoMeNal. Chemical biology Metabolomics Galaxy Beginners
UPGMA Worked Example

A walkthrough of clustering 7 biological sequences (A-G) using the Unweighted Pair-Group Method with Arithmetic mean (UPGMA) method. Note that UPGMA is actually a generic method and thus the walkthrough could apply to any objects A-G for which pairwise distances can be calculated. A small CGI...

Keywords: Molecular phylogeny, Phylogenetics, Upgma

UPGMA Worked Example https://tess.elixir-europe.org/materials/upgma-worked-example A walkthrough of clustering 7 biological sequences (A-G) using the Unweighted Pair-Group Method with Arithmetic mean (UPGMA) method. Note that UPGMA is actually a generic method and thus the walkthrough could apply to any objects A-G for which pairwise distances can be calculated. A small CGI site for generating a UPGMA tree from a distance matrix can be found here. The source data for this worked example is a subset of Cytochrome C distances from Table 3 of one of the seminal phylogenetic papers: Fitch WM & Margoliash E (1967). Construction of phylogenetic trees. Science 155:279-84. PowerPoint timings are set up to be consistent with autolooping the presentation. Molecular phylogeny, Phylogenetics, Upgma Beginners 2014-08-04 2017-10-09
QuickGuides

A range of Quick Guides to commonly used bioinformatics tools and resources.

Keywords: Blast, Emboss, Mysql, Perl, Phylip, Unixlinux, Velvet

QuickGuides https://tess.elixir-europe.org/materials/quickguides A range of Quick Guides to commonly used bioinformatics tools and resources. Blast, Emboss, Mysql, Perl, Phylip, Unixlinux, Velvet Beginners 2014-07-31 2017-10-09
Perl 1 Training Course - For Programming Beginners

A set of slides from The Sainsbury Laboratory Perl course for absolute beginners

Keywords: Perl, Programming

Perl 1 Training Course - For Programming Beginners https://tess.elixir-europe.org/materials/perl-1-training-course-for-programming-beginners A set of slides from The Sainsbury Laboratory Perl course for absolute beginners Perl, Programming Beginners Bench biologists 2014-07-03 2017-10-09
Working with Affymetrix CEL files in R

This tutorial shows how to download some public Affymetrix microarray data, load the data into R, calculate expression values and do some very simple plotting

Keywords: Affymetrix, Microarrays, R

Working with Affymetrix CEL files in R https://tess.elixir-europe.org/materials/working-with-affymetrix-cel-files-in-r This tutorial shows how to download some public Affymetrix microarray data, load the data into R, calculate expression values and do some very simple plotting Affymetrix, Microarrays, R Beginners PhD students Researchers 2014-01-14 2017-10-09
Simple plotting in R

This quick and simple tutorial demonstrates some of the easy plotting tools in the R core software. the data are in the "data" directory and come from an old two-colour microarray experiment where spots on the array were printed with different buffers, in different concentrations and with...

Keywords: Bioinformatics, Microarray data analysis, Plotting data, R, Visualisation

Simple plotting in R https://tess.elixir-europe.org/materials/simple-plotting-in-r This quick and simple tutorial demonstrates some of the easy plotting tools in the R core software. the data are in the "data" directory and come from an old two-colour microarray experiment where spots on the array were printed with different buffers, in different concentrations and with different pins.  Also, both channels Cy5 and Cy3 were spotted identically, and so the fold ratio for all spots should be 1 - this helps us see bias in the data Bioinformatics, Microarray data analysis, Plotting data, R, Visualisation Beginners PhD students Researchers Scientists 2014-01-14 2017-10-09
Quality Control of Illumina Sequencing data in the cloud

This course was created as part of the SeqAhead project and contains both powerpoint presentations and course materials that teach students how to set up an Amazon EC2 virtual machine, connect to it, download some Illumina data from  the short read archive (SRA), assess the quality of the data...

Keywords: Bioinformatics, Cloud, Next generation sequencing, Next generation sequencing data analysis, Ngs bioinformatics, Qc ngs data

Quality Control of Illumina Sequencing data in the cloud https://tess.elixir-europe.org/materials/quality-control-of-illumina-sequencing-data-in-the-cloud This course was created as part of the SeqAhead project and contains both powerpoint presentations and course materials that teach students how to set up an Amazon EC2 virtual machine, connect to it, download some Illumina data from  the short read archive (SRA), assess the quality of the data and learn to trim both bad quality bases and adapter sequences Bioinformatics, Cloud, Next generation sequencing, Next generation sequencing data analysis, Ngs bioinformatics, Qc ngs data Beginners 2014-01-13 2017-10-09
ArrayExpress: Quick tour

This quick tour provides an overview of EMBL-EBI’s functional genomics database ArrayExpress. This course was updated in December 2015. An undergraduate-level understanding of biology is an advantage. You may wish to have a look at our Functional genomics: An introduction to EMBL-EBI...

Scientific topics: Functional genomics, Gene expression and microarray, Gene expression

Keywords: Gene Expression

ArrayExpress: Quick tour https://tess.elixir-europe.org/materials/arrayexpress-quick-tour This quick tour provides an overview of EMBL-EBI’s functional genomics database ArrayExpress. This course was updated in December 2015. An undergraduate-level understanding of biology is an advantage. You may wish to have a look at our Functional genomics: An introduction to EMBL-EBI resources before taking this course. Functional genomics Gene expression and microarray Gene expression Gene Expression Beginners 2016-06-14