Aarhus Universitets segl

Gut microbiota 16S rRNA gene data analysis and bioinformatics

Publikation: AndetUdgivelser på nettet - Net-publikationUndervisning

DOI

In this module, I will walk you through the necessary steps involved in the analysis of 16S rRNA microbiome amplicons data from raw sequences to publication quality visualizations and statistical analysis. Non-cultured 16S rRNA metagenomics is a promising method for understanding the ecology of an environment in regards with the number and the structure of the microbiome in association with the environmental factors, e.g. host-microbiome interactions. In prokaryotes there is a ubiquitous gene compartment integrated in the ribosome, so-called 16S rRNA genes, which are highly conserved among prokaryotes and at the same time having hypervariable regions (HVRs) V1 to V9, which are good targets for evolutionary and ecological studies on prokaryotes Jünemann et. al (2017). This module is mainly focused on 16S rRNA gene data, but I can carefully say that you can apply most of the techniques explained here to genome data and count multivariate datasets.
Note: all this workflow has been done on Jupyter notebook on a cluster node with 120 GB processer from Aarhus University, Denmark. In order to multitask in different nodes, tasks on Qiime2 have been summited to the cluster by separate bash scripts.

This module includes the following steps:

Steps in Qiime
1. Importing raw data into Qiime2
2. Filtering, dereplication, sample inference, chimera identification, and merging of paired-end reads by DADA2 package in qiime2
3. Training a primer-based region-specific classifier for taxonomic classification by Naïve-Bayes method (in Qiime2)
4. Creating a phylogenetic tree using SATE-enabled phyhlogenetic placement (SEPP) method
Steps in R
1. Importing unrooted tree, ASV table and repseqs with the metadata to R using qiime2R package into R
2. Preprocessing and cleaning up the sequences
3. Alpha diversity
4. Statistical analysis on alpha diversity metrics
5. Beta diversity
6. Statistical analysis on beta diversity metrics: a distance-based redundancy analysis (dbRDA) model
7. Differential abundance analysis of taxa by DESeq2
8. Network-based (graph-based) analysis
9. Analysis of chemical biomarkers for gut microbiome: Heatmap correlation
OriginalsprogEngelsk
Udgivelsesåraug. 2022
DOI
StatusUdgivet - aug. 2022

Se relationer på Aarhus Universitet Citationsformater

ID: 277650274