☰ Menu

        Sept. 2019 Microbial Community Analysis Workshop

Home
Introduction and Lectures
Intro to the Workshop and Core
What is Bioinformatics?
Experimental Design and Cost Estimation
Introduction to Command-Line and the Cluster
Logging in and Transferring Files
Intro to Command-Line
Advanced Command-Line (extra)
Running jobs on the Cluster and using modules
Intro to R and Rstudio
Getting Started
Intro to R
Prepare Data in R (extra)
Data in R (extra)
dbcAmplicons
dbcAmplicons Installing Software
dbcAmplicons - Amplicons talk
dbcAmplicons - Bioinformatics talk
Dataset and Metadata
dbcAmplicons - Data processing
dbcAmplicons w/Dada2
Coming soon
Microbial Community Analysis in R
Prepare MCA Analysis
MCA Analysis in phyloseq
Support
Cheat Sheets
Software and Links
Scripts
ETC
Closing thoughts
Workshop Photos
Github
Biocore website

dbcAmplicons pipeline: Amplicons

Amplicons the ‘old’ way

amplicons_figure1

dbcAmplicons

Originally conceived in late 2012 to lower per sample costs on relatively short, targeted (PCR) regions:

Uses the Illumina platform (mainly the MiSeq), capably of pooling thousands, or even tens of thousands of barcoded samples/targets per sequencing run.

Core Facility friendly, facilitates interactions between and across individual labs, standardizing workflows.

amplicons_figure2

Amplicons: Two Step PCR Approach

amplicons_figure3

Multiplex multiple amplicons targets

amplicons_figure4

Primer Design

amplicons_figure5 Prokaryote 16S Gene

PCR1 Template specific primer design

Each primer pair contains the following parts

amplicons_figure6

Examples PCR1 Template Specific Primers 16S V1-V3 (27F and 534R)

amplicons_figure7

PCR2 Barcoded Illumina Adapter Primers

Examples PCR2 Barcoded Illumina Adapter Primers

amplicons_figure8

Final product

amplicons_figure8b

QA/QC What is a “good” library?

amplicons_figure9

Benefits

DrawBacks

Nucleotide diversity

Critically important for imaging clusters, and data quality!

amplicons_figure10

Once a sample library is converted to clusters on a flow cell, “nucleotide diversity” refers to the distribution of nucleotides across the flow cell at any given cycle. From the viewpoint of the instrument software, a high diversity library translates into analyzing images containing an even distribution of spots from 4 different color channels corresponding to the 4 nucleotide bases A, T, C & G. In contrast, an unbalanced nucleotide distribution or “low diversity library” means that for any given image, or to two bases are present at a high percentage.

LOW Diversity Library amplicons_figure11

HIGH Diversity Library amplicons_figure12

Ways to Ensure Nucleotide Diversity

Appropriate nucleotide diversity and cluster density are important for high quality data. Low nucleotide diversity in combination with high cluster density will most-likely lead to poor data quality and/or low data yield.

Note: Experience has shown, that 15% shotgun spike-in, plus phase-shifted primers and/or multiple target region typically yields good results.