HTSMultiQC-cleaning-report

Loading report..

Highlight Samples

Regex mode off

    Rename Samples

    Click here for bulk input.

    Paste two columns of a tab-delimited table here (eg. from Excel).

    First column should be the old name, second column the new name.

    Regex mode off

      Show / Hide Samples

      Regex mode off

        Export Plots

        px
        px
        X

        Download the raw data used to create the plots in this report below:

        Note that additional data was saved in HTSMultiQC-cleaning-report_multiqc_report_data when this report was generated.


        Choose Plots

        If you use plots from MultiQC in a publication or presentation, please cite:

        MultiQC: Summarize analysis results for multiple tools and samples in a single report
        Philip Ewels, Måns Magnusson, Sverker Lundin and Max Käller
        Bioinformatics (2016)
        doi: 10.1093/bioinformatics/btw354
        PMID: 27312411

        Save Settings

        You can save the toolbox settings for this report to the browser.


        Load Settings

        Choose a saved report profile from the dropdown box below:

        About MultiQC

        This report was generated using MultiQC, version 1.10.dev0

        You can see a YouTube video describing how to use MultiQC reports here: https://youtu.be/qPbIlO_KWN0

        For more information about MultiQC, including other videos and extensive documentation, please visit http://multiqc.info

        You can report bugs, suggest improvements and find the source code for MultiQC on GitHub: https://github.com/ewels/MultiQC

        MultiQC is published in Bioinformatics:

        MultiQC: Summarize analysis results for multiple tools and samples in a single report
        Philip Ewels, Måns Magnusson, Sverker Lundin and Max Käller
        Bioinformatics (2016)
        doi: 10.1093/bioinformatics/btw354
        PMID: 27312411

        HTSMultiQC-cleaning-report

        A modular tool to aggregate results from bioinformatics analyses across many samples into a single report.

        Report generated on 2021-07-12, 11:24 based on data in: /share/workshop/gwas_workshop/jli/01-HTS_Preproc


        HTStream

        HTStream quality control and processing pipeline for High Throughput Sequencing data.

        Processing Overview

        General statistics from the HTStream pipeline.


        Preprocessing Statistics

        loading..

        Fragment Reduction

        Provides scaled statistics collected throughout the preprocessing pipeline, highlighting variable statistics across experiment.

        loading..

        Basepair Reduction

        Provides scaled statistics collected throughout the preprocessing pipeline, highlighting variable statistics across experiment.

        loading..

        hts_Stats

        Generates a JSON formatted file containing a set of statistical measures about the input read data.

        Showing 3/3 rows and 6/6 columns.
        Sample Name% PE% R1 Q30% R2 Q30GC ContentN ContentNotes
        SL378587_htsStats
        100.00%
        91.37%
        88.00%
        38.36%
        0.0009%
        RawReads stats
        SL378588_htsStats
        100.00%
        91.29%
        88.64%
        38.98%
        0.0005%
        RawReads stats
        SL378589_htsStats
        100.00%
        91.22%
        88.42%
        38.97%
        0.0004%
        RawReads stats

        Read Lengths: Paired End

        Distribution of read lengths for each sample.

        loading..

        Base by Cycle: Paired End

        Provides a measure of the uniformity of a distribution. The higher the average is at a certain position, the more unequal the base pair composition. N's are excluded from this calculation.

        loading..

        Quality by Cycle: Paired End

        Mean quality score for each position along the read. Sample is colored red if less than 60% of bps have mean score of at least Q30, orange if between 60% and 80%, and green otherwise.

        loading..

        hts_SeqScreener

        A simple sequence screening tool which uses a kmer lookup approach to identify reads from an unwanted source.

        Notice: No hits in any sample.

        hts_SuperDeduper

        A reference free duplicate read removal tool.

        Showing 3/3 rows and 3/3 columns.
        Sample Name% Duplicates% IgnoredNotes
        SL378587_htsStats
        5.90%
        0.01%
        remove PCR duplicates
        SL378588_htsStats
        6.70%
        0.01%
        remove PCR duplicates
        SL378589_htsStats
        6.25%
        0.01%
        remove PCR duplicates

        SuperDeduper: Duplicate Saturation

        Plots the number of duplicates against the number of unique reads per sample.

        loading..

        hts_AdapterTrimmer

        Trims adapters which are sequenced when the fragment insert length is shorter than the read length.

        Showing 3/3 rows and 4/4 columns.
        Sample Name% Bp Lost% AdaptersAvg. Bps TrimmedNotes
        SL378587_htsStats
        3.94%
        29.71%
        39.98
        trim adapters
        SL378588_htsStats
        2.58%
        19.28%
        40.37
        trim adapters
        SL378589_htsStats
        2.73%
        20.72%
        39.85
        trim adapters

        AdapterTrimmer: Trimmed Basepairs Composition

        Composition of basepairs trimmed from the ends of paired end and single end reads.

        loading..

        hts_QWindowTrim

        Uses a sliding window approach to remove the low quality ends of reads.

        Showing 3/3 rows and 5/5 columns.
        Sample Name% Bp Lost% R1 of Bp Lost% R2 of Bp LostAvg. Bps TrimmedNotes
        SL378587_htsStats
        0.15%
        37.45%
        62.55%
        0.43
        trim low qulity bases from ends of reads
        SL378588_htsStats
        0.19%
        36.72%
        63.28%
        0.55
        trim low qulity bases from ends of reads
        SL378589_htsStats
        0.17%
        37.20%
        62.80%
        0.49
        trim low qulity bases from ends of reads

        QWindowTrim: Trimmed Basepairs Composition

        Plots the number of low quality basepairs trimmed from ends of paired end and single end reads.

           
        loading..

        hts_NTrimmer

        Trims reads to the longest subsequence that contains no N's.

        Showing 3/3 rows and 5/5 columns.
        Sample NameTotal Bp Lost% R1 of Bp Lost% R2 of Bp Lost% DiscardedNotes
        SL378587_htsStats
        390
        81.54%
        18.46%
        0.00%
        remove any remanining N characters
        SL378588_htsStats
        746
        86.33%
        13.67%
        0.00%
        remove any remanining N characters
        SL378589_htsStats
        319
        62.38%
        37.62%
        0.00%
        remove any remanining N characters

        NTrimmer: Trimmed Basepairs Composition

        Plots the number of N bases trimmed from ends of paired end and single end reads.

           
        loading..

        hts_LengthFilter

        Discards reads below a minimum length threshold.

        Showing 3/3 rows and 2/2 columns.
        Sample Name% PE LostNotes
        SL378587_htsStats
        0.52%
        remove reads < 50bp
        SL378588_htsStats
        0.62%
        remove reads < 50bp
        SL378589_htsStats
        0.59%
        remove reads < 50bp

        hts_Stats 2

        Generates a JSON formatted file containing a set of statistical measures about the input read data.

        Showing 3/3 rows and 6/6 columns.
        Sample Name% PE% R1 Q30% R2 Q30GC ContentN ContentNotes
        SL378587_htsStats
        100.00%
        93.70%
        90.59%
        37.67%
        0.0000%
        final stats
        SL378588_htsStats
        100.00%
        93.36%
        90.74%
        38.55%
        0.0000%
        final stats
        SL378589_htsStats
        100.00%
        93.42%
        90.54%
        38.51%
        0.0000%
        final stats

        Read Lengths: Paired End

        Distribution of read lengths for each sample.

        loading..

        Base by Cycle: Paired End

        Provides a measure of the uniformity of a distribution. The higher the average is at a certain position, the more unequal the base pair composition. N's are excluded from this calculation.

        loading..

        Quality by Cycle: Paired End

        Mean quality score for each position along the read. Sample is colored red if less than 60% of bps have mean score of at least Q30, orange if between 60% and 80%, and green otherwise.

        loading..