Output¶
This document describes the output produced by the pipeline. Most of the plots are taken from the MultiQC report, which summarises results at the end of the pipeline.
The directories listed below will be created in the results directory after the pipeline has finished. All paths are relative to the top-level results directory.
FASTQC¶
This folder contains FastQC reports for ILLUMINA data, both pre and post trimming. Raw_Reads
and Trimmed_Reads
are the output directories. - Output: Quality metrics for raw and trimmed short-read data.
Output files
fastqc/
*_fastqc.html
: FastQC report containing quality metrics.*_fastqc.zip
: Zip archive containing the FastQC report, tab-delimited data file and plot images.
NANOPLOT¶
This folder contains NanoPlot reports for ONT data, both pre and post trimming. Raw_Reads
and Trimmed_Reads
are the output directories. - Output: Quality metrics for long-read data.
Output files
* `nanoplot/` * `*_QualityScatterPlot_dot.html &.png`: Nanoplot report containing quality metrics. * `*WeightedHistogramReadlength.html & .png`: Weighted histogram on read lengths. * `*NonWeightedHistogramReadlength.html & .png`: Non-Weighted histogram on read lengths. * `NanoPlot-report.html`: Cumulative report of all the above metrics. * `NanoStats.txt`: Cumulative statistics of Read lengths, read quality, basepair quality, N50 metrics.FASTP¶
This folder contains trimmed reads from both short and long reads. - Output: Adapter trimmed reads for both long and short-read data.
Output files
* `fastp/` * `*_fastp.html`: Fastp report of trimmed reads and post trimming quality metrics. * `*_fastp.json`: json file for the above report. * `*_fastp.fastq.gz`: a trimmed fastq file for both short and long-readsQualimap/BAMQC¶
This folder contains BAMQC reports for aligned reads. It includes quality metrics and coverage statistics for BAM files. - Output: Quality metrics and coverage statistics reports.
Output files
* `Bamqc/` * `*qualimapReport.html`: Qualimap - Bamqc report of aligned BAM file. * `*raw_data_qualimapReport`: Coverage, gc content, nucleotide content, mapping quality metrics are reported in this folder. * `*genome_results.txt`: cumulative report containing all the above metrics.ALIGNMENT/MINIMAP2¶
This folder contains alignment files generated using Minimap2. - Output: Aligned reads in BAM format.
Output files
* `minimap2/` * `*.bam`: Aligned bam filesSAMTOOLS¶
This folder contains BAM file statistics and reference indexing. - Output: Statistics on each BAM file and a reference index.
Output files
* `Samtools/` * `*.flagstat`: Primary statistics on aligned bam files * `*.stats` : General statistics * `*.reference.fasta.fai`: Reference indexPRIMERTRIMMING¶
This folder contains reads with trimmed primers using samtools ampliconclip
for ONT data and ivar trim
for Illumina data. - Output: BAM files with primers trimmed.
Output files
* `iVarTrim (Illumina)` * `*.ivar_trim.bam`: ivar trimmed bam files, the primers used here are SARS-CoV2 based. * `AmpliconClip (Oxford Nanopore)` * `*.ampliconclip.clipallowed.bam`: samtools amplicon clip trimmed bam files.VariantCalling¶
This folder contains variant calling and demixing results using Freyja. - Output: Variant calls and demixed sequences.
Output files
* `iVarTrim/VarCalls` * `*.tsv`: ivar trimmed variant calls * `*.mpileup` : mpileup files from ivar trimmed BAM files only. * `Freyja/VarCalls` * `*.depth.tsv` : Basepair depth at each position. * `*.variants.tsv` : Variant calls from both ivar and ampliconclip trimmed BAM files * `Freyja/Demix` * `*.tsv` : Lineage abundances of Sars-cov-2MultiQC¶
This folder contains an aggregated report describing quality control results from the pipeline. - Output: MultQC report.
Output files
* `multiqc/` * `multiqc_report.html`: a standalone HTML file that can be viewed in your web browser. * `multiqc_data/`: directory containing parsed statistics from the different tools used in the pipeline. * `multiqc_plots/`: directory containing static images from the report in various formats.Execution Reports¶
This folder contains report metrics generated during the workflow execution. - Output: Execution based reports.