Skip to content

Output

This document describes the output produced by the pipeline. Most of the plots are taken from the MultiQC report, which summarises results at the end of the pipeline.

The directories listed below will be created in the results directory after the pipeline has finished. All paths are relative to the top-level results directory.

FASTQC

This folder contains FastQC reports for ILLUMINA data, both pre and post trimming. Raw_Reads and Trimmed_Reads are the output directories. - Output: Quality metrics for raw and trimmed short-read data.

Output files
  • fastqc/
    • *_fastqc.html: FastQC report containing quality metrics.
    • *_fastqc.zip: Zip archive containing the FastQC report, tab-delimited data file and plot images.

NANOPLOT

This folder contains NanoPlot reports for ONT data, both pre and post trimming. Raw_Reads and Trimmed_Reads are the output directories. - Output: Quality metrics for long-read data.

Output files * `nanoplot/` * `*_QualityScatterPlot_dot.html &.png`: Nanoplot report containing quality metrics. * `*WeightedHistogramReadlength.html & .png`: Weighted histogram on read lengths. * `*NonWeightedHistogramReadlength.html & .png`: Non-Weighted histogram on read lengths. * `NanoPlot-report.html`: Cumulative report of all the above metrics. * `NanoStats.txt`: Cumulative statistics of Read lengths, read quality, basepair quality, N50 metrics.

FASTP

This folder contains trimmed reads from both short and long reads. - Output: Adapter trimmed reads for both long and short-read data.

Output files * `fastp/` * `*_fastp.html`: Fastp report of trimmed reads and post trimming quality metrics. * `*_fastp.json`: json file for the above report. * `*_fastp.fastq.gz`: a trimmed fastq file for both short and long-reads

Qualimap/BAMQC

This folder contains BAMQC reports for aligned reads. It includes quality metrics and coverage statistics for BAM files. - Output: Quality metrics and coverage statistics reports.

Output files * `Bamqc/` * `*qualimapReport.html`: Qualimap - Bamqc report of aligned BAM file. * `*raw_data_qualimapReport`: Coverage, gc content, nucleotide content, mapping quality metrics are reported in this folder. * `*genome_results.txt`: cumulative report containing all the above metrics.

ALIGNMENT/MINIMAP2

This folder contains alignment files generated using Minimap2. - Output: Aligned reads in BAM format.

Output files * `minimap2/` * `*.bam`: Aligned bam files

SAMTOOLS

This folder contains BAM file statistics and reference indexing. - Output: Statistics on each BAM file and a reference index.

Output files * `Samtools/` * `*.flagstat`: Primary statistics on aligned bam files * `*.stats` : General statistics * `*.reference.fasta.fai`: Reference index

PRIMERTRIMMING

This folder contains reads with trimmed primers using samtools ampliconclip for ONT data and ivar trim for Illumina data. - Output: BAM files with primers trimmed.

Output files * `iVarTrim (Illumina)` * `*.ivar_trim.bam`: ivar trimmed bam files, the primers used here are SARS-CoV2 based. * `AmpliconClip (Oxford Nanopore)` * `*.ampliconclip.clipallowed.bam`: samtools amplicon clip trimmed bam files.

VariantCalling

This folder contains variant calling and demixing results using Freyja. - Output: Variant calls and demixed sequences.

Output files * `iVarTrim/VarCalls` * `*.tsv`: ivar trimmed variant calls * `*.mpileup` : mpileup files from ivar trimmed BAM files only. * `Freyja/VarCalls` * `*.depth.tsv` : Basepair depth at each position. * `*.variants.tsv` : Variant calls from both ivar and ampliconclip trimmed BAM files * `Freyja/Demix` * `*.tsv` : Lineage abundances of Sars-cov-2

MultiQC

This folder contains an aggregated report describing quality control results from the pipeline. - Output: MultQC report.

Output files * `multiqc/` * `multiqc_report.html`: a standalone HTML file that can be viewed in your web browser. * `multiqc_data/`: directory containing parsed statistics from the different tools used in the pipeline. * `multiqc_plots/`: directory containing static images from the report in various formats.

Execution Reports

This folder contains report metrics generated during the workflow execution. - Output: Execution based reports.

Output files * `pipeline_info/` * Reports generated by Nextflow: `execution_report.html`, `execution_timeline.html`, `execution_trace.txt` and `pipeline_dag.dot`/`pipeline_dag.svg`. * Reports generated by the pipeline: `pipeline_report.html`, `pipeline_report.txt` and `software_versions.tsv`. * Reformatted samplesheet files used as input to the pipeline: `samplesheet.valid.csv`.

Last update: 2024-11-02