Analyse variants from nanopore reads¶
Apply the variant calling pipeline to reads from Nanopore sequencer.
Purpose¶
- Analyse variants in Nanopore reads
Required inputs¶
- Single-end reads from Nanopore sequencer in gzipped fastq format
- Each sample is represented by a single gzipped fastq file
- Reference genome in fasta format
|-- reads/original
|-- <sample_1>.fastq.gz
|-- <sample_2>.fastq.gz
|-- reference/<reference>
|-- <reference>.fa
Generated outputs¶
- List of identified variants in VCF file, filtered by user-defined criteria
- Summary PDF report to assess quality of reads, mapping and variant calling
Example¶
How to run example:
cd /usr/local/snakelines/example/nanopore
snakemake \
--snakefile ../../snakelines.snake \
--configfile config.yaml \
--use-conda
Example configuration:
platform: nanopore
sequencing: single_end
samples:
- name: example_.*
reference: sars_cov_2
report_dir: report/example
threads: 16
reference:
index:
fai:
method: samtools
reads:
preprocess:
trimmed:
method: trimmomatic
temporary: False
crop: 500
quality: 20
headcrop: 20
minlen: 35
report:
quality_report:
method: fastqc
read_types:
- original
mapping:
mapper:
method: minimap2
index:
method: samtools
postprocess:
sorted:
method: samtools
report:
quality_report:
method: qualimap
map_types:
- original
variant:
caller:
method: clair
lineage:
caller:
method: pangolin
summary:
method: cat