whatshap

Phase small variants and haplotag long-read DNA sequencing BAM files using Whatshap

usage

nexus run --nf-workflow haplotagging_whatshap.nf \
    -c nextflow.config \
    -w work/ \
    --samples_tsv_file samples.tsv \
    --output_dir results/ \
    --reference_genome_fasta_file /path/to/file.fasta \
    --params_whatshap_phase "--mapq 20" \
    --params_whatshap_haplotag "--ignore-read-groups --skip-missing-contigs --output-threads 4"
Note

Nextflow config files are available here. Use the config file that matches your installed nexus version (e.g. nexus_v0.2.0_nextflow_slurm.config).

parameters

parameter description
--samples_tsv_file TSV file with the following columns: ‘sample_id’, ‘bam_file’, ‘bam_bai_file’, ‘small_variants_vcf_file’
--output_dir Directory to which output files will be copied.
--reference_genome_fasta_file Reference genome FASTA file.
--params_whatshap_phase Whatshap ‘phase’ parameters (default: ‘“--mapq 20”’). Note that the parameters need to be wrapped in quotes.
--params_whatshap_haplotag Whatshap ‘haplotag’ parameters (default: ‘“--ignore-read-groups --skip-missing-contigs --output-threads 4”’). Note that the parameters need to be wrapped in quotes.