hapcut2-whatshap

Phase small variants with HapCUT2 and haplotag long-read DNA sequencing BAM files using WhatsHap

usage

nexus run --nf-workflow haplotagging_hapcut2-whatshap.nf \
    -c nextflow.config \
    -w work/ \
    --samples_tsv_file samples.tsv \
    --output_dir results/ \
    --reference_genome_fasta_file /path/to/file.fasta \
    --read_technology "pacbio" \
    --params_extracthairs "" \
    --params_hapcut2 "" \
    --params_whatshap_haplotag "--ignore-read-groups --skip-missing-contigs --output-threads 4"
Note

Nextflow config files are available here. Use the config file that matches your installed nexus version (e.g. nexus_v0.2.0_nextflow_slurm.config).

parameters

parameter description
--samples_tsv_file TSV file with the following columns: ‘sample_id’, ‘bam_file’, ‘bam_bai_file’, ‘small_variants_vcf_file’ (VCF may be plain ‘.vcf’ or ‘.vcf.gz’)
--output_dir Directory to which output files will be copied.
--reference_genome_fasta_file Reference genome FASTA file.
--read_technology Read technology for extractHAIRS (choices: pacbio, ont, illumina; default: ‘pacbio’).
--params_extracthairs Extra extractHAIRS parameters (default: ‘““’). Note that the parameters need to be wrapped in quotes.
--params_hapcut2 Extra HAPCUT2 parameters (default: ‘““’). Note that the parameters need to be wrapped in quotes.
--params_whatshap_haplotag Whatshap ‘haplotag’ parameters (default: ‘“--ignore-read-groups --skip-missing-contigs --output-threads 4”’). Note that the parameters need to be wrapped in quotes.