call-rna-vars
Call RNA variants in a long-read RNA-seq BAM file.
Usage
Template:
exacto call-rna-vars \
--bam-file <bam_file> \
--bam-bai-file <bam_bai_file> \
--reference-genome-fasta-file <reference_genome_fasta_file> \
--reference-gene-annotation-file <reference_gene_annotation_file> \
--reference-gene-annotation-source <reference_gene_annotation_source> \
--reference-gene-annotation-assembly <reference_gene_annotation_assembly> \
--reference-gene-annotation-version <reference_gene_annotation_version> \
--output-dir <output_dir> \
--output-prefix <output_prefix> \
[--num-threads NUM_THREADS] \
[--chunk-size CHUNK_SIZE] \
[--min-mapping-quality MIN_MAPPING_QUALITY] \
[--reference-transcript-scoring-method REFERENCE_TRANSCRIPT_SCORING_METHOD] \
[--reference-transcript-selection-strategy REFERENCE_TRANSCRIPT_SELECTION_STRATEGY] \
[--reference-transcript-top-k REFERENCE_TRANSCRIPT_TOP_K] \
[--reference-transcript-threshold REFERENCE_TRANSCRIPT_THRESHOLD] \
[--min-average-base-quality MIN_AVERAGE_BASE_QUALITY] \
[--temp-dir TEMP_DIR]Example:
exacto call-rna-vars \
--bam-file tumor_transcriptome_assembly.sorted.filtered.bam \
--bam-bai-file tumor_transcriptome_assembly.sorted.filtered.bam.bai \
--reference-genome-fasta-file reference_genome.fasta \
--reference-gene-annotation-file gencode.gtf.gz \
--reference-gene-annotation-source gencode \
--reference-gene-annotation-assembly hg38 \
--reference-gene-annotation-version v44 \
--output-dir rna_variants_outputs/ \
--output-prefix tumorDescription
Call RNA variants in a long-read RNA-seq BAM file.
NoteAt a glance
Inputs: *.bam, *.bam.bai, *.fasta, *.gtf.gz
Outputs: *.tsv (RNA variants), *.tsv (transcript structures) under --output-dir
Typical next step: integrate-vars, translate-structs
Required arguments
| Flag | Type | Description |
|---|---|---|
--bam-file |
str |
Input BAM file. |
--bam-bai-file |
str |
Input BAM.BAI file. |
--reference-genome-fasta-file |
str |
Reference genome FASTA file. |
--reference-gene-annotation-file |
str |
Reference gene annotation file. |
--reference-gene-annotation-source |
str |
Reference gene annotation source. |
--reference-gene-annotation-assembly |
str |
Reference gene annotation assembly (e.g. ‘hg38’). |
--reference-gene-annotation-version |
str |
Reference gene annotation version (e.g. ‘v41’). |
--output-dir |
str |
Output directory. |
--output-prefix |
str |
Output prefix. |
Optional arguments
| Flag | Type | Default | Description |
|---|---|---|---|
--num-threads |
int |
4 |
Number of threads. |
--chunk-size |
int |
10000 |
Chunk size for variant calling. |
--min-mapping-quality |
int |
20 |
Minimum mapping quality. |
--reference-transcript-scoring-method |
str |
cosine_similarity |
Reference transcript scoring method. |
--reference-transcript-selection-strategy |
str |
top_k |
Reference transcript scoring method. |
--reference-transcript-top-k |
int |
1 |
Select top k reference transcripts. |
--reference-transcript-threshold |
float |
0.9 |
Select reference transcripts with scores greater than or equal to the threshold. |
--min-average-base-quality |
float |
30 |
Minimum average base quality. |
--temp-dir |
str |
Temp directory. |