build-transcriptome-var-graph
Build a transcriptome variation graph.
Usage
Template:
exacto build-transcriptome-var-graph \
--transcript-structures-tsv-file <transcript_structures_tsv_file> \
--fasta-file <fasta_file> \
--output-fasta-file <output_fasta_file> \
[--graph-type GRAPH_TYPE] \
[--num-threads NUM_THREADS] \
[--batch-size BATCH_SIZE]Example:
exacto build-transcriptome-var-graph \
--transcript-structures-tsv-file tumor_primary_structures.tsv \
--fasta-file reference_transcriptome.fasta \
--output-fasta-file transcriptome_var_graph.fastaDescription
Build a transcriptome variation graph.
NoteAt a glance
Inputs: *.tsv (transcript structures), *.fasta (reference transcriptome)
Outputs: *.fasta (transcriptome variation graph)
Typical next step: Endpoint — feeds downstream graph-aware analyses
Required arguments
| Flag | Type | Description |
|---|---|---|
--transcript-structures-tsv-file |
str |
Transcript structures TSV file. Expected columns: ‘transcript_model_id’, ‘index’, ‘chromosome_1’, ‘position_1’, ‘operation_1’, ‘strand_1’, ‘chromosome_2’, ‘position_2’, ‘operation_2’, ‘strand_2’, ‘sequence’, ‘num_cycles’. |
--fasta-file |
str |
Reference genome FASTA file (variation graph backbone). |
--output-fasta-file |
str |
Output FASTA file. |
Optional arguments
| Flag | Type | Default | Description |
|---|---|---|---|
--graph-type |
str |
individual |
Variation graph type. Either ‘individual’ or ‘population’. |
--num-threads |
int |
4 |
Number of threads. |
--batch-size |
int |
1000 |
Batch size. |