nextclade
Bioinformatics tool for virus genome alignment, clade assignment and qc checks. More information: https://docs.nextstrain.org/projects/nextclade/en/stable/user/nextclade-cli/index.html.
- Align sequences to user provided [r]eference, [o]utputting the alignment to a file:
nextclade run {{path/to/sequences.fa}} -r {{path/to/reference.fa}} -o {{path/to/alignment.fa}}
- Create a [t]SV report, auto-downloading the latest [d]ataset:
nextclade run {{path/to/fasta}} -d {{dataset_name}} -t {{path/to/report.tsv}}
- List all available datasets:
nextclade dataset list
- Download the latest SARS-CoV-2 dataset:
nextclade dataset get --name sars-cov-2 --output-dir {{path/to/directory}}
- Use a downloaded [D]ataset, producing all [O]utputs:
nextclade run -D {{path/to/dataset_dir}} -O {{path/to/output_dir}} {{path/to/sequences.fasta}}
- Run on multiple files:
nextclade run -d {{dataset_name}} -t {{path/to/output_tsv}} -- {{path/to/input_fasta_1 path/to/input_fasta_2 ...}}
- Try reverse complement if sequence does not align:
nextclade run --retry-reverse-complement -d {{dataset_name}} -t {{path/to/output_tsv}} {{path/to/input_fasta}}