Scroll to navigation

DIAMOND(1) User Commands DIAMOND(1)

NAME

diamond - accelerated BLAST compatible local sequence aligner

SYNOPSIS

diamond COMMAND [OPTIONS]

DESCRIPTION


DIAMOND is a sequence aligner for protein and translated DNA searches
and functions as a drop-in replacement for the NCBI BLAST software
tools. It is suitable for protein-protein search as well as DNA-protein
search on short reads and longer sequences including contigs and
assemblies, providing a speedup of BLAST ranging up to x20,000.

COMMANDS

Build DIAMOND database from a FASTA file
Align amino acid query sequences against a protein reference database
Align DNA query sequences against a protein reference database
View DIAMOND alignment archive (DAA) formatted file
Produce help message
Display version information
Retrieve sequences from a DIAMOND database file

OPTIONS

General options:

number of CPU threads
database file
output file
output format
0 = BLAST pairwise
5 = BLAST XML
6 = BLAST tabular
100 = DIAMOND alignment archive (DAA)
101 = SAM
Value 6 may be followed by a space-separated list of these keywords:
qseqid means Query Seq - id
qlen means Query sequence length
sseqid means Subject Seq - id
sallseqid means All subject Seq - id(s), separated by a ';'
slen means Subject sequence length
qstart means Start of alignment in query
qend means End of alignment in query
sstart means Start of alignment in subject
send means End of alignment in subject
qseq means Aligned part of query sequence
sseq means Aligned part of subject sequence
evalue means Expect value
bitscore means Bit score
score means Raw score
length means Alignment length
pident means Percentage of identical matches
nident means Number of identical matches
mismatch means Number of mismatches
positive means Number of positive - scoring matches
gapopen means Number of gap openings
gaps means Total number of gaps
ppos means Percentage of positive - scoring matches
qframe means Query frame
btop means Blast traceback operations(BTOP)
stitle means Subject Title
salltitles means All Subject Title(s), separated by a '<>'
qcovhsp means Query Coverage Per HSP
qtitle means Query title
Default: qseqid sseqid pident length mismatch gapopen qstart qend sstart send evalue bitscore
verbose console output
enable debug log
disable console output

Makedb options:

input reference file in FASTA format

Aligner options:

input query file
file for unaligned queries
report unaligned queries (0=no, 1=yes)
maximum number of target sequences to report alignments for
report alignments within this percentage range of top alignment score (overrides --max-target-seqs)
compression for output files (0=none, 1=gzip)
maximum e-value to report alignments
minimum bit score to report alignments (overrides e-value setting)
minimum identity% to report an alignment
minimum query cover% to report an alignment
minimum subject cover% to report an alignment
enable sensitive mode (default: fast)
enable more sensitive mode (default: fast)
sequence block size in billions of letters (default=2.0)
number of chunks for index processing
directory for temporary files
gap open penalty (default=11 for protein)
gap extension penalty (default=1 for protein)
score matrix for protein alignment (default=BLOSUM62)
file containing custom scoring matrix
lambda parameter for custom matrix
K parameter for custom matrix
enable composition based statistics (0/1=default)
enable SEG masking of queries (yes/no)
genetic code to use to translate query (see user manual)
print full subject titles in output files
suppress reporting of identical self hits

Advanced options:

ignore translated sequences without an open reading frame of at least this length
number of standard deviations for ignoring frequent seeds
minimum number of identities for stage 1 hit
window size for local hit search
xdrop for ungapped alignment
minimum alignment score to continue local extension
band for hit verification
minimum score to keep a tentative alignment
xdrop for gapped alignment in bits
band for dynamic programming computation
number of seed shapes (0 = all available)
seed shapes
index mode (0=4x12, 1=16x9)
trace point fetch size
include subjects within this range of max-target-seqs
include subjects within this ratio of last hit
maximum number of HSPs per subject sequence to save for each query
effective database size (in letters)
disable auto appending of DAA and DMND file extensions
number of target sequences to fetch for seed extension

View options

DIAMOND alignment archive (DAA) file
only show alignments of forward strand

Getseq options

Sequence numbers to display.

AUTHOR

This manpage was written by Andreas Tille for the Debian distribution and can be used for any other usage of the program.

January 2017 diamond 0.8.31