NAME¶

last-train - Try to find suitable score parameters for aligning the given sequences

SYNOPSIS¶

last-train [options] lastdb-name sequence-file(s)

Try to find suitable score parameters for aligning the given sequences.

--revsym: force reverse-complement symmetry
--matsym: force symmetric substitution matrix
--gapsym: force insertion/deletion symmetry
--pid=PID: skip alignments with > PID% identity (default: 100)
--postmask=NUMBER: skip mostly-lowercase alignments (default=1)
--sample-number=N: number of random sequence samples (default: 20000 if --codon else 500)
--sample-length=L: length of each sample (default: 2000)
--scale=S: output scores in units of 1/S bits
--codon: DNA queries & protein reference, with frameshifts

-r SCORE: match score (default: 6 if Q>=1, else 5 if DNA)
-q COST: mismatch cost (default: 18 if Q>=1, else 5 if DNA)
-p NAME: match/mismatch score matrix (default: BL62 if protein)
-a COST: gap existence cost (default: 15 if DNA, 11 if protein, 21 if Q>=1)
-b COST: gap extension cost (default: 3 if DNA, 2 if protein, 9 if Q>=1)
-A COST: insertion existence cost
-B COST: insertion extension cost
-F LIST: frameshift probabilities: del-1,del-2,ins+1,ins+2 (default: 1-b,1-b,1-B,1-B)

-D LENGTH: query letters per random alignment (default: total sample length)
-E EG2: maximum expected alignments per square giga
-s STRAND: 0=reverse, 1=forward, 2=both (default: 2 if DNA and not lastdb -S2, else 1)
-S NUMBER: score matrix applies to forward strand of: 0=reference, 1=query (default: 1)
-C COUNT: omit gapless alignments in COUNT others with > scoreper-length
-T NUMBER: type of alignment: 0=local, 1=overlap (default: 0)
-R DIGITS: lowercase & simple-sequence options
-m COUNT: maximum initial matches per query position (default: 10)
-k STEP: use initial matches starting at every STEP-th position in each query (default: 1)
-P THREADS: number of parallel threads
-X NUMBER: N/X is ambiguous in: 0=neither sequence, 1=reference, 2=query, 3=both (default=0)
-Q NAME: input format: fastx, sanger (default=fasta)

February 2024

last-train 1542

Source file:	last-train.1.en.gz (from last-align 1542-1)
Source last updated:	2024-02-20T23:06:34Z
Converted to HTML:	2024-02-21T03:35:40Z