.TH GUSTAF 1 "August 2014" "gustaf 1.0.0" "" .SH NAME gustaf \- Gustaf - Generic mUlti-SpliT Alignment Finder: Tool for split-read mapping allowing multiple splits. .SH SYNOPSIS \fBgustaf\fP [\fIOPTIONS\fP] <\fIGENOME FASTA FILE\fP> <\fIREAD FASTA FILE\fP> .br \fBgustaf\fP [\fIOPTIONS\fP] <\fIGENOME FASTA FILE\fP> <\fIREAD FASTA FILE\fP> <\fIREAD FASTA FILE 2\fP> .SH DESCRIPTION GUSTAF uses SeqAns STELLAR to find splits as local matches on different strands or chromosomes. Criteria and penalties to chain these matches can be specified. Output file contains the breakpoints along the best chain. .sp The genome file is used as database input, the read file as query input. .sp All STELLAR options are supported. See STELLAR documentation for STELLAR parameters and options. .sp (c) 2011-2012 by Kathrin Trappe .SH REQUIRED ARGUMENTS .TP \fBFASTA_FILE_1\fP \fIINPUT_FILE\fP Valid filetypes are: \fI.fq\fP, \fI.fastq\fP, \fI.fasta\fP, and \fI.fa\fP. .TP \fBFASTA_FILE_2\fP List of \fIINPUT_FILE\fP's Either one (single-end) or two (paired-end) read files. Valid filetypes are: \fI.fq\fP, \fI.fastq\fP, \fI.fasta\fP, and \fI.fa\fP. .SH OPTIONS .TP \fB-h\fP, \fB--help\fP Display the help message. .TP \fB--version\fP Display version information. .SS Main Options: .TP \fB-tp\fP, \fB--transPen\fP \fIINTEGER\fP Interchromosomal translocation penalty Default: \fI5\fP. .TP \fB-ip\fP, \fB--invPen\fP \fIINTEGER\fP Inversion penalty Default: \fI5\fP. .TP \fB-op\fP, \fB--orderPen\fP \fIINTEGER\fP Intrachromosomal order change penalty Default: \fI0\fP. .TP \fB-oth\fP, \fB--overlapThresh\fP \fIDOUBLE\fP Allowed overlap between matches Default: \fI0.5\fP. .TP \fB-gth\fP, \fB--gapThresh\fP \fIINTEGER\fP Allowed gap length between matches, default value corresponse to expected size of microindels (5 bp) Default: \fI5\fP. .TP \fB-ith\fP, \fB--initGapThresh\fP \fIINTEGER\fP Allowed initial or ending gap length at begin and end of read with no breakpoint (e.g. due to sequencing errors at the end) Default: \fI15\fP. .TP \fB-bth\fP, \fB--breakendThresh\fP \fIINTEGER\fP Allowed initial or ending gap length at begin and end of read that creates a breakend/breakpoint (e.g. for reads extending into insertions) Default: \fI30\fP. .TP \fB-tth\fP, \fB--tandemThresh\fP \fIINTEGER\fP Minimal length of (small) insertion/duplication with double overlap to be considered tandem repeat Default: \fI50\fP. .TP \fB-pth\fP, \fB--breakpoint-pos-range\fP \fIINTEGER\fP Allowed difference in breakpoint position Default: \fI5\fP. .TP \fB-cbp\fP, \fB--complex-breakpoints\fP Disable inferring complex SVs .TP \fB-st\fP, \fB--support\fP \fIINTEGER\fP Number of supporting reads Default: \fI2\fP. .TP \fB-mst\fP, \fB--mate-support\fP \fIINTEGER\fP Number of supporting concordant mates Default: \fI2\fP. .TP \fB-ll\fP, \fB--library-size\fP \fIINTEGER\fP Library size of paired-end reads .TP \fB-le\fP, \fB--library-error\fP \fIINTEGER\fP Library error (sd) of paired-end reads .TP \fB-rc\fP, \fB--revcompl\fP Disable reverse complementing second mate pair input file. .SS Input Options: .TP \fB-m\fP, \fB--matchfile\fP \fIINPUT_FILE\fP File of (stellar) matches Valid filetypes are: \fI.gff\fP and \fI.GFF\fP. .SS Output Options: .TP \fB-gff\fP, \fB--gffOut\fP \fIOUTPUT_FILE\fP Name of gff breakpoint output file. Valid filetypes are: \fI.txt\fP and \fI.gff\fP. Default: \fIbreakpoints.gff\fP. .TP \fB-vcf\fP, \fB--vcfOut\fP \fIOUTPUT_FILE\fP Name of vcf breakpoint output file. Valid filetypes are: \fI.vcf\fP and \fI.txt\fP. Default: \fIbreakpoints.vcf\fP. .TP \fB-j\fP, \fB--jobName\fP \fISTRING\fP Job/Queue name Default: \fI\fP. .TP \fB-do\fP, \fB--dots\fP Enable graph output in dot format .SS Parallelization Options: .TP \fB-nth\fP, \fB--numThreads\fP \fIINTEGER\fP Number of threads for parallelization of I/O. Default: \fI1\fP. .SS Main Options: .TP \fB-e\fP, \fB--epsilon\fP \fIDOUBLE\fP Maximal error rate (max 0.25). In range [0.0000001..0.25]. Default: \fI0.05\fP. .TP \fB-l\fP, \fB--minLength\fP \fIINTEGER\fP Minimal length of epsilon-matches. In range [0..inf]. Default: \fI100\fP. .TP \fB-f\fP, \fB--forward\fP Search only in forward strand of database. .TP \fB-r\fP, \fB--reverse\fP Search only in reverse complement of database. .TP \fB-a\fP, \fB--alphabet\fP \fISTRING\fP Alphabet type of input sequences (dna, rna, dna5, rna5, protein, char). One of \fIdna\fP, \fIdna5\fP, \fIrna\fP, \fIrna5\fP, \fIprotein\fP, and \fIchar\fP. .TP \fB-v\fP, \fB--verbose\fP Set verbosity mode. .SS Filtering Options: .TP \fB-k\fP, \fB--kmer\fP \fIINTEGER\fP Length of the q-grams (max 32). In range [1..32]. .TP \fB-rp\fP, \fB--repeatPeriod\fP \fIINTEGER\fP Maximal period of low complexity repeats to be filtered. Default: \fI1\fP. .TP \fB-rl\fP, \fB--repeatLength\fP \fIINTEGER\fP Minimal length of low complexity repeats to be filtered. Default: \fI1000\fP. .TP \fB-c\fP, \fB--abundanceCut\fP \fIDOUBLE\fP k-mer overabundance cut ratio. In range [0..1]. Default: \fI1\fP. .SS Verification Options: .TP \fB-x\fP, \fB--xDrop\fP \fIDOUBLE\fP Maximal x-drop for extension. Default: \fI5\fP. .TP \fB-vs\fP, \fB--verification\fP \fISTRING\fP Verification strategy: exact or bestLocal or bandedGlobal One of \fIexact\fP, \fIbestLocal\fP, and \fIbandedGlobal\fP. Default: \fIexact\fP. .TP \fB-dt\fP, \fB--disableThresh\fP \fIINTEGER\fP Maximal number of verified matches before disabling verification for one query sequence (default infinity). In range [0..inf]. .TP \fB-n\fP, \fB--numMatches\fP \fIINTEGER\fP Maximal number of kept matches per query and database. If STELLAR finds more matches, only the longest ones are kept. Default: \fI50\fP. .TP \fB-s\fP, \fB--sortThresh\fP \fIINTEGER\fP Number of matches triggering removal of duplicates. Choose a smaller value for saving space. Default: \fI500\fP.