Scroll to navigation

GTHBSSMTRAIN(1)   GTHBSSMTRAIN(1)

NAME

gthbssmtrain - train splice site model

SYNOPSIS

gthbssmtrain [option ...] GFF3_file

DESCRIPTION

Create BSSM training data from annotation given in GFF3_file.

OPTIONS

-outdir

set name of output directory to which the training files are written default: training_data

-gcdonor

extract training data for GC donor sites default: yes

-filtertype

set type of features to used for filtering (usually 'exon' or 'CDS') default: exon

-goodexoncount

set the minimum number of good exons a feature must have to be included into the training data default: 1

-cutoff

set the minimum score an exon must have to count towards the ``good exon count'' (exons without a score count as good) default: 1.00

-extracttype

set type of features to be extracted as exons (usually 'exon' or 'CDS') default: CDS

-seqfile

set the sequence file from which to take the sequences default: undefined

-encseq

set the encoded sequence indexname from which to take the sequences default: undefined

-seqfiles

set the sequence files from which to extract the features use '--' to terminate the list of sequence files

-matchdesc

search the sequence descriptions from the input files for the desired sequence IDs (in GFF3), reporting the first match default: no

-matchdescstart

exactly match the sequence descriptions from the input files for the desired sequence IDs (in GFF3) from the beginning to the first whitespace default: no

-usedesc

use sequence descriptions to map the sequence IDs (in GFF3) to actual sequence entries. If a description contains a sequence range (e.g., III:1000001..2000000), the first part is used as sequence ID ('III') and the first range position as offset ('1000001') default: no

-regionmapping

set file containing sequence-region to sequence file mapping default: undefined

-seed

set seed for random number generator manually 0 generates a seed from the current time and the process id default: 0

-v

be verbose default: no

-gzip

write gzip compressed output files default: no

-bzip2

write bzip2 compressed output files default: no

-force

force writing to output files default: no

-help

display help and exit

-version

display version information and exit