'\" t
.\" Title: gt-genomediff
.\" Author: [FIXME: author] [see http://www.docbook.org/tdg5/en/html/author]
.\" Generator: DocBook XSL Stylesheets vsnapshot
.\" Date: 10/30/2023
.\" Manual: GenomeTools Manual
.\" Source: GenomeTools 1.6.5
.\" Language: English
.\"
.TH "GT\-GENOMEDIFF" "1" "10/30/2023" "GenomeTools 1\&.6\&.5" "GenomeTools Manual"
.\" -----------------------------------------------------------------
.\" * Define some portability stuff
.\" -----------------------------------------------------------------
.\" ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
.\" http://bugs.debian.org/507673
.\" http://lists.gnu.org/archive/html/groff/2009-02/msg00013.html
.\" ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
.ie \n(.g .ds Aq \(aq
.el .ds Aq '
.\" -----------------------------------------------------------------
.\" * set default formatting
.\" -----------------------------------------------------------------
.\" disable hyphenation
.nh
.\" disable justification (adjust text to left margin only)
.ad l
.\" -----------------------------------------------------------------
.\" * MAIN CONTENT STARTS HERE *
.\" -----------------------------------------------------------------
.SH "NAME"
gt-genomediff \- Calculates Kr: pairwise distances between genomes\&.
.SH "SYNOPSIS"
.sp
\fBgt genomediff\fR [option \&...] (INDEX | \-indexname NAME SEQFILE SEQFILE [\&...])
.SH "DESCRIPTION"
.PP
\fB\-indextype\fR [\fI\&...\fR]
.RS 4
specify type of index, one of: esa|pck|encseq\&. Where encseq is an encoded sequence and an enhanced suffix array will be constructed only in memory\&. (default: encseq)
.RE
.PP
\fB\-indexname\fR [\fIstring\fR]
.RS 4
Basename of encseq to construct\&. (default: undefined)
.RE
.PP
\fB\-unitfile\fR [\fIfilename\fR]
.RS 4
specifies genomic units, see below for description\&. (default: undefined)
.RE
.PP
\fB\-mirrored\fR [\fIyes|no\fR]
.RS 4
virtually append the reverse complement of each sequence (default: no)
.RE
.PP
\fB\-pl\fR [\fIvalue\fR]
.RS 4
specify prefix length for bucket sort recommendation: use without argument; then a reasonable prefix length is automatically determined\&. (default: 0)
.RE
.PP
\fB\-dc\fR [\fIvalue\fR]
.RS 4
specify difference cover value (default: 0)
.RE
.PP
\fB\-memlimit\fR [\fIstring\fR]
.RS 4
specify maximal amount of memory to be used during index construction (in bytes, the keywords
\fIMB\fR
and
\fIGB\fR
are allowed) (default: undefined)
.RE
.PP
\fB\-scan\fR [\fIyes|no\fR]
.RS 4
do not load esa index but scan it sequentially\&. (default: yes)
.RE
.PP
\fB\-thr\fR [\fIvalue\fR]
.RS 4
Threshold for difference (du, dl) in divergence calculation\&. default: 1e\-9
.RE
.PP
\fB\-abs_err\fR [\fIvalue\fR]
.RS 4
absolute error for expected shulen calculation\&. default: 1e\-5
.RE
.PP
\fB\-rel_err\fR [\fIvalue\fR]
.RS 4
relative error for expected shulen calculation\&. default: 1e\-3
.RE
.PP
\fB\-M\fR [\fIvalue\fR]
.RS 4
threshold for minimum logarithm\&. default: DBL_MIN
.RE
.PP
\fB\-v\fR [\fIyes|no\fR]
.RS 4
be verbose (default: no)
.RE
.PP
\fB\-help\fR
.RS 4
display help for basic options and exit
.RE
.PP
\fB\-help+\fR
.RS 4
display help for all options and exit
.RE
.PP
\fB\-version\fR
.RS 4
display version information and exit
.RE
.sp
The genomediff tool only accepts DNA input\&.
.sp
When used with sequence files or encseq, an enhanced suffix array will be built in memory\&. The ESA will not be created completely, but construction will use \fI\-memlimit\fR as a threshold and build it partwise, calculating the Shu\-length for each part\&.
.sp
File format for option \fI\-unitfile\fR (in Lua syntax):
.sp
.if n \{\
.RS 4
.\}
.nf
units = {
genome1 = { "path/file1\&.fa", "file2\&.fa" },
genome2 = { "file3\&.fa", "path/file4\&.fa" }
}
.fi
.if n \{\
.RE
.\}
.sp
Give the path to the files as they were given to the encseq tool! You can use
.sp
.if n \{\
.RS 4
.\}
.nf
$ gt encseq info INDEXNAME
.fi
.if n \{\
.RE
.\}
.sp
to get a list of files in an encoded sequence\&.
.sp
Comment lines in Lua start with \fI\-\-\fR and will be ignored\&.
.sp
See GTDIR/testdata/genomediff/unitfile1\&.lua for an example\&.
.sp
Options \fI\-pl\fR, \fI\-dc\fR and \fI\-memlimit\fR are options to influence ESA construction\&.
.SH "REPORTING BUGS"
.sp
Report bugs to \&.