Scroll to navigation

mbtg(1) General Commands Manual mbtg(1)

NAME

MBTG - Memory Based Tagger generator

SYNOPSYS

mbtg -T <filename> -s <setting filename>
or
mbtg [options]

DESCRIPTION

This programs generates, based on a tagged corpus, all the files needed to be able to tag a text with mbt.

OPTIONS

-h or --help
show help
-T <tagged training corpus file>
or
-E <enriched tagged training corpus file>
All further options have reasonable defaults, so using them is only needed for the experienced user. See the mbt manual for more details.
-s settingsfile
mbtg creates this file, which can be used to run mbt with minimal effort. (like mbt -s settings -T somefile)
-p pattern
the pattern for known words (default ddfa)
-P pattern
the pattern for unknown words (default dFapsss)
-% <number>
filter threshold for ambitag construction (default 5%)
-l <lexiconfile>
-L <file with list of frequent words>
-r <ambitagfile>
-k <known words case base>
-u <unknown words case base>
-K <known words instances file>
-U <unknown words instances file>
-V or --version
show version info
-e <sentence delimiter> (default '<utt>')
-X
keep the intermediate files
-Otimbl options

(Note: there is NO SPACE between O and the options)
<options> classifier options for both known and unknown words instances bases
K: <options> classifier options for known words instance base
U: <options> classifier options for unknown words case base
valid timbl options are: a d k m q v w x -

BUGS

possibly

AUTHORS

Ko van der Sloot Timbl@uvt.nl
Antal van den Bosch Timbl@uvt.nl

SEE ALSO

timbl(1) mbt(1) mbtserver(1)
2011 march 21