frog(1) | General Commands Manual | frog(1) |
NAME¶
frog - Dutch morpho-syntactic analyzer, IOB chunker and dependency parserSYNOPSYS¶
frog [options]DESCRIPTION¶
frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. frog's current version will tokenize, tag, lemmatize, and morphologically segment word tokens in Dutch text files, add IOB chunks and will assign a dependency graph to each sentence.OPTIONS¶
-c <configfile>set the configuration using 'file'
set debug level.
set input encoding. (default UTF8)
give some help
keep the intermediate files from the parser.
Last sentence only!
assume inputfile to hold one sentence per
line
send output to 'file' instead of stdout.
Defaults to the name of the inputfile with '.out' appended.
send all output to 'dir' instead of stdout.
Creates filenames from the inputfilename(s) with '.out' appended.
skip parts of the proces: Tokenizer (t),
Chunker (c), Multi-Word unit (m) or Parser (p)
Enable quotedetection in the tokenizer. May
run havock!
Run a server on 'port'
process 'file'
process 'xmlfile', which is supposed to be in
FoLiA format! If 'xmlfile' is empty, and --testdir=<dir> is
provided, all files in 'dir' will be processed as FoLia XML.
process all files in 'dir'. see also
--outputdir
location to store intermediate files. Default
/tmp.
show version info
generate FoLiA XML output and send it to
'dir'. Creates filenames from the inputfilename with '.xml' appended.
generate FoLiA XML output and send it to
'file'. Defaults to the name of the inputfile(s) with '.xml' appended.
When -X for FoLia is given, use 'id' to
give the doc an ID.
BUGS¶
likelyAUTHORS¶
Maarten van Gompel proycon@anaproy.nlSEE ALSO¶
ucto(1)2012 January 31 |