Scroll to navigation

HFST-LOOKUP(1) User Commands HFST-LOOKUP(1)

NAME

hfst-lookup - =perform transducer lookup (apply)

SYNOPSIS

hfst-lookup [OPTIONS...] [INFILE]

DESCRIPTION

perform transducer lookup (apply)

Common options:

-h, --help
Print help message
-V, --version
Print version info
-v, --verbose
Print verbosely while processing
-q, --quiet
Only print fatal erros and requested output
-s, --silent
Alias of --quiet

Input/Output options:

-i, --input=INFILE
Read input transducer from INFILE
-o, --output=OUTFILE
Write output to OUTFILE

-p, --pipe-mode[=STREAM] Control input and output streams

Lookup options:

-I, --input-strings=SFILE
Read lookup strings from SFILE
-O, --output-format=OFORMAT
Use OFORMAT printing results sets
-e, --epsilon-format=EPS
Print epsilon as EPS
-F, --input-format=IFORMAT
Use IFORMAT parsing input
-x, --statistics
Print statistics
-X, --xfst=VARIABLE
Toggle xfst VARIABLE
-c, --cycles=INT
How many times to follow input epsilon cycles
-b, --beam=B
Output only analyses whose weight is within B from the best analysis
-t, --time-cutoff=S
Limit search after having used S seconds per input (currently only works in optimized-lookup mode
-P, --progress
Show neat progress bar if possible

If OUTFILE or INFILE is missing or -, standard streams will be used. Format of result depends on format of INFILE OFORMAT is one of {xerox,cg,apertium}, xerox being default IFORMAT is one of {text,spaced,apertium}, default being text, unless OFORMAT is apertium VARIABLEs relevant to lookup are {print-pairs,print-space, quote-special,show-flags,obey-flags} Input epsilon cycles are followed by default INT=5 times. Epsilon is printed by default as an empty string. B must be a non-negative float. S must be a non-negative float. The default, 0.0, indicates no cutoff. If the input contains several transducers, a set containing results from all transducers is printed for each input string.

STREAM can be { input, output, both }. If not given, defaults to {both}. If input file is not specified with -I, input is read interactively line by line from the user. If you redirect input from a file, use --pipe-mode=input. --pipe-mode=output is ignored on non-windows platforms.

Todo:

For optimized lookup format, only strings that pass flag diacritic checks are printed and flag diacritic symbols are not printed. Support VARIABLE 'print-space' for optimized lookup format

Known bugs:

'quote-special' quotes spaces that come from 'print-space'

REPORTING BUGS

Report bugs to <hfst-bugs@helsinki.fi> or directly to our bug tracker at: <https://sourceforge.net/tracker/?atid=1061990&group_id=224521&func=browse>

hfst-lookup home page: <https://kitwiki.csc.fi/twiki/bin/view/KitWiki//HfstLookup>
General help using HFST software: <https://kitwiki.csc.fi/twiki/bin/view/KitWiki//HfstHome>

COPYRIGHT

Copyright © 2010 University of Helsinki, License GPLv3: GNU GPL version 3 <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law.
December 2015 HFST