.\" Automatically generated by Pod::Man 4.07 (Pod::Simple 3.32)
.\"
.\" Standard preamble:
.\" ========================================================================
.de Sp \" Vertical space (when we can't use .PP)
.if t .sp .5v
.if n .sp
..
.de Vb \" Begin verbatim text
.ft CW
.nf
.ne \\$1
..
.de Ve \" End verbatim text
.ft R
.fi
..
.\" Set up some character translations and predefined strings.  \*(-- will
.\" give an unbreakable dash, \*(PI will give pi, \*(L" will give a left
.\" double quote, and \*(R" will give a right double quote.  \*(C+ will
.\" give a nicer C++.  Capital omega is used to do unbreakable dashes and
.\" therefore won't be available.  \*(C` and \*(C' expand to `' in nroff,
.\" nothing in troff, for use with C<>.
.tr \(*W-
.ds C+ C\v'-.1v'\h'-1p'\s-2+\h'-1p'+\s0\v'.1v'\h'-1p'
.ie n \{\
.    ds -- \(*W-
.    ds PI pi
.    if (\n(.H=4u)&(1m=24u) .ds -- \(*W\h'-12u'\(*W\h'-12u'-\" diablo 10 pitch
.    if (\n(.H=4u)&(1m=20u) .ds -- \(*W\h'-12u'\(*W\h'-8u'-\"  diablo 12 pitch
.    ds L" ""
.    ds R" ""
.    ds C` ""
.    ds C' ""
'br\}
.el\{\
.    ds -- \|\(em\|
.    ds PI \(*p
.    ds L" ``
.    ds R" ''
.    ds C`
.    ds C'
'br\}
.\"
.\" Escape single quotes in literal strings from groff's Unicode transform.
.ie \n(.g .ds Aq \(aq
.el       .ds Aq '
.\"
.\" If the F register is >0, we'll generate index entries on stderr for
.\" titles (.TH), headers (.SH), subsections (.SS), items (.Ip), and index
.\" entries marked with X<> in POD.  Of course, you'll have to process the
.\" output yourself in some meaningful fashion.
.\"
.\" Avoid warning from groff about undefined register 'F'.
.de IX
..
.if !\nF .nr F 0
.if \nF>0 \{\
.    de IX
.    tm Index:\\$1\t\\n%\t"\\$2"
..
.    if !\nF==2 \{\
.        nr % 0
.        nr F 2
.    \}
.\}
.\"
.\" Accent mark definitions (@(#)ms.acc 1.5 88/02/08 SMI; from UCB 4.2).
.\" Fear.  Run.  Save yourself.  No user-serviceable parts.
.    \" fudge factors for nroff and troff
.if n \{\
.    ds #H 0
.    ds #V .8m
.    ds #F .3m
.    ds #[ \f1
.    ds #] \fP
.\}
.if t \{\
.    ds #H ((1u-(\\\\n(.fu%2u))*.13m)
.    ds #V .6m
.    ds #F 0
.    ds #[ \&
.    ds #] \&
.\}
.    \" simple accents for nroff and troff
.if n \{\
.    ds ' \&
.    ds ` \&
.    ds ^ \&
.    ds , \&
.    ds ~ ~
.    ds /
.\}
.if t \{\
.    ds ' \\k:\h'-(\\n(.wu*8/10-\*(#H)'\'\h"|\\n:u"
.    ds ` \\k:\h'-(\\n(.wu*8/10-\*(#H)'\`\h'|\\n:u'
.    ds ^ \\k:\h'-(\\n(.wu*10/11-\*(#H)'^\h'|\\n:u'
.    ds , \\k:\h'-(\\n(.wu*8/10)',\h'|\\n:u'
.    ds ~ \\k:\h'-(\\n(.wu-\*(#H-.1m)'~\h'|\\n:u'
.    ds / \\k:\h'-(\\n(.wu*8/10-\*(#H)'\z\(sl\h'|\\n:u'
.\}
.    \" troff and (daisy-wheel) nroff accents
.ds : \\k:\h'-(\\n(.wu*8/10-\*(#H+.1m+\*(#F)'\v'-\*(#V'\z.\h'.2m+\*(#F'.\h'|\\n:u'\v'\*(#V'
.ds 8 \h'\*(#H'\(*b\h'-\*(#H'
.ds o \\k:\h'-(\\n(.wu+\w'\(de'u-\*(#H)/2u'\v'-.3n'\*(#[\z\(de\v'.3n'\h'|\\n:u'\*(#]
.ds d- \h'\*(#H'\(pd\h'-\w'~'u'\v'-.25m'\f2\(hy\fP\v'.25m'\h'-\*(#H'
.ds D- D\\k:\h'-\w'D'u'\v'-.11m'\z\(hy\v'.11m'\h'|\\n:u'
.ds th \*(#[\v'.3m'\s+1I\s-1\v'-.3m'\h'-(\w'I'u*2/3)'\s-1o\s+1\*(#]
.ds Th \*(#[\s+2I\s-2\h'-\w'I'u*3/5'\v'-.3m'o\v'.3m'\*(#]
.ds ae a\h'-(\w'a'u*4/10)'e
.ds Ae A\h'-(\w'A'u*4/10)'E
.    \" corrections for vroff
.if v .ds ~ \\k:\h'-(\\n(.wu*9/10-\*(#H)'\s-2\u~\d\s+2\h'|\\n:u'
.if v .ds ^ \\k:\h'-(\\n(.wu*10/11-\*(#H)'\v'-.4m'^\v'.4m'\h'|\\n:u'
.    \" for low resolution devices (crt and lpr)
.if \n(.H>23 .if \n(.V>19 \
\{\
.    ds : e
.    ds 8 ss
.    ds o a
.    ds d- d\h'-1'\(ga
.    ds D- D\h'-1'\(hy
.    ds th \o'bp'
.    ds Th \o'LP'
.    ds ae ae
.    ds Ae AE
.\}
.rm #[ #] #H #V #F C
.\" ========================================================================
.\"
.IX Title "EOCONV 1"
.TH EOCONV 1 "2016-12-24" "perl v5.24.1" "User Contributed Perl Documentation"
.\" For nroff, turn off justification.  Always turn off hyphenation; it makes
.\" way too many mistakes in technical documents.
.if n .ad l
.nh
.SH "NAME"
eoconv \- Convert text files between various Esperanto encodings
.SH "USAGE"
.IX Header "USAGE"
eoconv [\-q] \-\-from=\fIencoding\fR \-\-to=\fIencoding\fR [\fIfile\fR ...]
.PP
.Vb 4
\& Options:
\&   \-\-from       specify input encoding (see below)
\&   \-\-to         specify output encoding (see below)
\&   \-q, \-\-quiet  suppress warnings
\&
\&   \-\-help       detailed help message
\&   \-\-man        full documentation
\&   \-\-version    display version information
\&
\& Valid encodings:
\&   post\-h post\-H post\-x post\-X post\-caret pre\-caret latex
\&   html\-hex html\-dec iso\-8859\-3 utf\-7 utf\-8 utf\-16 utf\-32
.Ve
.SH "DESCRIPTION"
.IX Header "DESCRIPTION"
\&\fBeoconv\fR will read the given input files (or stdin if no files are
specified) containing Esperanto text in the encoding specified by
\&\fB\-\-from\fR, and then output it in the encoding specified by \fB\-\-to\fR.
.SH "OPTIONS"
.IX Header "OPTIONS"
.IP "\fB\-\-from=\fR\fIencoding\fR" 17
.IX Item "--from=encoding"
Specify character encoding for input
.IP "\fB\-\-to=\fR\fIencoding\fR" 17
.IX Item "--to=encoding"
Specify character encoding for output
.IP "\fB\-q\fR \fB\-\-quiet\fR" 17
.IX Item "-q --quiet"
Suppress non-essential warning messages
.IP "\fB\-?\fR \fB\-\-help\fR" 17
.IX Item "-? --help"
Print a brief help message and exit.
.IP "\fB\-\-man\fR" 17
.IX Item "--man"
Print the manual page and exit.
.IP "\fB\-\-version\fR" 17
.IX Item "--version"
Print version information and exit.
.SS "\s-1CHARACTER ENCODINGS\s0"
.IX Subsection "CHARACTER ENCODINGS"
.IP "\fIpost-h\fR" 17
.IX Item "post-h"
\&\s-1ASCII\s0 postfix h notation
.IP "\fIpost-H\fR" 17
.IX Item "post-H"
\&\s-1ASCII\s0 postfix H notation
.IP "\fIpost-x\fR" 17
.IX Item "post-x"
\&\s-1ASCII\s0 postfix x notation
.IP "\fIpost-X\fR" 17
.IX Item "post-X"
\&\s-1ASCII\s0 postfix X notation
.IP "\fIpost-caret\fR" 17
.IX Item "post-caret"
\&\s-1ASCII\s0 postfix caret (^) notation
.IP "\fIpre-caret\fR" 17
.IX Item "pre-caret"
\&\s-1ASCII\s0 prefix caret (^) notation
.IP "\fIlatex\fR, \fILaTeX\fR" 17
.IX Item "latex, LaTeX"
\&\s-1ASCII\s0 LaTeX sequences
.IP "\fIhtml-hex\fR, \fIHTML-hex\fR" 17
.IX Item "html-hex, HTML-hex"
\&\s-1ASCII HTML\s0 hexadecimal entities
.IP "\fIhtml-dec\fR, \fIHTML-dec\fR" 17
.IX Item "html-dec, HTML-dec"
\&\s-1ASCII HTML\s0 decimal entities
.IP "\fIiso\-8859\-3\fR, \fI\s-1ISO\-8859\-3\s0\fR, \fIlatin3\fR, \fIlatin\-3\fR, \fILatin3\fR, \fILatin\-3\fR" 17
.IX Item "iso-8859-3, ISO-8859-3, latin3, latin-3, Latin3, Latin-3"
\&\s-1ISO\-8859\-3\s0
.IP "\fIutf\-7\fR, \fI\s-1UTF\-7\s0\fR, \fIutf7\fR, \fI\s-1UTF7\s0\fR" 17
.IX Item "utf-7, UTF-7, utf7, UTF7"
Unicode \s-1UTF\-7\s0
.IP "\fIutf\-8\fR, \fI\s-1UTF\-8\s0\fR, \fIutf8\fR, \fI\s-1UTF8\s0\fR" 17
.IX Item "utf-8, UTF-8, utf8, UTF8"
Unicode \s-1UTF\-8\s0
.IP "\fIutf\-16\fR, \fI\s-1UTF\-16\s0\fR, \fIutf16\fR, \fI\s-1UTF16\s0\fR" 17
.IX Item "utf-16, UTF-16, utf16, UTF16"
Unicode \s-1UTF\-16\s0
.IP "\fIutf\-32\fR, \fI\s-1UTF\-32\s0\fR, \fIutf32\fR, \fI\s-1UTF32\s0\fR" 17
.IX Item "utf-32, UTF-32, utf32, UTF32"
Unicode \s-1UTF\-32\s0
.SH "ESPERANTO ORTHOGRAPHY"
.IX Header "ESPERANTO ORTHOGRAPHY"
Esperanto is written in an alphabet of 28 letters.  However, only 22
of these letters can be found in the standard \s-1ASCII\s0 character set.
The remaining six \*(-- `c', `g', `h', `j', and `s' with circumflex, and
`u' with breve \*(-- are not available in \s-1ASCII\s0; neither are they among
the characters available in the common 8\-bit \s-1ISO\-8859\-1\s0 character
encoding.  Therefore, while the six special Esperanto characters pose
no problem for handwritten texts, they were impossible to represent on
standard typewriters, and are somewhat problematic even on modern-day
computers.  Various encoding systems have been developed to represent
Esperanto text in printed and typed text.
.SS "POSTFIX-h \s-1NOTATION\s0"
.IX Subsection "POSTFIX-h NOTATION"
This was the solution proposed by the creator of Esperanto,
L. L. Zamenhof.  He recommended using `u' for `u\-breve' and appending
an `h' to a letter to indicate that it should have a circumflex.
However, the letters `u' and `h' are already part of the Esperanto
alphabet, so using them for another purpose invites ambiguity and
mispronunciation.  It also makes conversion of Esperanto text to
postfix-h notation `lossy' or one-way; it is generally not possible to
convert from postfix-h notation via automated means.  This notation
suffers from the additional drawback that the text cannot be sorted
with standard rules for \s-1ASCII\s0 text.
.SS "POSTFIX-H \s-1NOTATION\s0"
.IX Subsection "POSTFIX-H NOTATION"
This is the same as postfix-h notation, except that `H' is used
instead of `h' following a capital letter.
.SS "POSTFIX-x \s-1NOTATION\s0"
.IX Subsection "POSTFIX-x NOTATION"
This is the most common \s-1ASCII\s0 notation encountered today.  It involves
appending an `x' to a letter to indicate that it should have an accent
(be it circumflex or breve).  Since `x' is not a letter in the
Esperanto alphabet, no ambiguity results.  However, \s-1ASCII\s0 sorting
algorithms still fail with postfix-x text.
.SS "POSTFIX-X \s-1NOTATION\s0"
.IX Subsection "POSTFIX-X NOTATION"
This is the same as postfix-x notation, except that `X' is used
instead of `x' following a capital letter.
.SS "\s-1PREFIX\- AND\s0 POSTFIX-CARET \s-1NOTATION\s0"
.IX Subsection "PREFIX- AND POSTFIX-CARET NOTATION"
Two slightly less popular \s-1ASCII\s0 encodings are to prepend or append a
caret (`^') to a letter to indicate that it should have an accent.
.SS "\s-1ISO\-8859\-3 \s0(\s-1LATIN\-3\s0)"
.IX Subsection "ISO-8859-3 (LATIN-3)"
\&\s-1ISO 8859\-3,\s0 also known as Latin\-3 or South European, is an 8\-bit
character encoding for Esperanto.  High-bit characters are used to
encode the accented Esperanto letters.  \s-1ISO\-8859\-3\s0 can also be used
for encoding English, Finnish, German, Italian, Latin, Maltese,
Turkish, and Portuguese, making it useful for texts which mix
Esperanto with one or more of these languages.
.SS "\s-1UNICODE \s0(\s-1ISO/IEC 10646\s0)"
.IX Subsection "UNICODE (ISO/IEC 10646)"
Unicode is a standard for matching every character of every human
language to a specific code.  The mapping methods are known as Unicode
Transformation Formats (\s-1UTF\s0). Among them are \s-1UTF\-32, UTF\-16, UTF\-8\s0 and
\&\s-1UTF\-7,\s0 where the numbers indicate the number of bits in one unit.
.SS "LaTeX \s-1SEQUENCES\s0"
.IX Subsection "LaTeX SEQUENCES"
The popular LaTeX typesetting package is capable of representing
virtually any accented character.  Note that conversion from LaTeX
sequences assumes that characters to be accented are enclosed in
braces \*(-- for example, `\e^{C}' will be recognized as `C' with
circumflex, but `\e^C' will not be.
.SS "\s-1HTML ENTITIES\s0"
.IX Subsection "HTML ENTITIES"
Unicode codes for Esperanto characters can be escaped in \s-1HTML\s0
documents by using \s-1HTML\s0 entities.  The codes can be represented in
either decimal (base\-10) or hexadecimal (base\-16) notation; the two
are functionally equivalent.
.SH "BUGS AND LIMITATIONS"
.IX Header "BUGS AND LIMITATIONS"
Because the postfix-h and postfix-H notations are inherently
ambiguous, conversion from postfix-h or \-H text is unlikely to result
in coherent text.  Use at your own risk, and carefully proofread the
results.
.PP
Report bugs to <psychonaut@nothingisreal.com>.
.SH "AUTHOR"
.IX Header "AUTHOR"
Tristan Miller <psychonaut@nothingisreal.com>
.SH "SEE ALSO"
.IX Header "SEE ALSO"
\&\fIcharsets\fR\|(7), \fIascii\fR\|(7), \fIiso_8859\-3\fR\|(7), \fIunicode\fR\|(7), \fIutf\-8\fR\|(7), \fIlatex\fR\|(1)
.SH "LICENSE AND COPYRIGHT"
.IX Header "LICENSE AND COPYRIGHT"
Copyright (C) 2004\-2016 Tristan Miller.
.PP
Permission is granted to make and distribute verbatim or modified
copies of this manual provided the copyright notice and this
permission notice are preserved on all copies.