Scroll to navigation

Text::Kakasi::JP(3pm) User Contributed Perl Documentation Text::Kakasi::JP(3pm)
 

NAME

Text::Kakasi::JP - Text::Kakasi AEueEeUeXiXEXXaaXaXoXE

SYNOPSIS

  use Text::Kakasi;
  # functional
  $res = Text::Kakasi::getopt_argv('kakasi', '-ieuc', '-w');
  $str = Text::Kakasi::do_kakasi($japanese_text);
  # object-oriented
  $obj = Text::Kakasi->new('-ieuc','-w');
  $str = $obj->get($japanese_text);

DESCRIPTION

EeUeXaXXXaaXXXeeXIekakasi (kanji kana simple inverter)XOXIXXXoXXXXXO~XXXXXX XoAeoXXXXXPXXXXkakasiXXAeoXXXXXeeXIXIeXXinput methodXEXO~XXXXXEXieXAXXXEXAXu XPXXXeEXXoXOXeXXXEXXXXXXXXXEXXXiXXXPXuXEXEXEeXO~EN~XXXXXeeXXCXXCXXXXkakasi XIeXPXXEXXn~XIEXXXXAXn~XXEeXaIoIN~XXXeeXXXEXXXCXXPXXXXXUeXXXXXIe <http://kakasi.namazu.org/> XoXaeIXXXXXXXXX
Text::Kakasi XIeXXAaAAayAaXXEXOXoXUXXXXXXXeXOXXXOeXXXXXXXEXOXpXIAD‐EyXIXXXo XXXXXO~XXXXXXXoAeoXXXXXPXXXXXOXoXUXXXXXIEyXIe ver. 1.05 XE 100% XssXXXCXX XXXXPerl 5.8 XEXssXIXXXXCXXoXEXXXEeXIeXXXOeXXXXXXXEXOXpXXXoXXXXXO~XXXXXXXo XEXXEXIXXXXXXeXPXXXX

Functional APIs

A~iXO~:XEXXXIAaIAXCXIe "Text::Kakasi::" XIeXEIXXXXAEXXXPXXXXXCXO~XXXeeXEXCXIe Text::Kakasi XIeXXXa export XXXPXXXoXXXEXXXIXOXoXo import XXXeeXEeXIeXXIa~ XXXD‐XEXXXIXeXXXEeXXXPXXXX
  use Text::Kakasi qw/getopt_argv do_kakasi/;
$err = getopt_argv($arg1, $arg2, ...)
kakasiXoXeXueXXXXXPXXXXXuXoXIekakasiXIXXXXXXXcXoXEAEXAauXCXXXXXEXXXIe kakasi 2.3.4 XEeXXXXXeeXXXXXXXcXoXIXXIXXCXXXX
  -a[jE] -j[aE] -g[ajE] -k[ajKH]
  -E[aj] -K[ajkH] -H[ajkK] -J[ajkKH]
  -i{oldjis,newjis,dec,euc,sjis}
   -o{oldjis,newjis,dec,euc,sjis}
  -r{hepburn,kunrei} -p -s -f -c"chars" 
   [jisyo1, jisyo2,,,]
  Character Sets:
       a: ascii  j: jisroman  g: graphic  k: kana 
       (j,k     defined in jisx0201)
       E: kigou  K: katakana  H: hiragana J: kanji
       (E,K,H,J defined in jisx0208)
  Options:
    -i: input coding system    -o: output coding system
    -r: romaji conversion system
    -p: list all readings (with -J option)
    -s: insert separate characters (with -J option)
    -f: furigana mode (with -J option)
    -c: skip chars within jukugo
        (with -J option: default TAB CR LF BLANK)
    -C: romaji Capitalize (with -Ja or -Jj option)
    -U: romaji Upcase     (with -Ja or -Jj option)
    -u: call fflush() after 1 character output
    -w: wakatigaki mode
    
EeUeXOXoXIeXXAXXuXpXEe0XoXXXXCOXpXEeXIeXXXiXEXXXIA~IXoEOeXXXPXXXX
$processed_str = do_kakasi($str)
$strXEeAD‐XXXAEkakasiXoXAXOXXXXXeeXIXoEOeXXXPXXXXXXXeXXXIIaeAeXXEXAXXXXX XiXcXX"undef"XoEOeXXXPXXXX
Ver. 1.x XEXUXEXeXXXuXoXIXCXeXEe"kakasi"XoXAeXXXeeEXIXXIeXXXeXPXXXoXXXaXA XiXoXXXPXCXEXXXeXAeXXXAEXaXXXXXPXXXoXX
close_kanwadic()
XAaXXAaXXEeopenXXXiXXXXn~XO~XXXXXeeXocloseXXXPXXXXEeUeXOXoXIeXaXiXIXssXXAXIXss XIXXXaXEeAXXssXXXXXXXssXCXIeIoIN~XXXeeEXIXXIeXPXXXEXXXCXXXcXXXX

Object-Oriented APIs

Version 2.0 XeXeXXEeUeXaXXXaaXXXeeXIeXXXOeXXXXXXXEXOXp(OOP)XIAPIXaAeoXXXXXPXXXX
$k = Text::Kakasi->new($args ...)
XXXOeXXXXXXXEXoAXAXXXXPXXXXXuXoXXAXXssXXXeeXiXcXIeXX "Text::Kakasi->new->set($args ...)" XXXXXIXEAEXXXXeeXIXEXEXeXPXXXX
$k->set($args ...)
"getopt_argv" XOXIOOPXXXoXXXXXO~XXXXXXXX
  my $k = Text::Kakasi->new;
  $k->set('-w'); # Text::Kakasi::getopt_argv('-w');
    
XXXAEXXXXXXXoEOeXX "getopt_argv()" XEXIeXUXEXeXX "set" XIeXieXEeXXXOeXXXXXX XEXXXEXoEOeXXXIXCXXXEXXXIXeXXXEEXXXXXAeCXXCXXXX
  my $tokenized = $k->set('-w')->get($raw_japanese);
    
XXXAEXXXXXXXXIssXXXXXiXcXEeXIeXX "$k->error" XoXEXXXPXXXX
$k->error
XCXaaXEeXAEXD‐XiXXXaXXXA~XEXIXXXAEXXXXXXXoEOeXXXPXXXX
$processed = $k->get($raw_japanese);
"do_kakasi" XOXIOOPXXXoXXXXXO~XXXXXXXXXEXXXIAEoXAeXIXXXXXEXIeAauXAXCXXXX
  # Functional
  getopt_argv('-w'); $processed = do_kakasi($raw_japanese);
  # OOP
  $k->set('-w')->get($raw_japanese);
    

Perl 5.8 Features

Perl 5.8 XCXXEXXuXXXXXEEN~XXXIEXXaAaXXEIeEAEXssXXEncodeXaXXXaaXXXeeXEeXeXA~ XAEAeoXXXXXiXPXXXXXXEeUeXaXXXaaXXXeeXIe perl 5.8 XEXssXEeXAEXXXIXXCXXoIoIN~XXXeeXe XXXEeXEXA~XAEXXXPXXXX
"$k->set($args ...)"XOXIXuXoXEeXX"-i encoding" XXXeXO "-o encoding"XXXXXPXPXiXeeXiXcXXEXXuXXXXXEEN~XXXIeEncodeXXXOXXXPXXXX XuXoXEeXIekakasiXXXXXYXXXEXXXAEXXXeeXaXIXEXXXEeXaXXEncodeXXXXXYXXXEXXXAE XXXeeXaXIXIeAXXAEXEXXXPXXXX Ia~XXXD‐XX
  $result = $k->set(qw/-iutf8 -outf8 -w/)->get($utf8);
XIeXEXXXEXUXUeAEXXAXCXXXX
  $euc = encode('eucjp' => $utf8);
  getopt_argv('-w');
  $tmp = do_kakasi($euc);
  $result = decode('eucjp' => $tmp);
"-outf8",XXXPXPXiXAEXXXeeXiXcXIeXX "$k->get" XIeutf8XO~XeXXEO~XXIEXXu IoXoEOeXXXPXXXX
XXXiXeXIXXCXXIeXX $Text::Kakasi::HAS_ENCODEXEe0XoAssAeeXXXeeXXXEXCIoIN~XX XEXXXeXXXEeXXXeeXXXEXaXAeCXXCXXXXAeIXieEN~XoXIeXXEeUeXaXXXaaXXXeeXouseXXXXXYXEeXX AEXAaXXEeAssAeeXXXiXPXXXX

SEE ALSO

kakasi(1), <http://kakasi.namazu.org/>,Encode,perlunicode

COPYRIGHT

  (C) 1998, 1999, 2000 NOKUBI Takatsugu <knok@daionet.gr.jp>
  (C) 2003 Dan Kogai <dankogai@dan.co.jp>
There is no warranty for this free software. Anyone can modify and/or redistribute this module under GNU GENERAL PUBLIC LICENSE. See COPYING file that is included in the archive for more details.

POD ERRORS

Hey! The above document had some coding errors, which are explained below:
Around line 7:
Non-ASCII character seen before =encoding in 'AEueEeUeXiXEXXaaXaXoXE'. Assuming ISO8859-1
2003-05-26 perl v5.20.0