NAME¶
open_jtalk — Japanese TTS system
SYNOPSIS¶
open_jtalk [
options] [
infile]
DESCRIPTION¶
This manual page documents briefly the
open_jtalk command.
This manual page was written for the
Debian distribution because the
original program does not have a manual page. Instead, it has documentation in
the GNU
Info format; see below.
open_jtalk is a program that synthesize speech waveform from Japanese
texts. It uses HMMs trained by the HMM-based speech synthesis system (HTS).
OPTIONS¶
A summary of options is included below.
- -x dir
- dictionary directory
- -td tree
- decision tree files for state duration
- -tm tree
- Show version of program.
- -tf tree
- decision tree files for Log F0
- -tl tree
- decision tree files for low-pass filter
- -md pdf
- model files for state duration
- -mm pdf
- model files for spectrum
- -mf pdf
- model files for Log F0
- -ml pdf
- model files for low-pass filter
- -dm win
- window files for calculation delta of spectrum
- -df win
- window files for calculation delta of Log F0
- -dl win
- window files for calculation delta of low-pass filter
- -ow s
- filename of output wav audio (generated speech)
- -ot s
- filename of output trace information
- -s i
- sampling frequency [16000][1--48000]
- -p i
- frame period (point) [80][1--]
- -a f
- all-pass constant [0.42][0.0--1.0]
- -g i
- gamma = -1 / i (if i=0 then gamma=0) [0][0--]
- -b f
- postfiltering coefficient [0.0][-0.8--8.0]
- -l
- regard input as log gain and output linear one (LSP)
- -u f
- voiced/unvoiced threshold[0.5][0.0--1.0]
- -em tree
- decision tree files for GV of spectrum
- -ef tree
- decision tree files for GV of Log F0
- -el tree
- decision tree files for GV of low-pass filter
- -cm pdf
- filenames of GV for spectrum
- -cf pdf
- filenames of GV for Log F0
- -cl pdf
- filenames of GV for low-pass filter
- -jm f
- weight of GV for spectrum [1.0][0.0--2.0]
- -jf f
- weight of GV for Log F0 [1.0][0.0--2.0]
- -jl f
- weight of GV for low-pass filter [1.0][0.0--2.0]
- -k tree
- GV switch
- -z i
- audio buffer size [1600][0--48000]
- infile
- text file
option '-d' may be repeated to use multiple delta parameters. generated
spectrum, log F0, and low-pass filter coefficient sequences are saved in
natural endian, binary (float) format.
EXAMPLE¶
If you installed hts-voice-nitech-jp-atr503-m001 in the current directory, the
following command let you make a voice file from input.txt:
% open_jtalk -s 48000 -p 240 -a 0.55 \
-td tree-dur.inf -tm tree-mgc.inf -tf tree-lf0.inf \
-tl tree-lpf.inf -md dur.pdf -mm mgc.pdf \
-mf lf0.pdf -ml lpf.pdf -dm mgc.win1 \
-dm mgc.win2 -dm mgc.win3 -df lf0.win1 \
-df lf0.win2 -df lf0.win3 -dl lpf.win1 \
-em tree-gv-mgc.inf -ef tree-gv-lf0.inf -cm gv-mgc.pdf \
-cf gv-lf0.pdf -k gv-switch.inf -ow output.wav \
-x dic_dir input.txt
AUTHOR¶
This manual page was written by Koichi Akabe vbkaisetsu@gmail.com for the
Debian system (and may be used by others). Permission is granted to
copy, distribute and/or modify this document under the terms of the GNU
General Public License, Version 2 any later version published by the Free
Software Foundation.
On Debian systems, the complete text of the GNU General Public License can be
found in /usr/share/common-licenses/GPL.