.TH "htsengine" "1" .SH "NAME" hts_engine \(em HMM-based speech synthesis engine .SH "SYNOPSIS" .PP \fBhts_engine\fR [\fBoptions\fP] [\fBinfile\fP] .SH "DESCRIPTION" .PP This manual page documents briefly the \fBhts_engine\fR command. .PP This manual page was written for the \fBDebian\fP distribution because the original program does not have a manual page. Instead, it has documentation in the GNU \fBInfo\fP format; see below. .PP \fBhts_engine\fR is a program that synthesize speech waveform from HMMs trained by the HMM-based speech synthesis system (HTS). .SH "OPTIONS" .PP A summary of options is included below. .IP "\fB-td tree\fP" 10 decision tree files for state duration .IP "\fB-tm tree\fP" 10 Show version of program. .IP "\fB-tf tree\fP" 10 decision tree files for Log F0 .IP "\fB-tl tree\fP" 10 decision tree files for low-pass filter .IP "\fB-md pdf\fP" 10 model files for state duration .IP "\fB-mm pdf\fP" 10 model files for spectrum .IP "\fB-mf pdf\fP" 10 model files for Log F0 .IP "\fB-ml pdf\fP" 10 model files for low-pass filter .IP "\fB-dm win\fP" 10 window files for calculation delta of spectrum .IP "\fB-df win\fP" 10 window files for calculation delta of Log F0 .IP "\fB-dl win\fP" 10 window files for calculation delta of low-pass filter .IP "\fB-od s\fP" 10 filename of output label with duration .IP "\fB-om s\fP" 10 filename of output spectrum .IP "\fB-of s\fP" 10 filename of output Log F0 .IP "\fB-ol s\fP" 10 filename of output low-pass filter .IP "\fB-or s\fP" 10 filename of output raw audio (generated speech) .IP "\fB-ow s\fP" 10 filename of output wav audio (generated speech) .IP "\fB-ot s\fP" 10 filename of output trace information .IP "\fB-vp\fP" 10 use phoneme alignment for duration .IP "\fB-i i f1 .. fi\fP" 10 enable interpolation & specify number(i),coefficient(f) [1][1\-\-] .IP "\fB-s i\fP" 10 sampling frequency [16000][1\-\-48000] .IP "\fB-p i\fP" 10 frame period (point) [80][1\-\-] .IP "\fB-a f\fP" 10 all-pass constant [0.42][0.0\-\-1.0] .IP "\fB-g i\fP" 10 gamma = \-1 / i (if i=0 then gamma=0) [0][0\-\-] .IP "\fB-b f\fP" 10 postfiltering coefficient [0.0][\-0.8\-\-8.0] .IP "\fB-l\fP" 10 regard input as log gain and output linear one (LSP) .IP "\fB-r f\fP" 10 speech speed rate [1.0][0.0\-\-10.0] .IP "\fB-fm f\fP" 10 add half-tone [0.0][\-24.0\-\-24.0] .IP "\fB-u f\fP" 10 voiced/unvoiced threshold[0.5][0.0\-\-1.0] .IP "\fB-em tree\fP" 10 decision tree files for GV of spectrum .IP "\fB-ef tree\fP" 10 decision tree files for GV of Log F0 .IP "\fB-el tree\fP" 10 decision tree files for GV of low-pass filter .IP "\fB-cm pdf\fP" 10 filenames of GV for spectrum .IP "\fB-cf pdf\fP" 10 filenames of GV for Log F0 .IP "\fB-cl pdf\fP" 10 filenames of GV for low-pass filter .IP "\fB-jm f\fP" 10 weight of GV for spectrum [1.0][0.0\-\-2.0] .IP "\fB-jf f\fP" 10 weight of GV for Log F0 [1.0][0.0\-\-2.0] .IP "\fB-jl f\fP" 10 weight of GV for low-pass filter [1.0][0.0\-\-2.0] .IP "\fB-k tree\fP" 10 GV switch .IP "\fB-z i\fP" 10 audio buffer size [1600][0\-\-48000] .IP "\fBinfile\fP" 10 label file .PP option '\-d' may be repeated to use multiple delta parameters. generated spectrum, log F0, and low-pass filter coefficient sequences are saved in natural endian, binary (float) format. .SH "EXAMPLE" .PP If you installed hts-voice-nitech-jp-atr503-m001 in the current directory, the following command let you make a voice file from input.lab: .PP .RS .PP .nf % hts_engine \-s 48000 \-p 240 \-a 0.55 \\ \-td tree-dur.inf \-tm tree-mgc.inf \-tf tree-lf0.inf \\ \-tl tree-lpf.inf \-md dur.pdf \-mm mgc.pdf \\ \-mf lf0.pdf \-ml lpf.pdf \-dm mgc.win1 \\ \-dm mgc.win2 \-dm mgc.win3 \-df lf0.win1 \\ \-df lf0.win2 \-df lf0.win3 \-dl lpf.win1 \\ \-em tree-gv-mgc.inf \-ef tree-gv-lf0.inf \-cm gv-mgc.pdf \\ \-cf gv-lf0.pdf \-k gv-switch.inf \-ow output.wav \\ input.lab .fi .PP .RE .SH "AUTHOR" .PP This manual page was written by Koichi Akabe vbkaisetsu@gmail.com for the \fBDebian\fP system (and may be used by others). Permission is granted to copy, distribute and/or modify this document under the terms of the GNU General Public License, Version 2 any later version published by the Free Software Foundation. .PP On Debian systems, the complete text of the GNU General Public License can be found in /usr/share/common-licenses/GPL. .\" created by instant / docbook-to-man, Mon 02 Jan 2012, 22:56