Scroll to navigation
NAME¶
heri-split - splits the dataset into training and testing sets
SYNOPSIS¶
heri-split [OPTIONS] dataset1 [dataset2...]
DESCRIPTION¶
heri-split splits the dataset into several training and testing sets as
it is required for N-fold cross-validation. Dataset contains one object per
line as in svmlight format. By default stratified sampling is used. That is,
all folds contain the same number of objects for each label.
OPTIONS¶
- -h, --help
- Display help information.
- -c, --folds count
- Set the number of folds. This is a mandatory option.
- -d, --output-dir dir
- Set the output directory. This is a mandatory option.
- -r,--random
- Use random sampling instead of stratified one.
- -s, --seed seed
- Set the seed value for pseudorandom generator.