User Manuals

NAME¶

svm-subset - a subset selection tool for LIBSVM

SYNOPSIS¶

svm-subset [ -s method ] dataset number [ output1 ] [ output2 ]

DESCRIPTION¶

Training large data is time consuming. Sometimes one should work on a smaller subset first. The python script subset.py randomly selects a specified number of samples. For classification data, we provide a stratified selection to ensure the same class distribution in the subset.

OPTIONS¶

-s method

0: -- stratified selection (classification only) (default)
1: -- random selection

output1: The subset. If output1 is omitted, the subset will be printed on the screen.
output2: The rest of data.

FILES¶

See svm-train(1) for the format of dataset

EXAMPLES¶

: svm-subset heart_scale 100 file1 file2

From heart_scale 100 samples are randomly selected and stored in file1. All remaining instances are stored in file2.

BUGS¶

Please report bugs to the Debian BTS.

AUTHOR¶

Chih-Chung Chang, Chih-Jen Lin <cjlin@csie.ntu.edu.tw>, Chen-Tse Tsai <ctse.tsai@gmail.com> (packaging)

Source file:	svm-subset.1.en.gz (from libsvm-tools 3.21+ds-1.2)
Source last updated:	2018-09-23T05:39:25Z
Converted to HTML:	2020-10-22T03:18:07Z