.\" DO NOT MODIFY THIS FILE! It was generated by help2man 1.46.4. .TH PYNLPL-SAMPLER "1" "February 2016" "pynlpl-sampler 0.7.7" "User Commands" .SH NAME sampler \- manual page for pynlpl-sampler 0.7.7 .SH DESCRIPTION usage: pynlpl\-sampler [\-h] [\-t TESTSETSIZE] [\-d DEVSETSITE] [\-T TRAINSETSITE] .TP [\-S SEED] files [files ...] .PP Extracts random samples from datasets, supports multiple parallel datasets (such as parallel corpora), provided that corresponding data is on the same line. .SS "positional arguments:" .TP files The data sets to sample from, must be of equal size (i.e., same number of lines) .SS "optional arguments:" .TP \fB\-h\fR, \fB\-\-help\fR show this help message and exit .TP \fB\-t\fR TESTSETSIZE, \fB\-\-testsetsize\fR TESTSETSIZE Test set size (lines) (default: 0) .TP \fB\-d\fR DEVSETSITE, \fB\-\-devsetsite\fR DEVSETSITE Development set size (lines) (default: 0) .TP \fB\-T\fR TRAINSETSITE, \fB\-\-trainsetsite\fR TRAINSETSITE Training set size (lines), leave unassigned (0) to automatically use all of the remaining data (default: 0) .TP \fB\-S\fR SEED, \fB\-\-seed\fR SEED Seed for random number generator (default: 0)