Scroll to navigation

mlpack_preprocess_imputer(1) User Commands mlpack_preprocess_imputer(1)

NAME

mlpack_preprocess_imputer - impute data

SYNOPSIS


mlpack_preprocess_imputer -i string -m string -s string [-c double] [-d int] [-V bool] [-o string] [-h -v]

DESCRIPTION

This utility takes a dataset and converts a user-defined missing variable to another to provide more meaningful analysis.

The program does not modify the original file, but instead makes a separate file to save the output data; You can save the output by specifying the file name with'--output_file (-o)'.

For example, if we consider 'NULL' in dimension 0 to be a missing variable and want to delete whole row containing the NULL in the column-wise'dataset.csv', and save the result to 'result.csv', we could run :

$ mlpack_mlpack_preprocess_imputer --input_file dataset --output_file result --missing_value NULL --dimension 0 --strategy listwise_deletion

REQUIRED INPUT OPTIONS

File containing data.
User defined missing value.
imputation strategy to be applied. Strategies should be one of 'custom', 'mean', 'median', and 'listwise_deletion'.

OPTIONAL INPUT OPTIONS

--custom_value (-c) [double] User-defined custom imputation value. Default value 0.

The dimension to apply imputation to. Default value 0.
Default help info.
Print help on a specific option. Default value ''.
Display informational messages and the full list of parameters and timers at the end of execution.
Display the version of mlpack.

OPTIONAL OUTPUT OPTIONS

File to save output into. Default value ''.

ADDITIONAL INFORMATION

For further information, including relevant papers, citations, and theory, consult the documentation found at http://www.mlpack.org or included with your distribution of mlpack.

12 December 2020 mlpack-3.4.2