Scroll to navigation

slurm_submit_workers(1) Cooperative Computing Tools slurm_submit_workers(1)

NAME

slurm_submit_workers - submit work_queue_worker to a SLURM cluster.

SYNOPSIS

slurm_submit_workers [options] <servername> <port> <num-workers>

DESCRIPTION

slurm_submit_workers schedules the execution of work_queue_worker(1) on the SLURM batch system through its job submission interface, qsub. The number of work_queue_worker scheduled and run is given by the num-workers argument.

The servername and port arguments specify the hostname and port number of the master for the work_queue_worker to connect. These two arguments become optional when the auto mode option is specified for work_queue_worker.

OPTIONS

 -M
Name of the preferred master for worker.
 -N <name>
Preferred project name for work_queue_worker to connect.
 -c <cores>
Set the number of cores each worker should use (0=auto). (default=1)
 -C <catalog>
Set catalog server for work_queue_worker to <catalog>. <catalog> format: HOSTNAME:PORT.
 -t <seconds>
Abort work_queue_worker after this amount of idle time (default=900s).
 -d <subsystem>
Enable debugging on worker for this subsystem (try -d all to start).
 -w <size>
Set TCP window size
 -i <time>
Set initial value for backoff interval when worker fails to connect to a master. (default=1s)
 -b <time>
Set maxmimum value for backoff interval when worker fails to connect to a master. (default=60s)
 -z <size>
Set available disk space threshold (in MB). When exceeded worker will clean up and reconnect. (default=100MB)
 -A <arch>
Set architecture string for the worker to report to master instead of the value in uname.
 -O <os>
Set operating system string for the worker to report to master instead of the value in uname.
 -s <path>
Set the location for creating the working directory of the worker.
 -p <parameters>
SLURM qsub parameters.
 -h
Show help message.

EXIT STATUS

On success, returns zero. On failure, returns non-zero.

EXAMPLES

Submit 10 worker instances to run on SLURM and connect to a specific master:

slurm_submit_workers master.somewhere.edu 9123 10

Submit 10 work_queue_worker instances to run on SLURM in auto mode with their preferred project name set to Project_A and abort timeout set to 3600 seconds:

slurm_submit_workers -a -t 3600 -M Project_A 10

COPYRIGHT

The Cooperative Computing Tools are Copyright (C) 2005-2019 The University of Notre Dame. This software is distributed under the GNU General Public License. See the file COPYING for details.

SEE ALSO

CCTools 7.1.2 FINAL