sge_intro - Grid Engine, a facility for executing jobs on remote machines
Grid Engine is a facility for executing Unix-like batch jobs (shell scripts or
binaries) on a pool of cooperating CPUs. Jobs are queued and executed remotely
according to defined policies. The CPUs may be in desktop systems, with jobs
run on them at times when they would otherwise be idle or only lightly loaded.
The work load may be distributed according to the (generalized) load situation
of each machine and the resource requirements of the jobs.
User level checkpointing programs are supported and a transparent
checkpointing mechanism is provided (see sge_ckpt(1)). Checkpointing
jobs migrate from system to system without user intervention on load demand.
In addition to simple batch jobs, interactive jobs and parallel jobs can
also be submitted to Grid Engine.
The Grid Engine user interface consists of several programs which are described
- qacct extracts arbitrary accounting information from the cluster
- qalter changes the characteristics of already submitted jobs.
- qconf provides the user interface for configuring, modifying,
deleting and querying queues and the cluster configuration.
- qdel provides the means for a user/operator/manager to cancel
- qevent provides a means of watching Grid Engine events and acting
on jobs finishing.
- qhold holds back submitted jobs from execution.
- qhost displays status information about Grid Engine execution
- qlogin initiates a telnet or similar login session with automatic
selection of a suitable host.
- qmake is a replacement for the standard Unix make facility.
It extends make with an ability to distribute independent make steps
across a cluster of suitable machines.
- qmod allows the owner(s) of a queue to suspend and enable queues,
e.g. all queues associated with his machine (all currently active
processes in this queue are also signaled) or to suspend and enable jobs
executing in the queues.
- qmon provides a Motif command interface to all Grid Engine
functions. The status of all, or a private selection of, the configured
queues is displayed on-line by changing colors at corresponding queue
- qping can be used to check the status of Grid Engine daemons.
- qquota provides a status listing of all currently used resource
quotas (see sge_resource_quota(5)).
- qresub creates new jobs by copying currently running or pending
- qrls releases holds from jobs previously assigned to them e.g. via
qhold(1) (see above).
- qrdel provides the means to cancel advance reservations.
- qrsh can be used for various purposes such as providing remote
execution of interactive applications via Grid Engine comparable to the
standard Unix facility rsh, to allow for the submission of batch jobs
which, upon execution, support terminal I/O (standard/error output and
standard input) and terminal control, to provide a batch job submission
client which remains active until the job has finished or to allow for the
Grid Engine-controlled remote execution of the tasks of parallel
- qrstat provides a status listing of all advance reservations in the
- qrsub is the user interface for submitting an advance reservation
to Grid Engine.
- qselect prints a list of queue names corresponding to specified
selection criteria. The output of qselect is usually fed into other
Grid Engine commands to apply actions on a selected set of queues.
- qsh opens an interactive shell (in an xterm(1)) on a low
loaded host. Any kind of interactive job can be run in this shell.
- qstat provides a status listing of all jobs and queues associated
with the cluster.
- qtcsh is a fully compatible replacement for the widely known and
used Unix C-Shell (csh) derivative tcsh. It provides a
command-shell with the extension of transparently distributing execution
of designated applications to suitable and lightly loaded hosts via Grid
- qsub is the user interface for submitting a job to Grid
sge_ckpt(1), qacct(1), qalter(1), qconf(1),
qdel(1), qhold(1), qhost(1), qlogin(1),
qmake(1), qmod(1), qmon(1), qresub(1),
qrls(1), qrsh(1), qselect(1), qsh(1),
qstat(1), qsub(1), qtcsh(1).
Copyright: 2008 by Sun Microsystems, Inc. Parts of the manual page texts are
Copyright 2011 Univa Corporation, and various dates by other contributors. The
manual pages are licensed under the Sun Industry Standards Source License,
like the source, except where noted in some files by other contributors, which
are under other free licences noted in their source.