Scroll to navigation

datalad subdatasets(1) General Commands Manual datalad subdatasets(1)


datalad subdatasets - report subdatasets and their properties.


datalad subdatasets [-h] [-d DATASET] [--fulfilled FULFILLED] [-r] [-R LEVELS] [--contains PATH] [--bottomup] [--set-property NAME VALUE] [--delete-property NAME] [PATH ...]


The following properties are reported (if possible) for each matching subdataset record.

Name of the subdataset in the parent (often identical with the
relative path in the parent dataset)

Absolute path to the subdataset

Absolute path to the parent dataset

SHA1 of the subdataset commit recorded in the parent dataset

Condition of the subdataset: 'clean', 'modified', 'absent', 'conflict'
as reported by `git submodule`

URL of the subdataset recorded in the parent

Name of the subdataset recorded in the parent

Any additional configuration property on record.

Performance note: Property modification, requesting BOTTOMUP reporting order, or a particular numerical RECURSION_LIMIT implies an internal switch to an alternative query implementation for recursive query that is more flexible, but also notably slower (performs one call to Git per dataset versus a single call for all combined).

The following properties for subdatasets are recognized by DataLad (without the 'gitmodule_' prefix that is used in the query results):

If set to 'skip', the respective subdataset is skipped when DataLad
is recursively installing its superdataset. However, the subdataset
remains installable when explicitly requested, and no other features
are impaired.

If a subdataset was originally established by cloning, 'datalad-url'
records the URL that was used to do so. This might be different from
'url' if the URL contains datalad specific pieces like any URL of the
form "ria+<some protocol>...".


path/name to query for subdatasets. Defaults to the current directory. Constraints: value must be a string

show this help message. --help-np forcefully disables the use of a pager for displaying the help message
specify the dataset to query. If no dataset is given, an attempt is made to identify the dataset based on the input and/or the current working directory. Constraints: Value must be a Dataset or a valid identifier of a Dataset (e.g. a path)
if given, must be a boolean flag indicating whether to report either only locally present or absent datasets. By default subdatasets are reported regardless of their status. Constraints: value must be convertible to type bool
if set, recurse into potential subdataset.
limit recursion into subdataset to the given number of levels. Constraints: value must be convertible to type 'int'
limit report to the subdatasets containing the given path. If a root path of a subdataset is given the last reported dataset will be the subdataset itself. This option can be given multiple times, in which case datasets will be reported that contain any of the given paths. Constraints: value must be a string
whether to report subdatasets in bottom-up order along each branch in the dataset tree, and not top-down.
Name and value of one or more subdataset properties to be set in the parent dataset's .gitmodules file. The property name is case-insensitive, must start with a letter, and consist only of alphanumeric characters. The value can be a Python format() template string wrapped in '<>' (e.g. '<{gitmodule_name}>'). Supported keywords are any item reported in the result properties of this command, plus 'refds_relpath' and 'refds_relname': the relative path of a subdataset with respect to the base dataset of the command call, and, in the latter case, the same string with all directory separators replaced by dashes. This option can be given multiple times. Constraints: value must be a string
Name of one or more subdataset properties to be removed from the parent dataset's .gitmodules file. This option can be given multiple times. Constraints: value must be a string


datalad is developed by The DataLad Team and Contributors <>.

2021-07-23 datalad subdatasets 0.14.6