NAME¶

r.neighbors - Makes each cell category value a function of the category values assigned to the cells around it, and stores new cell values in an output raster map layer.

KEYWORDS¶

raster, algebra, statistics, aggregation, neighbor, focal statistics, filter, parallel

SYNOPSIS¶

r.neighbors
r.neighbors --help
r.neighbors [-ac] input=name [selection=name] output=name[,name,...] [size=integer] [method=string[,string,...]] [weighting_function=string] [weighting_factor=float] [weight=name] [quantile=float[,float,...]] [title=phrase] [nprocs=integer] [memory=memory in MB] [--overwrite] [--help] [--verbose] [--quiet] [--ui]

Flags:¶

-a: Do not align output with the input
-c: Use circular neighborhood
--overwrite: Allow output files to overwrite existing files
--help: Print usage summary
--verbose: Verbose module output
--quiet: Quiet module output
--ui: Force launching GUI dialog

Parameters:¶

input=name [required]: Name of input raster map
selection=name: Name of an input raster map to select the cells which should be processed
output=name[,name,...] [required]: Name for output raster map
size=integer: Neighborhood size
Default: 3
method=string[,string,...]: Neighborhood operation
Options: average, median, mode, minimum, maximum, range, stddev, sum, count, variance, diversity, interspersion, quart1, quart3, perc90, quantile
Default: average
weighting_function=string: Weighting function
Options: none, gaussian, exponential, file
Default: none
none: No weighting
gaussian: Gaussian weighting function
exponential: Exponential weighting function
file: File with a custom weighting matrix
weighting_factor=float: Factor used in the selected weighting function (ignored for none and file)
weight=name: Text file containing weights
quantile=float[,float,...]: Quantile to calculate for method=quantile
Options: 0.0-1.0
title=phrase: Title for output raster map
nprocs=integer: Number of threads for parallel computing
Default: 1
memory=memory in MB: Maximum memory to be used (in MB)
Cache size for raster rows
Default: 300

DESCRIPTION¶

r.neighbors looks at each cell in a raster input map, and examines the values assigned to the cells in some user-defined "neighborhood" around it. It outputs a new raster map layer in which each cell is assigned a value that is some (user-specified) function of the values in that cell’s neighborhood. For example, each cell in the output layer might be assigned a value equal to the average of the values appearing in its 3 x 3 cell "neighborhood" in the input layer. Note that the centre cell is also included in the calculation.

Figure: Illustration for an 3x3 average neighborhood

OPTIONS¶

The user must specify the names of the raster map layers to be used for input and output, the method used to analyze neighborhood values (i.e., the neighborhood function or operation to be performed), and the size of the neighborhood.

The user can optionally specify a selection map, to compute new values only where the raster cells of the selection map are not NULL. In case of a NULL cells, the values from the input map are copied into the output map. This may useful to smooth only parts of an elevation map (pits, peaks, ...).

Example how to use a selection map with method=average:
input map:

selection map, NULL values are marked as *:

* * * * *
* * 1 * *
* 1 1 1 *
* * 1 * *
* * * * *

The output map:

Without using the selection map, the output map would look like this:

It is also possible to weigh cells within the neighborhood. This can be either done with a custom weights matrix or by specifying a weighting function.

In order to use a custom weights matrix, file needs to be specified as a weighting_function and a path to a text file containing the weights needs to be given in the weight option.

Alternatively, gaussian and exponential weighting functions can be selected as weighting function.

For the gaussian weighting function, the user specifies the sigma value (σ) for the gauss filter in the weighting_factor option. The sigma value represents the standard deviation of the gaussian distribution, where the weighting formula for the gaussian filter is defined as follows:

exp(-(x*x+y*y)/(2*σ^2))/(2*π*σ^2)

Lower values for sigma result in a steeper curve, so that more weight is put on cells close to the center of the moving window with a steeper decrease in weights with distance from the center.

For the exponential weighting function, the user specifies a factor for an exponential kernel in the weighting_factor. Negative factors result in negative exponential decrease in weights from the center cell. The weighting formula for the exponential kernel is defined as follows:

exp(factor*sqrt(x*x+y*y))

Stronger negative values for the factor result in a steeper curve, so that more weight is put on cells close to the center of the moving window with a steeper decrease in weights with distance from the center.

Optionally, the user can also run r.neighbors specify the TITLE to be assigned to the raster map layer output, select to not align the resolution of the output with that of the input (the -a option). These options are described further below.

Neighborhood Operation Methods: The neighborhood operators determine what new value a center cell in a neighborhood will have after examining values inside its neighboring cells. Each cell in a raster map layer becomes the center cell of a neighborhood as the neighborhood window moves from cell to cell throughout the map layer. r.neighbors can perform the following operations:

average

The average value within the neighborhood. In the following example, the result would be:
(7*4 + 6 + 5 + 4*3)/9 = 5.6667
The result is rounded to the nearest integer (in this case 6).



   Raw Data     Operation     New Data


   +---+---+---+          +---+---+---+


   | 7 | 7 | 5 |          |   |   |   |


   +---+---+---+ average  +---+---+---+


   | 4 | 7 | 4 |--------->|   | 6 |   |


   +---+---+---+          +---+---+---+


   | 7 | 6 | 4 |          |   |   |   |


   +---+---+---+          +---+---+---+

median

The value found half-way through a list of the neighborhood’s values, when these are ranged in numerical order.

mode

The most frequently occurring value in the neighborhood.

minimum

The minimum value within the neighborhood.

maximum

The maximum value within the neighborhood.

range

The range value within the neighborhood.

stddev

The statistical standard deviation of values within the neighborhood (rounded to the nearest integer).

sum

The sum of values within the neighborhood.

count

The count of filled (not NULL) cells.

variance

The statistical variance of values within the neighborhood (rounded to the nearest integer).

diversity

The number of different values within the neighborhood. In the above example, the diversity is 4.

interspersion

The percentage of cells containing values which differ from the values assigned to the center cell in the neighborhood, plus 1. In the above example, the interspersion is:
5/8 * 100 + 1 = 63.5
The result is rounded to the nearest integer (in this case 64).

quart1, quart3

The result will be the first or the third quartile (equal of 25th and 75th percentiles).

perc90

The result will be the 90th percentile of neighborhood.

quantile

Any quantile as specified by "quantile" input parameter.

Neighborhood Size: The neighborhood size specifies which cells surrounding any given cell fall into the neighborhood for that cell. The size must be an odd integer and represent the length of one of moving window edges in cells. For example, a size value of 3 will result in



                              _ _ _


                             |_|_|_|


    3 x 3 neighborhood --->  |_|_|_|


                             |_|_|_|

Matrix weights: A custom matrix can be used if none of the neighborhood operation methods are desirable by using the weight. This option must be used in conjunction with the size option to specify the matrix size and file needs to be specified as weighting_function. The weights desired are to be entered into a text file. For example, to calculate the focal mean with a matrix size of 3,

r.neigbors in=input.map out=output.map size=3 weighting_function=file \
weight=weights.txt

The contents of the weight.txt file:

3 3 3
1 4 8
9 5 3

This corresponds to the following 3x3 matrix:

+-+-+-+
|3|3|3|
+-+-+-+
|1|4|8|
+-+-+-+
|9|5|3|
+-+-+-+

To calculate an annulus shaped neighborhood the contents of weight.txt file may be e.g. for size=5:

The way that weights are used depends upon the specific aggregate (method) being used. However, most of the aggregates have the property that multiplying all of the weights by the same factor won’t change the final result (an exception is method=count). Also, most (if not all) of them have the properties that an integer weight of N is equivalent to N occurrences of the cell value, and having all weights equal to one produces the same result as when weights are not used. When weights are used, the calculation for method=average is:



  sum(w[i]*x[i]) / sum(w[i])

In the case where all weights are zero, this will end up with both the numerator and denominator to zero, which produces a NULL result.

FLAGS¶

-a: If specified, r.neighbors will not align the output raster map layer with that of the input raster map layer. The r.neighbors program works in the current geographic region. It is recommended, but not required, that the resolution of the geographic region be the same as that of the raster map layer. By default, if unspecified, r.neighbors will align these geographic region settings.
-c: This flag will use a circular neighborhood for the moving analysis window, centered on the current cell.

The exact masks for the first few neighborhood sizes are as follows:

3x3     . X .		5x5	. . X . .	7x7	. . . X . . .


        X O X			. X X X .		. X X X X X .


        . X .			X X O X X		. X X X X X .
				. X X X .		X X X O X X X


 				. . X . .		. X X X X X .
							. X X X X X .


        						. . . X . . .
9x9	. . . . X . . . .		11x11   . . . . . X . . . . .
	. . X X X X X . .			. . X X X X X X X . .


        . X X X X X X X .			. X X X X X X X X X .


        . X X X X X X X .			. X X X X X X X X X .


        X X X X O X X X X			. X X X X X X X X X .


        . X X X X X X X .			X X X X X O X X X X X


        . X X X X X X X .			. X X X X X X X X X .


        . . X X X X X . .			. X X X X X X X X X .


        . . . . X . . . .			. X X X X X X X X X .
				        	. . X X X X X X X . .
				        	. . . . . X . . . . .

NOTES¶

The r.neighbors program works in the current geographic region with the current mask, if any. It is recommended, but not required, that the resolution of the geographic region be the same as that of the raster map layer. By default, r.neighbors will align these geographic region settings. However, the user can select to keep original input and output resolutions which are not aligned by specifying this (e.g., using the -a option).

r.neighbors doesn’t propagate NULLs, but computes the aggregate over the non-NULL cells in the neighborhood.

The -c flag and the weights parameter are mutually exclusive. Any use of the two together will produce an error. Differently-shaped neighborhood analysis windows may be achieved by using the weight= parameter to specify a weights file where all values are equal. The user can also vary the weights at the edge of the neighborhood according to the proportion of the cell that lies inside the neighborhood circle, effectively anti-aliasing the analysis mask.

For aggregates where a weighted calculation isn’t meaningful (specifically: minimum, maximum, diversity and interspersion), the weights are used to create a binary mask, where zero causes the cell to be ignored and any non-zero value causes the cell to be used.

r.neighbors copies the GRASS color files associated with the input raster map layer for those output map layers that are based on the neighborhood average, median, mode, minimum, and maximum. Because standard deviation, variance, diversity, and interspersion are indices, rather than direct correspondents to input values, no color files are copied for these map layers. (The user should note that although the color file is copied for average neighborhood function output, whether or not the color file makes sense for the output will be dependent on the input data values.)

Propagation of output precision¶

The following logic has been implemented: For any aggregate, there are two factors affecting the output type:

1: Whether the input map is integer or floating-point.
2: Whether the weighted or unweighted version of the aggregate is used.

These combine to create four possibilities:

input type/weight	integer	float
no	yes	no	yes
average	float	float	float	float
median	[1]	[1]	float	float
mode	integer	integer	[2]	[2]
minimum	integer	integer	float	float
maximum	integer	integer	float	float
range	integer	integer	float	float
stddev	float	float	float	float
sum	integer	float	float	float
count	integer	float	integer	float
variance	float	float	float	float
diversity	integer	integer	integer	integer
interspersion	integer	integer	integer	integer
quart1	[1]	[1]	float	float
quart3	[1]	[1]	float	float
perc90	[1]	[1]	float	float
quantile	[1]	[1]	float	float

[1] For integer input, quantiles may produce float results from interpolating between adjacent values.
[2] Calculating the mode of floating-point data is essentially meaningless.

With the current aggregates, there are 5 cases:

1: Output is always float: average, variance, stddev, quantiles (with interpolation).
2: Output is always integer: diversity, interspersion.
3: Output is integer if unweighted, float if weighted: count.
4: Output matches input: minimum, maximum, range, mode (subject to note 2 above), quantiles (without interpolation).
5: Output is integer for integer input and unweighted aggregate, otherwise float: sum.

Performance¶

To enable parallel processing, the user can specify the number of threads to be used with the nprocs parameter (default 1). The memory parameter (default 300) can also be provided to determine the size of the buffer for computation.
Figure: Benchmark on the left shows execution time for different number of cells, benchmark in the middle shows execution time for different sizes of neighborhood for a 10000x10000 raster and benchmark on the right shows execution time for different memory size for a 10000x10000 raster. See benchmark scripts in source code. (Intel Core i9-10940X CPU @ 3.30GHz x 28)

To reduce the memory requirements to minimum, set option memory to zero. To take advantage of the parallelization, GRASS GIS needs to be compiled with OpenMP enabled.

EXAMPLES¶

Measure occupancy of neighborhood¶

Set up 10x10 computational region to aid visual inspection of results

g.region rows=10 cols=10

Fill 50% of computational region with randomly located cells. "distance=0" will allow filling adjacent cells.

r.random.cells output=random_cells distance=0 ncells=50

Count non-empty (not NULL) cells in 3x3 neighborhood

r.neighbors input=random_cells output=counts method=count

Optionally - exclude centre cell from the count (= only look around)

r.mapcalc "count_around = if( isnull(random_cells), counts, counts - 1)"

AUTHORS¶

Original version: Michael Shapiro, U.S.Army Construction Engineering Research Laboratory
Updates for GRASS GIS 7 by Glynn Clements and others

SOURCE CODE¶

Available at: r.neighbors source code (history)

Accessed: Thursday Mar 07 18:15:12 2024

Source file:	r.neighbors.1grass.en.gz (from grass-doc 8.3.2-1~exp1)
Source last updated:	2024-03-07T17:18:27Z
Converted to HTML:	2024-03-08T09:56:21Z