Package 'mop' reference manual

Title:	Mobility Oriented-Parity Metric
Description:	A set of tools to perform multiple versions of the Mobility Oriented-Parity metric. This multivariate analysis helps to characterize levels of dissimilarity between a set of conditions of reference and another set of conditions of interest. If predictive models are transferred to conditions different from those over which models were calibrated (trained), this metric helps to identify transfer conditions that differ substantially from those of calibration. These tools are implemented following principles proposed in Owens et al. (2013) <doi:10.1016/j.ecolmodel.2013.04.011>, and expanded to obtain more detailed results that aid in interpretation.
Authors:	Marlon E. Cobos [aut, cre] , Hannah L. Owens [aut] , Jorge Soberón [aut] , A. Townsend Peterson [aut]
Maintainer:	Marlon E. Cobos <[email protected]>
License:	GPL (>= 3)
Version:	0.1.2
Built:	2025-03-03 04:18:00 UTC
Source:	https://github.com/marlonecobos/mop

mop: Mobility Oriented-Parity Metric

Description

mop contains a set of tools to calculate the Mobility Oriented-Parity metric, which allows a user to compare a set of conditions of reference versus another set of of interest.

Details

The main goals of the MOP metric are to explore conditions in the set of interest that are non-analogous to those in the reference set, and to quantify how different conditions in the set of interest are from the reference set. The tools included here help to identify conditions outside the ranges of the reference set with greater detail than in other implementations. These tools are based on the methods proposed by Owens et al. (2013; doi:10.1016/j.ecolmodel.2013.04.011).

Functions in mop

mop, mop_distance, out_range, match_na_raster

Data included

reference_matrix, matrix_of_interest, reference_layers, layers_of_interest

Author(s)

Maintainer: Marlon E. Cobos [email protected] (ORCID)

Authors:

Hannah L. Owens [email protected] (ORCID)
Jorge Soberón [email protected] (ORCID)
A. Townsend Peterson [email protected] (ORCID)

Example of variables for a set of interest

Description

A SpatRaster object representing variables in a set of interest. Variables represent future bioclimatic variables downloaded from the WorldClim database (https://worldclim.org/).

Format

A SpatRaster object.

Value

No return value. Used with function rast to bring raster variables to analysis.

Examples

layers_of_interest <- terra::rast(system.file("extdata",
                                              "layers_of_interest.tif",
                                              package = "mop"))

terra::plot(layers_of_interest)
layers_of_interest <- terra::rast(system.file("extdata",
                                              "layers_of_interest.tif",
                                              package = "mop"))

terra::plot(layers_of_interest)

Match NA cells for all layers in SpatRaster

Description

Option to match cells with NA values in a SpatRaster with multiple layers.

Usage

match_na_raster(layers)
match_na_raster(layers)

Arguments

layers

a SpatRaster object containing two or more variables to be matched.

Value

A SpatRaster object with NA cells matching in all layers.

Examples

# data
layers <- terra::rast(system.file("extdata", "reference_layers.tif",
                      package = "mop"))

# add NA in some places
layers[20:24, 10:16][, 3] <- NA
terra::plot(layers)

# match NAs
matched <- match_na_raster(layers)
terra::plot(matched)
# data
layers <- terra::rast(system.file("extdata", "reference_layers.tif",
                      package = "mop"))

# add NA in some places
layers[20:24, 10:16][, 3] <- NA
terra::plot(layers)

# match NAs
matched <- match_na_raster(layers)
terra::plot(matched)

Example of matrix with variables in a set of interest

Description

A numeric table representing variables in a set of interest.

Usage

matrix_of_interest
matrix_of_interest

Format

A matrix with 723 rows and 6 columns.

Examples

data("matrix_of_interest", package = "mop")

head(matrix_of_interest)
data("matrix_of_interest", package = "mop")

head(matrix_of_interest)

Analysis of extrapolation risks using the MOP metric

Description

Analysis to calculate the mobility-oriented parity metric and other sub-products to represent dissimilarities and non-analogous conditions when comparing a set of reference conditions (M; m) against another set of conditions of interest (G; g).

Usage

mop(m, g, type = "basic",  calculate_distance = FALSE,
    where_distance = "in_range", distance = "euclidean",
    scale = FALSE, center = FALSE, fix_NA = TRUE, percentage = 1,
    comp_each = 2000, tol = NULL, rescale_distance = FALSE,
    parallel = FALSE, n_cores = NULL, progress_bar = TRUE)
mop(m, g, type = "basic",  calculate_distance = FALSE,
    where_distance = "in_range", distance = "euclidean",
    scale = FALSE, center = FALSE, fix_NA = TRUE, percentage = 1,
    comp_each = 2000, tol = NULL, rescale_distance = FALSE,
    parallel = FALSE, n_cores = NULL, progress_bar = TRUE)

Arguments

`m`	a `SpatRaster` or matrix of variables representing a set of conditions of reference (e.g., the set of conditions in which a model was calibrated). If a matrix is used, each column represents a variable.
`g`	a `SpatRaster` or matrix of variables representing a set of conditions of interest for which dissimilarity values and non-analogous conditions will be detected (e.g., conditions in which a model is projected). Variable names must match between `m` and `g`.
`type`	`character`, type of MOP analysis to be performed. See `Details` for options.
`calculate_distance`	`logical`, whether to calculate distances (dissimilarities) between `m` and `g`. The default, FALSE, runs rapidly and does not assess dissimilarity levels.
`where_distance`	`character`, where to calculate distances, considering how conditions in `g` are positioned in comparison to the range of conditions in `m`. See `Details` for options.
`distance`	`character`, which distances are calculated, `euclidean` or `mahalanobis`. Valid if `calculate_distance = TRUE`.
`scale`	scaling options, `logical` or `numeric-alike` as in `scale`.
`center`	`logical` or `numeric-alike` center options as in `scale`.
`fix_NA`	`logical`, whether to fix layers so cells with NA values are the same in all layers. Setting to FALSE may save time if the rasters are big and have no NA matching problems.
`percentage`	`numeric`, percentage of `m` closest conditions used to derive mean environmental distances to each combination of conditions in `g`.
`comp_each`	`numeric`, number of combinations in `g` to be used for distance calculations at a time. Increasing this number requires more RAM.
`tol`	tolerance to detect linear dependencies when calculating Mahalanobis distances. The default, NULL, uses `.Machine$double.eps`.
`rescale_distance`	`logical`, whether or not to re-scale distances 0-1. Re-scaling prevents comparisons of dissimilarity values obtained from runs with different values of `percentage`.
`parallel`	`logical`, whether calculations should be performed in parallel using `n_cores` of the computer. Using this option will speed up the analysis but will demand more RAM.
`n_cores`	`numeric`, number of cores to be used in parallel processing. If `parallel = TRUE` and `n_cores = NULL` (all CPU cores on current host - 1) will be used.
`progress_bar`	`logical`, whether to show a progress bar.

Details

type options return results that differ in the detail of how non-analogous conditions are identified.

basic - makes calculation as proposed by Owens et al. (2013) doi:10.1016/j.ecolmodel.2013.04.011.
simple - calculates how many variables in the set of interest are non-analogous to those in the reference set.
detailed - calculates five additional extrapolation metrics. See mop_detailed under Value below for full details.

where_distance options determine what values should be used to calculate dissimilarity

in_range - only conditions inside m ranges
out_range - only conditions outside m ranges
all - all conditions

When the variables used to represent conditions have different units, scaling and centering are recommended. This step is only valid when Euclidean distances are used.

Value

A object of class mop_results containing:

summary - a list with details of the data used in the analysis:
- variables - names of variables considered.
- type - type of MOP analysis performed.
- scale - value according to the argument scale.
- center - value according to the argument center.
- calculate_distance - value according to the argument calculate_distance.
- distance - option regarding distance used.
- percentage - percentage of m used as reference for distance calculation.
- rescale_distance - value according to the argument rescale_distance.
- fix_NA - value according to the argument fix_NA.
- N_m - total number of elements (cells with values or valid rows) in m.
- N_g - total number of elements (cells with values or valid rows) in g.
- m_ranges - matrix with ranges of variables in reference conditions (m).
mop_distances - if calculate_distance = TRUE, a SpatRaster or vector with distance values for the set of interest (g). Higher values represent greater dissimilarity compared to the set of reference (m).
mop_basic - a SpatRaster or vector, for the set of interest, representing conditions in which at least one of the variables is non-analogous to the set of reference. Values should be: 1 for non-analogous conditions, and NA for conditions inside the ranges of the reference set.
mop_simple - a SpatRaster or vector, for the set of interest, representing how many variables in the set of interest are non-analogous to those in the reference set. NA is used for conditions inside the ranges of the reference set.
mop_detailed - a list containing:
- interpretation_combined - a data.frame to help identify combinations of variables in towards_low_combined and towards_high_combined that are non-analogous to m.
- towards_low_end - a SpatRaster or matrix for all variables representing where non-analogous conditions were found towards low values of each variable.
- towards_high_end - a SpatRaster or matrix for all variables representing where non-analogous conditions were found towards high values of each variable.
- towards_low_combined - a SpatRaster or vector with values representing the identity of the variables found to have non-analogous conditions towards low values. If vector, interpretation requires the use of the data.frame interpretation_combined.
- towards_high_combined - a SpatRaster or vector with values representing the identity of the variables found to have non-analogous conditions towards high values. If vector, interpretation requires the use of the data.frame interpretation_combined.

Examples

# data
reference_layers <- terra::rast(system.file("extdata", "reference_layers.tif",
                                            package = "mop"))

layers_of_interest <- terra::rast(system.file("extdata",
                                              "layers_of_interest.tif",
                                              package = "mop"))

# analysis
mop_res <- mop(m = reference_layers, g = layers_of_interest)

summary(mop_res)
# data
reference_layers <- terra::rast(system.file("extdata", "reference_layers.tif",
                                            package = "mop"))

layers_of_interest <- terra::rast(system.file("extdata",
                                              "layers_of_interest.tif",
                                              package = "mop"))

# analysis
mop_res <- mop(m = reference_layers, g = layers_of_interest)

summary(mop_res)

MOP distance calculation

Description

Calculates distances from each of the points of interest in g_matrix to a defined percentage of the reference conditions in m_matrix.

Usage

mop_distance(m_matrix, g_matrix, distance = "euclidean", percentage = 1,
             comp_each = 2000, tol = NULL, parallel = FALSE, n_cores = NULL,
             progress_bar = TRUE)
mop_distance(m_matrix, g_matrix, distance = "euclidean", percentage = 1,
             comp_each = 2000, tol = NULL, parallel = FALSE, n_cores = NULL,
             progress_bar = TRUE)

Arguments

`m_matrix`	matrix of variables representing the set of conditions to be used as reference. Each column represents a variable.
`g_matrix`	matrix of variables representing the set of conditions to be compared against the reference conditions (where distances are to be calculated). Each column represents a variable. Variable names must match those in `m_matrix`.
`distance`	`character`, one of two options: "euclidean" or "mahalanobis".
`percentage`	`numeric`, percentage of points of m (the closest ones) used to derive mean environmental distances to each g point.
`comp_each`	`numeric`, number of points of the g matrix to be used for distance calculations at a time (default = 2000). Increasing this number requires more RAM.
`tol`	tolerance to detect linear dependencies when calculating Mahalanobis distances. The default, NULL, uses `.Machine$double.eps`.
`parallel`	`logical`, if TRUE, calculations will be performed in parallel using `n_cores` of the computer. Using this option will speed up the analysis but will demand more RAM.
`n_cores`	`numeric`, number of cores to be used in parallel processing. Uses current host CPU cores - 1 by default.
`progress_bar`	`logical`, whether to show a progress bar for calculations. Valid when calculations are not run in parallel.

Value

A numeric vector with values of distances calculated according to parameters used.

Examples

# data
data("reference_matrix", package = "mop")
data("matrix_of_interest", package = "mop")

# analysis
mop_dist <- mop_distance(m_matrix = reference_matrix,
                         g_matrix = matrix_of_interest)
# data
data("reference_matrix", package = "mop")
data("matrix_of_interest", package = "mop")

# analysis
mop_dist <- mop_distance(m_matrix = reference_matrix,
                         g_matrix = matrix_of_interest)

Constructor of S3 objects of class mop_results

Description

Constructor of S3 objects of class mop_results

Usage

new_mop_results(summary = new("list"), mop_distances = NULL,
                mop_basic = NULL, mop_simple = NULL,
                mop_detailed = new("list"))
new_mop_results(summary = new("list"), mop_distances = NULL,
                mop_basic = NULL, mop_simple = NULL,
                mop_detailed = new("list"))

Arguments

`summary`	a list with a summary of the data and parameters used in analysis. Default = empty list.
`mop_distances`	a `SpatRaster` or numeric vector of distances from the set of conditions of reference to the set of conditions of interest. Default = NULL.
`mop_basic`	a `SpatRaster` or numeric vector showing conditions in the set of interest outside the ranges in the reference set. The value `1` indicates conditions outside one or more ranges. Default = NULL.
`mop_simple`	a `SpatRaster` or numeric vector showing conditions in the set of interest outside the ranges in the reference set. Values indicate how many variables are outside reference ranges. Default = NULL.
`mop_detailed`	a list with a detailed representation of mop results in conditions outside the range of reference. Default = empty list.

Value

An object of class mop_results.

Detect values outside ranges of reference conditions

Description

Options to identify which values in a set of conditions of interest (g_matrix) are outside the range of a set of conditions of reference (m_matrix).

Usage

out_range(m_matrix, g_matrix, type = "basic")
out_range(m_matrix, g_matrix, type = "basic")

Arguments

`m_matrix`	matrix of variables representing the set of conditions to be used as reference. Each column represents a variable.
`g_matrix`	matrix of variables representing the set of conditions to be compared against the reference conditions (where conditions outside range are to be detected). Each column represents a variable. Variable names must match those in `m_matrix`.
`type`	`character`, type of identification to be performed. See `Details` for options.

Details

Results are produced according to type:

basic - helps to identify conditions outside ranges, in general, one or variables are only counted as 1. This is always returned.
simple - identifies the number of variables with conditions outside ranges, for each condition of interest outside ranges, the number of non-analogous variables is returned.
detailed - produces various results (including the two above):
- high_all - identifies non-analogous conditions towards high values of variables, for each variable independently.
- low_all - identifies non-analogous conditions towards low values of variables, for each variable independently.
- high_combined - values are used to identify combinations of variables with non-analogous conditions towards high values of the variables.
- low_combined - values are used to identify combinations of variables with non-analogous conditions towards low values of the variables.
- interpretation - a data.frame to help identify which variables are considered in combined results.

Value

A list containing the ranges in m_matrix, results from analysis according to type, and table to help with interpretations. NA values represent conditions of interest inside ranges of reference conditions. See Details.

Examples

# data
data("reference_matrix", package = "mop")
data("matrix_of_interest", package = "mop")

# analysis
out <- out_range(m_matrix = reference_matrix,
                 g_matrix = matrix_of_interest)
# data
data("reference_matrix", package = "mop")
data("matrix_of_interest", package = "mop")

# analysis
out <- out_range(m_matrix = reference_matrix,
                 g_matrix = matrix_of_interest)

Print a short version of elements in mop objects

Description

Print a short version of elements in mop objects

Usage

## S3 method for class 'mop_results'
print(x, ...)
## S3 method for class 'mop_results'
print(x, ...)

Arguments

`x`	object of class mop_results.
`...`	further arguments to be passed to or from other methods. Ignored in this function.

Value

A short description of objects in the console.

Example of variables for a set of reference

Description

A SpatRaster object representing variables in a set of reference. Variables represent current bioclimatic variables downloaded from the WorldClim database (https://worldclim.org/).

Format

A SpatRaster object.

Value

No return value. Used with function rast to bring raster variables to analysis.

Examples

reference_layers <- terra::rast(system.file("extdata", "reference_layers.tif",
                                            package = "mop"))

terra::plot(reference_layers)
reference_layers <- terra::rast(system.file("extdata", "reference_layers.tif",
                                            package = "mop"))

terra::plot(reference_layers)

Example of matrix with variables in a set of reference

Description

A numeric table representing variables in a set of reference.

Usage

reference_matrix
reference_matrix

Format

A matrix with 723 rows and 6 columns.

Examples

data("reference_matrix", package = "mop")

head(reference_matrix)
data("reference_matrix", package = "mop")

head(reference_matrix)

Summary of attributes and results

Description

Summary of attributes and results

Usage

## S3 method for class 'mop_results'
summary(object, ...)
## S3 method for class 'mop_results'
summary(object, ...)

Arguments

`object`	object of class mop_results.
`...`	additional arguments affecting the summary produced. Ignored in this function.

Value

A printed summary.

Package 'mop'

Help Index

mop: Mobility Oriented-Parity Metric

Description

Details

Functions in mop

Data included

Author(s)

See Also

Example of variables for a set of interest

Description

Format

Value

Examples

Match NA cells for all layers in SpatRaster

Description

Usage

Arguments

Value

Examples

Example of matrix with variables in a set of interest

Description

Usage

Format

Examples

Analysis of extrapolation risks using the MOP metric

Description

Usage

Arguments

Details

Value

See Also

Examples

MOP distance calculation

Description

Usage

Arguments

Value

Examples

Constructor of S3 objects of class mop_results

Description

Usage

Arguments

Value

Detect values outside ranges of reference conditions

Description

Usage

Arguments

Details

Value

Examples

Print a short version of elements in mop objects

Description

Usage

Arguments

Value

Example of variables for a set of reference

Description

Format

Value

Examples

Example of matrix with variables in a set of reference

Description

Usage

Format

Examples

Summary of attributes and results

Description

Usage

Arguments

Value