SALib.analyze.pawn module#

SALib.analyze.pawn.analyze(problem: Dict, X: ndarray, Y: ndarray, S: int = 10, print_to_console: bool = False, seed: int = None)[source]#

Performs PAWN sensitivity analysis.

The PAWN method [1] is a moment-independent approach to Global Sensitivity Analysis (GSA). It is described as producing robust results at relatively low sample sizes (see [2]) for the purpose of factor ranking and screening.

The distribution of model outputs is examined rather than their variation as is typical in other common GSA approaches. The PAWN method further distinguishes itself from other moment-independent approaches by characterizing outputs by their cumulative distribution function (CDF) as opposed to their probability distribution function. As the CDF for a given random variable is typically normally distributed, PAWN can be more appropriately applied when outputs are highly-skewed or multi-modal, for which variance-based methods may produce unreliable results.

PAWN characterizes the relationship between inputs and outputs by quantifying the variation in the output distributions after conditioning an input. A factor is deemed non-influential if distributions coincide at all S conditioning intervals. The Kolmogorov-Smirnov statistic is used as a measure of distance between the distributions.

This implementation reports the PAWN index at the min, mean, median, and max across the slides/conditioning intervals as well as the coefficient of variation (CV) and standard deviation (stdev). The median value is the typically reported value. As the CV is (standard deviation / mean), it indicates the level of variability across the slides, with values closer to zero indicating lower variation.

Notes

Compatible with:

all samplers

This implementation ignores all NaNs.

When applied to grouped factors, the analysis is conducted on each factor individually, and the mean of their results are reported.

Examples

>>> X = latin.sample(problem, 1000)
>>> Y = Ishigami.evaluate(X)
>>> Si = pawn.analyze(problem, X, Y, S=10, print_to_console=False)
Parameters:
  • problem (dict) – The problem definition

  • X (numpy.array) – A NumPy array containing the model inputs

  • Y (numpy.array) – A NumPy array containing the model outputs

  • S (int) – Number of slides; the conditioning intervals (default 10)

  • print_to_console (bool) – Print results directly to console (default False)

  • seed (int) – Seed value to ensure deterministic results

References

  1. Pianosi, F., Wagener, T., 2015.

    A simple and efficient method for global sensitivity analysis based on cumulative distribution functions. Environmental Modelling & Software 67, 1-11. https://doi.org/10.1016/j.envsoft.2015.01.004

  2. Pianosi, F., Wagener, T., 2018.

    Distribution-based sensitivity analysis from a generic input-output sample. Environmental Modelling & Software 108, 197-207. https://doi.org/10.1016/j.envsoft.2018.07.019

  3. Baroni, G., Francke, T., 2020.

    An effective strategy for combining variance- and distribution-based global sensitivity analysis. Environmental Modelling & Software, 134, 104851. https://doi.org/10.1016/j.envsoft.2020.104851

  4. Baroni, G., Francke, T., 2020.

    GSA-cvd Combining variance- and distribution-based global sensitivity analysis baronig/GSA-cvd

SALib.analyze.pawn.cli_action(args)[source]#
SALib.analyze.pawn.cli_parse(parser)[source]#