Methods

gsax implements ten methods for global sensitivity analysis (GSA). All of them answer the same broad question — which input parameters actually drive my model's output? — but they differ in what exactly they measure, how many model evaluations they cost, and whether they need a dedicated sampling design or can work with data you already have.

If you're new to the package, start with Choosing a Method, then jump to the section for the method you picked. Each method section opens with what it measures and when you'd choose it, followed by the estimator details.

Throughout this page, $D$ is the number of input parameters and $N$ is a sample count.

Choosing a Method

Three questions narrow the field quickly.

1. Can you still choose where to run the model? Four methods need their own sampling design, which gsax generates for you: Sobol' (Saltelli matrices), eFAST (search curves), Morris (trajectories), and DGSM (plain Monte Carlo plus autodiff). The other six — HDMR, PCE, Shapley effects, HSIC, PAWN, and Borgonovo delta — are given-data methods: they accept any set of $(X, Y)$ pairs, including simulation runs you already have.

2. What should the number mean? Variance-based methods (Sobol', HDMR, PCE, eFAST, Shapley) report fractions of output variance — "parameter 3 explains 40% of the output's spread". Screening methods (Morris, DGSM) trade that precision for cheap, reliable rankings. Moment-independent methods (HSIC, PAWN, Borgonovo delta) measure how strongly an input affects the whole output distribution — the right lens when your output is skewed or heavy-tailed and variance feels like the wrong summary.

3. What's your evaluation budget? Sobol' needs $N (2 D + 2)$ model runs by default ( $N$ typically 1024+). Morris needs only $r (D + 1)$ with $r \approx 10 – 50$ trajectories. DGSM gets the whole gradient for roughly the price of one evaluation per sample point (JAX-differentiable models only). The given-data methods cost nothing beyond the runs you already have.

Common situations:

"I can run the model freely and want the standard variance decomposition." Use Sobol' via Saltelli sampling — the reference method, with first-order, total-order, and second-order indices.
"My model is expensive and has many parameters." Screen first with Morris ( $r (D + 1)$ runs), or with DGSM if the model is JAX-differentiable. Fix the negligible parameters, then spend the remaining budget on Sobol' for the survivors.
"I only have existing simulation data." Any given-data method works. HDMR or PCE for variance-based indices via a surrogate; HSIC, PAWN, or Borgonovo delta for distribution-based indices.
"My inputs are correlated." Sobol', PCE, eFAST, DGSM, Morris, and Shapley all assume independent inputs. Use HDMR (which separates structural from correlation-induced variance), or HSIC / PAWN / Borgonovo delta (which make no independence assumption).
"My output distribution is skewed or heavy-tailed." Use PAWN or Borgonovo delta — both compare whole output distributions rather than variances.
"I want one fair importance number per parameter that sums to 1." Use Shapley effects.
"I also want a fast emulator of my model." Use HDMR (emulate_hdmr) or PCE (emulate_pce).

Comparison table

Consideration	Sobol'	HDMR	PCE	Shapley	eFAST	DGSM	Morris	HSIC	PAWN	Borgonovo delta
Sampling requirement	Structured Saltelli design, $N (2 D + 2)$ evaluations (default)	Any $(X, Y)$ pairs	Any $(X, Y)$ pairs	Any $(X, Y)$ pairs	Search curves, $N \times D$ evaluations	Plain MC, $N$ evaluations + autodiff	Trajectory or radial design, $r (D + 1)$ evaluations (deduplicated)	Any $(X, Y)$ pairs	Any $(X, Y)$ pairs	Any $(X, Y)$ pairs
Input independence	Assumed	Handled via ANCOVA decomposition	Assumed	Assumed (dependent-input Shapley is future work)	Assumed	Assumed	Assumed	Not assumed	Not assumed	Not assumed
Input distributions	Uniform + Gaussian	Uniform + Gaussian (via CDF mapping)	Uniform + Gaussian	Uniform + Gaussian (both backends)	Uniform + Gaussian	Uniform + Gaussian (+ truncated Normal)	Uniform + Gaussian (truncated-quantile grid)	Uniform + Gaussian (via CDF mapping)	Uniform + Gaussian (via CDF mapping)	Any (rank-based classes; marginals not used)
Output shapes	Scalar, multi-output, time-series	Scalar, multi-output, time-series	Scalar, multi-output, time-series	Scalar, multi-output, time-series (both backends)	Scalar, multi-output, time-series	Scalar, multi-output, time-series	Scalar, multi-output, time-series	Scalar, multi-output, time-series	Scalar, multi-output, time-series	Scalar, multi-output, time-series
What the numbers mean	Exact variance fractions (given enough samples)	Variance fractions from a B-spline surrogate (fit-dependent)	Variance fractions from a polynomial surrogate (fit-dependent)	Exact allocation within the fitted surrogate; depends on fit quality	Exact variance fractions (given enough samples)	Bounds on $S_{T}$ , not exact indices	Screening ranks ( $μ^{*}$ as $S_{T}$ proxy), not variance fractions	Dependence measure, not variance fractions	Distributional (KS) distance, not variance fractions	Distributional (L1) distance, not variance fractions
Second-order indices	Direct estimation from cross-matrices	From interaction component functions	Analytical from coefficients	Not available (interaction variance folded into $Sh$ )	Not available	Not available	Not available	Not available	Not available	Not available
Interaction detection	Via $S_{2}$ and the gap $S_{T} - S_{1}$	Via explicit interaction component functions	Via $S_{2}$ from coefficients	Via the gaps $Sh - S_{1}$ and $S_{T} - Sh$	Via the gap $S_{T} - S_{1}$ only	Not available (bounds only)	Via large $σ$ relative to $μ^{*}$ (not pair-attributable)	Via the Total HSIC − R2-HSIC gap	Not available (first-order only)	Not available (the $δ - S_{1}$ gap flags influence beyond first-order variance)
Surrogate/emulator	No	Yes (`emulate_hdmr`)	Yes (`emulate_pce`)	Fits HDMR or PCE internally (no emulator returned)	No	No	No	No	No	No

Background: Variance-Based Sensitivity Analysis

Why Global Sensitivity Analysis?

Unlike local sensitivity methods (e.g. partial derivatives at a nominal point), global sensitivity analysis explores the entire parameter space. This matters for non-linear models, where interactions and non-monotonic responses mean a gradient at one point can be misleading. GSA quantifies each parameter's contribution to output uncertainty across the whole input domain.

In practice, GSA serves several roles:

Parameter identifiability: parameters with near-zero sensitivity across all outputs are effectively unidentifiable from data and may need to be fixed rather than estimated; high-sensitivity parameters are the ones data can constrain.
Experimental design: for time-series outputs, watching sensitivity indices evolve over time helps pick measurement times when outputs are most informative about the parameters of interest.
Model simplification: if interaction indices are negligible, the model response is approximately additive, and simpler surrogate models may suffice.

The Hoeffding–Sobol' Decomposition

The theoretical foundation of variance-based GSA is the Hoeffding (ANOVA) decomposition. Any square-integrable function $f (X)$ of $D$ independent inputs can be uniquely decomposed into summands of increasing dimensionality:

f (X) = f_{0} + \sum_{i = 1}^{D} f_{i} (X_{i}) + \sum_{i < j} f_{i j} (X_{i}, X_{j}) + \dots + f_{1, 2, \dots, D} (X_{1}, \dots, X_{D})

where $f_{0} = E [f (X)]$ is the overall mean, each $f_{i}$ captures the main effect of parameter $i$ , each $f_{i j}$ captures the pairwise interaction between $i$ and $j$ , and so on. Because these component functions are mutually orthogonal, the total output variance decomposes additively:

Var (Y) = \sum_{i} V_{i} + \sum_{i < j} V_{i j} + \dots + V_{1, 2, \dots, D}

where $V_{i} = Var [f_{i} (X_{i})]$ , $V_{i j} = Var [f_{i j} (X_{i}, X_{j})]$ , etc.

Sobol' Sensitivity Indices

Dividing each variance component by $Var (Y)$ yields the Sobol' sensitivity indices:

First-order index $S_{i}$ — the fraction of output variance you could remove by fixing parameter $i$ at its true value (its main effect, ignoring interactions):

S_{i} = \frac{V_{i}}{Var (Y)} = \frac{{Var}_{X_{i}} [E_{X_{\sim i}} (Y ∣ X_{i})]}{Var (Y)}

Second-order index $S_{i j}$ — the additional variance from the pairwise interaction between $i$ and $j$ , beyond their individual main effects:

S_{i j} = \frac{V_{i j}}{Var (Y)}

Total-order index $S_{T_{i}}$ — the fraction of output variance parameter $i$ is involved in at all, counting every interaction it participates in. A parameter with $S_{T_{i}} \approx 0$ can safely be fixed:

S_{T_{i}} = \frac{E_{X_{\sim i}} [{Var}_{X_{i}} (Y ∣ X_{\sim i})]}{Var (Y)} = 1 - \frac{{Var}_{X_{\sim i}} [E_{X_{i}} (Y ∣ X_{\sim i})]}{Var (Y)}

where $X_{\sim i}$ denotes all inputs except $X_{i}$ . By construction, $S_{T_{i}} \geq S_{i}$ always holds, with equality when parameter $i$ has no interactions. The gap $S_{T_{i}} - S_{i}$ quantifies how much of parameter $i$ 's influence comes through interactions.

Sobol' Indices via Saltelli Sampling

This is the reference method and gsax's default workflow: exact, model-free variance decomposition with well-understood convergence. Pick it when you can afford a dedicated sampling design and your inputs are independent. gsax uses the Saltelli sampling scheme (Saltelli 2002, 2010), which arranges quasi-random sample matrices so that first-order ( $S_{1}$ ), total-order ( $S_{T}$ ), and second-order ( $S_{2}$ ) indices can all be estimated from a single batch of model evaluations.

The Pick-Freeze Sampling Scheme

The method generates two independent $N \times D$ quasi-random sample matrices $A$ and $B$ using a Sobol' low-discrepancy sequence (via scipy.stats.qmc.Sobol). For each parameter $j$ , a cross-matrix ${AB}^{(j)}$ is constructed by taking all columns from $A$ except column $j$ , which is replaced by column $j$ from $B$ . This "pick-and-freeze" construction allows conditional expectations to be estimated via sample averages.

The cost is $N (D + 2)$ model evaluations for all first-order and total-order indices, or $N (2 D + 2)$ when second-order indices are included (calc_second_order=True, the default).

Estimators

gsax implements the following estimators:

First-order — Saltelli (2010):

{\hat{S}}_{i} = \frac{\frac{1}{N} \sum_{n = 1}^{N} f (B)_{n} \cdot (f ({AB}^{(i)})_{n} - f (A)_{n})}{Var (Y)}

Total-order — Jansen (1999):

{\hat{S}}_{T_{i}} = \frac{\frac{1}{2 N} \sum_{n = 1}^{N} {(f (A)_{n} - f ({AB}^{(i)})_{n})}^{2}}{Var (Y)}

Variance normalisation: all estimators normalise by a pooled output variance computed over the concatenation of $A$ and $B$ outputs (i.e. $Var ([A; B])$ over $2 N$ points). Pooling both base-sample vectors doubles the effective sample size and gives a more robust variance estimate.

How to use it

gsax.sample() generates the Sobol' quasi-random sequence and builds the Saltelli cross-matrices. Duplicate rows are removed so your model only evaluates unique input points.
You evaluate your model on sampling_result.samples.
gsax.analyze() reconstructs the Saltelli layout internally and computes all indices in a single jit(vmap(...)) pass.

Two optional knobs align results with SALib. gsax.analyze(..., prenormalize=True) applies SALib-style output standardization once per output slice before computing the estimators, which changes the point-estimate path to match SALib more closely. When bootstrapping (num_resamples > 0), ci_method="quantile" reports percentile bootstrap bounds and ci_method="gaussian" reports symmetric bounds from the bootstrap standard deviation; either way, gsax returns explicit lower/upper endpoint arrays rather than SALib's symmetric confidence widths.

Index summary

Index	Meaning
$S_{1} (i)$	Fraction of output variance due to parameter $i$ alone (main effect).
$S_{T} (i)$	Fraction of output variance due to parameter $i$ including all its interactions. $S_{T} \geq S_{1}$ always.
$S_{2} (i, j)$	Fraction of output variance due to the pairwise interaction between $i$ and $j$ , beyond their individual effects.

When to use Sobol': you can afford the structured Saltelli design — $N (D + 2)$ evaluations for first/total only, $N (2 D + 2)$ with second-order (default) — and want exact, model-free variance decomposition with independent inputs.

RS-HDMR (Random Sampling High-Dimensional Model Representation)

RS-HDMR is a given-data, variance-based method: it fits a B-spline surrogate to any set of $(X, Y)$ pairs and derives sensitivity indices analytically from the surrogate's variance decomposition. Pick it when model runs are expensive and you want to reuse existing data, when your inputs may be correlated, or when you also want a fast emulator of the model.

Theoretical Background

High-Dimensional Model Representation (HDMR) exploits the observation that, for many practical problems, only the low-order interactions among input variables significantly influence the output. The RS-HDMR variant constructs component functions from randomly sampled input–output data, rather than requiring structured grids. The model is decomposed as:

f (X) \approx f_{0} + \sum_{i} f_{i} (X_{i}) + \sum_{i < j} f_{i j} (X_{i}, X_{j}) + \sum_{i < j < k} f_{i j k} (X_{i}, X_{j}, X_{k})

where each component function is expanded in a B-spline basis and fitted via backfitting with Tikhonov regularisation.

ANCOVA Decomposition

Unlike the classical Sobol' decomposition, which assumes independent inputs, RS-HDMR uses an ANCOVA (analysis of covariance) decomposition that separates each component's variance into:

Structural variance ( $S_{a}$ ): the contribution that would remain if all inputs were independent — analogous to the classical Sobol' index.
Correlative variance ( $S_{b}$ ): the additional contribution arising from correlations between inputs.

This distinction matters because many real-world models have correlated inputs (e.g. coupled physical parameters), and conflating structural and correlative contributions can produce misleading sensitivity rankings.

How to use it

You provide any set of $(X, Y)$ pairs — no sampling design required.
gsax.analyze_hdmr() maps inputs to $[0, 1]$ via their marginal CDFs, optionally standardises outputs once over the sample axis (prenormalize=True), builds B-spline basis matrices, and fits component functions via backfitting with Tikhonov regularisation.
The ANCOVA decomposition splits each component's variance into structural ( $S_{a}$ ) and correlative ( $S_{b}$ ) parts. Total-order indices ( $S_{T}$ ) sum contributions from all terms involving a given parameter.

When prenormalization is enabled, the surrogate is trained on standardized outputs, but gsax.hdmr.emulate() (or the top-level alias gsax.emulate_hdmr()) maps predictions back to the original output scale before returning them.

Index summary

Index	Meaning
$S_{a} (t)$	Structural (uncorrelated) variance contribution of term $t$ . For first-order terms with independent inputs, equivalent to Sobol' $S_{1}$ .
$S_{b} (t)$	Correlative variance contribution of term $t$ (due to input correlations).
$S (t)$	Total contribution per term: $S_{a} + S_{b}$ .
$S_{T} (i)$	Total-order per parameter: sum of $S$ for all terms involving parameter $i$ .

When to use HDMR:

Model evaluations are expensive and you want to reuse existing runs
Inputs may be correlated (Sobol' assumes independent inputs)
You need a surrogate/emulator for fast prediction at new inputs (gsax.hdmr.emulate)

PCE (Polynomial Chaos Expansion)

PCE is the second given-data, surrogate-based route to Sobol indices: it fits an orthogonal polynomial surrogate to $(X, Y)$ data and reads the indices directly from the expansion coefficients (Sudret, 2008), with no Monte Carlo estimation noise. Pick it when your model is smooth. The polynomial basis follows the Wiener-Askey scheme: Legendre polynomials for uniform inputs, Hermite polynomials for unbounded Gaussian inputs; truncated Gaussian inputs use Legendre polynomials after CDF mapping to $[- 1, 1]$ .

How to use it

You provide any set of $(X, Y)$ pairs; Y may be scalar (N,), multi-output (N, K), or time-series (N, T, K) — all output slices share one polynomial basis and are fitted in a single solve.
gsax.analyze_pce() maps inputs to the appropriate reference domain, builds the design matrix from a total-degree multi-index, and fits coefficients via regularized least squares.
Sobol indices ( $S_{1}$ , $S_{T}$ , $S_{2}$ ) are computed analytically from the squared coefficients.
Leave-one-out cross-validation RMSE quantifies surrogate accuracy.

When to use PCE:

You want analytical Sobol indices without Monte Carlo sampling noise
Your model is smooth enough to be well-approximated by low-order polynomials
You have mixed uniform and Gaussian inputs (the Wiener-Askey scheme selects the appropriate basis automatically)
You need a fast emulator (emulate_pce mirrors the training output layout)

Shapley Effects

The Shapley effect ${Sh}_{i}$ is a single, fairly allocated importance score per parameter: each parameter's share of the output variance, with every interaction split evenly among its participants, so the scores sum to exactly 1. Pick it when you need one defensible number per parameter — for ranking, reporting, or budget allocation — rather than the two-sided $S_{1}$ / $S_{T}$ view. It applies the Shapley value from cooperative game theory to variance-based sensitivity analysis, treating the output variance as a payout divided among the inputs, viewed as players whose coalition worths are the partial variances of the ANOVA decomposition (Owen, 2014; Song, Nelson & Staum, 2016). Like HDMR and PCE, it is a given-data method: any set of $(X, Y)$ pairs works.

Theoretical Background

For independent inputs, the Hoeffding–Sobol' decomposition splits the output variance into partial variances $V_{u}$ indexed by subsets $u \subseteq {1, \dots, D}$ of the parameters. The Shapley effect of parameter $i$ allocates each interaction's variance equally among its participants:

{Sh}_{i} = \sum_{u ∋ i} \frac{V_{u}}{| u |}

so a main-effect variance $V_{i}$ is attributed entirely to parameter $i$ , a pairwise interaction variance $V_{i j}$ is split half-and-half between $i$ and $j$ , and so on. Under independent inputs this yields:

Bracketing: $S_{1, i} \leq {Sh}_{i} \leq S_{T, i}$ — the Shapley effect always lies between the first-order and total-order Sobol indices.
Exact partition: unlike $S_{1}$ (which omits interactions, so $\sum_{i} S_{1, i} \leq 1$ ) and $S_{T}$ (which counts each interaction once per participant, so $\sum_{i} S_{T, i} \geq 1$ ), Shapley effects split every interaction fairly and sum to exactly 1 with no gaps or double counting.

Independence assumption (v1 limitation): gsax currently assumes independent inputs. The Shapley value is particularly attractive for dependent inputs — where Sobol indices lose their clean interpretation — but the dependent-input formulation requires conditional-variance estimation and is future work. Do not rely on the indices when inputs are strongly correlated.

How gsax computes them

gsax computes Shapley effects analytically from a fitted surrogate's variance decomposition — no permutation Monte Carlo, no conditional-variance sampling, and no external shap dependency:

backend="pce" (default) fits a polynomial chaos expansion and groups the squared orthonormal coefficients by the support of their multi-index (Sudret, 2008) — exact within the fitted polynomial.
backend="hdmr" fits the RS-HDMR B-spline surrogate and uses the structural ( $S_{a}$ ) variances of its component functions as the partial variances $V_{u}$ , truncated at maxorder.

Both backends accept scalar (N,), multi-output (N, K), and time-series (N, T, K) Y.

Normalization is by the surrogate's total decomposed variance $\sum_{u} V_{u}$ , so $\sum_{i} {Sh}_{i} = 1$ exactly — the Shapley efficiency property (Owen, 2014). $S_{1}$ and $S_{T}$ from the same surrogate use the same denominator, so for backend="pce" they match analyze_pce exactly, while for backend="hdmr" they differ from analyze_hdmr (which normalizes by $Var (Y)$ ) by a factor of explained_variance.

How much of the output variance the surrogate actually captured is reported separately in the explained_variance field, $\sum_{u} V_{u} / Var (Y)$ : close to 1 for a good fit, below 1 when truncation or fit error leaves variance unexplained, and above 1 when an overfit surrogate over-counts shared variance — an honest diagnostic rather than a silently renormalized result. A UserWarning is emitted when it strays far from 1. Interactions above maxorder (HDMR) or the polynomial order (PCE) are absent from the allocation.

How to use it

You provide any set of $(X, Y)$ pairs — no sampling design required.
gsax.analyze_shapley() fits the selected surrogate backend ("pce" or "hdmr"), extracts its variance decomposition, and allocates each partial variance equally among the parameters in its interaction set.
The result carries Sh alongside S1 and ST computed from the same surrogate, so the three indices are directly comparable and the ordering $S_{1} \leq Sh \leq S_{T}$ is visible at a glance.

python

import jax.numpy as jnp
import gsax
from gsax.benchmarks.ishigami import PROBLEM, evaluate

X = gsax.sample_mc(PROBLEM, N=2000, seed=42)
Y = evaluate(jnp.asarray(X))

# PCE backend (default) — exact within the fitted polynomial
result = gsax.analyze_shapley(PROBLEM, jnp.asarray(X), Y)
print("Sh:", result.Sh)              # (D,) Shapley effects
print("sum:", result.Sh.sum())       # == 1 (Shapley efficiency property)
print("explained:", result.explained_variance)  # sum_u V_u / Var(Y) — fit quality
print("order:", result.order)        # effective surrogate order used
print("S1:", result.S1)              # first-order, same surrogate
print("ST:", result.ST)              # total-order, same surrogate

# HDMR backend — B-spline surrogate; HDMR-only knobs
result_hdmr = gsax.analyze_shapley(PROBLEM, jnp.asarray(X), Y, backend="hdmr", maxorder=2)

Backend-specific keyword arguments are validated: explicitly setting a knob that belongs to the non-selected backend (e.g. backend="pce" with maxorder=3) raises ValueError.

Index summary

Index	Meaning
$Sh (i)$	Shapley effect: parameter $i$ 's fair share of decomposed variance, including an equal split of every interaction it participates in. $\sum_{i} {Sh}_{i} = 1$ exactly (Shapley efficiency).
$S_{1} (i)$	First-order index from the same surrogate (main effect only).
$S_{T} (i)$	Total-order index from the same surrogate (main effect plus all interactions counted in full).
`explained_variance`	Fraction of $Var (Y)$ the surrogate captured, $\sum_{u} V_{u} / Var (Y)$ — a separate fit-quality diagnostic, not a per-parameter index.

When to use Shapley effects:

You want a single, fairly allocated importance score per parameter that sums to exactly 1 (e.g. for ranking, reporting, or budget allocation)
Interactions matter and you want them attributed to their participants rather than omitted ( $S_{1}$ ) or double-counted ( $S_{T}$ )
You have existing $(X, Y)$ pairs and want analytical indices without permutation Monte Carlo noise
Your inputs are independent (required in this version)

References

Owen, A.B. (2014). Sobol' indices and Shapley value. SIAM/ASA Journal on Uncertainty Quantification, 2(1), 245-251.
Song, E., Nelson, B.L. & Staum, J. (2016). Shapley effects for global sensitivity analysis: Theory and computation. SIAM/ASA Journal on Uncertainty Quantification, 4(1), 1060-1083.
Sudret, B. (2008). Global sensitivity analysis using polynomial chaos expansions. Reliability Engineering & System Safety, 93(7), 964-979.

eFAST (Extended Fourier Amplitude Sensitivity Test)

eFAST computes the same first-order and total-order Sobol indices as the Saltelli workflow, but through a frequency-based decomposition with a simpler sampling design of $N \times D$ evaluations. Pick it when you need $S_{1}$ and $S_{T}$ but not second-order indices. Instead of pick-and-freeze matrices, eFAST evaluates the model along sinusoidal search curves in the input space, then applies the discrete Fourier transform to extract variance contributions from the spectral content of the output.

How it works

For each parameter $i$ , eFAST constructs a search curve by assigning the highest frequency $ω_{0}$ to parameter $i$ (the "focal" parameter) and lower complementary frequencies $ω_{j}$ to all other parameters. The model is evaluated at $N$ points along each curve, yielding one output vector per parameter.

The Fourier power spectrum of the output along each curve is then decomposed:

First-order index — the fraction of total variance captured by harmonics of $ω_{0}$ :

S_{i} = \frac{D_{1}}{V} = \frac{\sum_{p = 1}^{M} | F_{p ω_{0}} |^{2}}{V}

where $V$ is the total variance (via Parseval's theorem) and $M$ is the interference factor controlling how many harmonics are summed.

Total-order index — the complement of the low-frequency (non-focal) variance:

S_{T_{i}} = 1 - \frac{D_{t}}{V} = 1 - \frac{\sum_{k \leq ⌊ ω_{0} / 2 ⌋} | F_{k} |^{2}}{V}

The low-frequency content at or below $⌊ ω_{0} / 2 ⌋$ is driven entirely by the complementary parameters' slower oscillations, so subtracting it from unity gives the total effect of the focal parameter including all its interactions.

How to use it

gsax.sample_efast() (or efast.sample()) generates search-curve samples with shape (N * D, D), where each contiguous block of $N$ rows corresponds to one parameter's search curve.
You evaluate your model on all N * D rows.
gsax.analyze_efast() (or efast.analyze()) splits the output by curve, computes the Fourier spectrum for each, and extracts $S_{1}$ and $S_{T}$ indices.

Index summary

Index	Meaning
$S_{1} (i)$	Fraction of output variance from the focal parameter's harmonics (main effect).
$S_{T} (i)$	Total effect including interactions, computed as $1 - D_{t} / V$ .

eFAST does not produce second-order ( $S_{2}$ ) interaction indices. If pairwise interactions are needed, use the Sobol workflow instead.

When to use eFAST:

You only need $S_{1}$ and $S_{T}$ (no $S_{2}$ required)
You want a simpler sampling design without the Saltelli cross-matrix structure
You are screening a large number of parameters
The total cost is $N \times D$ evaluations, which can be lower than Saltelli's $N (D + 2)$ (first/total only) or $N (2 D + 2)$ (with second-order, the default) when $N$ is chosen smaller than the Saltelli base count

Reference

Saltelli, A., Tarantola, S. & Chan, K.P.-S. (1999). A quantitative model-independent method for global sensitivity analysis of model output. Technometrics, 41(1), 39-56.

DGSM (Derivative-based Global Sensitivity Measures)

DGSM is the cheapest quantitative method when your model is JAX-differentiable: it uses exact gradients from automatic differentiation to compute bounds on the total Sobol index $S_{T}$ , at roughly the cost of one model evaluation per sample point. Pick it as a fast screening or sanity-check step before committing to a full Sobol' analysis — or use Morris (below) if your model is a black box.

The DGSM Moments

For a model $f (X)$ with $D$ inputs, DGSM computes two statistics for each parameter $i$ :

Mean squared derivative (importance measure):

ν_{i} = E [{(\frac{\partial f}{\partial X_{i}})}^{2}]

Mean derivative:

σ_{i} = E [\frac{\partial f}{\partial X_{i}}]

These moments are estimated from $N$ i.i.d. Monte Carlo samples. Because DGSM uses jax.jacrev (reverse-mode autodiff), the cost of computing the full Jacobian for all $D$ inputs in a single pass is comparable to a single model evaluation, making DGSM particularly efficient for high-dimensional problems.

Bounds on the Total Sobol Index

DGSM does not compute Sobol indices directly. Instead, it provides an upper bound and a lower bound on the total-order index $S_{T_{i}}$ .

Poincaré upper bound (Sobol' & Kucherenko, 2009):

S_{T_{i}} \leq \frac{C (p_{i}) \cdot ν_{i}}{Var (Y)}

where $C (p_{i})$ is the Poincaré constant of the $i$ -th input's marginal distribution.

Kucherenko–Song lower bound (Kucherenko & Song, 2016):

S_{T_{i}} \geq \frac{Var (X_{i}) \cdot σ_{i}^{2}}{Var (Y)}

When the upper and lower bounds are close, DGSM gives a tight bracket on $S_{T}$ without the cost of a full Sobol analysis.

Poincaré Constants by Distribution

The Poincaré constant depends on the marginal distribution of each input:

Distribution	Poincaré Constant $C$
Uniform $[a, b]$	$(b - a)^{2} / π^{2}$
Gaussian $N (μ, σ^{2})$	$σ^{2}$
Truncated Normal	Spectral solve (P1 finite-element Neumann eigenproblem)

For truncated normal inputs, the constant is computed numerically by solving a weighted eigenproblem on a finite-element grid. gsax handles this automatically when the input spec declares truncation bounds.

How to use it

gsax.sample_mc() generates plain Monte Carlo samples from the declared input distributions.
You pass your JAX-differentiable function and the samples to gsax.analyze_dgsm().
Internally, jax.jacrev computes the Jacobian via reverse-mode autodiff, and the DGSM moments and bounds are derived.
The returned DGSMResult contains nu, sigma, upper_bound, lower_bound, and var_y.

Alternatively, if the Jacobian has been computed externally (e.g. for non-JAX models), you can pass pre-computed Y and dfdx arrays directly.

Index summary

Field	Meaning
$ν_{i}$	Mean squared derivative: $E [(\partial f / \partial X_{i})^{2}]$ . Higher values indicate stronger influence.
$σ_{i}$	Mean derivative: $E [\partial f / \partial X_{i}]$ . Non-zero when the effect is non-symmetric.
Upper bound	Poincaré bound: $C_{i} \cdot ν_{i} / Var (Y)$ . Conservative upper bound on $S_{T}$ .
Lower bound	Kucherenko–Song bound: $Var (X_{i}) \cdot σ_{i}^{2} / Var (Y)$ . Guaranteed lower bound on $S_{T}$ .

When to use DGSM:

You have a JAX-differentiable model and want fast screening without the cost of Saltelli or eFAST sampling
You want bounds on $S_{T}$ rather than exact indices
You are screening a large number of parameters where autodiff is cheaper than structured designs
You want a quick sanity check before running a full Sobol analysis

References

Sobol', I.M. & Kucherenko, S. (2009). Derivative based global sensitivity measures and their link with global sensitivity indices. Mathematics and Computers in Simulation, 79(10), 3009-3017.
Kucherenko, S. & Song, S. (2016). Derivative-based global sensitivity measures and their link with Sobol' sensitivity indices. Reliability Engineering & System Safety, 148, 81-95.
Lamboni, M., Iooss, B., Popelin, A.-L. & Gamboa, F. (2013). Derivative-based global sensitivity measures: General links with Sobol' indices and numerical tests. Mathematics and Computers in Simulation, 87, 44-54.

Morris (Elementary Effects Screening)

Morris is a global screening method: with only $r (D + 1)$ model evaluations ( $r$ typically 10–50 trajectories), it ranks parameters and flags which ones are negligible. Pick it as a triage step for expensive black-box models — fix the parameters Morris rules out, then spend your remaining budget on an exact method like Sobol' for the survivors. Technically it is a globalized one-at-a-time (OAT) design: it measures coarse finite-difference effects of each input at many locations spread across the input domain, then summarises them into robust importance measures.

How it works

The design consists of $r$ trajectories, each a path of $D + 1$ points where consecutive points differ in exactly one coordinate. Each trajectory contributes one elementary effect per input — a finite-difference slope:

E E_{i} = \frac{f (x + Δ e_{i}) - f (x)}{Δ}

where $e_{i}$ is the unit vector along input $i$ and $Δ$ is the step in unit-cube coordinates. gsax implements two designs:

Trajectory design (Morris 1991, default): each trajectory is a random walk on a $p$ -level grid (num_levels, default 4) with the canonical step $Δ = p / (2 (p - 1))$ , visiting inputs in a random order.
Radial design (Campolongo et al. 2011, method="radial"): star designs around scrambled-Sobol' base points, where each elementary effect compares a one-coordinate swap against the shared base point with a per-step $Δ_{i} = b_{i} - a_{i}$ .

Both uniform and Gaussian marginals are supported. The design includes the unit-cube boundaries, which an unbounded inverse CDF would map to infinity, so each Gaussian coordinate is confined to $[q, 1 - q]$ (truncation_quantile, default $q = 0.005$ — the 0.5%–99.5% quantile range) before the inverse-CDF transform; uniform marginals are untouched, and deduplication and prefix-nesting are unaffected.

The $r$ elementary effects per input are reduced to three screening measures:

$μ_{i}$ — the mean elementary effect. Sign cancellation can mask non-monotonic influence, which is why $μ$ alone is unreliable.
$μ_{i}^{*}$ — the mean absolute elementary effect (Campolongo et al. 2007). This is the headline importance measure — read it as "how strongly does the output respond, on average, when this input moves?" — and a good proxy for the total-order index $S_{T}$ ranking.
$σ_{i}$ — the standard deviation of the elementary effects (ddof=1). A large $σ_{i}$ relative to $μ_{i}^{*}$ means the effect of input $i$ changes across the domain, indicating nonlinearity or interactions with other inputs.

The canonical output is the $μ^{*}$ – $σ$ scatter plot: parameters near the origin are negligible, parameters far along the $μ^{*}$ axis are influential, and parameters high above the diagonal act mainly through nonlinearity or interactions.

Morris is closely related to DGSM: as $Δ \to 0$ , $μ_{i}^{*} \to E | \partial f / \partial x_{i} |$ , so Morris is the black-box, macro-step analog of gsax's DGSM — use DGSM when the model is JAX-differentiable, Morris when it is not.

How to use it

gsax.sample_morris() builds the trajectories, removes exact duplicate rows (grid designs collide often in low dimensions, so this saves real model evaluations, just like Saltelli sampling), and returns only the unique rows.
You evaluate your model on sampling_result.samples.
gsax.analyze_morris() reconstructs the expanded design internally, drops trajectories containing non-finite values with a warning, and reduces one elementary effect per trajectory and parameter to $μ$ , $μ^{*}$ , and $σ$ . Pass num_resamples > 0 (with a JAX PRNG key) for bootstrap confidence intervals over trajectories.

Elementary effects are computed in unit-cube coordinates, so $μ^{*}$ is directly comparable across parameters regardless of their physical ranges; MorrisResult.to_physical_units() rescales to derivative-scale values in the problem's native units (uniform-marginal problems only — for Gaussian marginals the inverse-CDF transform is nonlinear, so the measures stay in grid coordinates). MorrisSamplingResult.downsample() prefix-slices to fewer trajectories without re-simulation, mirroring SamplingResult.downsample().

Compared to SALib's Morris implementation, gsax adds unique-row deduplication, vectorized multi-output and time-series analysis (SALib's Morris is scalar-only), bootstrap confidence intervals, the radial design, and prefix-nested downsampling.

Index summary

Measure	Meaning
$μ (i)$	Mean elementary effect. Sign cancellation can hide non-monotonic influence.
$μ^{*} (i)$	Mean absolute elementary effect. Headline importance measure; proxy for the $S_{T}$ ranking.
$σ (i)$	Standard deviation of the elementary effects. Large $σ / μ^{*}$ indicates nonlinearity or interactions.

When to use Morris:

You want a cheap screening pass before committing to a full Sobol' run
Your model is a black box (not JAX-differentiable — otherwise consider DGSM)
You have many parameters and a tight evaluation budget — the cost is $r (D + 1)$ with $r$ typically 10-50
You only need a ranking and an interaction flag, not exact variance fractions

References

Morris, M.D. (1991). Factorial sampling plans for preliminary computational experiments. Technometrics, 33(2), 161-174.
Campolongo, F., Cariboni, J. & Saltelli, A. (2007). An effective screening design for sensitivity analysis of large models. Environmental Modelling & Software, 22(10), 1509-1518.
Campolongo, F., Cariboni, J. & Saltelli, A. (2011). From screening to quantitative sensitivity analysis. A unified approach. Computer Physics Communications, 182(4), 978-988.
Saltelli, A. et al. (2008). Global Sensitivity Analysis: The Primer, ch. 3. Wiley.

HSIC (Hilbert–Schmidt Independence Criterion)

HSIC measures the statistical dependence between each input and the output — any dependence, including nonlinear, non-monotone, and heteroscedastic effects that variance-based indices can underweight. Pick it when you suspect your model's behaviour isn't well summarised by variance, when your inputs may be correlated, or when you want statistical significance tests attached to the indices. It works in a reproducing kernel Hilbert space (RKHS), mapping inputs and outputs through Gaussian RBF kernels. Like HDMR, it is a given-data method: any set of $(X, Y)$ pairs works, with no independence assumption on the inputs.

The HSIC Dependence Measure

Each input $X_{i}$ and the output $Y$ are passed through a characteristic kernel — a Gaussian RBF whose bandwidth is set automatically by the median heuristic (the median pairwise distance between sample points). Writing $K$ and $L$ for the two $N \times N$ kernel matrices, gsax uses the biased V-statistic estimator

\hat{HSIC} (X_{i}, Y) = \frac{1}{N^{2}} tr (K H L H), H = I - \frac{1}{N} 1 1^{⊤}

where $H$ is the centering matrix. For characteristic kernels, $HSIC (X_{i}, Y) = 0$ if and only if $X_{i}$ and $Y$ are independent, so a larger value signals stronger dependence.

First-Order and Total Indices

gsax reports two normalised indices per parameter.

R2-HSIC (first-order) — the normalised dependence between input $i$ and the output, in $[0, 1]$ ; read it as a kernel analogue of a squared correlation coefficient (centred kernel alignment):

R_{HSIC, i}^{2} = \frac{\hat{HSIC} (X_{i}, Y)}{\sqrt{\hat{HSIC} (X_{i}, X_{i}) \hat{HSIC} (Y, Y)}}

Total HSIC — the analogue of a total-order index, capturing dependence carried through interactions with the other inputs. It is built from augmented product kernels $k_{d}^{*} = 1 + k_{c, d}$ (Larsen & Alexanderian, 2026), where $k_{c, d}$ is the centred kernel for input $d$ . The constant term makes the product of augmented kernels capture all interaction orders rather than only the highest, which yields correct total indices even for purely additive models. The total index for input $i$ then follows from comparing the complement product kernel (all inputs except $i$ ) with the full product kernel.

Unlike Sobol indices, R2-HSIC values are individual dependence measures and do not sum to 1.

Permutation p-values

Because HSIC is a dependence measure rather than a variance fraction, gsax attaches a permutation test to each first-order index: the output labels are randomly shuffled n_perms times to build a null distribution of HSIC values, and the p-value uses the Phipson–Smyth correction $(c + 1) / (M + 1)$ , where $M$ is the number of permutations (n_perms) and $c$ counts permuted HSIC values at least as large as the observed one. A small p-value (< 0.05) indicates a statistically significant dependence between the input and the output.

How to use it

gsax.sample_mc() generates plain Monte Carlo samples — any sampling strategy works, since no structured design is required.
You evaluate your model on the samples.
gsax.analyze_hsic() transforms each input to $[0, 1]$ via its marginal CDF, builds the kernel matrices with the median heuristic, and computes all indices and p-values in a single JIT-compiled pass.

HSIC is $O (N^{2})$ in time and memory because it forms $N \times N$ kernel matrices; for large $N$ , pass chunk_size to build them in row blocks and limit peak memory. For outputs of large magnitude, set prenormalize=True to standardise $Y$ before kernel construction.

Index summary

Index	Meaning
$R_{HSIC}^{2} (i)$	Normalised first-order kernel dependence between input $i$ and the output, in $[0, 1]$ .
Total HSIC $(i)$	Total dependence of input $i$ including interactions, via augmented complement product kernels.
p-value $(i)$	Permutation p-value for the first-order dependence (Phipson–Smyth corrected).

When to use HSIC:

You want a measure that captures any dependence — nonlinear, non-monotone, or heteroscedastic — not just variance contributions
Your inputs may be correlated (HSIC makes no independence assumption)
You have existing $(X, Y)$ pairs and want indices without additional model runs
You want statistical significance testing via permutation p-values

References

Gretton, A., Herbrich, R., Smola, A., Bousquet, O. & Schölkopf, B. (2005). Kernel methods for measuring independence. Journal of Machine Learning Research, 6, 2075-2129.
Da Veiga, S. (2015). Global sensitivity analysis with dependence measures. Reliability Engineering & System Safety, 142, 346-362.
Larsen and Alexanderian (2026). Total HSIC sensitivity indices via augmented product kernels. arXiv preprint arXiv:2603.00849.

PAWN (CDF-Based Sensitivity)

PAWN asks a different question from the variance-based methods: not "how much variance does this input explain?" but "how much does the entire output distribution shift when this input is held fixed?". Pick it when you care about tails, skewness, or other distributional features that variance misses. It compares the unconditional output CDF against conditional CDFs obtained by fixing each input within a bin, using the Kolmogorov–Smirnov (KS) distance as the measure of separation (Pianosi & Wagener, 2015). Like HSIC and HDMR, it is a given-data method: any $(X, Y)$ pairs work, with no independence assumption on the inputs.

The KS Distance

For parameter $i$ , its range is partitioned into n_bins equal-width bins. Within each bin $b$ , PAWN forms the conditional output CDF $F_{Y ∣ X_{i} \in b}$ from the samples whose $i$ -th input falls in that bin, and compares it with the unconditional CDF $F_{Y}$ (built from all samples) via the Kolmogorov–Smirnov statistic — the largest absolute gap between the two CDFs:

{KS}_{i, b} = sup_{y} | F_{Y} (y) - F_{Y ∣ X_{i} \in b} (y) |

A large KS value in a bin means fixing $X_{i}$ there substantially changes the output distribution; a value near zero means the output is insensitive to that input over that region.

Aggregating Across Bins

Each parameter yields one KS value per bin. The PAWN index reduces these to a single number per input using one of three statistics:

median (default) — robust to a single anomalous bin.
max — the worst-case shift across the input range.
mean — the average shift.

Because it is built on CDFs rather than moments, the PAWN index is moment-independent and invariant under monotone transformations of the output — it captures tail and skewness changes that variance-based indices miss.

How to use it

gsax.sample_mc() generates plain Monte Carlo samples (Monte Carlo, Latin Hypercube, or Sobol sequences all work — no structured design required).
You evaluate your model on the samples.
gsax.analyze_pawn() maps each input to $[0, 1]$ , assigns samples to bins, and computes the per-bin KS distances and their aggregate in a single JIT-compiled pass. Pass n_bootstrap > 0 for bootstrap confidence intervals.

The number of bins (n_bins, default 10) trades conditioning resolution against sample density per bin; with very few samples per bin the KS statistic becomes noisy, so increase $N$ or decrease n_bins.

Index summary

Index	Meaning
PAWN $(i)$	Aggregated (median / max / mean) KS distance between the unconditional and conditional output CDFs for input $i$ , in $[0, 1]$ . Higher means stronger influence on the output distribution.

When to use PAWN:

You care about distributional changes beyond variance, such as tail behaviour or skewness shifts
You want a moment-independent index, invariant under monotone output transforms
You have existing $(X, Y)$ pairs from any sampling strategy
Your inputs may be correlated (no independence assumption or structured design)

Reference

Pianosi, F. & Wagener, T. (2015). A simple and efficient method for global sensitivity analysis based on cumulative distribution functions. Environmental Modelling & Software, 67, 1-11.

Borgonovo Delta (Density-Based Sensitivity)

Borgonovo's $δ$ index is the second moment-independent method in gsax, and the natural companion to PAWN: where PAWN summarises a distributional shift by the largest gap between CDFs, $δ$ measures the expected L1 distance between the entire output density and the output density conditional on an input (Borgonovo, 2007). Pick it when you want a distribution-based index on a fixed $[0, 1]$ scale, or as a drop-in, faster replacement for SALib.analyze.delta:

δ_{i} = \frac{1}{2} E_{X_{i}} [\int | f_{Y} (y) - f_{Y ∣ X_{i}} (y) | d y]

The index is $0$ when fixing $X_{i}$ never changes the output distribution, and $1$ when the output is a deterministic function of $X_{i}$ alone. Because it compares whole densities rather than variances, $δ$ captures influence carried through tails, skewness, or multimodality that variance-based indices underweight, and it is invariant under monotone transformations of the output. Like HSIC and PAWN, it is a given-data method: any $(X, Y)$ pairs work, with no independence assumption on the inputs.

How it works

gsax implements the given-data estimator of Plischke, Borgonovo & Smith (2013):

For each input, the samples are ordered by that input's rank and split into $M$ equal-frequency classes. By default $M$ follows the Plischke sample-size heuristic (roughly $N^{2 / 7}$ , at most 48 classes); override it with n_classes.
The unconditional density $f_{Y}$ and each class-conditional density $f_{Y ∣ X_{i} \in C_{m}}$ are estimated by Gaussian KDE with Silverman bandwidths on a fixed grid of grid_size points spanning $[min Y, max Y]$ .
The L1 distances are integrated with the trapezoid rule and averaged with class weights, giving the plug-in estimate

{\hat{δ}}_{i} = \sum_{m = 1}^{M} \frac{n_{m}}{2 N} \int | {\hat{f}}_{Y} (y) - {\hat{f}}_{Y ∣ X_{i} \in C_{m}} (y) | d y

The plug-in estimate is biased upward at finite $N$ , so by default gsax applies Plischke's bootstrap bias reduction $2 {\hat{δ}}_{i} - \overset{―}{{\hat{δ}}_{i}^{(b)}}$ over n_bootstrap resamples, with percentile confidence intervals from the same replicates. Because this correction subtracts a bootstrap mean from twice the plug-in estimate, the reported $δ$ (and its percentile-interval bounds) can fall marginally below $0$ for weak or near-noninfluential inputs at small $N$ , even though the true index and the plug-in estimate both lie in $[0, 1]$ .

The same class partition also yields the given-data first-order Sobol index (variance of the class means over the total variance) at negligible extra cost, so every analysis returns both $δ$ and $S_{1}$ .

The estimator matches SALib.analyze.delta (same equal-frequency rank partition, class-count heuristic, Silverman KDE factors, and 100-point output grid) with three differences: the central estimate is computed on the original sample — deterministic given the data, where SALib evaluates it on a random resample; a constant output column yields $δ = S_{1} = 0$ instead of an error; and a bootstrap replicate that happens to be constant (reachable for rare-event outputs) contributes the point estimate rather than a spurious zero, where SALib raises LinAlgError.

How to use it

gsax.sample_mc() generates plain Monte Carlo samples (any sampling strategy works — no structured design is required).
You evaluate your model on the samples.
gsax.analyze_borgonovo() partitions each input into rank classes and computes $δ$ , $S_{1}$ , and their bootstrap intervals in a single JIT-compiled kernel, vmapped over output columns and scanned over bootstrap replicates.

Set n_bootstrap=0 to skip bias correction and confidence intervals (raw plug-in estimate), or bias_correct=False to keep the intervals but report the uncorrected estimate. For large time-series outputs, lower chunk_size to bound peak memory, which scales with chunk_size * D * N * grid_size.

Index summary

Index	Meaning
$δ (i)$	Expected L1 distance between the unconditional and conditional output densities for input $i$ , in $[0, 1]$ . Higher means stronger influence on the output distribution; $0$ means no influence at all.
$S_{1} (i)$	Given-data first-order Sobol index from the same class partition — the variance-based view of the same conditioning, for comparison at no extra cost.

When to use Borgonovo delta:

You care about influence on the whole output distribution — tails, skewness, multimodality — not just variance
You want a moment-independent index with a fixed $[0, 1]$ scale, invariant under monotone output transforms
You have existing $(X, Y)$ pairs from any sampling strategy, possibly with correlated inputs
You use SALib.analyze.delta and want a deterministic, JIT-compiled equivalent that also handles multi-output and time-series Y

Reference

Borgonovo, E. (2007). A new uncertainty importance measure. Reliability Engineering & System Safety, 92(6), 771-784.
Plischke, E., Borgonovo, E. & Smith, C.L. (2013). Global sensitivity measures from given data. European Journal of Operational Research, 226(3), 536-550.

Output Shapes

All ten methods share the same output contract: scalar, multi-output, and time-series outputs. The shape of Y determines the shape of all returned index arrays (read S1 / ST as the method's per-parameter measures — mu / mu_star / sigma for Morris, nu / sigma and the bounds for DGSM; only Sobol and PCE produce S2):

Y shape	S1 / ST shape	S2 shape
`(N,)`	`(D,)`	`(D, D)`
`(N, K)`	`(K, D)`	`(K, D, D)`
`(N, T, K)`	`(T, K, D)`	`(T, K, D, D)`

D is always the last axis. Confidence interval arrays (when using bootstrap) prepend a leading dimension of 2 for [lower, upper].

How a 2-D Y is read depends on problem.output_names. Without it, a 2-D Y is always (N, K) — multiple outputs, no time dimension. With exactly one entry in output_names and more than one column, a 2-D (N, M) Y is read as M timepoints of that single labeled output and flows through as (N, M, 1), keeping the labeled output axis in results; a lone column (N, 1) stays a scalar output (N, K=1) (pass (N, 1, 1) explicitly for a genuine 1-timepoint series). With several entries, the column count must equal len(output_names). A 1-D (N,) Y is one output regardless of how many names are declared.

You need not pass exactly the canonical layout: every public entry point resolves Y through the same inference ladder. Exact canonical shapes pass silently; unambiguously recoverable layouts — a transposed (K, N) array, or a 3-D (N, K, T) array whose middle axis matches len(output_names) — are fixed with a UserWarning naming the transformation; ambiguous layouts raise. gsax never guesses.

Time-series outputs are particularly useful for dynamic models, where the evolution of sensitivity indices over time can reveal which parameters dominate at different stages of a process — for example, a parameter that is highly influential early in a batch but negligible later.

Data Cleaning

gsax.analyze() automatically drops sample groups that contain non-finite values (NaN, Inf). The Saltelli layout requires groups of rows to stay together, so if any row in a group is non-finite, the entire group is removed. A message is printed when this happens. The nan_counts field on the result reports how many NaN values remain in the computed indices.

References

Sobol', I.M. (2001). Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. Mathematics and Computers in Simulation, 55(1-3), 271-280.
Saltelli, A. (2002). Making best use of model evaluations to compute sensitivity indices. Computer Physics Communications, 145(2), 280-297.
Saltelli, A., Annoni, P., Azzini, I., Campolongo, F., Ratto, M., & Tarantola, S. (2010). Variance based sensitivity analysis of model output. Computer Physics Communications, 181(2), 259-270.
Jansen, M.J.W. (1999). Analysis of variance designs for model output. Computer Physics Communications, 117(1-2), 35-43.
Li, G., Rabitz, H., Yelvington, P.E., Oluwole, O.O., Bacon, F., Kolb, C.E., & Schoendorf, J. (2010). Global sensitivity analysis for systems with independent and/or correlated inputs. Journal of Physical Chemistry A, 114(19), 6022-6032.
Rabitz, H. & Alis, O. (1999). General foundations of high-dimensional model representations. Journal of Mathematical Chemistry, 25(2-3), 197-233.
Sudret, B. (2008). Global sensitivity analysis using polynomial chaos expansions. Reliability Engineering & System Safety, 93(7), 964-979.
Owen, A.B. (2014). Sobol' indices and Shapley value. SIAM/ASA Journal on Uncertainty Quantification, 2(1), 245-251.
Song, E., Nelson, B.L. & Staum, J. (2016). Shapley effects for global sensitivity analysis: Theory and computation. SIAM/ASA Journal on Uncertainty Quantification, 4(1), 1060-1083.
Saltelli, A., Tarantola, S. & Chan, K.P.-S. (1999). A quantitative model-independent method for global sensitivity analysis of model output. Technometrics, 41(1), 39-56.

Methods ​

Choosing a Method ​

Comparison table ​

Background: Variance-Based Sensitivity Analysis ​

Why Global Sensitivity Analysis? ​

The Hoeffding–Sobol' Decomposition ​

Sobol' Sensitivity Indices ​

Sobol' Indices via Saltelli Sampling ​

The Pick-Freeze Sampling Scheme ​

Estimators ​

How to use it ​

Index summary ​

RS-HDMR (Random Sampling High-Dimensional Model Representation) ​

Theoretical Background ​

ANCOVA Decomposition ​

How to use it ​

Index summary ​

PCE (Polynomial Chaos Expansion) ​

How to use it ​

Shapley Effects ​

Theoretical Background ​

How gsax computes them ​

How to use it ​

Index summary ​

References ​

eFAST (Extended Fourier Amplitude Sensitivity Test) ​

How it works ​

How to use it ​

Index summary ​

Reference ​

DGSM (Derivative-based Global Sensitivity Measures) ​

The DGSM Moments ​

Bounds on the Total Sobol Index ​

Poincaré Constants by Distribution ​

How to use it ​

Index summary ​

References ​

Morris (Elementary Effects Screening) ​

How it works ​

How to use it ​

Index summary ​

References ​

HSIC (Hilbert–Schmidt Independence Criterion) ​

The HSIC Dependence Measure ​

First-Order and Total Indices ​

Permutation p-values ​

How to use it ​

Index summary ​

References ​

PAWN (CDF-Based Sensitivity) ​

The KS Distance ​

Aggregating Across Bins ​

How to use it ​

Index summary ​

Reference ​

Borgonovo Delta (Density-Based Sensitivity) ​

How it works ​

How to use it ​

Index summary ​

Reference ​

Output Shapes ​

Data Cleaning ​

References ​

Methods

Choosing a Method

Comparison table

Background: Variance-Based Sensitivity Analysis

Why Global Sensitivity Analysis?

The Hoeffding–Sobol' Decomposition

Sobol' Sensitivity Indices

Sobol' Indices via Saltelli Sampling

The Pick-Freeze Sampling Scheme

Estimators

How to use it

Index summary

RS-HDMR (Random Sampling High-Dimensional Model Representation)

Theoretical Background

ANCOVA Decomposition

How to use it

Index summary

PCE (Polynomial Chaos Expansion)

How to use it

Shapley Effects

Theoretical Background

How gsax computes them

How to use it

Index summary

References

eFAST (Extended Fourier Amplitude Sensitivity Test)

How it works

How to use it

Index summary

Reference

DGSM (Derivative-based Global Sensitivity Measures)

The DGSM Moments

Bounds on the Total Sobol Index

Poincaré Constants by Distribution

How to use it

Index summary

References

Morris (Elementary Effects Screening)

How it works

How to use it

Index summary

References

HSIC (Hilbert–Schmidt Independence Criterion)

The HSIC Dependence Measure

First-Order and Total Indices

Permutation p-values

How to use it

Index summary

References

PAWN (CDF-Based Sensitivity)

The KS Distance

Aggregating Across Bins

How to use it

Index summary

Reference

Borgonovo Delta (Density-Based Sensitivity)

How it works

How to use it

Index summary

Reference

Output Shapes

Data Cleaning

References