Formula sheet

Uncertainty Quantification and Sensitivity Analysis Formula Sheet

Formula sheet for uncertainty propagation, correlated inputs, Monte Carlo estimates, sensitivity indices, failure probability, guards and validation evidence.

This formula sheet collects uncertainty quantification and sensitivity analysis formulas used to turn model outputs, measurements and simulations into engineering decisions. Use it when a nominal result is not enough: the question is whether uncertainty can change the action.

The formulas are most useful when the decision boundary is explicit. State the output metric, acceptance limit, input evidence, correlation assumptions, validation basis and action if the uncertainty is larger than the margin.

Symbols and Conventions

Common symbols:

SymbolMeaning
x_iuncertain input
ymodel output or decision metric
f(x)model mapping inputs to output
\mu_xmean or expected value of input x
\sigma_xstandard deviation of input x
u_xstandard uncertainty of input x
u_ystandard uncertainty of output y
\Sigma_xinput covariance matrix
\nabla fgradient of the model output with respect to inputs
NMonte Carlo sample count
P_fprobability of failure
g(x)limit-state function; positive is usually safe
S_ifirst-order variance-based sensitivity index
S_{T_i}total-effect sensitivity index

Keep units attached to physical quantities. Sensitivity coefficients may have units; normalized sensitivities and variance fractions are dimensionless.

Deterministic Margin

For an upper-bound requirement:

\displaystyle y\le y_{\lim}

the deterministic margin is:

\displaystyle m=y_{\lim}-y.

For a lower-bound requirement:

y\ge y_{\min}

the deterministic margin is:

m=y-y_{\min}.

If m is small compared with output uncertainty, the nominal pass is not stable.

Decision Guard

A simple guarded upper-bound rule is:

\displaystyle y+k u_y \le y_{\lim}

where k is a coverage or engineering guard factor. The margin after uncertainty is:

\displaystyle m_g=y_{\lim}-(y+k u_y).

If m_g<0, the design should be held, redesigned, tested further or justified with stronger evidence.

Mean, Variance and Standard Uncertainty

For sampled input data:

\displaystyle \bar{x}=\frac{1}{n}\sum_{i=1}^{n}x_i

and:

\displaystyle s_x^2=\frac{1}{n-1}\sum_{i=1}^{n}(x_i-\bar{x})^2.

The standard deviation is:

s_x=\sqrt{s_x^2}.

The standard uncertainty of a mean estimate is:

\displaystyle u_{\bar{x}}=\frac{s_x}{\sqrt{n}}.

Do not confuse spread of individual parts or operating cases with uncertainty in the estimated mean. They answer different engineering questions.

Standardization and Z-Score

For a normally distributed or approximately standardized quantity:

\displaystyle z=\frac{x-\mu}{\sigma}.

For a limit x_{\lim}:

\displaystyle z_{\lim}=\frac{x_{\lim}-\mu}{\sigma}.

A larger positive z_{\lim} means the limit is farther above the mean in standard-deviation units. This is useful for quick screening, but it depends on the distribution assumption.

Local Linear Propagation

For:

y=f(x_1,x_2,\ldots,x_n),

the first-order approximation is:

\displaystyle \Delta y\approx \sum_i \frac{\partial f}{\partial x_i}\Delta x_i.

For independent inputs:

\displaystyle u_y^2\approx \sum_i \left(\frac{\partial f}{\partial x_i}u_{x_i}\right)^2.

The contribution from input i is:

\displaystyle c_i^2=\left(\frac{\partial f}{\partial x_i}u_{x_i}\right)^2.

The fractional contribution is:

\displaystyle F_i=\frac{c_i^2}{u_y^2}.

This identifies which input dominates the output uncertainty near the nominal point.

Matrix Covariance Propagation

Let:

J=\nabla f=\begin{bmatrix} \dfrac{\partial f}{\partial x_1} & \dfrac{\partial f}{\partial x_2} & \cdots & \dfrac{\partial f}{\partial x_n} \end{bmatrix}.

With input covariance matrix \Sigma_x:

u_y^2\approx J\Sigma_xJ^T.

For multiple outputs y=f(x) with Jacobian J:

\Sigma_y\approx J\Sigma_xJ^T.

Use the matrix form when inputs are correlated or when several outputs share the same uncertain inputs.

Two-Input Correlation

For two inputs:

u_y^2\approx a^2u_x^2+b^2u_z^2+2ab\operatorname{cov}(x,z),

where:

\displaystyle a=\frac{\partial y}{\partial x},\qquad b=\frac{\partial y}{\partial z}.

Using correlation coefficient \rho_{xz}:

\operatorname{cov}(x,z)=\rho_{xz}u_xu_z.

Positive correlation increases uncertainty when the two sensitivities have the same sign. It can reduce uncertainty when the sensitivities have opposite signs.

Relative Uncertainty for Products and Powers

For a product:

y=A x^a z^b

with independent uncertainties, the relative standard uncertainty is approximately:

\displaystyle \left(\frac{u_y}{|y|}\right)^2\approx a^2\left(\frac{u_x}{|x|}\right)^2+ b^2\left(\frac{u_z}{|z|}\right)^2.

For a general product:

y=C\prod_i x_i^{a_i},

the relative form is:

\displaystyle \left(\frac{u_y}{|y|}\right)^2\approx \sum_i a_i^2\left(\frac{u_{x_i}}{|x_i|}\right)^2.

This is useful for first-pass sizing equations, power laws, heat-transfer correlations and material indices. It fails near zero or when inputs are strongly nonlinear over the uncertainty range.

Interval Propagation by Bounds

If only bounds are defensible:

x_i\in [x_{i,\min},x_{i,\max}],

then evaluate:

y_{\min}=\min_{x\in \mathcal{X}} f(x),

and:

y_{\max}=\max_{x\in \mathcal{X}} f(x).

For monotonic models, the extrema occur at interval endpoints. For non-monotonic or constrained models, endpoint checking may miss the true worst case.

Coverage Interval

For approximately normal output uncertainty:

y\pm k u_y.

Common first-pass values are:

k\approx 1 \quad \text{for about } 68\%,

and:

k\approx 2 \quad \text{for about } 95\%.

The coverage factor should match the consequence of the decision, the distribution shape and whether u_y is well supported by evidence.

Monte Carlo Propagation

For sampled inputs:

x^{(k)}=\left(x_1^{(k)},x_2^{(k)},\ldots,x_n^{(k)}\right),

evaluate:

y^{(k)}=f\left(x^{(k)}\right),\qquad k=1,\ldots,N.

The sample mean is:

\displaystyle \bar{y}=\frac{1}{N}\sum_{k=1}^{N}y^{(k)}.

The sample standard deviation is:

\displaystyle s_y=\sqrt{\frac{1}{N-1}\sum_{k=1}^{N}(y^{(k)}-\bar{y})^2}.

Monte Carlo is appropriate when the model is nonlinear, threshold-based, discontinuous, correlated or too complex for closed-form propagation.

Monte Carlo Exceedance Probability

For an upper limit:

\displaystyle y\le y_{\lim},

define an indicator:

\displaystyle I^{(k)}= \begin{cases} 1,& y^{(k)}>y_{\lim}\\ 0,& y^{(k)}\le y_{\lim}. \end{cases}

The estimated exceedance probability is:

\displaystyle \hat{P}_{ex}=\frac{1}{N}\sum_{k=1}^{N}I^{(k)}.

Approximate sampling standard error is:

\displaystyle SE_{\hat{P}}\approx \sqrt{\frac{\hat{P}_{ex}(1-\hat{P}_{ex})}{N}}.

Use this uncertainty when comparing a simulated failure probability with a release threshold.

Percentiles from Simulation

The pth percentile of simulated output is:

y_p=\operatorname{quantile}(y,p).

For a guarded upper-bound release:

\displaystyle y_{0.95}\le y_{\lim}

or:

\displaystyle y_{0.99}\le y_{\lim}

may be more meaningful than checking only the mean. The chosen percentile must be tied to the consequence and requirement.

Monte Carlo Convergence

For the sample mean:

\displaystyle SE_{\bar{y}}=\frac{s_y}{\sqrt{N}}.

For a target half-width h using multiplier z:

\displaystyle N\gtrsim \left(\frac{z s_y}{h}\right)^2.

This formula estimates sample count for a mean. Tail percentiles and rare failures usually require many more samples or specialized methods.

One-at-a-Time Sensitivity

For a baseline input x_i and perturbation \Delta x_i:

\displaystyle S_i^{OAT}=\frac{f(x_i+\Delta x_i)-f(x_i-\Delta x_i)}{2\Delta x_i}.

The normalized sensitivity is:

\displaystyle \tilde{S}_i=\frac{x_i}{y}\frac{\partial y}{\partial x_i}.

This elasticity tells the percent change in output for a percent change in input near the baseline. It is local and can miss interactions.

Finite-Difference Sensitivity with Noise

Forward difference:

\displaystyle \frac{\partial y}{\partial x_i}\approx \frac{f(x_i+\Delta x_i)-f(x_i)}{\Delta x_i}.

Central difference:

\displaystyle \frac{\partial y}{\partial x_i}\approx \frac{f(x_i+\Delta x_i)-f(x_i-\Delta x_i)}{2\Delta x_i}.

Choose \Delta x_i large enough to overcome numerical noise and small enough to represent the local response. In simulation models, sensitivity can be corrupted by solver tolerance, mesh changes, event switches or discontinuities.

Regression-Based Sensitivity

For a fitted linear surrogate:

y=\beta_0+\sum_i \beta_i x_i+\epsilon,

a standardized coefficient is:

\displaystyle \beta_i^*=\beta_i\frac{s_{x_i}}{s_y}.

Large |\beta_i^*| indicates a strong linear effect in the sampled range. Regression coefficients are not reliable when inputs are strongly collinear or when the response is nonlinear without an adequate surrogate.

Variance-Based Sensitivity

For independent inputs, a first-order sensitivity index is:

\displaystyle S_i=\frac{\operatorname{Var}\left(\operatorname{E}[Y\mid X_i]\right)}{\operatorname{Var}(Y)}.

It measures the fraction of output variance explained by input X_i alone.

The total-effect index is:

\displaystyle S_{T_i}=1-\frac{\operatorname{Var}\left(\operatorname{E}[Y\mid X_{\sim i}]\right)}{\operatorname{Var}(Y)}.

It includes direct effects and interactions involving X_i. If S_{T_i} is much larger than S_i, interactions are important.

Limit-State Reliability

Define a limit-state function:

g(x)>0 \quad \text{safe}

and:

g(x)\le 0 \quad \text{failure}.

The probability of failure is:

P_f=P(g(X)\le 0).

From Monte Carlo:

\displaystyle \hat{P}_f=\frac{N_f}{N},

where N_f is the number of failed samples.

For a normally distributed safety margin g:

\displaystyle \beta=\frac{\mu_g}{\sigma_g},

and:

P_f=\Phi(-\beta),

where \Phi is the standard normal cumulative distribution function.

Chance Constraint

A probabilistic requirement may be written:

P(g(X)\ge 0)\ge 1-\alpha.

Equivalently:

P_f\le \alpha.

For an upper-bound output:

\displaystyle P(Y\le y_{\lim})\ge 1-\alpha.

Chance constraints should not be used unless the input distributions and tail behavior are credible enough for the consequence.

Robust Objective

A simple risk-aware objective can penalize spread:

J_{robust}=\operatorname{E}[J(X)]+k\sigma_J.

For a lower-is-better objective, this discourages designs that look good only at nominal inputs. For a performance metric that must be high, the conservative form may be:

P_{robust}=\operatorname{E}[P(X)]-k\sigma_P.

The value of k should be a policy or requirement choice, not a tuning knob chosen after seeing the result.

Model Residual Metrics

For validation data y_i and model predictions \hat{y}_i:

r_i=y_i-\hat{y}_i.

Mean bias:

\displaystyle \bar{r}=\frac{1}{n}\sum_i r_i.

Root mean square error:

\displaystyle RMSE=\sqrt{\frac{1}{n}\sum_i r_i^2}.

Normalized residual:

\displaystyle z_i=\frac{r_i}{u_i},

where u_i is the expected standard uncertainty for the comparison. Large structured residuals indicate model-form error, sensor bias, missing physics or operating-range drift.

Coverage of Prediction Intervals

If a model reports prediction interval [L_i,U_i], empirical coverage is:

\displaystyle \hat{C}=\frac{1}{n}\sum_i I(L_i\le y_i\le U_i).

For a nominal 95\% interval, measured coverage far below 95\% means the uncertainty model is overconfident or the validation data are outside the intended envelope.

Evidence-Weighted Decision Record

A compact uncertainty decision record should include:

ItemFormula or check
Nominal marginm=y_{\lim}-y or m=y-y_{\min}
Combined uncertaintyu_y^2\approx J\Sigma_xJ^T
Guarded marginm_g=y_{\lim}-(y+k u_y)
Failure probability\hat{P}_f=N_f/N
Dominant inputlargest F_i or S_{T_i}
Validation residualRMSE, \bar{r}, normalized residuals
Actionrelease, hold, test, redesign, monitor or retire model

The action should follow from the decision metric, not from the fact that an analysis was performed.

Practical Checklist

Before accepting an uncertainty or sensitivity analysis, check:

  1. The engineering decision and acceptance threshold are stated.
  2. Input uncertainty evidence is named: measurement, supplier data, field data, test, judgement or standard.
  3. Units, distributions, intervals and correlations are explicit.
  4. Local propagation is used only where linearization is credible.
  5. Monte Carlo convergence is checked on the decision metric, not only the mean.
  6. Sensitivity ranks the quantity that controls the decision.
  7. Tail probabilities and percentiles have enough evidence for the consequence.
  8. Validation residuals are independent of calibration data when possible.
  9. Model-form uncertainty is not hidden inside arbitrary input spread.
  10. The decision record states what new evidence would change the conclusion.

Common Mistakes

  • Reporting a nominal result without comparing uncertainty to the decision margin.
  • Assuming independence when inputs share calibration, supplier lot, environment or operating mode.
  • Adding probability distributions that have no evidence basis.
  • Trusting Monte Carlo sample count without checking convergence of tails or failures.
  • Running sensitivity analysis on a convenient output instead of the release metric.
  • Treating RSS propagation as valid through discontinuities, saturations or threshold logic.
  • Using a robust objective after tuning k to prefer a desired design.
  • Validating against the same data used for calibration and calling it independent evidence.
  • Reporting percentiles without stating distribution assumptions and sample uncertainty.
  • Letting an early concept model become production evidence without a model-authority boundary.

The central habit is to make uncertainty decision-focused. The calculation should reveal whether the engineering action is stable, what input can change it, and what evidence would reduce risk most efficiently.

REF

See also