Exercise set

Uncertainty Quantification and Sensitivity Analysis Exercises

Solved uncertainty and sensitivity exercises for propagation, correlation, Monte Carlo, percentiles, failure probability, guards and validation.

Branch: Mathematical Engineering
Content: Exercise set
Updated: Jul 03, 2026
Revision: v1.0.1 · published

These exercises practise uncertainty quantification and sensitivity analysis as engineering decision tools. Each problem converts uncertainty, variability, sampling evidence or sensitivity into a release, hold, redesign or validation decision.

The examples are simplified, but the habit is realistic: state the limit, quantify uncertainty on the decision metric, identify what drives the conclusion, and decide whether the evidence is strong enough for the consequence.

How to use these exercises

For each problem, compute the requested uncertainty metric and then decide what it means for the engineering action. A result is not complete until the pass/fail boundary, assumptions and residual risk are explicit.

Use local propagation for transparent first-pass checks, Monte Carlo for nonlinear or threshold-driven cases, and validation residuals when the model itself may be wrong.

Release Evidence Notes

Uncertainty and sensitivity evidence should state the model output, decision limit, input distributions, correlation assumptions, sample size, convergence criterion, validation data and consequence of a wrong decision. A percentile or failure probability is not useful unless the input evidence and model boundary are defensible.

Sensitivity evidence should identify which inputs drive the decision and which ones are merely convenient to vary. A high sensitivity can call for better measurement, tighter supplier control, more conservative design margin or a model update; a low sensitivity should not be used to ignore an input outside the validated range.

Engineering Boundary Notes

These exercises use simplified propagation, Monte Carlo and sensitivity calculations. They do not replace a full uncertainty management plan, reliability analysis, model validation, probability elicitation, calibration of input distributions or risk acceptance review. Simulation precision does not compensate for weak input assumptions or an invalid model.

Common Release Mistakes

reporting many Monte Carlo samples without checking convergence or input evidence;
treating independent inputs as independent when they share a cause or source;
quoting a percentile without stating the confidence, sample size and model limit;
using local linear propagation outside a nonlinear or threshold region;
focusing on nominal sensitivity while the release decision is controlled by tail risk;
accepting a model because uncertainty is small even though validation residuals are large.

Scenario Map

Scenario	Main calculation	Engineering decision
Guarded margin	nominal value plus uncertainty guard	Decide whether a nominal pass remains a release pass.
Propagation	derivatives, RSS and covariance	Identify uncertainty contributors and correlation effects.
Monte Carlo	exceedance, percentile and convergence	Decide whether simulated evidence is stable enough.
Sensitivity	finite differences, normalized effects and variance indices	Prioritize which input needs better evidence.
Reliability	limit-state margin and failure probability	Compare risk with an acceptance threshold.
Validation	residuals, coverage and evidence gates	Hold a model when uncertainty claims are not supported.

Validation Package Checklist

output metric, limit state and decision consequence are defined;
input distributions, correlations and evidence sources are recorded;
propagation method or Monte Carlo model is appropriate for the nonlinearity;
sample size, convergence and tail-estimate stability are checked;
dominant sensitivities and weak assumptions are named;
validation residuals, calibration data or independent checks support the model;
final decision states accept, hold, redesign, collect data or reduce uncertainty.

Exercise 1: Guarded Margin for a Temperature Limit

A component has predicted maximum temperature:

T=103.5\ \text{deg C}

with standard uncertainty:

u_T=2.1\ \text{deg C}.

The release limit is:

\displaystyle T_{\lim}=110\ \text{deg C}.

Use a guard factor:

k=2.

Find the nominal margin, guarded temperature and guarded margin.

Solution

The nominal margin is:

\displaystyle m=T_{\lim}-T=110-103.5=6.5\ \text{deg C}.

The guarded temperature is:

T_g=T+k u_T=103.5+2(2.1)=107.7\ \text{deg C}.

The guarded margin is:

\displaystyle m_g=T_{\lim}-T_g=110-107.7=2.3\ \text{deg C}.

The guarded check passes, but with limited remaining margin.

Engineering Comment

The nominal value alone looked comfortable. The uncertainty guard consumes most of the margin, so the release should require evidence that $u_T$ is credible for the actual board, enclosure and operating load.

Plausibility Check

The guard adds $4.2\ \text{deg C}$ to the nominal prediction. The original $6.5\ \text{deg C}$ margin should shrink to $2.3\ \text{deg C}$ , which it does.

Exercise 2: Local Propagation for a Product Model

A heat loss estimate uses:

Q=UA\Delta T.

Nominal values are:

U=35\ \text{W/(m^2 K)},\qquad A=2.0\ \text{m^2},\qquad \Delta T=18\ \text{K}.

Independent standard uncertainties are:

u_U=3,\qquad u_A=0.05,\qquad u_{\Delta T}=0.8.

Find $Q$ and the standard uncertainty $u_Q$ using relative uncertainty propagation.

Solution

The nominal heat loss is:

Q=35(2.0)(18)=1260\ \text{W}.

For a product:

\displaystyle \left(\frac{u_Q}{Q}\right)^2= \left(\frac{u_U}{U}\right)^2+ \left(\frac{u_A}{A}\right)^2+ \left(\frac{u_{\Delta T}}{\Delta T}\right)^2.

Substitute:

\displaystyle \left(\frac{u_Q}{Q}\right)^2= \left(\frac{3}{35}\right)^2+ \left(\frac{0.05}{2.0}\right)^2+ \left(\frac{0.8}{18}\right)^2.

Calculate:

\displaystyle \left(\frac{u_Q}{Q}\right)^2= 0.00735+0.000625+0.00198=0.00996.

So:

\displaystyle \frac{u_Q}{Q}=0.0998.

The output standard uncertainty is:

u_Q=0.0998(1260)=126\ \text{W}.

Engineering Comment

The heat-transfer coefficient dominates because it has the largest relative uncertainty. Improving area measurement would not materially reduce the uncertainty unless $U$ is also better characterized.

Plausibility Check

The largest relative uncertainty is about $8.6\%$ from $U$ , so a combined result near $10\%$ is plausible. Ten percent of $1260\ \text{W}$ is about $126\ \text{W}$ .

Exercise 3: Correlated Inputs in a Clearance Calculation

A clearance is calculated as:

C=X-Y.

The standard uncertainties are:

u_X=0.08\ \text{mm},\qquad u_Y=0.06\ \text{mm}.

The correlation coefficient is:

\rho_{XY}=0.75.

Find the standard uncertainty of $C$ .

Solution

For $C=X-Y$ , the sensitivities are:

\displaystyle \frac{\partial C}{\partial X}=1,\qquad \frac{\partial C}{\partial Y}=-1.

The covariance is:

\operatorname{cov}(X,Y)=\rho_{XY}u_Xu_Y =0.75(0.08)(0.06) =0.0036\ \text{mm}^2.

The propagated variance is:

u_C^2=u_X^2+u_Y^2-2\operatorname{cov}(X,Y).

Substitute:

u_C^2=0.08^2+0.06^2-2(0.0036) =0.0064+0.0036-0.0072 =0.0028.

Therefore:

u_C=\sqrt{0.0028}=0.0529\ \text{mm}.

Engineering Comment

Positive correlation reduces uncertainty in a difference. If both dimensions move together because they share a process setup, their difference can be more stable than either individual measurement.

Plausibility Check

If the variables were independent, the uncertainty would be $\sqrt{0.08^2+0.06^2}=0.10\ \text{mm}$ . Positive correlation in a subtraction should reduce it, and $0.0529\ \text{mm}$ does.

Exercise 4: Interval Bound for a Monotonic Stress Model

A simplified stress model is:

\displaystyle \sigma=\frac{F}{A}.

The force and area are bounded by:

F\in[9.5,10.8]\ \text{kN},

and:

A\in[78,82]\ \text{mm}^2.

Find the worst-case maximum stress.

Solution

Stress increases with force and decreases with area. The maximum occurs at maximum force and minimum area:

\displaystyle \sigma_{\max}=\frac{10.8\ \text{kN}}{78\ \text{mm}^2}.

Convert:

10.8\ \text{kN}=10{,}800\ \text{N}.

Since:

1\ \text{N/mm}^2=1\ \text{MPa},

the stress is:

\displaystyle \sigma_{\max}=\frac{10{,}800}{78}=138.5\ \text{MPa}.

Engineering Comment

Interval analysis is defensible when only bounds are known. It is conservative for monotonic models, but non-monotonic models may need optimization over the interval rather than endpoint checking.

Plausibility Check

A $10\ \text{kN}$ load over about $80\ \text{mm}^2$ gives roughly $125\ \text{MPa}$ . The worst-case value should be somewhat higher, so $138.5\ \text{MPa}$ is plausible.

Exercise 5: Monte Carlo Exceedance Probability

A Monte Carlo study evaluates $N=20{,}000$ samples of a design metric. The limit is exceeded in:

N_f=74

samples. Estimate the exceedance probability and its approximate sampling standard error.

Solution

The exceedance probability estimate is:

\displaystyle \hat{P}_{ex}=\frac{N_f}{N} =\frac{74}{20{,}000} =0.00370.

The approximate standard error is:

\displaystyle SE_{\hat{P}}=\sqrt{\frac{\hat{P}_{ex}(1-\hat{P}_{ex})}{N}}.

Substitute:

\displaystyle SE_{\hat{P}}=\sqrt{\frac{0.00370(0.99630)}{20{,}000}} =0.000429.

The estimate is:

\hat{P}_{ex}=0.370\%\quad \text{with } SE\approx0.043\%.

Engineering Comment

If the release threshold is $0.5\%$ , this looks acceptable. If the threshold is $0.3\%$ , it is not. The decision depends on the risk criterion, not only on the existence of a Monte Carlo run.

Plausibility Check

Seventy-four failures out of twenty thousand is a few tenths of a percent. The standard error is much smaller than the estimate but not negligible near tight limits.

Exercise 6: Sample Count for Mean Convergence

A preliminary simulation gives output standard deviation:

s_y=4.5\ \text{units}.

You want the approximate $95\%$ confidence half-width for the mean to be no more than:

h=0.5\ \text{units}.

Use:

\displaystyle N\gtrsim\left(\frac{z s_y}{h}\right)^2

with:

z=1.96.

Find the required sample count.

Solution

Substitute:

\displaystyle N\gtrsim\left(\frac{1.96(4.5)}{0.5}\right)^2.

Calculate:

\displaystyle \frac{1.96(4.5)}{0.5}=17.64.

Then:

N\gtrsim17.64^2=311.

Use at least:

N=312

samples for this mean estimate.

Engineering Comment

This sample count is for the mean, not for tail probability or rare failures. A release decision based on the 99th percentile or $P_f$ may require many more samples.

Plausibility Check

The desired half-width is about one ninth of the standard deviation, so a few hundred samples are reasonable.

Exercise 7: Percentile Release Check

A simulated load distribution has:

y_{95}=87.2\ \text{kN}

for the 95th percentile. The allowable load is:

\displaystyle y_{\lim}=90.0\ \text{kN}.

The simulation percentile uncertainty is estimated as:

u_{95}=1.6\ \text{kN}.

Use a guarded percentile:

y_{g}=y_{95}+u_{95}.

Does the 95th percentile check pass?

Solution

The guarded percentile is:

y_g=87.2+1.6=88.8\ \text{kN}.

Compare:

88.8\ \text{kN}<90.0\ \text{kN}.

The guarded 95th percentile passes.

The margin is:

90.0-88.8=1.2\ \text{kN}.

Engineering Comment

The margin is not large. If the percentile came from a small sample or an unvalidated distribution, this should be treated as a conditional pass pending stronger tail evidence.

Plausibility Check

The unguarded percentile is $2.8\ \text{kN}$ below the limit. Adding $1.6\ \text{kN}$ of uncertainty should leave $1.2\ \text{kN}$ , which matches the result.

Exercise 8: One-at-a-Time Sensitivity

A baseline model output is:

y_0=240.

An input $x$ has baseline value:

x_0=60.

When $x$ is increased by $5$ , the output becomes $258$ . When $x$ is decreased by $5$ , the output becomes $223$ .

Estimate the central finite-difference sensitivity and normalized sensitivity.

Solution

The central sensitivity is:

\displaystyle \frac{\partial y}{\partial x}\approx \frac{258-223}{2(5)} =\frac{35}{10} =3.5.

The normalized sensitivity is:

\displaystyle \tilde{S}_x=\frac{x_0}{y_0}\frac{\partial y}{\partial x} =\frac{60}{240}(3.5) =0.875.

Engineering Comment

A normalized sensitivity of $0.875$ means a 1 percent increase in $x$ produces roughly a 0.875 percent increase in $y$ near the baseline. This is a strong local effect.

Plausibility Check

The output changes by $35$ units over a $10$ unit input span, so a slope of $3.5$ is direct. Since $x/y=0.25$ , the normalized sensitivity should be about one quarter of $3.5$ , or $0.875$ .

Exercise 9: Variance Contribution Ranking

Three independent inputs contribute to output uncertainty:

Input	Sensitivity	Standard uncertainty
$x_1$	$4.0$	$0.20$
$x_2$	$1.5$	$0.60$
$x_3$	$8.0$	$0.05$

Compute each variance contribution and identify the dominant input.

Solution

Contribution from $x_1$ :

c_1^2=(4.0\cdot0.20)^2=0.64.

Contribution from $x_2$ :

c_2^2=(1.5\cdot0.60)^2=0.81.

Contribution from $x_3$ :

c_3^2=(8.0\cdot0.05)^2=0.16.

Total variance:

u_y^2=0.64+0.81+0.16=1.61.

Fractions:

\displaystyle F_1=\frac{0.64}{1.61}=39.8\%,

\displaystyle F_2=\frac{0.81}{1.61}=50.3\%,

\displaystyle F_3=\frac{0.16}{1.61}=9.9\%.

The dominant contributor is $x_2$ .

Engineering Comment

The largest sensitivity does not automatically dominate. Input $x_3$ has the largest sensitivity, but its uncertainty is small. Input $x_2$ deserves the first evidence-improvement effort.

Plausibility Check

The products are $0.8$ , $0.9$ and $0.4$ . Squaring them gives the ranking $x_2$ , $x_1$ , $x_3$ , as calculated.

Exercise 10: First-Order and Total Sensitivity Indices

A global sensitivity study reports:

Input	$S_i$	$S_{T_i}$
$A$	$0.42$	$0.47$
$B$	$0.18$	$0.41$
$C$	$0.09$	$0.12$

Which input has the strongest direct effect, and which input shows the strongest interaction effect?

Solution

The strongest direct effect is the largest first-order index:

S_A=0.42.

So input $A$ has the strongest direct effect.

Interaction involvement can be screened by:

S_{T_i}-S_i.

For $A$ :

0.47-0.42=0.05.

For $B$ :

0.41-0.18=0.23.

For $C$ :

0.12-0.09=0.03.

Input $B$ shows the strongest interaction effect.

Engineering Comment

Input $B$ would be underestimated by a one-at-a-time study. It matters mainly through interactions, so experiments or simulations should vary it jointly with other inputs.

Plausibility Check

$A$ has the highest direct index, but $B$ has the largest gap between total and first-order effect. The two conclusions are different and both are useful.

Exercise 11: Failure Probability from a Normal Safety Margin

A safety margin is modeled as approximately normal:

g\sim N(\mu_g,\sigma_g^2)

with:

\mu_g=12.0,\qquad \sigma_g=4.0.

Failure occurs when:

g\le 0.

Find the reliability index:

\displaystyle \beta=\frac{\mu_g}{\sigma_g}

and estimate the failure probability using:

P_f=\Phi(-\beta).

Use $\Phi(-3.0)=0.00135$ .

Solution

The reliability index is:

\displaystyle \beta=\frac{12.0}{4.0}=3.0.

Therefore:

P_f=\Phi(-3.0)=0.00135.

As a percentage:

P_f=0.135\%.

Engineering Comment

This result depends strongly on the normal-margin assumption. If the margin distribution is skewed, bounded, mixed-mode or driven by rare operational states, the normal approximation may be misleading.

Plausibility Check

A mean margin three standard deviations above zero should correspond to a small lower-tail probability, around one tenth of a percent. The value is plausible.

Exercise 12: Chance Constraint Release Screen

A controller design must satisfy:

P(Y\le 50)\ge 0.99.

A Monte Carlo study with $N=10{,}000$ samples gives:

N_{ex}=86

samples with $Y>50$ .

Does the chance constraint pass?

Solution

The estimated exceedance probability is:

\displaystyle \hat{P}_{ex}=\frac{86}{10{,}000}=0.0086.

The estimated satisfaction probability is:

\hat{P}(Y\le 50)=1-0.0086=0.9914.

Compare with the requirement:

0.9914>0.99.

The nominal Monte Carlo estimate passes.

Engineering Comment

The pass is narrow because the exceedance threshold is $1\%$ and the estimate is $0.86\%$ . A release package should include sampling uncertainty and evidence that the input distributions are credible in the tail.

Plausibility Check

Eighty-six exceedances in ten thousand samples is less than one percent. The satisfaction probability should therefore be just above $99\%$ .

Exercise 13: Robust Objective Comparison

Two design candidates have simulated cost metric $J$ :

Candidate	$\operatorname{E}[J]$	$\sigma_J$
A	$100$	$18$
B	$108$	$6$

For a lower-is-better objective, use:

J_{robust}=\operatorname{E}[J]+k\sigma_J

with:

k=1.5.

Which candidate is preferred?

Solution

For candidate A:

J_{robust,A}=100+1.5(18)=127.

For candidate B:

J_{robust,B}=108+1.5(6)=117.

Since:

117<127,

candidate B is preferred under the robust objective.

Engineering Comment

Candidate A has better nominal expected cost, but its uncertainty is much larger. The robust objective prefers the design with less spread because the decision penalizes risk.

Plausibility Check

The uncertainty penalty for A is $27$ , while for B it is $9$ . That difference is large enough to reverse the nominal ranking.

Exercise 14: Validation Residual Bias and RMSE

A model is compared with five independent validation measurements. Residuals are:

r=[2.0,\ -1.0,\ 3.0,\ 4.0,\ -2.0].

Compute the mean bias and RMSE.

Solution

The mean residual is:

\displaystyle \bar{r}=\frac{2-1+3+4-2}{5} =\frac{6}{5} =1.2.

The RMSE is:

\displaystyle RMSE=\sqrt{\frac{1}{5}(2^2+(-1)^2+3^2+4^2+(-2)^2)}.

Calculate:

\displaystyle RMSE=\sqrt{\frac{4+1+9+16+4}{5}} =\sqrt{\frac{34}{5}} =2.61.

Engineering Comment

The model has a positive average bias and a larger RMS residual. A release decision should compare both with the uncertainty claimed by the model and with the engineering margin.

Plausibility Check

Residuals are mostly a few units, so an RMSE around $2.6$ is reasonable. The positive residuals are larger than the negative ones, so positive bias is expected.

Exercise 15: Prediction Interval Coverage

A model reports 95 percent prediction intervals for $n=40$ validation cases. The measured value falls inside the reported interval in:

N_{in}=34

cases. Estimate empirical coverage and decide whether the interval model looks overconfident.

Solution

The empirical coverage is:

\displaystyle \hat{C}=\frac{N_{in}}{n} =\frac{34}{40} =0.85.

So:

\hat{C}=85\%.

The reported intervals claim 95 percent coverage but achieved only 85 percent on validation cases.

The interval model looks overconfident.

Engineering Comment

Undercoverage means the uncertainty bands are too narrow, the validation data are outside the intended envelope, or the model is missing important variability. This is a model-authority problem, not just a plotting issue.

Plausibility Check

Missing 6 cases out of 40 means 15 percent are outside the interval. A 95 percent interval should miss about 5 percent, so the observed coverage is clearly low.

Exercise 16: Evidence Value from Sensitivity Ranking

A release metric has total uncertainty:

u_y=5.0.

The largest variance contributor is input $x_1$ with contribution fraction:

F_1=64\%.

If new calibration halves the standard uncertainty of $x_1$ , estimate the new total output uncertainty. Assume all other contributions stay the same.

Solution

The total variance is:

u_y^2=25.

The variance from $x_1$ is:

0.64(25)=16.

The remaining variance is:

25-16=9.

Halving standard uncertainty quarters the variance contribution:

\displaystyle 16\left(\frac{1}{2}\right)^2=4.

The new total variance is:

u_{y,new}^2=4+9=13.

Therefore:

u_{y,new}=\sqrt{13}=3.61.

Engineering Comment

Sensitivity ranking becomes valuable when it tells where evidence will reduce decision risk. Here, calibrating $x_1$ reduces output uncertainty from $5.0$ to $3.61$ , a meaningful improvement.

Plausibility Check

Because $x_1$ dominates but does not account for all variance, halving its standard uncertainty should reduce total uncertainty substantially but not by half. The result matches that expectation.

Exercise 17: Decision Margin After Improved Evidence

A nominal design metric is:

y=72.

The upper limit is:

\displaystyle y_{\lim}=80.

Before improved evidence, output uncertainty is:

u_y=5.0.

After improved evidence, it is:

u_y=3.6.

Use guard factor:

k=2.

Compare the guarded margins before and after evidence improvement.

Solution

Initial guarded value:

y_{g,old}=72+2(5.0)=82.

Initial guarded margin:

m_{g,old}=80-82=-2.

The initial guarded check fails.

Improved guarded value:

y_{g,new}=72+2(3.6)=79.2.

Improved guarded margin:

m_{g,new}=80-79.2=0.8.

After improved evidence, the guarded check passes with margin $0.8$ .

Engineering Comment

This shows why reducing uncertainty can be equivalent to design improvement. The nominal design did not change, but the evidence became strong enough to support a guarded release.

Plausibility Check

The original nominal margin is $8$ . A two-sigma guard of $10$ fails by $2$ ; a two-sigma guard of $7.2$ passes by $0.8$ . The arithmetic is consistent.

Exercise 18: UQ Release Decision Gate

A model-supported release package gives:

Check	Requirement	Result
Guarded margin	$\ge 0$	$1.4$
Monte Carlo exceedance	$\le 0.5\%$	$0.42\%$
Dominant input evidence	required	calibrated
Validation RMSE	$\le 3.0$	$3.4$
Prediction interval coverage	$\ge 90\%$	$86\%$

Should the model-supported decision be released?

Solution

The guarded margin passes:

1.4>0.

The exceedance probability passes:

0.42\%<0.5\%.

The dominant input evidence is present.

The validation RMSE fails:

3.4>3.0.

Prediction interval coverage also fails:

86\%<90\%.

The model-supported decision should not be released.

Engineering Comment

The UQ calculation appears acceptable, but validation does not support the model’s claimed accuracy. Release should be held for recalibration, model-form review, wider uncertainty bands or a restricted operating envelope.

Plausibility Check

Two validation gates fail. Even if uncertainty propagation and Monte Carlo checks pass, unsupported model accuracy is enough to block release.

Review Checklist

Before accepting an uncertainty or sensitivity analysis, check:

the decision metric, acceptance threshold and action are explicit;
input evidence and units are traceable;
variability, lack of knowledge and model-form uncertainty are separated;
independence or correlation assumptions are justified;
local propagation is valid near the operating point;
Monte Carlo convergence is checked on the decision metric;
sensitivity ranks the input that controls the action, not a convenient output;
failure probability or percentile checks include sampling uncertainty;
validation residuals and interval coverage support the claimed uncertainty;
the report states what new evidence would change the decision.

Common Mistakes

Treating a nominal pass as release evidence without a guarded margin.
Assuming all inputs are independent because covariance data are inconvenient.
Using local linear propagation through threshold, saturation or discontinuity.
Reporting a Monte Carlo histogram without exceedance or percentile evidence.
Ranking sensitivities on mean performance when the decision depends on tail risk.
Ignoring interaction effects when total sensitivity is much larger than first-order sensitivity.
Reporting failure probability without defining failure.
Validating a model with calibration data and calling it independent.
Treating low RMSE as sufficient when interval coverage is poor.
Reducing uncertainty in an input that barely contributes to the decision metric.

The central habit is to make uncertainty actionable. A good UQ result tells which decision is stable, which input can overturn it, and which evidence would most efficiently reduce risk.

REF