Formula sheet

Engineering Statistics and Reliability Data Formula Sheet

Mathematical engineering formula sheet for engineering statistics, confidence intervals, sample size, DOE, regression, reliability life data, censored data, Monte Carlo evidence, and validation.

Branch: Mathematical Engineering
Content: Formula sheet
Updated: Jun 23, 2026
Revision: v1.0.0 · reviewed

This formula sheet collects engineering statistics and reliability data formulas for estimation, test planning, experimental design, model validation, life data, censored observations, and decision evidence. Use it to make calculations traceable, not to hide weak assumptions behind statistical notation.

Every formula needs a decision boundary: what population is represented, what measurement is used, what error is costly, and what action follows from the result.

Symbols and Conventions

Common symbols:

Symbol	Meaning
$n$	sample size
$x_i$	measured observation
$\bar{x}$	sample mean
$s$	sample standard deviation
$\mu$	population mean
$\sigma$	population standard deviation
$SE$	standard error
$z$	standard normal multiplier or z-score
$t$	Student t multiplier
$p$	probability or proportion
$\lambda$	failure rate for an exponential model
$T$	total exposure time, cycles, distance, or transactions
$R(t)$	reliability over mission time $t$
$\beta$	Weibull shape parameter
$\eta$	Weibull scale parameter

Keep units attached to the engineering variable. Statistical multipliers are dimensionless; the interval or estimate has the same unit as the measured quantity.

Mean, Variance, and Standard Deviation

Sample mean:

\displaystyle \bar{x}=\frac{1}{n}\sum_{i=1}^{n}x_i

Sample variance:

\displaystyle s^2=\frac{1}{n-1}\sum_{i=1}^{n}(x_i-\bar{x})^2

Sample standard deviation:

s=\sqrt{s^2}

Use $n-1$ for the usual unbiased sample variance estimate. Before interpreting $s$ , check whether observations are independent, representative, and measured with the same method.

Weighted Mean

For observations with weights $w_i$ :

\displaystyle \bar{x}_w=\frac{\sum_i w_i x_i}{\sum_i w_i}

Weighted means are useful when measurements have different exposure, duration, uncertainty, or population weight. They are misleading if the weights are chosen after seeing the result.

Z-Score

For an observation $x$ :

\displaystyle z=\frac{x-\mu}{\sigma}

For sample-based screening:

\displaystyle z\approx\frac{x-\bar{x}}{s}

Use z-scores only when the distribution and spread are meaningful for the decision. A large z-score can indicate a real shift, a special-cause event, a measurement error, or a model mismatch.

Standard Error of a Mean

If observations are independent:

\displaystyle SE_{\bar{x}}=\frac{s}{\sqrt{n}}

Standard error measures uncertainty in the mean estimate, not the spread of individual observations. A small standard error does not imply that every part, patient, packet, sensor, or test article is close to the mean.

Confidence Interval for a Mean

When $s$ is estimated from data:

\displaystyle \bar{x}\pm t_{\alpha/2,n-1}\frac{s}{\sqrt{n}}

For large samples or known $\sigma$ :

\displaystyle \bar{x}\pm z_{\alpha/2}\frac{\sigma}{\sqrt{n}}

Worked Check

A calibration offset has:

\bar{x}=0.42\ \text{mm}

s=0.18\ \text{mm}

n=9

Use:

t_{0.975,8}=2.31

Half-width:

\displaystyle h=2.31\frac{0.18}{\sqrt{9}}=0.139\ \text{mm}

Interval:

0.42\pm0.139=0.281\ \text{mm to}\ 0.559\ \text{mm}

Engineering Comment

If the allowable mean offset is $0.50\ \text{mm}$ , this interval crosses the limit. The result is not a clean release; it calls for correction, more representative data, or a guarded decision rule.

Confidence Interval for a Proportion

For a simple large-sample screen:

\displaystyle \hat{p}=\frac{x}{n}

\displaystyle SE_{\hat{p}}=\sqrt{\frac{\hat{p}(1-\hat{p})}{n}}

Approximate interval:

\hat{p}\pm z_{\alpha/2}SE_{\hat{p}}

This approximation is weak for small samples, rare failures, or proportions near 0 or 1. Use an exact or adjusted method when release risk is high.

Sample Size for Mean Precision

To estimate a mean with target half-width $h$ :

\displaystyle n\approx\left(\frac{zs}{h}\right)^2

Round up.

Worked Check

If:

s=0.24\ \text{bar}

target half-width is:

h=0.06\ \text{bar}

and:

z=1.96

then:

\displaystyle n=\left(\frac{1.96(0.24)}{0.06}\right)^2=61.5

Round up:

n=62

Engineering Comment

This assumes stable variance and representative sampling. If results differ by operator, lot, temperature, site, or duty cycle, the design should be blocked or stratified rather than simply enlarged.

Sample Size for Proportion Precision

For a target half-width $h$ :

\displaystyle n\approx\frac{z^2p(1-p)}{h^2}

If $p$ is unknown, use $p=0.5$ for a conservative maximum variance screen.

Difference Between Two Means

For independent samples:

\Delta=\bar{x}_1-\bar{x}_2

Standard error:

\displaystyle SE_{\Delta}=\sqrt{\frac{s_1^2}{n_1}+\frac{s_2^2}{n_2}}

Signal-to-noise screen:

\displaystyle S=\frac{|\Delta|}{SE_{\Delta}}

This is not a full test unless the assumptions and degrees of freedom are handled. It is useful for quick engineering review of whether a claimed improvement is large relative to noise.

Guarded Acceptance with Measurement Uncertainty

For a symmetric limit $L$ on absolute error and expanded uncertainty $U$ :

A_g=L-U

Accept only when:

|E_{meas}|\leq A_g

Worked Check

If:

L=3.0\%

U=0.45\%

then:

A_g=3.0-0.45=2.55\%

A measured error of:

|E_{meas}|=2.7\%

fails the guarded rule even though it is below the nominal $3.0\%$ limit.

Engineering Comment

Guard bands reduce false acceptance risk but can increase false rejection. The project should define the rule before testing.

Process Capability

For two-sided specification limits:

\displaystyle C_p=\frac{USL-LSL}{6\sigma}

Upper capability:

\displaystyle C_{pu}=\frac{USL-\mu}{3\sigma}

Lower capability:

\displaystyle C_{pl}=\frac{\mu-LSL}{3\sigma}

Overall centered capability:

C_{pk}=\min(C_{pu},C_{pl})

Capability values require a stable process, credible distribution, adequate measurement system, and specification limits that actually represent engineering requirements.

Factorial Effect Estimate

For a two-level factor $A$ coded low/high, the main effect can be screened as:

Effect_A=\bar{y}_{A+}-\bar{y}_{A-}

For an interaction between factors $A$ and $B$ :

Effect_{AB}=\bar{y}_{AB+}-\bar{y}_{AB-}

where $AB+$ means runs where the product of coded factor signs is positive.

Randomization, replication, blocking, and pre-defined response variables matter more than the arithmetic. A factorial effect from a biased run order can be a temperature drift, tooling drift, operator sequence, or battery state rather than a real factor effect.

Linear Regression

Simple linear model:

y_i=\beta_0+\beta_1x_i+\epsilon_i

Least-squares slope:

\displaystyle \hat{\beta}_1=\frac{\sum_i(x_i-\bar{x})(y_i-\bar{y})}{\sum_i(x_i-\bar{x})^2}

Intercept:

\hat{\beta}_0=\bar{y}-\hat{\beta}_1\bar{x}

Predicted value:

\hat{y}=\hat{\beta}_0+\hat{\beta}_1x

Regression should be checked with residuals, leverage, operating range, measurement uncertainty, and physical plausibility. A good fit inside a test range is not proof outside that range.

Bias and RMSE

Prediction residual:

e_i=y_i-\hat{y}_i

Mean bias:

\displaystyle Bias=\frac{1}{n}\sum_i e_i

Root mean square error:

\displaystyle RMSE=\sqrt{\frac{1}{n}\sum_i e_i^2}

Bias shows systematic offset. RMSE combines bias and scatter. For validation, compare both with the engineering margin.

Failure Rate and MTBF from Exposure

For $N_f$ failures over total exposure $T$ :

\displaystyle \hat{\lambda}=\frac{N_f}{T}

For a constant-rate model:

\displaystyle MTBF=\frac{1}{\hat{\lambda}}=\frac{T}{N_f}

Use consistent exposure units: operating hours, cycles, starts, transactions, distance, or mission count. Do not mix calendar time with operating exposure without a defensible conversion.

Exponential Reliability

For constant failure rate:

R(t)=e^{-\lambda t}

Probability of failure by time $t$ :

F(t)=1-R(t)=1-e^{-\lambda t}

Mean time to failure:

\displaystyle MTTF=\frac{1}{\lambda}

This model is appropriate only when the failure rate is approximately constant over the use interval. It does not represent early-life screening failures or wear-out unless the interval is limited and justified.

Zero-Failure Demonstration Screen

For zero observed failures over exposure $T$ , a one-sided confidence lower bound on MTBF under an exponential model can be screened as:

\displaystyle MTBF_C\geq\frac{T}{-\ln(1-C)}

where $C$ is confidence level.

Worked Check

Ten units run:

300\ \text{h/unit}

with zero failures. Total exposure:

T=10(300)=3000\ \text{h}

For:

C=0.90

-\ln(1-0.90)=2.303

Therefore:

\displaystyle MTBF_{90}\geq\frac{3000}{2.303}=1303\ \text{h}

Engineering Comment

This is not a field reliability guarantee. It depends on the exponential assumption, failure definition, unit representativeness, test stress, censoring, and whether the released configuration matches the tested configuration.

Weibull Life Data

Weibull cumulative failure probability:

\displaystyle F(t)=1-\exp\left[-\left(\frac{t}{\eta}\right)^{\beta}\right]

Reliability:

\displaystyle R(t)=\exp\left[-\left(\frac{t}{\eta}\right)^{\beta}\right]

Hazard rate:

\displaystyle h(t)=\frac{\beta}{\eta}\left(\frac{t}{\eta}\right)^{\beta-1}

Interpretation:

$\beta<1$ suggests decreasing hazard, often early-life or screening-dominated failures;
$\beta=1$ gives constant hazard, equivalent to exponential life;
$\beta>1$ suggests increasing hazard, often wear-out or degradation.

The interpretation is engineering evidence, not automatic truth. Check failure modes, censoring, operating conditions, and confidence bounds.

Censored Data Exposure

Right-censored units contribute exposure even when they have not failed. A simple exponential failure-rate estimate with complete exposure is:

\displaystyle \hat{\lambda}=\frac{N_f}{\sum_i t_i}

where failed and censored units both contribute observed time $t_i$ .

Worked Check

Four devices are observed:

Unit	Observation
failed	$120\ \text{h}$
failed	$180\ \text{h}$
still running	$240\ \text{h}$
still running	$240\ \text{h}$

Failures:

N_f=2

Exposure:

T=120+180+240+240=780\ \text{h}

Failure rate estimate:

\displaystyle \hat{\lambda}=\frac{2}{780}=0.00256\ \text{h}^{-1}

MTBF estimate:

\displaystyle MTBF=\frac{1}{0.00256}=390\ \text{h}

Engineering Comment

Discarding censored units would underuse valid exposure. Treating censored units as failures would be overly pessimistic. The right method depends on the censoring mechanism and life model.

Monte Carlo Propagation

For a model:

y=f(x_1,x_2,\ldots,x_k)

Monte Carlo simulation draws random samples from input distributions and evaluates:

y_j=f(x_{1,j},x_{2,j},\ldots,x_{k,j})

Estimated mean:

\displaystyle \bar{y}=\frac{1}{N}\sum_{j=1}^{N}y_j

Estimated probability of exceeding a limit $L$ :

\displaystyle \hat{P}(y>L)=\frac{\#(y_j>L)}{N}

Monte Carlo results are only as valid as the input distributions, correlations, model form, sample count, and validation evidence.

Queueing and Waiting-Time Screen

For a stable single-server M/M/1 screen:

\displaystyle \rho=\frac{\lambda}{\mu}

Average number in system:

\displaystyle L=\frac{\rho}{1-\rho}

Average time in system:

\displaystyle W=\frac{1}{\mu-\lambda}

This model assumes Poisson arrivals, exponential service times, one server, first-in-first-out behavior, and steady state. Use it as an order-of-magnitude check for test labs, service queues, packet systems, inspection stations, or maintenance response.

Validation Acceptance Metrics

Normalized error:

\displaystyle e_N=\frac{y_{model}-y_{meas}}{y_{scale}}

Coverage check for uncertainty:

|y_{model}-y_{meas}|\leq U_{combined}

where $U_{combined}$ should include model uncertainty, measurement uncertainty, input uncertainty, and numerical uncertainty when relevant.

Validation should specify:

operating range covered;
independent data source;
acceptance threshold;
residual pattern review;
uncertainty basis;
action if the model passes only part of the range.

Common Mistakes

Common engineering statistics mistakes include:

reporting a mean without spread, confidence, or measurement uncertainty;
using more samples to compensate for biased sampling;
treating repeated measurements on one unit as if they represented many units;
using a normal model for bounded, censored, rare-event, or strongly skewed data without checking;
using MTBF without failure definition, exposure basis, confidence, or failure-mode context;
ignoring censored units in reliability data;
accepting a regression model outside the tested range;
using Monte Carlo output without validating input distributions and correlations.

Formula Selection Checklist

Before using a statistical result for release or design change, confirm:

the engineering decision is stated;
the measured variable and units are defined;
the population and sampling method match the decision;
the measurement system is adequate;
the assumptions behind the formula are credible;
uncertainty or confidence is compared with engineering margin;
the result changes an action: release, hold, retest, redesign, monitor, or escalate.

REF

Disciplines

Engineering Statistics and Reliability Data Formula Sheet

Symbols and Conventions

Mean, Variance, and Standard Deviation

Weighted Mean

Z-Score

Standard Error of a Mean

Confidence Interval for a Mean

Worked Check

Engineering Comment

Confidence Interval for a Proportion

Sample Size for Mean Precision

Worked Check

Engineering Comment

Sample Size for Proportion Precision

Difference Between Two Means

Guarded Acceptance with Measurement Uncertainty

Worked Check

Engineering Comment

Process Capability

Factorial Effect Estimate

Linear Regression

Bias and RMSE

Failure Rate and MTBF from Exposure

Exponential Reliability

Zero-Failure Demonstration Screen

Worked Check

Engineering Comment

Weibull Life Data

Censored Data Exposure

Worked Check

Engineering Comment

Monte Carlo Propagation

Queueing and Waiting-Time Screen

Validation Acceptance Metrics

Common Mistakes

Formula Selection Checklist

See also