Formula sheet

Algorithmic Performance Formula Sheet

Core formulas and checks for algorithmic performance: Big O growth, loops, memory footprint, amortized cost, queueing, throughput, latency percentiles, and reliability.

Branch: Computer Engineering
Content: Formula sheet
Updated: May 29, 2026
Revision: v1.0.4 · reviewed

This formula sheet collects common estimates used when reviewing algorithms, data structures, and computer-system performance. The equations are first-pass checks, not substitutes for profiling. Use them to compare growth rates, find capacity limits, set benchmark expectations, and identify workloads that need more detailed measurement.

Always state what $n$ means. It may be number of elements, bytes, vertices, edges, requests, records, samples, tasks, or states. Also state whether the estimate is best case, average case, expected case, amortized case, or worst case.

Common growth classes

Constant time:

O(1)

Logarithmic time:

O(\log n)

Linear time:

O(n)

Linearithmic time:

O(n\log n)

Quadratic time:

O(n^2)

Exponential time:

O(2^n)

Factorial time:

O(n!)

Asymptotic notation suppresses constants and lower-order terms. For engineering decisions, combine growth class with measured constants, memory traffic, allocation behaviour, and latency requirements.

Loop operation counts

A single loop over $n$ items is usually:

T(n)=an+b

so:

T(n)=O(n)

A nested loop over all pairs is usually:

T(n)=an^2+bn+c

so:

T(n)=O(n^2)

For triangular nested loops:

\displaystyle \sum_{i=1}^{n} i=\frac{n(n+1)}{2}

so the growth is still:

O(n^2)

For a loop that repeatedly halves the remaining work:

k=\log_2 n

so the growth is:

O(\log n)

Recurrences

Binary search can be written as:

\displaystyle T(n)=T\left(\frac{n}{2}\right)+c

which gives:

T(n)=O(\log n)

Merge sort is commonly written as:

\displaystyle T(n)=2T\left(\frac{n}{2}\right)+cn

which gives:

T(n)=O(n\log n)

A simple exhaustive subset search often has:

T(n)=2T(n-1)+c

which gives:

T(n)=O(2^n)

Recurrences are estimates of structure, not performance proof. Constant factors, allocation, recursion depth, and cache behaviour still need review.

Data-structure operation estimates

Array index access:

O(1)

Array scan:

O(n)

Insertion or deletion in the middle of a contiguous array:

O(n)

Balanced binary search tree lookup, insertion, or deletion:

O(\log n)

Unbalanced binary search tree worst-case lookup, insertion, or deletion:

O(n)

Hash table expected lookup, insertion, or deletion:

O(1)

Hash table worst-case lookup under severe collision:

O(n)

These are model values. Real systems should also check memory locality, allocation, resizing, synchronization, ordering requirements, and adversarial input behaviour.

Graph storage and traversal

For a graph with $V$ vertices and $E$ edges, adjacency-list storage is:

O(V+E)

Adjacency-matrix storage is:

O(V^2)

Breadth-first search or depth-first search with adjacency lists:

O(V+E)

Checking whether an edge exists in an adjacency matrix:

O(1)

Sparse graphs often satisfy:

E \ll V^2

Dense graphs approach:

E \approx V^2

The representation should match graph density, update frequency, memory limits, and the operations that dominate the workload.

Memory footprint

Approximate memory for an array of $n$ fixed-size elements:

M \approx nS_e+S_{header}

where $S_e$ is element size and $S_{header}$ is container overhead.

Approximate memory for a pointer-based node structure:

M \approx n(S_e+pS_p+S_m)

where $p$ is number of pointers per node, $S_p$ is pointer size, and $S_m$ is metadata or allocator overhead per node.

Hash table load factor:

\displaystyle \alpha=\frac{n}{m}

where $n$ is stored entries and $m$ is bucket capacity. Higher load factor reduces unused memory but usually increases collision cost and probe length.

Amortized cost

For a dynamic array that doubles capacity when full, a sequence of $n$ appends has total copy cost bounded by a constant multiple of $n$ :

T_{total}=O(n)

Amortized append cost is therefore:

T_{append}=O(1)

Amortized cost is not the same as worst-case cost for one operation. A resize can still create a latency spike. Real-time systems may need reserved capacity, bounded containers, incremental resizing, or static allocation.

Throughput and service time

If one worker completes one operation in average service time $S_t$ , ideal throughput is:

\displaystyle X=\frac{1}{S_t}

With $c$ identical independent workers, ideal throughput is:

\displaystyle X_{ideal}=\frac{c}{S_t}

Actual throughput is lower when there is contention, synchronization, serial work, I/O, retries, cache interference, or uneven load balancing.

If work has a serial fraction $s$ and a parallel fraction $1-s$ , Amdahl’s Law gives ideal speedup on $p$ processors:

\displaystyle Speedup(p)=\frac{1}{s+\frac{1-s}{p}}

This shows why serial bottlenecks can dominate even when many processors are available.

Queueing checks

Little’s Law:

L=\lambda W

where $L$ is average number in the system, $\lambda$ is arrival rate or throughput, and $W$ is average time in the system.

For a simple service model with $c$ parallel servers and service rate $\mu$ per server, utilization is:

\displaystyle \rho=\frac{\lambda}{c\mu}

A stable queue requires:

\rho<1

As $\rho$ approaches 1, waiting time can rise sharply. Design should include burstiness, variability, priority rules, backpressure, and tail latency, not only average utilization.

Latency decomposition

End-to-end latency can be estimated as:

T_{end}=T_{queue}+T_{service}+T_{network}+T_{serialization}+T_{storage}+T_{render}+T_{retry}

Only include terms that apply to the system under review. The important point is to define start and stop events consistently.

Average latency:

\displaystyle \bar{T}=\frac{1}{n}\sum_{i=1}^{n}T_i

Maximum observed latency:

T_{max}=\max(T_1,T_2,\ldots,T_n)

Percentile latency $P_q$ is the value below which $q$ percent of observations fall. For interactive and real-time systems, $P95$ , $P99$ , and deadline misses are often more informative than the mean.

Bandwidth and transfer time

Transfer time for payload size $B$ over effective bandwidth $R$ :

\displaystyle T_{transfer}=\frac{B}{R}

If a fixed setup latency $T_0$ is present:

\displaystyle T_{total}=T_0+\frac{B}{R}

This distinction matters for small messages. A high-bandwidth link can still perform poorly for small requests if fixed latency dominates.

For a data bus with width $w$ bits and clock rate $f$ , ideal raw bandwidth is:

R_{raw}=wf

Effective bandwidth is lower after protocol overhead, idle cycles, arbitration, encoding, retries, and contention:

R_{eff}=R_{raw}\eta

where $\eta$ is efficiency between 0 and 1.

Reliability and failure probability

If an operation has independent failure probability $p$ and is attempted $n$ times, expected failures are:

E[f]=np

Probability of no failures across $n$ independent attempts:

P_0=(1-p)^n

Probability of at least one failure:

P_{\geq 1}=1-(1-p)^n

These simplified equations are useful for order-of-magnitude checks, but real systems often have correlated failures, retries, overload coupling, and common-cause faults.

Benchmark interpretation

Speedup from an optimization:

\displaystyle Speedup=\frac{T_{old}}{T_{new}}

Percentage time reduction:

\displaystyle Reduction=\frac{T_{old}-T_{new}}{T_{old}}\times 100\%

Operations per second:

\displaystyle OPS=\frac{N_{ops}}{T}

Memory bandwidth used:

\displaystyle R_m=\frac{B_{read}+B_{written}}{T}

For a benchmark to be meaningful, report hardware, compiler or runtime version, input size, input distribution, warm-up method, sample count, measurement overhead, memory use, and latency distribution.

Practical checklist

Use these formulas with a short engineering checklist:

Define $n$ , workload mix, and input distribution.
Estimate asymptotic growth and memory footprint.
Check worst-case or adversarial behaviour.
Check latency percentiles, not just average time.
Compare theory with measurements on representative data.
Review allocation, locality, synchronization, and I/O.
Document invariants and failure behaviour.

A formula sheet can narrow the design space quickly. It cannot decide whether a structure is appropriate unless the workload, hardware, and operational constraints are clear.

REF

Disciplines

Algorithmic Performance Formula Sheet

Common growth classes

Loop operation counts

Recurrences

Data-structure operation estimates

Graph storage and traversal

Memory footprint

Amortized cost

Throughput and service time

Queueing checks

Latency decomposition

Bandwidth and transfer time

Reliability and failure probability

Benchmark interpretation

Practical checklist

See also