Hidden Variables Formula

The Formula

P(A \mid B) \neq P(A \mid B, C) when hidden variable C confounds the relationship

When to use: What's lurking behind the scenes that we forgot to account for?

Quick Example

Price of gas affects demandβ€”but so does income, weather, and alternatives (hidden).

Notation

C denotes a confounding (hidden) variable; P(A \mid B, C) conditions on it to reveal the true relationship

What This Formula Means

Quantities or factors that influence a mathematical or real-world situation but are not explicitly included in the current model or expression.

What's lurking behind the scenes that we forgot to account for?

Formal View

C confounds A and B if P(A \mid B) \neq P(A \mid B, C); Simpson's paradox: the sign of association between A and B can reverse when conditioning on C

Worked Examples

Example 1

easy
A formula gives a car's stopping distance as d = 0.044v^2 + 0.75v (metres, km/h). Identify the hidden variables that this formula ignores.

Solution

  1. 1
    Hidden variable 1: road surface condition (dry, wet, icy) β€” friction coefficient varies greatly.
  2. 2
    Hidden variable 2: tyre quality and pressure β€” affects grip.
  3. 3
    Hidden variable 3: driver reaction time variation β€” the 0.75v term assumes a fixed reaction time.
  4. 4
    Hidden variable 4: slope of the road β€” braking downhill vs uphill is very different.
  5. 5
    The formula is a simplified model valid only under assumed conditions.

Answer

\text{Hidden: road friction, tyre condition, variable reaction time, road slope}
Hidden variables are factors that influence the output but are not explicitly represented in the model. Identifying them reveals the model's limitations and conditions of validity.

Example 2

medium
The correlation between shoe size and reading ability in children appears strong. Identify the hidden variable and explain why correlation does not imply causation here.

Common Mistakes

  • Concluding causation from correlation without considering hidden variables β€” ice cream sales and drownings both rise in summer due to heat, not each other
  • Failing to include a relevant variable in a model and then getting misleading results
  • Assuming that because a hidden variable is hard to measure, it can be ignored β€” unmeasured does not mean unimportant

Why This Formula Matters

Many modeling errors come from hidden variables β€” two quantities appear correlated because a third hidden variable drives both of them.

Frequently Asked Questions

What is the Hidden Variables formula?

Quantities or factors that influence a mathematical or real-world situation but are not explicitly included in the current model or expression.

How do you use the Hidden Variables formula?

What's lurking behind the scenes that we forgot to account for?

What do the symbols mean in the Hidden Variables formula?

C denotes a confounding (hidden) variable; P(A \mid B, C) conditions on it to reveal the true relationship

Why is the Hidden Variables formula important in Math?

Many modeling errors come from hidden variables β€” two quantities appear correlated because a third hidden variable drives both of them.

What do students get wrong about Hidden Variables?

Correlation without causation often signals hidden variables.

What should I learn before the Hidden Variables formula?

Before studying the Hidden Variables formula, you should understand: modeling.