Math · Statistics & Probability · Grade 9-12 · 5 min read

Conditional Probability

⚡ In one breath

Conditional probability P(AB)P(A|B) is the chance of AA given that BB has occurred — you divide the overlap by the size of the known condition.

📐 The formula

P(A|B) = \frac{P(A
Venn diagram of Event A and Event BA two-event view of conditional probability.ABA ∩ B
A: Event A
B: Event B
A two-event view of conditional probability.

Orient

The one-line idea, why it matters, and the intuition.

Section 1

Quick Answer

Conditional probability P(AB)P(A|B) is the chance of AA given that BB has occurred — you divide the overlap by the size of the known condition. Use it whenever new information narrows the situation, like 'given the card is red, what's the chance it's a heart?' The cue is the word 'given' or 'if we know.' Before calculating, ask: Has some information already been revealed that shrinks the set of possible outcomes?

Section 2

Why This Matters

Conditional probability is how reasoning updates with evidence — it powers medical test interpretation, Bayes' rule, and the formal definition of independence (P(AB)=P(A)P(A|B)=P(A)). Students who forget that the denominator becomes P(B)P(B) instead of 1 misread risk and overcount. Recognizing it by "Has some information already been revealed that shrinks the set of possible outcomes?" — rather than by familiar numbers — is what lets a student tell it apart from joint probability and independent events and reversed conditional p(ba)p(b|a) in a mixed problem set.

Section 3

Intuitive Explanation

A standard deck: 'given the drawn card is red, what's the chance it's a heart?' — your world shrinks to the 26 red cards, of which 13 are hearts, so 1326=12\frac{13}{26}=\frac{1}{2}, not 1352\frac{13}{52}. This is the clean version of the idea because the visible structure matches the concept before any formula or procedure is chosen.

Do not confuse P(AB)P(A|B) with P(BA)P(B|A) — 'chance of disease given a positive test' and 'chance of a positive test given disease' are different numbers, and swapping them is the classic base-rate error. That contrast matters because many wrong answers come from recognizing a surface feature, such as a familiar number or word, instead of the actual task.

A useful way to slow down is to name the signal words and then test them. Words like **given that**, **if we know**, **among those who**, **conditional on**, **now that B happened** are helpful clues, but they are not enough by themselves. They must point to the same structure as the mental model: P(AB)P(A|B) is the chance of AA once you restrict attention to only the cases where BB already happened.

The recognition test is simple: Has some information already been revealed that shrinks the set of possible outcomes? If yes, conditional probability is probably the right tool; if not, compare with Joint probability or Independent events or Reversed conditional P(BA)P(B|A) before calculating.

Core idea

P(AB)P(A|B) is the chance of AA once you restrict attention to only the cases where BB already happened.

Recognize

The cues that signal this concept and how to distinguish it from look-alikes.

Section 4

When to Use

Use Conditional Probability when new information about one event narrows the possibilities and you want the updated probability of another. Strong signals include **given that**, **if we know**, **among those who**, **conditional on**, **now that B happened**. The safest workflow is to read the final question first, identify what kind of answer it wants, and then test the structure. Do not use conditional probability just because familiar numbers appear; first decide whether the situation answers "Has some information already been revealed that shrinks the set of possible outcomes?" with yes.

✨ Pro tip

Ask: Has some information already been revealed that shrinks the set of possible outcomes?

Section 5

How to Recognize It

Before using Conditional Probability, check the structure of the problem, not just the vocabulary. These questions force the same recognition move from several angles: the task, the signal words, the nearest confusion, and the thing that would make the concept fail.

  1. Has some information already been revealed that shrinks the set of possible outcomes?

    If yes, the problem matches conditional probability. If no, pause before applying the procedure, because the same numbers may belong to a different idea.

  2. Which words signal the structure?

    Look for given that, if we know, among those who, conditional on. These words are useful only after the situation matches them; a keyword without structure is not proof.

  3. What is the nearest confusion?

    Joint probability is the common trap here: The chance both AA and BB happen, out of the full sample space. Compare the desired final answer before choosing a method.

  4. What answer form should I expect?

    The answer should fit this mental model: P(AB)P(A|B) is the chance of AA once you restrict attention to only the cases where BB already happened. If the expected answer sounds more like joint probability, use the comparison table before solving.

  5. What would make this NOT Conditional Probability?

    Do not confuse P(AB)P(A|B) with P(BA)P(B|A) — 'chance of disease given a positive test' and 'chance of a positive test given disease' are different numbers, and swapping them is the classic base-rate error. This tells you when to switch tools instead of forcing the concept.

Section 6

Conditional Probability vs Common Confusions

The hard part is recognizing when the task is really about conditional probability instead of a nearby idea. Read the final answer the problem wants, then ask which row describes the structure before you start calculating.

Conditional Probability

Meaning
Use this when new information about one event narrows the possibilities and you want the updated probability of another. The deciding question is: Has some information already been revealed that shrinks the set of possible outcomes?
Key test
Has some information already been revealed that shrinks the set of possible outcomes?
Formula
P(AB)=P(A and B)P(B)P(A|B) = \frac{P(A \text{ and } B)}{P(B)}
Example
From a standard deck, a card is drawn and seen to be red. What is the probability it is a heart?

Joint probability

Meaning
The chance both AA and BB happen, out of the full sample space.
Key test
Use when nothing is assumed yet and you want both to occur.
Formula
P(AB)P(A\cap B)
Example
Chance a card is red AND a heart from all 52

Independent events

Meaning
A special case where P(AB)=P(A)P(A|B)=P(A) — the condition changes nothing.
Key test
Use when the condition has no effect; then conditioning is pointless.
Formula
P(AB)=P(A)P(B)P(A\cap B)=P(A)P(B)
Example
Coin flip given a die roll

Reversed conditional $P(B|A)$

Meaning
The condition and event are swapped, generally a different value.
Key test
Use when the given information is the other event.
Formula
P(BA)=P(AB)P(A)P(B|A)=\frac{P(A\cap B)}{P(A)}
Example
Test-positive given disease vs disease given test-positive

Apply

Worked examples and the mistakes most students make.

Section 7

Formula & Notation

P(AB)=P(A and B)P(B)P(A|B) = \frac{P(A \text{ and } B)}{P(B)}
P(AB)=P(AB)P(B)P(A|B) = \frac{P(A \cap B)}{P(B)} where P(B)>0P(B) > 0

How to read it: P(AB)P(A|B) reads 'probability of AA given BB'; the vertical bar means 'given that'

Section 8

Worked Examples

Example 1 — Heart given red

Easy

Problem

From a standard deck, a card is drawn and seen to be red. What is the probability it is a heart?

Solution

  1. Information ('it is red') has narrowed the world to red cards only.

    Name the structure before touching arithmetic — that is what makes the right method obvious.

  2. Ask the recognition question: Has some information already been revealed that shrinks the set of possible outcomes?

    If the answer is yes, the concept applies; the cue, not a keyword, decides the method.

  3. Divide hearts by the known condition (red cards), not by all 52.

    The rule is chosen only after the structure matches, so the steps mean something.

  4. P(heartred)=1326P(\text{heart}|\text{red})=\frac{13}{26}.

    Keep units, shape, or answer form tied to the story so the work does not become symbol pushing.

  5. Check the answer against the original question.

    It should fit the mental model — probability inside a shrunk world. If it does not, revisit the recognition step before changing the arithmetic.

Answer

12\frac{1}{2}

Takeaway: Conditioning shrinks the denominator to the known event.

Example 2 — Joint, not conditional

Standard

Problem

From a full deck, what is the probability a card is red AND a heart, before seeing anything?

Solution

  1. Notice why this looks like the same concept.

    Nearby language or numbers can tempt you toward probability inside a shrunk world.

  2. Nothing has been revealed, so the world is still all 52 cards.

    Spotting what actually changed is what separates this from the concept it resembles.

  3. Use joint probability over the full deck, not a conditional.

    The nearby idea may share numbers but answers a different question, so it needs a different move.

  4. State the result in the language of the actual task.

    1352=14\frac{13}{52}=\frac{1}{4}. Name it for what the problem really asked, not the concept you first expected.

  5. Say the contrast in one sentence.

    No 'given' means joint probability over the whole sample space, not conditional.

Answer

1352=14\frac{13}{52}=\frac{1}{4}

Takeaway: No 'given' means joint probability over the whole sample space, not conditional.

Example 3 — Spot the trap: Probability inside a shrunk world

Application

Problem

A student starts with this idea: "Dividing by the whole sample space" What should they check before accepting that reasoning?

Solution

  1. Pause before the first move.

    The first move is a decision, not a calculation — does the situation really match probability inside a shrunk world.

  2. Run the recognition test: Has some information already been revealed that shrinks the set of possible outcomes?

    This is the single check that the trap skips.

  3. divide by P(B)P(B), the known condition, which shrinks the denominator.

    Stating the safer rule turns the mistake into a checkable step instead of a vague "be careful."

  4. Compare with the nearest confusion, Joint probability.

    The chance both AA and BB happen, out of the full sample space.

  5. State the corrected decision and reuse it.

    Using the concept only when the structure matches leaves a process the student can repeat on a new problem.

Answer

divide by P(B)P(B), the known condition, which shrinks the denominator.

Takeaway: The recognition step prevents the common trap: Dividing by the whole sample space

Section 9

Common Mistakes

Common slip-up

Dividing by the whole sample space

The right idea

divide by P(B)P(B), the known condition, which shrinks the denominator.

Common slip-up

Swapping P(AB)P(A|B) and P(BA)P(B|A)

The right idea

read carefully which event is the 'given.'

Common slip-up

Assuming independence to skip the formula

The right idea

only set P(AB)=P(A)P(A|B)=P(A) if you have verified the events are independent.

Practice

Try it, then see where this concept fits in the path.

Section 10

Mini Practice

Try these on your own. Tap Reveal when you want to check.

  1. What clue tells you this is a Conditional Probability situation: From a standard deck, a card is drawn and seen to be red. What is the probability it is a heart?

    Hint: Has some information already been revealed that shrinks the set of possible outcomes?

  2. From a standard deck, a card is drawn and seen to be red. What is the probability it is a heart?

    Hint: Divide hearts by the known condition (red cards), not by all 52.

  3. Why is this a contrast case instead of Conditional Probability: From a full deck, what is the probability a card is red AND a heart, before seeing anything?

    Hint: Nothing has been revealed, so the world is still all 52 cards.

  4. Fix this thinking: Dividing by the whole sample space

    Hint: Name the recognition cue before choosing a rule.

  5. Which is the better fit here: Conditional Probability or Joint probability? Explain the deciding difference.

    Hint: For Conditional Probability, ask: Has some information already been revealed that shrinks the set of possible outcomes?

  6. Write one sentence that would remind a classmate how to recognize Conditional Probability.

    Hint: Use the mental model "Probability inside a shrunk world." and one signal word.

Want the full set?

50 practice questions for this concept — free to try, every one with a complete worked solution showing the why, not just the answer.

Section 11

Frequently Asked Questions

How do I know when to use Conditional Probability?

Use Conditional Probability when new information about one event narrows the possibilities and you want the updated probability of another. Do not start from the numbers alone; first name the structure of the situation. The fastest check is: Has some information already been revealed that shrinks the set of possible outcomes? If the answer is yes and the wording matches cues like given that, if we know, among those who, then conditional probability is probably the right tool.

What is Conditional Probability most often confused with?

Conditional Probability is often confused with Joint probability. Joint probability means The chance both AA and BB happen, out of the full sample space. The difference is not just vocabulary; it changes the action you take. For conditional probability, the key test is "Has some information already been revealed that shrinks the set of possible outcomes?" For joint probability, the better cue is: Use when nothing is assumed yet and you want both to occur.

What is the fastest recognition cue for Conditional Probability?

Look for given that, if we know, among those who, conditional on, but treat those words as clues, not proof. A word problem can contain a familiar keyword and still ask for a different idea. After noticing the cue, ask the recognition question: Has some information already been revealed that shrinks the set of possible outcomes? That question protects you from using a memorized procedure in the wrong place.

What mistake should I avoid with Conditional Probability?

Avoid this thinking: "Dividing by the whole sample space" That mistake usually happens when the student jumps to a rule before checking the situation. The safer version is: divide by P(B)P(B), the known condition, which shrinks the denominator. A good habit is to say the mental model out loud first: "Probability inside a shrunk world." Then choose the calculation or representation.

How can I tell this apart from Independent events?

Independent events is the better fit when the task is about this: A special case where P(AB)=P(A)P(A|B)=P(A) — the condition changes nothing. Conditional Probability is the better fit when new information about one event narrows the possibilities and you want the updated probability of another. If both ideas seem possible, compare what the problem wants as the final answer. The desired output often reveals whether you should use conditional probability or switch to the nearby concept.

Why does Conditional Probability matter?

Conditional probability is how reasoning updates with evidence — it powers medical test interpretation, Bayes' rule, and the formal definition of independence (P(AB)=P(A)P(A|B)=P(A)). Students who forget that the denominator becomes P(B)P(B) instead of 1 misread risk and overcount. The practical value is recognition: once you can spot conditional probability, you can choose a method before calculating. That makes later topics easier because you are not memorizing isolated tricks; you are recognizing the same structure when it appears in a new representation.

Section 12

Learning Path

Conditional Probability

You are here

Next →

You're at the end!
Before this, students should be comfortable with Probability and Independent Events. This page focuses on the recognition cue: Has some information already been revealed that shrinks the set of possible outcomes? That cue is the bridge between earlier skills and later problem solving: students first learn to identify the structure, then they learn which calculation, diagram, graph, or proof move belongs to it. After this, students can use conditional probability as a tool in larger problems.

Section 13

See Also