Marginal Distributions- Understanding Joint Probability Tables

What Marginal Distributions Actually Are

Marginal distributions are the probabilities you get when you ignore some variables in a joint probability table. That's it. No fancy definitions needed.

When you have data on multiple variables at once—like height and weight, or age and income—you can "sum out" the variables you don't care about. What remains is a marginal distribution.

Statisticians call it "marginal" because in old-school probability tables, these sums were written in the margins of the table. The name stuck.

The Foundation: Joint Probability Tables

Before you can understand marginal distributions, you need a joint probability table. This is just a table showing the probability of two (or more) things happening together.

Here's a basic example with weather and traffic:

Heavy Traffic Light Traffic Total (Marginal)
Rainy 0.15 0.10 0.25
Sunny 0.20 0.55 0.75
Total (Marginal) 0.35 0.65 1.00

The cells inside the table show joint probabilities—probabilities of two things happening together. The cells on the edges (where totals are) show marginal probabilities.

How to Calculate Marginal Distributions

You calculate marginal distributions by summing across rows or summing down columns. That's the whole process.

The Row Sum (Weather Distribution)

To find the marginal distribution of weather:

P(Rainy) = 0.15 + 0.10 = 0.25

P(Sunny) = 0.20 + 0.55 = 0.75

You're summing out the traffic variable. What you get is the probability distribution of weather, ignoring traffic entirely.

The Column Sum (Traffic Distribution)

To find the marginal distribution of traffic:

P(Heavy Traffic) = 0.15 + 0.20 = 0.35

P(Light Traffic) = 0.10 + 0.55 = 0.65

You're summing out the weather variable. Now you have the probability distribution of traffic, ignoring weather.

Why This Matters

Marginal distributions let you answer questions like:

Any time you want to know the probability of one thing without caring about another variable, you're working with marginal distributions.

Marginal vs. Conditional vs. Joint

People confuse these three constantly. Here's the difference:

Type What It Tells You Example from Table Above
Joint Two things together P(Rainy AND Heavy Traffic) = 0.15
Marginal One thing, ignoring others P(Rainy) = 0.25
Conditional One thing, given another P(Heavy Traffic | Rainy) = 0.15/0.25 = 0.60

Notice the conditional probability uses the marginal. That's a clue that marginals are foundational—they're often the denominator in Bayes' theorem calculations.

Getting Started: Calculate Marginals in 3 Steps

Here's how to extract marginal distributions from any joint probability table:

  1. Identify your joint table — Make sure all cells sum to 1.00. If they don't, you have a probability table problem, not a marginal problem.
  2. Choose your variable — Decide which variable you want the marginal distribution for. If you want Weather, sum across columns. If you want Traffic, sum down rows.
  3. Add the probabilities — Sum all joint probabilities for that variable. Write the result in the margin. Repeat for each outcome of your variable.

That's literally all there is to it. No calculus. No complicated formulas. Just addition.

Common Mistakes

Mixing up rows and columns: If you're asked for the marginal of the variable in rows, sum the columns. If asked for the marginal of the variable in columns, sum the rows. Always sum over the variable you don't want.

Forgetting to normalize: Your marginal probabilities should always sum to 1. If they don't, you made an arithmetic error. Go back and check your additions.

Confusing marginal with conditional: P(Rainy) = 0.25 is marginal. P(Rainy | Heavy Traffic) = 0.15/0.35 = 0.43 is conditional. Different things.

When You'll Use This

Marginal distributions show up in:

Any multivariate problem eventually requires extracting single-variable distributions. That's marginal distributions doing the work.