Standard Deviation and Variance- Statistical Measures Explained

What Are Variance and Standard Deviation?

These two measures tell you how spread out your data is. That's it. If your data points cluster tightly around the mean, both values are small. If they're scattered far and wide, both values are large.

Variance is the average of the squared differences from the mean. Standard deviation is the square root of variance. They measure the same thing—spread—just in different units.

Why Should You Care?

Without these measures, you're flying blind. You might know the average temperature in two cities is 70°F, but one city swings from 30°F to 110°F while the other stays between 65°F and 75°F. The average tells you nothing about risk, consistency, or volatility.

Standard deviation and variance show up everywhere:

Population vs. Sample: The Critical Distinction

Before calculating anything, ask yourself: do you have all possible data points or just a sample?

Population variance/standard deviation: You have every single data point. Divide by N.

Sample variance/standard deviation: You have a subset. Divide by N-1. This correction (Bessel's correction) gives you a better estimate of the true population spread from limited data.

How to Calculate Variance

Step by Step

  1. Find the mean (average) of all data points
  2. Subtract the mean from each data point to get deviations
  3. Squaring each deviation eliminates negative values
  4. Average the squared deviations by dividing by N (population) or N-1 (sample)

The Formulas

Population Variance:

σ² = Σ(xᵢ - μ)² / N

Sample Variance:

s² = Σ(xᵢ - x̄)² / (n - 1)

How to Calculate Standard Deviation

Standard deviation is just the square root of variance. That's the whole calculation.

Population SD: σ = √[Σ(xᵢ - μ)² / N]

Sample SD: s = √[Σ(xᵢ - x̄)² / (n - 1)]

The squaring and then square-rooting might seem redundant, but it serves a purpose: squaring gives more weight to points far from the mean. A data point twice as far from the mean contributes four times as much to the variance.

Worked Example

Data set: 2, 4, 4, 4, 5, 5, 7, 9

Step 1: Mean = (2 + 4 + 4 + 4 + 5 + 5 + 7 + 9) / 8 = 40 / 8 = 5

Step 2: Deviations from mean: -3, -1, -1, -1, 0, 0, 2, 4

Step 3: Squared deviations: 9, 1, 1, 1, 0, 0, 4, 16

Step 4: Sum of squared deviations = 32

Step 5: Variance = 32 / 8 = 4 (population) or 32 / 7 ≈ 4.57 (sample)

Step 6: Standard deviation = √4 = 2 (population) or √4.57 ≈ 2.14 (sample)

Variance vs. Standard Deviation: Direct Comparison

FeatureVarianceStandard Deviation
Formulaσ² or s²σ or s (square root of variance)
UnitsOriginal units squaredSame as original data
InterpretabilityHarder to interpret directlyEasy to interpret in context
Use caseTheoretical calculations, algorithmsReporting results, real-world context
Sensitivity to outliersHigh (due to squaring)High (inherited from variance)

When to Use Which

Use standard deviation when:

Use variance when:

Common Mistakes

Mixing up population and sample formulas. Using N instead of N-1 when you have a sample makes your standard deviation underestimate the true population spread.

Ignoring units. Variance gives you squared units. If you're measuring height in inches, variance is in square inches—a meaningless unit for interpretation.

Forgetting that standard deviation can't be negative. If you calculate a negative SD, something went wrong in your math.

Assuming low SD means all data points are close to the mean. This is usually true, but skewed distributions can fool you. Always visualize your data.

Quick Reference: Getting Started

Need to calculate this right now? Here's your cheat sheet:

For most practical work, standard deviation is what you want. It gives you an interpretable number in your original units. Variance is mostly useful when you're doing the math—it's the intermediate step that makes other calculations work.