Normal Distribution Standard Deviation- Statistical Guide
What Normal Distribution Actually Is
Normal distribution is a probability distribution where data clusters around a mean. Most values sit near the center. Fewer values appear as you move away from the middle. It looks like a bell curve.
This shape isn't arbitrary. It appears constantly in nature, test scores, measurements, and errors. Height, IQ scores, blood pressure readings—they all tend toward this pattern.
The curve is symmetric. Flip it left to right and you get the same shape. The mean, median, and mode all sit at the same point: dead center of the curve.
Standard Deviation: The Distance From Average
Standard deviation measures how spread out your data is. A small standard deviation means numbers cluster close to the mean. A large standard deviation means they're scattered far from it.
It's calculated in units of your original data. If you're measuring height in inches, the standard deviation is in inches. If you're measuring test scores, it's in points.
The symbol for standard deviation is σ (sigma) for a population or s for a sample.
The 68-95-99.7 Rule: Your Quick Reference
For any normal distribution:
- About 68% of data falls within one standard deviation of the mean
- About 95% of data falls within two standard deviations
- About 99.7% of data falls within three standard deviations
This rule works for every normal distribution. No matter the mean or standard deviation, the percentages stay the same. This is why these concepts pair so well together.
Why This Matters
You can predict probabilities without calculating integrals. If a dataset is normally distributed with a mean of 100 and standard deviation of 15, you know roughly 68% of scores fall between 85 and 115.
You can identify outliers. Values beyond three standard deviations from the mean are rare—less than 0.3% of your data. If you see them, something unusual is happening.
You can compare different datasets. A test with a mean of 500 and standard deviation of 100 isn't automatically "harder" than one with a mean of 20 and standard deviation of 5. Standard deviation lets you make fair comparisons.
Reading the Curve: Z-Scores
A z-score tells you how many standard deviations a value sits from the mean. The formula is simple:
z = (x - μ) / σ
Where x is the value, μ is the mean, and σ is the standard deviation.
A z-score of 0 means the value equals the mean. A z-score of 1 means it's one standard deviation above the mean. A z-score of -2 means it's two standard deviations below.
Z-scores let you compare values from different normal distributions. A score of 130 on an IQ test (mean 100, SD 15) has a z-score of 2. A weight of 180 pounds in a population with mean 150 and SD 20 also has a z-score of 1.5. Now you can see which is more extreme relative to its distribution.
Standard Deviation vs Variance
Variance is the square of standard deviation. If standard deviation is 10, variance is 100. That's it.
Variance shows up more in statistical formulas because it has nicer mathematical properties. Standard deviation shows up more in reports because it's in the same units as your data.
Both measure the same thing—spread. Pick whichever makes your math easier or your audience's understanding clearer.
Common Mistakes People Make
Assuming everything is normally distributed. Real data often isn't. Income, website traffic, and population sizes are frequently skewed. Check your data before applying these rules.
Forgetting that 95% doesn't include all the remaining data. About 2.5% sits above two standard deviations, and 2.5% sits below. These extremes matter in hypothesis testing.
Confusing population and sample standard deviations. The formula differs slightly. Sample standard deviation uses n-1 in the denominator (Bessel's correction) to correct for estimation bias. Population standard deviation uses n.
Quick Reference: Key Percentiles
| Standard Deviations from Mean | Percent of Data Below | Percent of Data Above |
|---|---|---|
| -3σ | 0.13% | 99.87% |
| -2σ | 2.28% | 97.72% |
| -1σ | 15.87% | 84.13% |
| 0 (mean) | 50% | 50% |
| +1σ | 84.13% | 15.87% |
| +2σ | 97.72% | 2.28% |
| +3σ | 99.87% | 0.13% |
How to Calculate Standard Deviation: Step by Step
Here's the process for a sample:
- Find the mean. Add all values and divide by the number of values.
- Subtract the mean from each value. This gives you deviations from the mean.
- Square each deviation. Negative numbers become positive.
- Add all squared deviations together.
- Divide by (n-1) for a sample, or n for a population.
- Take the square root.
Example: Data set is 2, 4, 6, 8, 10
Mean = 30 / 5 = 6
Deviations: -4, -2, 0, 2, 4
Squared: 16, 4, 0, 4, 16
Sum = 40
Variance = 40 / 4 = 10 (sample) or 40 / 5 = 8 (population)
Standard deviation = √10 ≈ 3.16 (sample) or √8 ≈ 2.83 (population)
When to Use This
Normal distribution and standard deviation work together when:
- You're describing data that clusters around an average
- You need to find probabilities for ranges of values
- You're comparing scores from different tests or populations
- You want to spot values that don't fit the pattern
- You're running statistical tests that assume normality
They don't work when your data is heavily skewed, has multiple peaks, or comes from a fundamentally different distribution. Always visualize your data first.
The Bottom Line
Normal distribution gives you a shape. Standard deviation gives you a ruler. Together, they let you make precise statements about where data points sit and how likely they are.
Most people overthink this. The math is straightforward. The hard part is knowing when the assumptions apply and when they don't. Check your data. If it looks normal, use these tools. If it doesn't, find different ones.