Statistics Problems- Solutions and Examples
Statistics Problems: The Complete Breakdown
You're stuck on a statistics problem. Maybe it's a probability question that makes no sense, or a hypothesis test you can't figure out. This guide cuts through the confusion and gives you actual solutions with working examples.
No motivational garbage. Just the math, explained plainly.
Most Common Statistics Problems Students Face
These are the problem types that wreck people most often:
- Finding mean, median, and mode from data sets
- Calculating standard deviation and variance
- Probability of compound events
- Z-score calculations and normal distribution
- Hypothesis testing (H₀ vs H₁)
- Correlation and regression equations
If you're staring at any of these right now, keep reading. We solve each one below.
Descriptive Statistics: Mean, Median, Mode
These are the basics. Most people can find the mean (average) fine. The median trips people up when there's an even number of values.
The Problem
Find the mean, median, and mode for this data set: 4, 8, 6, 5, 3, 8, 9, 8, 2
The Solution
Mean: Add everything up, divide by how many numbers exist.
4 + 8 + 6 + 5 + 3 + 8 + 9 + 8 + 2 = 53
53 ÷ 9 = 5.89
Median: Put numbers in order, find the middle one.
Sorted: 2, 3, 4, 5, 6, 8, 8, 8, 9
The 5th value is 6
Mode: The number that shows up most. That's 8 (appears 3 times)
Standard Deviation and Variance
These measure how spread out your data is. People hate calculating these by hand. Here's why.
The Problem
Find the standard deviation for: 2, 4, 6, 8, 10
The Solution (Step by Step)
Step 1: Find the mean. (2+4+6+8+10) ÷ 5 = 6
Step 2: Subtract the mean from each value, then square it.
- (2-6)² = 16
- (4-6)² = 4
- (6-6)² = 0
- (8-6)² = 4
- (10-6)² = 16
Step 3: Find the average of those squared differences.
16 + 4 + 0 + 4 + 16 = 40
40 ÷ 5 = 8
Step 4: Take the square root.
√8 = 2.83
That's your standard deviation. The variance is just step 3's result: 8.
Probability Problems
Probability questions follow specific formulas. The confusion usually comes from mixing up "and" vs "or" problems.
Independent Events (Using "AND")
What's the probability of flipping heads twice in a row?
P(A and B) = P(A) × P(B)
0.5 × 0.5 = 0.25 (25%)
Mutually Exclusive Events (Using "OR")
What's the probability of rolling a 2 or a 5 on a die?
P(A or B) = P(A) + P(B)
⅙ + ⅙ = 2/6 = 1/3 (33.3%)
Non-Mutually Exclusive Events
In a deck of cards, what's P(Jack or Red)?
These events overlap (red jacks exist). Formula:
P(A or B) = P(A) + P(B) - P(A and B)
4/52 + 26/52 - 2/52 = 28/52 = 7/13 (53.8%)
Z-Scores and the Normal Distribution
Z-scores tell you how many standard deviations a value is from the mean. Useful for comparing different data sets.
The Formula
Z = (X - μ) ÷ σ
Where X is your value, μ is the mean, and σ is standard deviation.
The Problem
A test has a mean of 70 and standard deviation of 10. You scored 85. What's your Z-score?
The Solution
Z = (85 - 70) ÷ 10
Z = 15 ÷ 10
Z = 1.5
You scored 1.5 standard deviations above average. Using a Z-table, that puts you around the 93rd percentile. Not bad.
Hypothesis Testing: The Process
This is where most students give up. Hypothesis testing follows a rigid 5-step process. Memorize it.
Step 1: State Your Hypotheses
H₀ (Null): No effect, no difference, nothing going on
H₁ (Alternative): There IS an effect or difference
Step 2: Choose Your Significance Level
Almost always α = 0.05 (5%). This is standard.
Step 3: Calculate Your Test Statistic
Depends on what you're testing. Common ones:
- Z-test for proportions
- T-test for means (when σ is unknown)
- Chi-square for categorical data
Step 4: Find the Critical Value
Look this up in a table based on your α and test type.
Step 5: Make Your Decision
If |test statistic| > critical value: Reject H₀
If |test statistic| < critical value: Fail to reject H₀
Example: One-Sample Z-Test
A company claims their batteries last 100 hours. You test 50 batteries and get a mean of 97 hours with σ = 15. Test at α = 0.05.
Step 1: H₀: μ = 100, H₁: μ ≠ 100
Step 2: α = 0.05
Step 3: Z = (97 - 100) ÷ (15 ÷ √50) = -3 ÷ 2.12 = -1.42
Step 4: Critical values for two-tailed test at α = 0.05: ±1.96
Step 5: |-1.42| < 1.96 → Fail to reject H₀
The data doesn't prove the company's claim is wrong.
Correlation and Regression
Correlation measures the relationship strength between two variables. Regression gives you an equation to predict values.
The Problem
Hours studied vs. test score. Data: (2, 60), (4, 70), (6, 85), (8, 95). Find the regression equation.
The Solution
Using the formulas (or a calculator):
Slope (b) = r(SD_y ÷ SD_x)
After calculating: b ≈ 5.8, intercept ≈ 49
Regression equation: ŷ = 5.8x + 49
Interpretation: Each additional hour studied increases score by about 5.8 points.
Quick Comparison: When to Use Which Test
| Scenario | Test to Use | Why |
|---|---|---|
| Compare one group to a known value | One-sample t-test | Small sample, unknown σ |
| Compare two group means | Two-sample t-test | Independent groups |
| Compare before/after on same group | Paired t-test | Dependent data |
| Compare three or more groups | ANOVA | Avoids multiple t-test errors |
| Test relationships between categories | Chi-square | Categorical data only |
| Predict one variable from another | Linear regression | Continuous variables |
Getting Started: Your Action Plan
Stop drowning in theory. Here's how to actually solve statistics problems:
Before You Start Any Problem
- Identify the question type. Descriptive? Probability? Hypothesis test? Regression?
- Write down what you're given. Sample size, means, standard deviations, α level
- Pick your formula. Don't guess. Match the problem type to the right approach
While Solving
- Show every step. Teachers grade the work. You can't get partial credit with scribbled answers.
- Double-check your arithmetic. Most wrong answers come from calculation errors, not concept mistakes
- Label everything. "Z = 1.96" means nothing without context
After You Finish
- Does the answer make sense? A probability over 100%? That's wrong.
- Check your units. If calculating money, your answer better be in dollars
Common Formulas Reference
- Mean: Σx ÷ n
- Standard Deviation: √[Σ(x - x̄)² ÷ n]
- Probability: Desired outcomes ÷ Total outcomes
- Z-score: (X - μ) ÷ σ
- Correlation: r = Σ(x - x̄)(y - ȳ) ÷ √[Σ(x - x̄)² × Σ(y - ȳ)²]
- Regression slope: b = r(SD_y ÷ SD_x)
Final Thoughts
Statistics isn't about memorizing every formula. It's about recognizing which formula applies to your specific problem. Work through enough examples and patterns start clicking.
Start with the problems above. Practice the ones that match what you're working on. Check your work. Repeat until it's automatic.