Cumulative Frequency Distribution- How to Create and Interpret
What Is Cumulative Frequency Distribution?
A cumulative frequency distribution shows how data points accumulate as you move through your dataset. Instead of showing how many values fall in each class interval, it shows how many values fall at or below each interval.
Think of it as a running total. You start counting from the lowest values and keep adding until you reach the highest. That's your cumulative frequency.
It's useful when you need to answer questions like:
- What percentage of students scored 70 or below?
- How many products sold for under $50?
- What's the median income in your dataset?
Key Terms You Need to Know
Frequency — the number of times a value or range appears in your data.
Cumulative Frequency (CF) — the running total of frequencies as you progress through ordered classes.
Less than type — cumulative frequency where each class boundary represents "less than" that boundary.
More than type — cumulative frequency where each class boundary represents "more than" that boundary.
How to Calculate Cumulative Frequency
The formula is dead simple:
CF = Previous Cumulative Frequency + Current Class Frequency
You start with the first class. Its cumulative frequency equals its regular frequency. Then for each subsequent class, you add the previous CF to the current class frequency.
Example Calculation
Let's say you have test scores for 25 students:
| Score Range | Frequency | Cumulative Frequency |
|---|---|---|
| 0-20 | 2 | 2 |
| 21-40 | 5 | 2 + 5 = 7 |
| 41-60 | 8 | 7 + 8 = 15 |
| 61-80 | 7 | 15 + 7 = 22 |
| 81-100 | 3 | 22 + 3 = 25 |
The last cumulative frequency should equal your total number of observations. If it doesn't, you've made an error somewhere.
Creating a Cumulative Frequency Table
Here's the step-by-step process:
- Organize your data into class intervals with clear boundaries
- Count the frequency for each class
- Add a cumulative frequency column
- Calculate each CF value by adding the current frequency to the previous CF
- Verify — final CF must equal total observations
Class Boundaries Matter
When working with continuous data, use proper class boundaries to avoid gaps. If your intervals are 10-20, 20-30, your boundaries should be 10-20, 20-30 (the 20 belongs to the second class).
For "less than" ogives, use upper boundaries. For "more than" ogives, use lower boundaries.
Drawing a Cumulative Frequency Graph (Ogive)
An ogive is just a line graph plotting class boundaries against cumulative frequencies.
Less Than Ogive Steps
- Calculate cumulative frequencies for each class
- Use upper class boundaries as your x-axis points
- Plot CF against these boundaries
- Connect points with a smooth curve or straight lines
- Extend the line to intersect the x-axis at the lower boundary of the first class
More Than Ogive Steps
- Calculate "more than" cumulative frequencies
- Use lower class boundaries as your x-axis points
- Plot and connect as above
The two ogives intersect at the median. You can use this graphically to find the median value.
How to Interpret Cumulative Frequency Distributions
Reading an ogive is straightforward once you know what you're looking at.
Finding Percentiles
To find the value at the 75th percentile:
- Calculate 75% of total frequency
- Locate that value on the y-axis
- Draw a horizontal line to intersect the curve
- Drop vertically to the x-axis
- Read the value
Finding Median
The median is the value where cumulative frequency equals n/2. On your graph, locate n/2 on the y-axis, draw a horizontal line to the curve, then drop to the x-axis. That's your median.
Reading Percentages Directly
If your y-axis shows cumulative percentages instead of frequencies, you can read what percentage of data falls below any given value directly from the graph.
Common Mistakes to Avoid
- Wrong class boundaries — this throws off your entire graph
- Forgetting to extend the curve to the x-axis for the first/last class
- Mixing up less than and more than types — they're not interchangeable
- Not checking that final CF equals total observations — always verify
Practical How-To: Create Your First Cumulative Frequency Distribution
Let's walk through creating one from scratch using exam grades.
Step 1: Gather your data
Grades for 30 students: 45, 52, 67, 73, 81, 35, 58, 62, 78, 85, 41, 55, 69, 74, 88, 47, 53, 68, 76, 82, 38, 56, 64, 72, 79, 91, 49, 57, 66, 77
Step 2: Create class intervals
Range is 35-91. Use intervals of 10: 31-40, 41-50, 51-60, 61-70, 71-80, 81-90, 91-100
Step 3: Tally frequencies
31-40: 2 (35, 38)
41-50: 5 (41, 45, 47, 49, 52)
51-60: 6 (52, 53, 55, 56, 57, 58)
61-70: 6 (62, 64, 66, 67, 68, 69)
71-80: 7 (72, 73, 74, 76, 77, 78, 79)
81-90: 3 (81, 82, 85)
91-100: 1 (91)
Step 4: Calculate cumulative frequencies
| Class | Frequency | Cumulative Frequency |
|---|---|---|
| 31-40 | 2 | 2 |
| 41-50 | 5 | 7 |
| 51-60 | 6 | 13 |
| 61-70 | 6 | 19 |
| 71-80 | 7 | 26 |
| 81-90 | 3 | 29 |
| 91-100 | 1 | 30 |
Step 5: Draw the ogive
Upper boundaries: 40, 50, 60, 70, 80, 90, 100
Plot points: (40, 2), (50, 7), (60, 13), (70, 19), (80, 26), (90, 29), (100, 30)
To find the median: n/2 = 15. Draw horizontal line at y=15. It hits the curve around x=62. So the median grade is approximately 62.
When to Use Cumulative Frequency vs Regular Frequency
Regular frequency distributions are better for seeing the shape of your data — where values cluster, whether it's normal or skewed.
Cumulative frequency is better for answering "how many fall below X" type questions. It's also the only way to quickly estimate percentiles and quartiles graphically.
For most statistical work, you'll want both. Use the regular frequency to understand distribution shape. Use cumulative to find specific values and thresholds.
Quick Reference: Finding Key Values
| What You Want | Method |
|---|---|
| Median | Find n/2 on y-axis, read x-value |
| Quartiles (Q1, Q3) | Find n/4 and 3n/4 on y-axis |
| Percentiles | Find (P/100) × n on y-axis |
| Value below X% | Find X% of n on y-axis, read x-value |
That's the whole thing. Calculate cumulative frequencies, plot your ogive, read values off the graph. No mysteries here.