Calculating Percentile- Statistics Tutorial

What Is a Percentile and Why You Need to Know How to Calculate It

A percentile tells you the value below which a given percentage of observations fall. If your score is in the 90th percentile, it means you scored higher than 90% of everyone who took the test.

That's it. No fancy definitions needed.

Doctors use percentiles to track baby growth. Companies use them to analyze salary distributions. Schools use them to report test scores. If you work with data, you'll need to calculate percentiles at some point.

The Basic Percentile Formula

The most common method uses this formula:

P = (n/N) × 100

Where:

Most people get tripped up on which formula to use. There are actually several methods, and different software packages use different ones.

How to Calculate Percentile: Step-by-Step

Method 1: The Linear Interpolation Method (Most Common)

This is what Excel, Python, and most statistical software use by default.

Step 1: Sort your data in ascending order

Step 2: Calculate the index using: i = (P/100) × (N + 1)

Step 3: If i is a whole number, that's your percentile value

Step 4: If i has a decimal, interpolate between the values at positions floor(i) and ceil(i)

Method 2: Nearest Rank Method

Simpler but less precise. Used in some older statistical packages.

Rank = ceil(P/100 × N)

The value at that rank is your percentile.

Method 3: Weighted Percentile

Used when your data has weights or frequencies attached to it. Multiply each value by its weight before calculating.

Real Example: Finding the 75th Percentile

You have test scores: 45, 67, 23, 89, 56, 78, 91, 34

Step 1: Sort them: 23, 34, 45, 56, 67, 78, 89, 91

Step 2: N = 8 values

Step 3: i = (75/100) × (8 + 1) = 0.75 × 9 = 6.75

Step 4: Interpolate between position 6 and 7

75th percentile = 78 + 0.75 × (89 - 78) = 86.25

Percentile vs Percentage: Stop Confusing Them

A percentage is a raw score out of 100. A percentile is a rank position in a distribution.

Scoring 80% on a test means you got 80 out of 100 questions right. Scoring in the 80th percentile means you outperformed 80% of test-takers.

These are not the same thing. People mix them up constantly.

Percentile Calculation Methods Compared

Method Best For Used By Accuracy
Linear Interpolation General statistics, continuous data Excel, Python, R High
Nearest Rank Discrete data, simple rankings Some older statistical packages Medium
Weighted Percentile Survey data, grouped frequencies Advanced analytics tools High (when weights apply)
Median-Based (Tukey's Method) Robust statistics, outliers present Exploratory data analysis Medium-High

How to Calculate Percentile in Excel

Excel gives you three options:

Use PERCENTILE.INC for most work. Use PERCENTILE.EXC only if you need values strictly between 0 and 1.

How to Calculate Percentile in Python

NumPy makes this straightforward:

import numpy as np

data = [23, 34, 45, 56, 67, 78, 89, 91]
percentile_75 = np.percentile(data, 75)
print(percentile_75)  # Output: 86.25

You can also calculate multiple percentiles at once:

np.percentile(data, [25, 50, 75, 90])

Common Percentile Uses in the Real World

Getting Started: Quick Checklist

  1. Collect your dataset
  2. Sort values from lowest to highest
  3. Decide which percentile you need (25th, 50th, 90th, etc.)
  4. Choose your calculation method based on your tools
  5. Calculate or use software to compute
  6. Interpret the result in context

Watch Out for These Mistakes

Empty or missing values: Decide whether to exclude them before calculating. Different choices give different results.

Small datasets: Percentiles from datasets with fewer than 20 values are unreliable. The interpolation becomes meaningless.

Method mismatch: If you're comparing percentiles from different tools, verify they're using the same method. Excel's PERCENTILE.EXC and PERCENTILE.INC give different results.

Endpoint values: Some methods cannot calculate the 0th or 100th percentile. Check your tool's limitations.

Which Method Should You Use?

For most business and analytical work, linear interpolation is the right choice. It gives smooth, continuous results that match what people expect.

Use nearest rank only when you need simple ordinal rankings and your data is clearly ordinal.

Use weighted percentiles when your data represents aggregated counts or survey responses where each row has a frequency weight.

The method matters less than being consistent. Pick one method and stick with it across all your analyses.