Calculating the 97th Percentile- Statistics Guide

What Is the 97th Percentile?

The 97th percentile is the value below which 97% of observations fall. In plain terms, if your score is at the 97th percentile, you scored higher than 97 out of 100 people.

That's it. That's the definition.

People overcomplicate this. They don't. Here's what you actually need to know.

Why the 97th Percentile Matters

You encounter percentiles more than you think:

The 97th percentile specifically flags extreme values. It's useful when you want to identify outliers, set performance thresholds, or understand where someone sits in a distribution.

The Formula for Calculating Percentiles

There's no single "correct" method. Different fields use different approaches. Here are the most common:

Method 1: Linear Interpolation (Most Common)

Position = (P / 100) × (N + 1)

Where:

Method 2: Nearest Rank Method

Position = ceil(P / 100 × N)

This rounds up to the nearest whole number. Simpler but less precise for small datasets.

Method 3: Weighted Percentile (Used in Growth Charts)

Pediatric growth charts use a different approach based on LMS parameters (Lambda, Mu, Sigma). Don't worry about the math—this is pre-calculated for you.

How to Calculate the 97th Percentile: Step by Step

Let's work through a real example. You have these test scores:

Data set: 45, 52, 67, 73, 78, 82, 85, 89, 92, 95, 98, 99, 100

Step 1: Sort your data (already sorted above)

Step 2: Count your observations. N = 13

Step 3: Apply the formula. Using the linear interpolation method:

Position = (97 / 100) × (13 + 1) = 0.97 × 14 = 13.58

Step 4: Interpolate between the 13th and 14th values. Since we only have 13 values, we take the highest value.

The 97th percentile = 100

97th Percentile vs Other Percentiles

PercentileWhat It MeansTypical Use Case
50thMedian, half below, half aboveCentral tendency
84thOne standard deviation above meanHigh performance threshold
97thExtreme high performerIdentifying top outliers
99thTop 1%Elite tier, anomaly detection

The jump from 97th to 99th is smaller than you think in raw data terms. But in practical terms, it's the difference between "very high" and "almost nobody is higher."

Common Mistakes When Calculating Percentiles

Mistake 1: Confusing Percentile with Percentage

A score of 97% means you got 97 out of 100 questions correct. The 97th percentile means you scored higher than 97% of test-takers. These are completely different things.

Mistake 2: Using the Wrong Method

Excel's PERCENTILE.EXC and PERCENTILE.INC give different results. EXC excludes the endpoints, INC includes them. Pick one and stick with it.

Mistake 3: Assuming Normal Distribution

The 97th percentile of a normal distribution is straightforward to calculate. Most real-world data isn't normally distributed. Your data might be skewed.

Mistake 4: Small Sample Sizes

With fewer than 30 data points, percentile calculations become unreliable. The 97th percentile of 5 values is essentially meaningless.

Calculating the 97th Percentile in Excel and Google Sheets

Excel

=PERCENTILE.EXC(A1:A100, 0.97)

Or for the inclusive version:

=PERCENTILE.INC(A1:A100, 0.97)

Google Sheets

=PERCENTILE(A1:A100, 0.97)

Google's PERCENTILE function uses the linear interpolation method (similar to EXC for most datasets).

Python (NumPy)

import numpy as np
data = [45, 52, 67, 73, 78, 82, 85, 89, 92, 95, 98, 99, 100]
percentile_97 = np.percentile(data, 97)
print(percentile_97)

When to Use the 97th Percentile

This specific percentile shows up in certain contexts more than others:

Quick Reference: 97th Percentile Calculation Cheat Sheet

The Bottom Line

Calculating the 97th percentile is straightforward once you understand the formula. The harder part is knowing which method to use and when your data is suitable for percentile analysis at all.

For small datasets, the nearest-rank method works fine. For larger datasets where precision matters, use linear interpolation. And always check whether your field has standard conventions before reinventing the wheel.