Measures of Position and Outliers
Learn how to standardize data with Z-scores, identify where values fall within a distribution using percentiles, and detect unusual observations (outliers).
The Z-Score (Standardized Values)
Definition
The Z-Score (or Standardized Value) represents the number of standard deviations that an observation is from the mean. It tells us how "unusual" a data point is.
Population Z-Score
= population mean, = population std dev
Sample Z-Score
= sample mean, = sample std dev
LogicLens: Why Z-Scores Are Powerful
- • Dimensionless: Z-scores have no units, so you can compare values from completely different datasets (e.g., SAT scores vs. ACT scores).
- • Positive Z: The value is above the mean.
- • Negative Z: The value is below the mean.
- • Z = 0: The value equals the mean exactly.
Example: Comparing Test Scores
Alice scored 1200 on the SAT (mean = 1060, std dev = 217).
Bob scored 28 on the ACT (mean = 21, std dev = 5.2).
Who performed better relative to their peers?
Alice's Z-Score
Bob's Z-Score
Bob's Z-score (1.35) is higher, so Bob performed better relative to his peers.
Z-Score Calculator
Quick Examples:
Percentiles & Quartiles
Percentiles
The -th Percentile () is the value below which percent of the data falls.
Example: If you score in the 90th percentile, you scored higher than 90% of all test-takers.
Quartiles
Quartiles divide the data into four equal parts:
- • = 25th Percentile ()
- • = 50th Percentile () = Median
- • = 75th Percentile ()
The Interquartile Range (IQR)
The IQR measures the spread of the middle 50% of the data. It is resistant to outliers because it ignores the extreme values in the top and bottom quarters of the dataset.
Example: Finding Quartiles
Given the ordered dataset: 3, 5, 7, 8, 12, 15, 18, 21
| Quartile | Position | Value |
|---|---|---|
| Median of lower half: (5 + 7)/2 | 6 | |
| (Median) | (8 + 12)/2 | 10 |
| Median of upper half: (15 + 18)/2 | 16.5 |
Identifying Outliers (The 1.5 × IQR Rule)
What is an Outlier?
An outlier is an observation that is unusually far from the rest of the data. Outliers can indicate data entry errors, measurement issues, or genuinely unusual cases that warrant further investigation.
The Fence Method (Step-by-Step)
- 1Find (25th Percentile) and (75th Percentile)
- 2Calculate
- 3Calculate the Fences:
Lower Fence
Upper Fence
Any data point below the Lower Fence or above the Upper Fence is classified as an outlier.
Example: Outlier Detection
Given: , . Is the value 150 an outlier?
Since 150 is exactly at the Upper Fence, it is typically not considered an outlier (we look for values beyond the fences). However, values like 151 would be flagged as outliers.
Try It Yourself
Percentiles, Quartiles & Outlier Calculator
(45 data points)
Quartiles
IQR & Fences
No Outliers Detected
All 45 data points fall within the acceptable range [13.00, 61.00].
Sorted Data
Adaptive Assessment
Unlock Your Personalized Quiz
Sign in to access AI-generated practice problems tailored to this section.