transpcr.gif (812 bytes) Interquartile Range

The range from the minimum to maximum data values can be distorted by one or two unusual observations. To avoid this problem, we often focus on the range of the data values in the middle, particularly on the middle 50 percent of the data values. To find the middle 50 percent, first find the median of the bottom half of the data values. In the data sets below, the data values in the bottom half (i.e., those observations below the median displayed in red) are displayed in blue. It is then easy to find the median (the boldface value) of the blue numbers. This observation is the middle or half-way number of the bottom half of the numbers, so it marks the boundary between the first and second quarters of the data. This number is called the first quartile or Q1.

Set A: 7 8 9 10 11 11 12 12 12 13 13 13 14 14 14
  Min     Q1       Q2       Q3     Max
Set B: 1 2 3 4 5 5 6 12 18 19 19 19 20 20 20

Similarly, the data values in the top half (i.e., those observations above the median) are displayed in green. The middle number of the top half of the numbers (boldfaced) is the third quartile or Q3 because it marks the boundary between the third and fourth quarters of the data. Note that the median, because it divides the second quarter of the observations from the third quarter, is the second quartile or Q2. The range between Q3 and Q1 is the range for the middle 50 percent of the observations. Because this range is between the two quartiles, it is known as the interquartile range (IQR). In the case of Set A, the IQR = 13 - 10 = 3 while for Set B, the IQR = 19 - 4 = 15. As did the range for these data sets, the IQR indicates that the data in Set B are more spread out away from the center than are the data in Set A.
 

If there are many observations, then the Q1 data value will be greater than or equal to 25 percent of the data values. Similarly, the median Q2 will be greater than or equal to 50 percent of the data values and Q3 will be greater than or equal to 75 percent of the data values. However, when there are small numbers of observations, it is impossible to hit those proportions exactly. For example, in the above data sets, the Q3 value is greater than or equal to 12 of the 15 data values (or 80 percent). In such cases, some computer programs calculate Q1 and Q3 as above, but some instead try to split the difference, or "interpolate," between data values. This means that the same value for the quartiles and the interquartile range are not always reported by all computer programs. Click on the "Computer" icon to the left to see a comparison of the numbers reported by several different statistical software packages.

 

File:
© 1999, Duxbury Press.