Formalities

You probably recognized on the previous page that the shape of the sampling distribution for the slope was close to the normal distribution. Indeed, the sampling distribution for the slope is a normal distribution with a mean equal to b and a standard deviation equal to
population standard deviation of slope
where
population standard deviation of model errors
is the standard deviation of the errors from the model of Y using X; that is, the typical distance from an observation to the regression line. Click on the Equations button for the mathematical details.

  The standard deviation of the sampling distribution tells us how wide the distribution is, and this tells us how far off our estimate might be. The smaller the standard deviation the narrower the sampling distribution. Just as in previous tests, increasing the number of observations will produce a narrower distribution, so our slope estimate will be more accurate. Also note that if X has a larger standard deviation then the sampling distribution will also be narrower.

If we knew the population value of the standard deviation for the sampling distribution of the slope, then we could calculate a z-score and use the normal distribution to identify surprising values. However, we must estimate the sampling distribution's standard deviation. So, just as was the case for the one-group and two-group comparisons, we must instead use the Student t statistic. The standard deviation of the sampling distribution for the slope is easy to estimate from the data. The numerator is based on the typical model squared error--the sum of the error squares divided by the degrees of freedom. This is sometimes called the "mean squared error" (MSE) and is given by:
mean squared error for regression
and the denominator is estimated by the square root of n times the standard deviation for X. Putting everything together,
t-statistic for slope
We will generally use the computer to calculate the Student t for the slope so it is more important to understand the concepts in the above formula than the computational details. The right-most expression tells us when the t for the slope will be large--when the slope b is large, when the number of observations n is large, when the standard deviation of X (the predictor) is large, and when the aggregate error MSE is small (i.e., the data points are close to the best-fitting line).

 

File: /swithers/seestats/SeeingStatisticsFiles/seeing/reg/test2.html
© 1999, Duxbury Press.
The sampling distribution of the slope allows us to identify surprising values. In this case, values of the slope more extreme than +/- .4 are extremely rare and values more extreme than +/- .3 are surprising.  
File: /swithers/seestats/SeeingStatisticsFiles/seeing/reg/test2.html
© 1999, Duxbury Press.
LIGN=CENTER VALIGN=MIDDLE>2 3 4 5 6 7 8 9 10 Stress 2.5 5 10 15 17.5 20 25 30 35 40 FailTime 63 58 55 61 62 37 38 45 46 19

Move the line up and down by dragging the box or change the slope by clicking and dragging anywhere else so that the line represents all the observations. Try to place the line so that it is closest to all the observations.

Based on your line, do you think there is a positive, negative, or no relationship between Stress and FailTime?

Return to Top


Biology

Ott (1993) presents this example on p. 452: Fifteen male volunteers ate a low-cholesterol diet for four weeks. Below are the ages and the reduction in cholesterol (in mg per 100 ml of blood serum) for each participant:
Participant 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Age 45 43 46 49 50 37 34 30 31 26 22 58 60 52 27
Reduce 30 52 45 38 62 55 25 30 40 17 28 44 61 58 45

Move the line up and down by dragging the box or change the slope by clicking and dragging anywhere else so that the line represents all the observations. Try to place the line so that it is closest to all the observations.

Based on your line, do you think there is a positive, negative, or no relationship between Age and Reduce?

Return to Top

 


File: /swithers/seestats/SeeingStatisticsFiles/seeing/reg/test2.html
© 1999, Duxbury Press.
a dieter's age and the amount of cholesterol reduced?

Return to Top

 


File: /swithers/seestats/SeeingStatisticsFiles/seeing/reg/test2.html
© 1999, Duxbury Press.
al">File: