transpcr.gif (812 bytes) Correlation Coefficient Formula

The prior page suggested agreement between the two z-scores for each observation as a useful index for how correlated the variables are. This leads to the following formula:

Open the Equations window with the button in the right margin if you would like to see the mathematical details for this formula. The formula can be understood intuitively. The correlation is simply the average product of all the pairs of z-scores. If there is a perfect positive relationship, then all the z-scores will be equal. That in turn means that all their products will be positive so that the sum of the products will be large. It turns out (see the Equations window for details) that for a perfect positive relationship, the typical or average z-score product equals 1.

 
On the other hand, if there is a perfect negative relationship, then all the z-scores will have the same value, but with opposite signs from each other. That in turn means that all their products will be negative, so that the sum of the products will be a large negative number.

Finally, if there is no relationship between the two variables, about half the time the z-scores in a pair will have the same sign and so give a positive product and about half the time the z-scores will have opposite signs and so give a negative product. In the sum, the positive and negative products will tend to cancel, yielding a sum near 0.

 
Example Calculations

Below are the correlation coefficient calculations for the pre- and post-SKQ scores. The first two columns present the data and their means and standard deviations. The middle two columns are the respective z-scores calculated from

 
The last column is the product of the two z-scores in each row. The average or mean of the last column is 0.39 and equals the correlation coefficient.

The correlation coefficient

indicates a moderate positive relationship between the Statistical Knowledge Scores at the beginning of the semester and those at the end. That is, students with higher scores at the beginning of the semester tended to have higher scores at the end of the semester as well.

 
  Almost all statistical computer programs can calculate the correlation coefficient for you. However, when examining those correlation coefficients you should remember their conceptual definition illustrated above.

File:
© 1999, Duxbury Press.