Exam 3 Review

QSCI 482 Prof. Conquest          TAs Kennedy, Malinick, Norman
      REVIEW QUESTIONS FOR EXAM 3 [EXAM 3 IS THURSDAY DEC. 12, 2002]

1. This is a continuation of Problem #1 from HW 7. [Comparison
of 3 types of Timber Harvest: None, Moderate, Intensive.] 1e and 1f come
after having done a one-factor analysis of variance [ANOVA] on the data.

1e. Compute the best available 95% confidence interval for the INTENSIVE
mean.

1f. Compute the best available 95% confidence interval for the mean that
you get by pooling the two means from the NONE and the MODERATE groups.

2. The lengths of fish raised under 4 types of environmental conditions
are displayed below, along with the sample sizes. The data are normally
distributed.

                  TREATMENT GROUP

          Group 1       Group 2    Group 3      Group 4
Mean       53.533       83.200     78.722       74.667
n           5            5          5            5

SOURCE OF VARIATION   df     SS

Total(corrected)           3125.880
Between Groups             2588.212
Within Groups (error)      537.668

GROUP       q(Group 2)     q(Group 3)    q(Group 4)
Group 1      _________     _________    __________
Group 4         3.29       _________
Group 3         1.73

a. Fill in the analysis of variance table [alpha=.05] to test for equality
of mean fish lengths under the 4 types of environmental conditions. Write
out your null and alternative hypotheses, and include the final P-value
along with your results.

b. Fill in the 4 missing q-values that you need to do either a Tukey or
Student-Newman-Keuls (SNK) multiple comparison.

c. Now that you have all the computed q-values, do BOTH a Tukey multiple
comparison and a SNK multiple comparison; use significance level alpha =
0.05. Summarize your results for each method regarding which means can and
cannot be declared significantly different.

3. The following data set concerns an experiment in which the
growth increments of insect larvae (increases in length in mm) were
measured after one week of feeding on artifical diets under controlled
environmental conditions. There are 3 levels of Dietary Protein [Low,
Medium, and High]. There is also the Alkaloid factor [Absent in the diet,
or Present in the Diet]. There are 4 replicates in each combination of
Dietary Protein and Alkaloid.

      NO ALKALOID      WITH ALKALOID      MEAN      TOTAL

LOW      3,5,4,6           6,7,5,8           5.50      44
PROTEIN

MED.       3,2,4,5           7,8,6,7           5.25      42
PROTEIN

HIGH      7,6,5,4           5,2,3,2           4.25      34
PROTEIN

MEAN 4.5         5.5         5.00
TOTAL 54          66                120

The Sums of Squares for the different sources of variation for the above
data are as follows:

SOURCE OF VARIATION      SS
Level of Protein     7.

Alkaloid

Interaction       39.

Error             28.

TOTAL             80.

a. At the .10 level of significance, test for the presence of INTERACTION.
Be sure to write out both your null and alternative hypotheses. Do a plot
of the means; use both the plot and the result of the statistical test to
interpret the results.

b. At the .05 level of significance, test for the overall effect
of PROTEIN LEVEL. Use both the plot of means [that you did in part 'a'
above] and the test to interpret the results.

c. At the .05 level of significance, test for the overall effect
of ALKALOID. Use both the plot of means and the test to interpret the
results.

d. Do the best available 95% confidence interval for the average growth
increment for the pooled {Low+Medium} Protein levels.

4. Scientists wish to test whether four Varieties of house plants reach
the same maximum height. They are also keenly aware that growing
conditions vary considerably in the greenhouse. Therefore, six green house
benches were set up as blocks (the assumption is that on each bench, the
growing conditions are the same). Each bench had the four Varieties
represented exactly once, assigned at random to a particular place on the
bench. The response variable is maximum plant height (cm.):

Block      Variety 1      Variety 2      Variety 3      Variety 4
1     19.8       21.9       16.4       14.7
2     16.7       19.8       15.4       13.5
3     17.7       21.0       14.8       12.8
4     18.2       21.4       15.6       13.7
5     20.3       22.1       16.4       14.6
6     15.5       20.8       14.6       12.9

Test the null hypothesis (alpha = .05) that all four Varieties of plants
reach the same maximum height. If you end up concluding that there is an
equality somewhere among the Varieties, then follow up with a multiple
comparison technique to find out where the differences are.

Some needed SS are: SS(Varieties) = 188.54; SS(Blocks) = 19.79;
SS(Total, Corrected) = 214.86.

5. Some experimenters are trying to decide what is a good number of
treatment groups for their experiment. They know they can afford about 24
or 25 statistical replicates, total. They must decide between:

Plan A: 5 treatments with 5 replicates per treatment, for a total of N=25;

OR

Plan B: 4 treatments with 6 replicates per treatment, for a total of N=24.

Plan A includes more treatments in the experiment (and has a larger N),
and Plan B has more replicates per treatment (but a smaller N).

They decide to resolve the issue in the following manner: they will find
the Minimum Detectable Difference between the largest and smallest
population mean for 90% power, and they will choose the design that gives
them the "finer" [i.e., smaller] value for the Minimum Detectable
Difference. Using this rule, which design is better? IMPORTANT NOTE: In
your computations, you may assume that the response variable has been
standardized so that MSE = 1.0.