Chapter 13. Introduction to Analysis of Variance (ANOVA)

Statistics for the Behavioral Sciences (5th ed.) Gravetter & Wallnau Chapter 13 Introduction to Analysis of Variance (ANOVA) University of Guelph Psy...
Author: Basil Fisher
37 downloads 2 Views 2MB Size
Statistics for the Behavioral Sciences (5th ed.) Gravetter & Wallnau

Chapter 13 Introduction to Analysis of Variance (ANOVA) University of Guelph Psychology 3320 — Dr. K. Hennig Winter 2003 Term 1

Figure 13-2 (p. 397) A typical situation in which ANOVA would be used. Three separate samples are obtained to evaluate the mean differences among three populations (or treatments) with unknown means.

2

1

Figure 13-3 (p. 399) - mixed model design A research design with two factors. The research study uses two factors: One factor uses two levels of therapy technique (I versus II), and the second factor uses three levels of time (before, after, and 6 months after). Also notice that the therapy factor uses two separate groups (independent measures) and the time factor uses3 the same group for all three levels (repeated measures).

4

Statistics for the Behavioral Sciences, Sixth Edition by Frederick J. Gravetter and Larry B. Wallnau Copyright © 2004 by Wadsworth Publishing, a division of Thomson Learning. All rights reserved.

2

The test statistics t=

difference between sample means difference exp ected by chance (error)

t=

( M 1 − M 2 ) − ( µ1 − µ2 ) s( M 1− M2 )

F=

var iance (differences) betweensample means var iance( differences ) exp ected by chance( error)

n E.g., M1 = 20 M2 = 30; can find the difference. n but if there are three means? Use variance to

measure the size of the difference between groups.

5

Logic of ANOVA n Goal: measure the variability and determine

where it comes from. n Determine the total variability of the data n

break into parts (analyze) the variability/variance, i.e., ANOVA

6

3

Logic (contd.) n Between-treatment variance (note 50º vs. 70º) n Within-treatment variance (e.g., 70º - not all the

same) n When you see “variance,” think “differences”

7

Logic (contd.) n H0: the differences between treatments are

simply due to chance n H1: the differences are significantly greater than can be explained by chance, i.e., the differences are caused by treatment effects n Two primary sources of chance differences: n n

individual differences - people are different experimental error - measurement always involves 8

4

Logic (contd.) n Compare with chance - how big are the

differences when there is no treatment effect? n Within-treatments variance: n n

e.g., in 70º condition individuals were all tested in same condition, yet different scores within-treatment variance is a measure of difference expected just by chance

9

Figure 13-4 (p. 403) The independent- measures analysis of variance partitions, or analyzes, the total variability into two components: variance between treatments and variance within treatments.

10

5

The F-ratio: The test statistic for ANOVA F=

var iancebetween treatments var iance within treatments

=

treatment effect + differences dueto chance differences dueto chance

=

0 + differences dueto chance differences dueto chance

n An F-ratio near 1.00 means differences are

due to chance n In ANOVA the denominator is the error term; the same as the numerator when the treatment effect is zero 11

Steps n 1. Analysis of the SS n SSwithin n SS between n 2. Analysis of the degrees of freedom (df) n df within n df between n 3. Calculation of variances

12

6

1. Analysis of the SS

n Calculate T1,2,3 (? X=) for each of the k (=3)

treatment conditions n Calculate G (grand total) = T1 + T2 + T3 n N =n1 + n2 + n3 Calculate ? X2 for entire N=15 13

Table 13-2 (p. 406) Hypothetical data from an experiment examining learning performance under three temperature conditions.

14

7

1a. SS within treatments Partitioning the sum of squares (SS) for the independent-measures analysis of variance.

(∑ X ) 2 SS = ∑ X − N 2

SS within = SS1 + SS 2 + SS3

15

1b. SS between treatments

n But, only works if samples are all same size

(ns are equal), thus use a compuational formula: T 2 G2

SS between = ∑

n



N

16

8

Step 2: Analysis of the degrees of freedom (df) Partitioning degrees of freedom (df) for the independent-measures analysis of variance.

dftotal = N - 1 = 15 - 1=14

dfbetween = k - 1 = 3 - 1 = 2

dfwithin = N - k = 15 - 3 =12

17

Step 3: Analysis of the variances The structure and sequence of calculations for the analysis of variance. (Recall: s2 = SS/df)

=

MS between MS within

18

9

Results are organized into a table

19

Figure 13-8 (p. 413) The distribution of F-ratios with df = 2.12. Of all the values in the distribution, only 5% are larger than F = 3.88, and only 1% are larger than F = 6.93.

20

10

Table 13-3 (p. 414) A portion of the F distribution table. Entries in roman type are critical values for the .05 level of significance, and bold type values are for the .01 level of significance. The critical values for df = 2.12 21 have been highlighted (see text).

M2=12

M1=8 8

Figure 13-15 (p. 421) A visual representation of the between-treatments variability and the withintreatments variability that form the numerator and denominator, respectively, of the F-ratio. In (a), the difference between treatments is relatively large and easy to see. In (b), the same 4-point difference between treatments is relatively small and is 22 overwhelmed by the within -treatments variability.

11

Figure 13-11 (p. 431) The distribution of t statistics with df = 18 and the corresponding distribution of F-ratios with df = 1.18. Notice that the critical values for α = .05 are t = ±2.101 and that F = 2.1012 = 4.41 23

Post hoc tests n E.g., M1 = 3

M2 = 5 M3 = 10 n 2-pt difference M1 and M2 n experimentwise alpha level n planned vs. unplanned comparisons n Tukey (HSD) MSwithin HSD = q n n Scheffe

24

12