 |
Anthony G.
Greenwald, PhD
IMPLICIT
ASSOCIATION TEST: VALIDITY DEBATES
Note posted
29 Dec 07: An in press (JPSP) meta-analysis of predictive
validity of the IAT is now available on the page listing unpublished
manuscripts. The current draft includes literature available
thru January 2007. The meta-analysis has some surprising findings
about validity of IAT measures in predicting intergroup discrimination
(i.e., prediction of discriminatory behavior from IAT measures of attitudes
and stereotypes involving race, ethnicity, gender, and sexual orientation.
This is the one domain in which IAT measures significantly outperform
self-report measures (which have unexpectedly low predictive validity
in this domain aggregate effect size r of about .09).
I encourage
all interested to send me material for inclusion on this page. I
have not updated the material below recently, in part because there
has been little new activity in debates about construct validity of
IAT measures. Nevertheless, I am sure there are at least a few
relevant papers that I haven't yet gotten around to posting on this
page.
I thank all
of the authors who have provided permission to include either links
to their papers or the PDFs that are downloadable from this page. Very
special thanks to Brian Nosek, whose work in preparing the "age
7" chapter (cited below) brought together some of the sets of findings
collected on this page. I encourage readers to bring to my attention
suggestions for additions or modifications to this page, and also to
let me know about errors, including faulty links.
CONSTRUCT
VALIDITY
salience asymmetry
self-esteem IAT
culture vs. person
INTERNAL VALIDITY
Preface: The IAT has received a remarkable level of
attention to validity concerns in the relatively short period since its
first publication in 1998, increasing both empirical and theoretical understanding
of the IAT. This page was designed to survey the topics that have received
attention, includng almost 50 papers, most of which are either published
or in press, and most of which have been made downloadable with the permission
of their authors.
The topics are divided into categories of construct validity, internal
validity, and statistical conclusion validity. This is by no means a complete
overview of research on the validity of IAT measures. Rather, it focuses
only on topics about which validity-related questions have been raised.
In particular, this page makes no attempt to summarize the research on
predictive and convergent validity of IAT measures. Overview of those
literatures are fortunately available in the meta-analyses by Greenwald
et al. (2007) and by Hofmann
et al. (2005), and in the recent overviews of IAT research by Nosek
et al. (2007) and Lane
et al. (2007), all of which are cited at the end of this brief introduction.
Greenwald,
A. G., Poehlman, T. A., Uhlmann, E., & Banaji, M. R. (JPSP,
in press). Understanding and using the Implicit Association Test: III.
Meta-analysis of predictive validity. (date of draft: 12 Oct 07) This
is NOT a final draft, The data set will be updated,
mainly by replacing previously unpublished reports with published ones
where possible. As a result, almost all numerical results should
change at least slightly.
Hofmann,
W., Gawronski, B., Gschwendner, T., Le, H., & Schmitt, M. (2005).
A meta-analysis on the correlation between the Implicit Association
Test and explicit self-report measures. Personality and Social Psychology
Bulletin, 31, 1369-1385.
Nosek,
B. A., Greenwald, A. G., & Banaji, M. R. (2007). The Implicit
Association Test at age 7: A methodological and conceptual review (pp.
265292). In J. A. Bargh (Ed.), Automatic processes in social
thinking and behavior. Psychology Press.
Lane,
K. A., Banaji, M. R., Nosek, B. A., & Greenwald, A. G. (2007).
Understanding and using the Implicit Association Test: IV. What we know
(so far) (Pp. 59102). In B. Wittenbrink & N. S. Schwarz
(Eds.). Implicit measures of attitudes: Procedures and controversies.
New York: Guilford Press.
Construct
Validity
Back
to Top
(Is the IAT measure properly interpreted as assessing
variations in association strengths?)
The three headings
in this section focus on alternatives to the interpretation that the
IAT measures strengths of associations that they are nominally designed
to assess. The critique for each of these topics was reasonable enough
to justify collecting empirical data. It may be too soon to regard any
of these critiques as a closed question. It also seems apparent that
these construct-validity criticisms have deterred few researchers from
treating the IAT as a useful measure. As mentioned previously, this
page does not dwell on any of the large and growing literature that
supports the validity of the associations strength interpretation of
the IAT. The next paragraph's quote from an in-press chapter by Nosek,
Greenwald, and Banaji (cited just below the quote) indicates some of
the other emphases in construct-validity studies of IAT measures.
"The realization that the IAT and self-report are related
introduces important questions about whether they measure distinct constructs.
In a multitrait-multimethod (MTMM) investigation of the IAT and self-report
across seven attitude domains, Nosek and Smyth (2004) found strong evidence
for both convergent and discriminant validity - IAT attitude measures
were related to their corresponding self-report measure and not measures
of other traits. Further, using structural equation modeling, this MTMM
investigation revealed that the best-fitting models represented the
IAT and self-report as related, but distinct constructs, rather than
as a single attitude construct, even after accounting for common method
variance in both measures (Nosek & Smyth, 2004; see also Cunningham,
Nezlek, & Banaji, 2004). This extends similar findings for individual
constructs such as Greenwald and Farnham (2000) for self-esteem, and
Cunningham et al. (2001) for racial attitudes. Finally, Nosek (2005)
reported evidence that the relationship between the IAT and self-report
is moderated by multiple interpersonal (self-presentation, perceived
distinctiveness from the norm) and intrapersonal (e.g., evaluative strength)
features of attitudes."
Nosek, B. A., Greenwald,
A. G., & Banaji, M. R. (2007). The Implicit Association Test
at age 7: A methodological and conceptual review (pp. 265292).
In J. A. Bargh (Ed.), Automatic processes in social thinking and
behavior. Psychology Press.
- SALIENCE
ASYMMETRY
Back
to Top
Rothermund and Wentura proposed that subjects responding
to the IAT should find it easier to group categories by perceptual salience
- highly salient categories are easily grouped together. Although this
may well be true, especially when there are extreme variations of perceptual
salience, it seems unlikely that this is a stronger influence on IAT
measures than are association strengths, There is also some debate between
Rothermund and Wentura and Greenwald et al. about the meaning of "association".
Greenwald et al. suggest that the question about the meaning of association
is non-central to interpretation of IAT findings.
Rothermund, K., &
Wentura, D. (2004). Underlying processes in the Implicit Association
Test(IAT): Dissociating salience from associations. Journal of Experimental
Psychology: General, 133, 139-165.
Greenwald, A. G., Nosek, B.
A., Banaji, M. R., & Klauer, K. C. (2005). Validity of the salience
asymmetry interpretation of the IAT: Comment on Rothermund and Wentura
(2004). Journal of Experimental Psychology: General, 134,
420-425. GNBK2005.pdf
Rothermund, K.,
Wentura, D., & De Houwer, J. (2005). Validity of the salience
asymmetry account of the Implicit Association Test: Reply to Greenwald,
Nosek, Banaji, and Klauer (2005). Journal of Experimental Psychology:
General, 134, 426-430.
- SELF-ESTEEM
IAT (construct validity)
Back
to Top
Two types of questions have been raised in regard to uses
of the IAT to measure implicit self-esteem. First, how to conceive what
it measures - in other words, what is implicit self-esteem? This question
was raised by Bosson, Swann, and Pennebaker (2000) in their study that
showed weak correlations among several available measures of implicit
self-esteem. Second, what is the most appropriate concept to contrast
with self in an IAT measure of implicit self-esteem? This question was
raised by Karpinski (2004), to which the Pinter and Greenwald (2005)
article was a reply. Perhaps the best answer to this question is that
it is reasonable to consider multiple alternative contrast categories
for self, something being done in various studies that are not
available as circulatable reports.
Perhaps the best evidence for construct validity of the self-esteem
IAT measure was its use to confirm balance-theoretical predictions of
the role of self-esteem and self-concept in implicit social cognition,
by Greenwald et al. (2002).
Bosson, J. K., Swann, W. B. Jr., & Pennebaker, J. W. (2000).
Stalking the perfect measure of implicit self-esteem: The blind men
and the elephant revisited? Journal of Personality & Social Psychology.
79, 631-643.
Karpinski, A. (2004).
Measuring self-esteem using the Implicit Association Test: The role
of the other. Personality and Social Psychology Bulletin, 30,
22-34.
Pinter, B., & Greenwald, A. G. (2005). Clarifying the role of
the "other" category in the self-esteem IAT. Experimental
Psychology, 52, 74-79.
Greenwald, A. G., Banaji, M. R.,
Rudman, L. A., Farnham, S. D., Nosek, B. A., & Mellott, D. S. (2002).
A unified theory of implicit attitudes, stereotypes, self-esteem, and
self-concept. Psychological Review, 109, 3-25.
- CULTURE
VS. PERSON
Back
to Top
The most interesting question raised about the IAT's construct
validity is the suggestion that the IAT measures associations that reside
in the culture, rather than in the person. This alternative view has
appeared, in different forms, in Karpinski and Hilton (2001), Olson
and Fazio (2004), and Arkes and Tetlock (2004). The Nosek and Hansen
(2004) manuscript and the Banaji, Nosek, and Greenwald (2004) article
respond to these. Perhaps the most compelling type of evidence that
the IAT measures something in the person (even if that is derived from
something in the culture) has been the numerous successful uses of the
IAT to measure individual differences. The large collection of predictive
validity studies involving the IAT, reviewed by Poehlman, Uhlmann, Greenwald,
and Banaji (2005) gives an overview of evidence indicating sensitivity
to individual differences, something also supported by the Greenwald
et al. (2002) and Hofmann et al. (2005) articles cited elsewhere on
this page.
Karpinski,
A., & Hilton, J. L. (2001). Attitudes and the implicit association
test. Journal of Personality and Social Psychology, 81,
774-788.
Olson, M. A.,
& Fazio, R. H. (2004). Reducing the Influence of Extrapersonal
Associations on the Implicit Association Test: Personalizing the IAT.
Journal of Personality & Social Psychology. 86, 653-667.
Nosek, B. A., &
Hansen, J. J. (2004). The associations in our heads belong to us:
Searching for attitudes and knowledge in implicit cognition. Unpublished
manuscript.
Arkes, H. R., &
Tetlock, P. E. (2004). Attributions of Implicit Prejudice, or "Would
Jesse Jackson 'Fail' the Implicit Association Test?" Psychological
Inquiry. 15, 257-278. [entire
exchange]
Banaji, M. R., Nosek, B. A., & Greenwald, A. G. (2004). No place
for nostalgia in science: A response to Arkes and Tetlock. Psychological
Inquiry, 15, 279-289. [entire
exchange]
Arkes, H. R., & Tetlock,
P. E. (2004). The Implicit Prejudice Exchange: Islands of Consensus
in a Sea of Controversy: Response. Psychological Inquiry,
15, 311-321. [entire
exchange]
Poehlman, T. A., Uhlmann,
E., Greenwald, A. G., & Banaji, M. R. (2006). Understanding
and using the Implicit Association Test: III. Meta-analysis of predictive
validity. Manuscript submitted for publication.
Internal
Validity
Back
to Top
(Do IAT procedures generate effects that are due to variables
other than the category variations represented in the selections of
stimuli used for IAT measures?)
Most all of the following internal
validity topics were first investigated in response to interpretive
questions raised repeatedly by both psychologists and laypersons who
experienced the IAT on the web in group demonstrations. Several of these
questions were never raised seriously in print because they had been
resolved (in articles cited in this section) very early in the course
of research into the IAT's properties.
- FAMILIARITY
Back
to Top
Comments expressed as questions in pre-1998 conference and
colloquium presentations, and by many laypersons who completed an IAT
on the web, often oberved that, because familiar stimuli might be better
liked, stimulus item familiarity was a possible source of artifact in
IAT attitude measures. Several studies examined this question. Their
findings established that familiarity was not a substantial source of
artifact, so long as the stimuli used to represent a category in the
IAT were not entirely unfamiliar (such as nonsense strings).
Brendl, Markman, and Messner (2001) is an example of a study that used
totally unfamiliar stimuli, intending these to represent pseudo-categories
that should lack associations with other categories. Greenwald and Nosek
(2001) concluded that such uses of totally unfamiliar stimuli would
yield problematic IAT findings and should be avoided. The best explanation
for the problematic nature of this procedure may be that the IAT has
been shown to operate at the level of the categories that contain the
stimuli used in the task. When the task stimuli fall into no existing
category, the IAT appears not to work as desired. The situation might
be similar if items chosen to represent a category in an IAT were poor
exemplars that were difficult to classify into the intended category.
This circumstance should be avoided, as it is known to produce slow
responding and psychometrically inferior IAT measures.
Rudman,
L. A., Greenwald, A. G., Mellott, D. S., & Schwartz, J. L. K. (1999).
Measuring the automatic components of prejudice: Flexibility and generality
of the Implicit Association Test. Social Cognition, 17,
437-465.
Dasgupta, N.,
McGhee, D. E., Greenwald, A. G., & Banaji, M. R. (2000). Automatic
preference for White Americans: Eliminating the familiaritexplanation.
Journal of Experimental Social Psychology, 36, 316-328.
Ottaway,
S. A., Hayden, D. C., & Oakes, M. A. (2001). Implicit attitudes
and racism: Effects of word familiarity and frequency on the implicit
association test. Social Cognition, 19, 97-144.
Brendl,
C. M., Markman, A. B., & Messner, C. (2001). How do indirect
measures of evaluation work? Evaluating the inference of prejudice in
the Implicit Association Test. Journal of Personality and Social
Psychology, 81, 760-773.
Dasgupta, N.,
Greenwald, A. G., & Banaji, M. R. (2003). The first ontological
challenge to the IAT: Attitude or mere familiarity? Psychological
Inquiry, 14, 238-243.
Greenwald, A. G., &
Nosek, B. A. (2001). Health of the Implicit Association Test at
age 3. Zeitschrift für Experimentelle Psychologie, 48,
85-93.
- ORDER
OF COMBINED TASKS
Back
to Top
The first IAT publication (Greenwald et al., 1998) observed
that, other things being equal, strengths of associations used in the
first of the IAT's two combined tasks appeared to be stronger than those
used in the second. This finding was subsequently observed in numerous
other studies that used a procedure of counterbalancing order of presentation
of the IAT's two combined tasks. Nosek,
Greenwald, & Banaji (2005) showed the possibility of reducing this
order effect by increasing the amount of practice on the 5th of the
7 blocks of the standard IAT procedure.
Greenwald, A. G., McGhee,
D. E., & Schwartz, J. L. K. (1998). Measuring individual differences
in implicit cognition: The Implicit Association Test. Journal of
Personality and Social Psychology, 74, 1464-1480.
Nosek, B. A., Greenwald, A. G., & Banaji, M. R. (2005). Understanding
and using the Implicit Association Test: II. Method variables and construct
validity. Personality and Social Psychology Bulletin, 31,
166-180.
- PREVIOUS
EXPERIENCE WITH THE IAT (order of measures)
Back
to Top
Greenwald, Nosek, & Banaji (2003) reported that IAT
effects were slightly reduced in magnitude for subjects who had prior
experience taking the IAT. No interpretation of this effect has yet
been offered.
Greenwald, A. G, Nosek, B. A., &
Banaji, M. R. (2003). Understanding and using the Implicit Association
Test: I. An improved scoring algorithm. Journal of Personality and
Social Psychology, 85, 197-216.
- INTERTRIAL
INTERVAL DURATION
Back
to Top
Greenwald et al. (1998, Experiment 1) reported that the
Interval between successive trials of the IAT, in the range from 100
ms to 700 ms, had virtually no effect on magnitude of IAT effects.
Greenwald, A. G., McGhee,
D. E., & Schwartz, J. L. K. (1998). Measuring individual differences
in implicit cognition: The Implicit Association Test. Journal of
Personality and Social Psychology, 74, 1464-1480.
- ASSIGNMENT
OF CATEGORIES TO RIGHT OR LEFT KEY
Back
to Top
Some have suspected that a perhaps-natural association of
right side with positive valence would make attitudinal IATs vary as
a function of whether the pleasant category was assigned to the left
or the right side. Greenwald et al. (1998) reported no effect of assigning
pleasant and unpleasant categories to the right or left side. Numerous
subsequent studies have similarly counterbalanced side to which categories
are assigned, with no reports that this produced variations in results.
Greenwald, A. G., McGhee,
D. E., & Schwartz, J. L. K. (1998). Measuring individual differences
in implicit cognition: The Implicit Association Test. Journal of
Personality and Social Psychology, 74, 1464-1480.
- HANDEDNESS
Back
to Top
Greenwald & Nosek (2001, p. 87) reported no effects
on IAT scores of subject self-reported handedness in a large web-based
data collection.
Greenwald, A. G., & Nosek, B. A. (2001). Health of the Implicit
Association Test at age 3. Zeitschrift für Experimentelle Psychologie,
48, 85-93.
- SYSTEMATIC
METHOD VARIANCE AND TASK-SWITCHING
Back
to Top
This category includes the role of task-switching; cognitive
fluency; and age on IAT measures. As an overview: It appears that cognitive
fluency and age are both associated with general slowing of responding,
which tends to increase latency difference effects (and therefore IAT
measures). Effects of cognitive fluency and age on IAT measures are
substantially moderated (even if not eliminated) by using the IAT's
improved scoring algorithm for the IAT, which reduces the correlation
of IAT effects with average latency of responding to a small value (Greenwald,
Nosek, & Banaji, JPSP, 2003).
In regard to task switching: It appears that the association
strengths measured by the IAT reduce task-switching costs when associated
categories share a response. Consequently, task switching costs may
be integral to what is measured by the IAT, rather than being an artifactual
contributor to IAT effects.
McFarland,
S. G., & Crouch, Z. (2002). A cognitive skill confound on the
Implicit Association Test. Social Cognition, 20, 483-510.
Mierke, J., & Klauer, K. C. (2001). Implicit association measurement
with the IAT: Evidence for effects of executive control porcesses. Zeitschrift
fuer Experimentelle Psychologie. 48, 107-122. (no PDF available)
Mierke, J. & Klauer, K. C. (2003). Method-specific variance in the
Implicit Association Test. Journal of Personality and Social Psychology,
85, 1180-1192. (no PDF available)
Klauer, K.
C., & Mierke, J. (2005). Task-set inertia, attitude accessibility,
and compatibility-order effects: New evidence for a task-set switching
account of the IAT effect. Personality and Social Psychology Bulletin,
in press.
Back, M. D., Schmukle,
S. C., & Egloff, B. (2005). Measuring task-switching ability
in the Implicit Association Test. Experimental Psychology, 52,
167-179.
Greenwald, A. G, Nosek, B. A., &
Banaji, M. R. (2003). Understanding and using the Implicit Association
Test: I. An improved scoring algorithm. Journal of Personality and
Social Psychology, 85, 197-216.
Cai, H., Sriram, N., Greenwald, A. G., & McFarland, S. G. (2004).
The Implicit Association Test's D measure can minimize a cognitive skill
confound: Comment on McFarland and Crouch (2002). Social Cognition,
22, 673-684.
-
FAKEABILITY
Back
to Top
Several studies have examined the effects of asking subjects
to fake their IAT performances. Findings reveal that it is difficult
to fake IAT performances, although modest effects of faking have been
observed in some of these studies. Clearly, the best strategy for faking
the IAT (although few research subjects discover it on their own) is
to respond slowly in whichever of the two combined tasks is easier.
Research that is currently underway (nothing yet published) is seeking
to identify statistical patterns that will indicate that a respondent
is attempting to fake an IAT result.
Banse, R., Seise, J., & Zerbes, N. (2001). Implicit attitudes toward
homosexuality: Reliability, validity, and controllability of the IAT.
Zeitschrift fur Experimentelle Psychologie, 48, 145-160.
(no PDF available)
Kim, D. Y. (2003).
Voluntary controllability of the Implicit Association Test (IAT). Social
Psychology Quarterly, 66, 83-96.
Steffens,
M. C. (2004). Is the Implicit Association Test Immune to Faking?
Experimental Psychology, 51, 2004, 165-179.
Egloff,
B., & Schmukle, S. C. (2002). Predictive validity of an Implicit
Association Test for measuring anxiety. Journal of Personality and
Social Psychology, 83, 1441-1455.
Fiedler,
K., & Bluemke, M. (in press). Faking the IAT: Aided and Unaided
Response Control on the Implicit Association Tests. Basic and Applied
Social Psychology.
- STIMULUS
ITEM (EXEMPLAR) SELECTION
Back
to Top
Results obtained by Steffens and Plewe (2001), Govan and
Williams (2004), Mitchell, Nosek, & Banaji (2003) and Bluemke &
Friese (2005) establish that it is possible to influence IAT measures
by selecting stimuli in a fashion that confounds a category distinction
of interest with some other category distinction. For example, Mitchell
et al. showed that implicit race attitudes were influenced when category
exemplars were selected to represent either (a) disliked Black and liked
White or, alternately, liked Black and disliked White. Govan and Williams
showed a similar result and also showed that category exemplars of flowers
and insects could be varied by identifying liked insects (e.g., butterfly,
firefly, grasshopper) and disliked flowers (some of which were not exactly
flowers - e.g., poison ivy, nettles, weed). In a gender attitude IAT,
Steffens and Plewe represented valence concepts with adjectives that
were confounded with gender (e.g., beautiful, aesthetic, tasteful [positive]
vs. martial, corrupt, violent [negative] or successful,
logical, independent [positive] vs. prim,
bitchy, and hysterical [negative]). These findings make clear that it
is wise to avoid selecting stimuli so as to permit the subject to the
IAT's category distinctions in more than one way. At the same time,
De Houwer (2001) clearly showed that the IAT effectively tolerated a
good deal of variance in an irrelevant attribute (valence) that was
allowed to vary when subjects were asked to classify names as British
(e.g., Princess Diana, the Queen Mother, and two mass murderers) and
foreign (e.g., Einstein, Ghandi, Hitler, and Saddam Hussein).
Steffens,
M. C., & Plewe, I. (2001). Items' cross-category associations
as a confounding factor in the Implicit Association Test. Zeitschrift
für Experimentelle Psychologie, 48, 123-134.
De Houwer, J.,
(2001). A structural and process analysis of the Implicit Association
Test. Journal of Experimental Social Psychology, 37, 443-451.f
Mitchell,
J. P., Nosek, B. A., & Banaji, M. R. (2003). Contextual variations
in implicit evaluation. Journal of Experimental Psychology: General,
132, 455-469.
Govan, C.
L., & Williams, K. D. (2004). Reversing or eliminating IAT effects
by changing the affective valence of the stimulus items. Journal
of Experimental Social Psychology, 40, 357-365.
Bluemke,
M., & Friese, M. (in press). Do features of stimuli influence
IAT effects? Journal of Experimental Social Psychology
Statistical
Conclusion Validity Back
to Top
(Are the numbers of the IAT measures being interpreted appropriately?)
Questions about interpretation of
numerical values of IAT measures have been raised in two articles (both
by Blanton & Jaccard, in press). The Greenwald et al. (in press)
responses to these articles explain how IAT measures, although less
than perfect, are being used in reasonable ways. The issues in the first
pair of articles have to do mainly with whether the zero point and units
of IAT measures are meaningful. The second set of articles also has
to do with the meaningfulness of the IAT measure's zero point, with
the added consideration of the relevance of this to testing multiplicative
relationships among triads of association-strength measures as theorized
by Greenwald et al. (2002). These issues have little relation to construct
validity and internal validity questions.
- METRIC
PROPERTIES OF IAT MEASURES
Back
to Top
Blanton, H., & Jaccard, J. (2006). Arbitrary metrics in psychology.
American Psychologist., 61, 27-41. (Authors declined to
make PDF available)
Greenwald,
A. G., Nosek, B. A., & Sriram, N. (2006). Consequential validity
of the Implicit Association Test: Comment on the article by Blanton
and Jaccard. American Psychologist, 61, 5661.
- TESTING
MULTIPLICATIVE MODELS
Back
to Top
Greenwald,
A. G., Banaji, M. R., Rudman, L. A., Farnham, S. D., Nosek, B. A., &
Mellott, D. S. (2002). A unified theory
of implicit attitudes, stereotypes, self-esteem, and self-concept. Psychological
Review, 109, 3-25. UnifiedTheory.2002.pdf
Greenwald,
A. G., Rudman, L. A., Nosek, B. A., & Zayas, V. (2006).
Why so little faith? A reply to Blanton and Jaccard's (2006) skeptical
view of testing pure multiplicative theories. Psychological Review,
113, 170180.
Blanton,
H., & Jaccard, J. (2006). Tests of multiplicative models in psychology:
A case study using the unified theory of implicit attitudes, stereotypes,
self-esteem, and self-concept. Psychological Review, 113,
55-65.
(Authors declined to make PDF available)
For
downloadable publications on the IAT, go to the
publications by topic
page
|