Bivariate relationships: Categorical data Assessing relationships in bivariate data
 

1) is there a relationship?

  • statistical independence/dependence
  • 2) strength of relationship? 3) direction of relationship?
     
     
     
     

    2 x 2 tables
     

  • binary/dichotomous variable (yes/no re reference value; two values)

  •  
  • measures of association

  •  
  • relative risk (RR) = % in reference category of dv for one category of iv divided by % in reference category of dv for other category of iv
  • odds ratio (OR) = odds of falling in reference category to not falling in reference category for one category of iv to odds for other category of iv
  • if row variable is i.v. and column variable is d.v., and right column is reference category of d.v. (this format is different from text), then
  •  
    a
    b
    c
    d
     
    RR = (b / (a + b)) / (d / (c + d))

    increased/decreased risk =
    (RR - 1) x 100%
     

    OR = (b/a) / (d/c)
     
     
     

    Perry Preschool (Schweinhart, Barnes, Weikart)

    randomized experiment: preschool vs. control (no preschool)

    outcome at age 27:

     
    never arrested
    arrested 1+ times
     
    preschool
    25 (43%)
    33 (57%)
    58
    control
    20 (31%)
    45 (69%)
    65
     
    45
    78
    113

     
     
     
     
     
     

    Association can be interpreted as causation in experimental studies
     
     

    be careful w/ percentage increases--also important to consider absolute percentage to interpret fully (issue of baseline risk)
     
     
     

    Pitfalls in analyzing associations

    Simpson's paradox


    Ignoring denominators/Incomplete crosstabulations


     
     
     

    Multiple interpretations of associations in observational studies


     
     
     

    Elaboration/partial tables

    Prosecution of child sexual abuse cases (Brewer, Rowe, Brewer)
     
     
     
     
    rejected
    prosecuted
     
    medical evidence
    19 (44%)
    24 (56%)
    43
    no med. evidence
    43 (61%)
    28 (39%)
    71
     
    62
    52
    114

     
     

    elaboration: partial tables re seriousness of abuse (conditional relationship?)
     

    less serious abuse (e.g., fondling)
     
    rejected
    prosecuted
     
    medical evidence
    8 (100%)
    0 (0%)
    8
    no med. evidence
    20 (67%)
    10 (33%)
    30
     
    28
    10
    38

     
     

    more serious abuse (e.g., penetration)
     
    rejected
    prosecuted
     
    medical evidence
    11 (31%)
    24 (69%)
    35
    no med. evidence
    23 (56%)
    18 (44%)
    41
     
    34
    42
    76

     
     

    Be careful about three criteria for causality - association, time order, and nonspuriousness (no confounding)