############################################################################################# # The data file Vax004data578.12.05.part1.dat contains the following 26 variables, # with one variable per column. This dataset contains one record per subject (repeated # measures over time for HIV infected subjects are in the second dataset described below, # Vax004data578.12.05.part2.dat) # # Variables in columns for dataset Vax004data578.12.05.part1.dat: # # 1. Subject identification number # 2. Vaccination assignment (1=vaccine; 0=placebo) # 3. Sex (0=male; 1=female) # 4. Age (1=18-25; 2=26-30; 3=31-40; 4=41-50; 5=>50) # 5. White (1=white; 0=o/w) # 6. Black (1=black; 0=o/w) # 7. Asian (1=asian; 0=o/w) # 8. hisp (1=hispanic; 0=o/w) # 9. Otherrace (1=another race; 0=o/w) # 10. Education (1=Less than High School; 2=High School Graduate; # 3=College Graduate; 4=Advanced Degree) # 11. US (1=United States; 0=o/w) # 12. Canada (1=Canada; 0=o/w) # 13. Netherlands (1=Netherlands; 0=o/w) # 14. Region (1=Northeast; 2=South; 3=Southwest; 4=West coast; 5=Midwest; 6=Netherlands) # 15. Baseline risk score (1-7 summarizing the amount of risky behavior as measured by # a baseline questionnaire. For many analyses it has been divided into # low (score 0), medium (score (1-3), and high (score > 3). # It can also be treated as a quantitative score variable.) # 16. Level of CD4 blocking # antibody response 2 # weeks after the Month # 6 immunization (quantitative, measured in almost all HIV infected vaccinees, # and in a random sample of uninfected vaccinees.) # 17. Level of log10 MN # neutralization response # 2 weeks after the Month # 6 immunization (quantitative, measured in almost all HIV infected vaccinees, # and in a random sample of uninfected vaccinees.) # 18. HIV infection status (1 if infected during follow-up; 0 if not) # 19. Estimated days to # infection (The days between randomization and estimated date of HIV infection.) # For subjects who never become HIV infected, it is the total time # of follow-up, that is the censoring time.) # 20. Days to infection Dx (The days between randomization and the date of HIV infection detection. # For subjects who never become HIV infected, it is the total time # of follow-up.) # 21. Calendar time of # HIV infection Dx (categorical, 1=1998-1999; 2=2000 Jan1 to June 30; # 3=2000 July 1 to Dec 31; 4=2001; 5=2002) # 22. Start ART (NA if never became HIV infected; 0 if infected and never started # antiretroviral therapy (ART); 1 if infected and started # ART at some point during follow-up.) # 23. Days Start ART (NA if never became HIV infected. For HIV infected subjects, # the days between HIV infection detection and initiation # of ART if treatmentstart=1; the days between HIV infection # detection and censoring if treatmentstart=0. # This variable is a key time variable for post-infection analyses, # as it measures the total time of follow-up after diagnosis # of HIV infection.) # 24. HIV genetic distance 1 (NA if never became HIV infected. For HIV infected subjects, # the Neutralizing face core genetic distance of the infecting # HIV sequence to the GNE8 HIV strain represented in the vaccine.) # 25. HIV genetic distance 2 (NA if never became HIV infected. For HIV infected subjects, # the Neutralizing face core + V1/V2 genetic distance of the infecting # HIV sequence to the GNE8 HIV strain represented in the vaccine.) # 26. HIV genetic distance 3 (NA if never became HIV infected. For HIV infected subjects, # the V3 loop genetic distance of the infecting # HIV sequence to the GNE8 HIV strain represented in the vaccine.) ################################################################################################### ####################################################################### # Second dataset: Vax004postinfectiondata2.11.05.dat # # Variables in columns, with repeated measures over time placed in multiple rows # for each subject. # # 1. Subject identification number (For linking with the first dataset) # # 2. Group Time (The grouped post-infection visit time in days # with 9 possible values 15 30 61 122 244 365 487 609 730.) # 3. Viral Load (The log10 plasma RNA viral load measured from a # blood sample drawn on the visit date defined by # Days Since Infection Dx and grouptime.) # 4. CD4 (The squarerooted CD4 cell count measured from a blood sample # drawn on the visit date defined by Days Since Infection Dx and # grouptime.) ################################################################################################### ################################################################################################# # NOTES # For all analyses viral loads and CD4 cell counts measured on or after the # date of antiretroviral therapy initiation should be excluded. # # Some viral loads are left censored < log10(400) copies/ml; # some are righ censored > log10(750,000) copies/ml. # # Sometimes a subject will have multiple viral load and/or CD4 values at a scheduled visit (as # indicated by the grouptime variable). For such cases a custom can be used, such as taking the # median value. ###########################################################################################