.BG Fits proportional hazards regression model to case-cohort data .FN cch .DN Returns estimates and standard errors from relative risk regression fit to data from case-cohort studies. A choice is available among the Prentice, Self-Prentice and Lin-Ying methods for estimation of regression coefficients and standard errors. .IP .CS cch_function(formula, data=sys.parent(), cc, id, cohort.size, tenter=NULL, method="Prentice") .RA .AG formula A formula object that must have a Surv object as the response on the left of the ~ operator and model terms separated by + operators on the right. The Surv object must be of type "right", Surv(time,event), or of type counting, Surv(tenter,texit,event). .AG subcoh Vector of indicators for subjects sampled as part of the sub-cohort 0 = not a sub-cohort member, 1 = a sub-cohort member .AG id Vector of unique identifiers .AG cohort.size Scalar with size of original cohort from which subcohort was sampled .OA .AG data An optional data frame in which to interpret the variables occurring in the formula. .AG method Three procedures are available. The default method is "Prentice", with options for "SelfPren" or "LinYing". .RT list of estimated regression coefficients and two estimates of their asymptotic variance-covariance matrix. .RC coef regression coefficients. .RC naive.var Self-Prentice model based variance-covariance matrix. .RC var Lin-Ying empirical variance-covariance matrix. .DT Implements methods for case-cohort data analysis described by Therneau and Li (1999). The three methods differ in the choice of "risk sets" used to compare the covariate values of the failure with those of others at risk at the time of failure. "Prentice" uses the sub-cohort members "at risk" plus the failure if that occurs outside the sub-cohort and is score unbiased. "SelfPren" (Self-Prentice) uses just the sub-cohort membes "at risk". These two have the same asymptotic variance-covariance matrix. "LinYing" (Lin-Ying) uses the all members of the sub-cohort and all failures outside the sub-cohort who are "at risk". The methods also differ in the weights given to different score contributions. .IP The output may be examined by the summary function, which returns the estimated coefficients and their standard errors in a tabular form. .SH REFERENCES Prentice, RL (1986). A case-cohort design for epidemiologic cohort studies and disease prevention trials. Biometrika 73: 1-11. Self, S and Prentice, RL (1988). Asymptotic distribution theory and efficiency results for case-cohort studies. Annals of Statistics 16: 64-81. Lin, DY and Wei, LJ (1989). The robust inference for the Cox proportional hazards model. Journal of the American Statistical Association 84: 1074-1078. Lin, DY and Ying, Z (1993). Cox regression with incomplete covariate measurements. Journal of the American Statistical Association 88: 1341-1349. Barlow, WE (1994). Robust variance estimation for the case-cohort design. Biometrics 50: 1064-1072 Therneau, TM and Li, H (1999). Computing the Cox model for case-cohort designs. Lifetime Data Analysis 5: 99-112. Borgan, O et al. (1999). Exposure stratified case-cohort designs. Lifetime Data Analysis (to appear) .EX options(contrasts=c("contr.treatment","contr.poly")) # Read in the complete Wilms Tumor Data # (Breslow and Chatterjee, Applied Statistics, 1999) # # Read 4088 individual data records and place in data frame # nwtco_read.table("nwtco.asc",header=T) nwtco_nwtco[!is.na(nwtco$edrel),] # Only 4028 subjects have positive observation time attach(nwtco,pos=1) # # Select subcohort by simple random sampling # set.seed(300) id.subcoh_sample(seqno,size=668) subcoh_as.numeric(!is.na(match(seqno,id.subcoh))) selccoh_rel==1|subcoh==1 ccoh.data_data.frame(nwtco[selccoh,]) ccoh.data$subcohort_subcoh[selccoh] ccoh.data$histol_factor(ccoh.data$histol,labels=c("FH","UH")) # Central histology ccoh.data$stage_factor(ccoh.data$stage,labels=c("I","II","III","IV")) # Stage ccoh.data$age_ccoh.data$age/12 # Age in years # # Standard case-cohort analysis: simple random subcohort # fit.ccP <- cch(Surv(edrel, rel) ~ stage + histol + age, data = ccoh.data, subcoh = ccoh.data$subcoh, id = ccoh.data$seqno, cohort.size = 4028) # # Output summary(fit.ccP) $call: cch(formula = Surv(edrel, rel) ~ stage + histol + age, data = ccoh.data, subcoh = ccoh.data$ subcoh, id = ccoh.data$seqno, cohort.size = 4028) $method: [1] "Prentice" $cohort.size: [1] 4028 $subcohort.size: 1 668 $coefficients: Value SE Z p stageII 0.73457084 0.16988715 4.323875 1.533121e-005 stageIII 0.59708356 0.17530637 3.405943 6.593588e-004 stageIV 1.38413197 0.20802431 6.653703 2.858092e-011 histol 1.49806307 0.16430597 9.117521 0.000000e+000 age 0.04326787 0.02427407 1.782473 7.467220e-002 attr(, "class"): [1] "summary.cch"