Set up/Background

Our goal here is to validate the accuracy of the risksetROC package. Specifically, we want to see if we have a "true" AUC trace at a given set of time of interest, how close is the estimated AUC trace if we use the risksetROC package to produce the AUCs. We here describe the background and compare the "true" and simulated AUC traces. For reference, we include the simulation file, two output files and two pdf files. The first output file contains AUCs at time points of interest averaged over simulations and produced using different method of estimation (MLE, Cox, LocalCox, Schoenfeld with order = 0 and order = 1). The second output file contains the integrated AUC (iAUC) traces over simulations. We also provide the link for the documentation file for risksetROC package.

We start with the setup for simulation. We simulate (log) survival times and marker values from a bivariate normal distribution with m = (0, 10)T, r = -0.70, and variance of 1 (please note that time is in log scale if not mentioned otherwise). As per the convention, r is taken to be negative so that higher marker value indicates smaller event time. Censoring time is generated independently as normal, so that, about 40% of the observations are censored. Using this data we estimate ROC curves, AUCs at each of TargetTimes ranging from 7.5 to 12.5. Let n denote the sample size and Nsim denote the number of repetitions. For each of Nsim number of repetitions, we generate data for n individuals and repeat this procedure Nsim times, to get Nsim set of AUCs values at each of TargetTimes. Finally, we estimate the integrated AUC (iAUC) from the estimated AUC and the Kaplan-Meier estimate of the survival time.

As discussed in the paper by Heagerty and Zheng (2005), Biometrics, 61, 92 -- 105, the true positive (TP) and false positive based on a bivariate normal distribution of Marker(M) and Time(T) can be expressed as follows:
TPtI(c) = P(M > c | T = t) = F(r (log(t) - c)/ ((1-r2)1/2))

FPtD(c) = P(M > c | T > t) = S2N (c, log(t); r)/F(-log(t))

The integrated AUC (iAUC or concordance index) is given by: C = sin -1(-r)/p+0.5

The "true" AUC trace and concordance index is obtained as follows. At each of the TargetTimes, we evaluated the TP and FP using the known parametric values and obtained AUC by numerically integrating the area under ROC curve. True concordance index is evaluated at the known r.

The AUC curves are estimates using five methods. We first use MLEs of the parameter along with the known parametric form of TP and FP to estimate the bivariate normal AUC. The iAUC is estimated using the MLE of the correlation coefficient of marker and event time. Estimation for MLEs is discussed below while the details of the rest of the methods (Cox, LocalCox, Schoenfeld with order = 0 and order = 1) are discussed in the aforementioned paper and the risksetROC package is used to estimate AUC traces at TargetTimes as well as iAUC.

The MLEs are estimated as follows. First note that we have observed marker values and (censored) event times. Observed marker mean and variance is used as MLEs for marker distribution. To estimate the expectation and variance from the censored event time we fit a parametric survival model for event time with normal error and marker as the only covariate:

T|M = b0 + b1 M + Error
so, E(T) = b0 + b1 E(M)
and V(T) = b12 V(M) + Error Variance
Also, b1 = r (V(T)/V(M))1/2

Once the estimates of b0, b1 and error variance are obtained, we estimated the mean and variance of event time and correlation using the above relations. R package survival and function survreg was used to estimate the regression coefficients.

Two output files are out1.txt and out2.txt, the first contains the average AUC values at each TargetTimes (averaged over simulations) while the second contains the iAUC values over simulations. The plot associated with out1.txt is figure 1 while figure 2 plots the iAUC traces over simulations. Please see the documentation file for further details of the functions and more background.


mail to: Paramita Saha
Last modified: Mon Aug 27 20:31:21 PDT 2007