KAPLANMEIER procedure
Calculates the Kaplan-Meier estimate of the survivor function (J.T.N.M. Thissen).
Options
Parameters
Description
Survival data are data in which the response variate is the lifetime of a component or the survival time of a patient. Typically these are censored, i.e. the survival time of some units is unknown at the end of the study. The survivor function F(t) is a key element in the analysis of survival data. It is defined as the probability of an individual still surviving at time t. KAPLANMEIER calculates the Kaplan-Meier estimate of the survivor function for two different types of data.
The first type of data occurs when all timepoints are accurately observed. The observed timepoints or the timepoints at which censoring took place are then specified using the TIME parameter. The CENSORED variate contains values 0 and 1 to specify whether the corresponding element of TIME is censored (1) or not (0); if there was no censoring, this need not be set. The GROUPS parameter can be used to specify a factor to indicate different groups whose survivor functions are to be estimated separately. The distinct TIME values can be saved using the EVENT parameter, and the number of deaths and the number of units at risk at each individual EVENT can be saved using parameters NDEATH and NATRISK respectively. The Kaplan-Meier estimate can be saved with the ESTIMATE parameter. The NEWGROUPS parameter can save a factor indicating the group structure of the output variates.
The second type of data is relevant when the units are observed at the end of time-intervals. The exact times are then unknown and input should be specified using parameters EVENT, NDEATH, NATRISK. These specify the timepoints, number of deaths and number of risk at the end of each interval. The GROUPS parameter can again be used to request separate group estimates.
The PRINT option selects the output to be displayed with settings:
The default is PRINT=estimates,graph.
The probability level for the Kaplan-Meier estimate confidence interval can be set using the PROBABILITY option; by default this is 0.95. Percentiles for estimating survival times can be set using the PERCENTILES option; by default this is 25,50,75. If PRINT=graph is set, then the PLOT option can be used to include censored observations and a reference line at S(t)=0.5 to indicate the median survival time. If GRAPHICS=highresolution different lines are drawn for different groups, whereas GRAPHICS=lineprinter produces separate graphs for the different groups. Lower and upper bounds for the x-axis can be set by options XLOWER and XUPPER, the TITLE option can specify a title for the plots. Options WINDOW and KEYWINDOW control the windows used for high-resolution graphs.
Options: PRINT, GRAPHICS, TITLE, WINDOW, KEYWINDOW, SCREEN, PROBABILITY, XLOWER, XUPPER, PLOT, PERCENTILES.
Parameters: TIME, CENSORED, GROUPS, EVENT, NDEATH, NATRISK, ESTIMATE, NEWGROUPS.
Method
When TIME is set, the Kaplan-Meier estimate is calculated according to equation (1.10) in Kalbfleisch & Prentice (1980). When TIME is not set, the Kaplan-Meier estimate is directly calculated from the variates specified by EVENT, NDEATH and NATRISK. If PERCENTILES includes the median (50) then a confidence interval is displayed for the median using the method described in Brookmeyer & Crowley (1982). The mean survival time is calculated by the formula
μ = ∑i=1...k { S(ti-1) × (ti - ti-1) }
where
k is the number of ordered death times,
S(ti-1) is the Kaplan-Meier estimate of the survivor function at the (i-1)th death time,
ti is the death time, where t0 is defined to be zero
Its standard error is calculated using the formula:
se(μ) = √[ (m/m-1) x ∑i=1...k-1 { (Ai ** (2/ni)) × (ni - di) } ]
where
m = ∑i=1...k { di }
Ai = ∑j=1...k-1 { S(tj-1) × (tj+1 - tj) }
Action with
RESTRICT
The input variates and factor GROUPS may be restricted identically. The Kaplan-Meier estimate is based only on the units not excluded by the restriction.
Reference
Brookmeyer, R. & Crowley, J. (1982). A confidence interval for the median survival time. Biometrics, 38, 29-41.
Collett, D. (1994). Modelling Survival Data in Medical Research. Chapman & Hall. London.
Kalbfleisch, J.D. & Prentice, R.L. (1980). The Statistical Analysis of Failure Time Data. Wiley, New York.