RSURVIVAL procedure
Models survival times of exponential, Weibull, extreme-value, log-logistic or lognormal distributions (R.W. Payne & D.A. Murray).
Options
Parameter
Description
RSURVIVAL models survival times assuming that they follow either an exponential, Weibull, extreme-value, log-logistic or lognormal distribution, as indicated by the DISTRIBUTION option. It also caters for right-censored observations, where the subject concerned survived the trial: the CENSORED option can be used to specify a variate with an entry for each subject containing one where the subject survived, otherwise zero. The log-logistic caters for left-censored observations, which they can be specified by an entry of -1 in the CENSORED variate. The model to be fitted to the survival times is specified using the TERMS parameter.
The analysis is performed using the generalized linear models facilities of GenStat. For the exponential, Weibull and extreme-value distributions a y-variate (= 1 - CENSORED) is specified indicating whether the subject died or survived, and an offset variate is included which depends on the time variate (see Chapter 6 of Aitkin et al. 1989). For the exponential distribution this offset is simply the logarithm of the times. With the Weibull distribution it is the Weibull parameter α multiplied by the logarithm of the times, while for the extreme-value distribution it is the parameter α multiplied by the times. The parameters of the TERMS model and α itself are estimated alternately (with number of cycles controlled by the MAXCYCLE option) until successive estimates are within a tolerance specified by the TOLERANCE option. The ALPHA option can input an initial value for α and save the estimated value. By setting the MAXCYCLE option to one, α can be fixed at the initial value; this is useful for comparing one model with another, when the value of α should be fixed at the value estimated from the more complicated model. The log-logistic distribution is fitted using a logistic regression model with number of successes 1-c and binomial denominator 2-c-b (where c is an index for a right-censored observation and b is an index for a left-censored observation) using an offset variate of the logarithm of times divided by σ. The parameters of the TERMS model and σ (shape parameter) are estimated alternately (with number of cycles controlled by the MAXCYCLE option) until successive estimates are within a tolerance specified by the TOLERANCE option. For the lognormal distribution maximization of the log-likelihood is achieved using an EM algorithm details of which are given in Section 6.19 of Aitkin et al. (1989). The SIGMA option can be used to save the estimated value of the shape parameter for both the log-logistic and lognormal distributions. The importance of variables in the lognormal model should be assessed by omitting the variable and comparing -2 times the log-likelihood; this can be saved using the _2LOGLIKELIHOOD option. The SURVIVOR option allows you to save estimates of the survivor function. For the Weibull distribution the PARAMETERIZATION option can be used to choose whether to produce the estimates for the survivor function using the proportional hazards or accelerated failure time parameterization.
The PRINT option controls printed output with similar settings to those of the FIT directive, except that there is an extra setting loglikelihood to print -2 times the log-likelihood. Further information can be printed subsequently by using RDISPLAY in the usual way. Also, the GRAPHICS option can be set to either high or line to produce either high-resolution or line-printer plots of the empirical survivor function against the value predicted by the model when the exponential, Weibull and extreme-value distributions are selected (see Aitken et al. 1989, pages 275-276).
Options: PRINT, TIMES, DISTRIBUTION, CENSORED, GRAPHICS, ALPHA, _2LOGLIKELIHOOD, SIGMA, SURVIVOR, PARAMETERIZATION, MAXCYCLE, TOLERANCE.
Parameter: TERMS.
Method
Full details of the method can be found in Chapter 6 of Aitkin et al. (1989). For the exponential distribution (pages 269-270), the survivor function is
S(t) = exp(-λ t)
with
λ = exp( Σ( bi xi) )
where bi are the parameter estimates, xi are the appropriate values of the
explanatory variates, and t is the time. The Weibull distribution (page 280)
is defined with density function
f(t) = α λ t(α-1) exp(-λ (tα))
and has survivor function
S(t) = exp(-λ tα).
The extreme-value distribution (pages 283-284) has survivor function
S(t) = exp(-λ exp(αt)).
The loglogistic distribution (pages 295-297) has the survivor function
S(t) = 1 / { 1 + (t / θ)a }
with
θ = exp(∑(bi × xi))
and a = 1 / σ.
The lognormal distribution (pages 297-300) has survivor function
S(t) = 1 - CDFNORMAL( log( (t - ∑(bi × xi)) / σ))
Action with
RESTRICT
The vectors involved in the analysis may be restricted as usual for a generalized linear model.
Reference
Aitkin, M., Anderson, A., Francis, B. & Hinde, J. (1989). Statistical Modelling in GLIM. Oxford University Press.