PARAMETRIC SURVIVAL ANALYSIS (EVENT HISTORY ANALYSIS)
Overview
Survival analysis is the study of time-to-event data. Its terminology traces back to medical studies where the event of interest was death, and to industrial studies where the event of interest was failure, such as burn-out of a motor or bulb. The objective in these seminal studies was to understand the correlates of survival, hence survival analysis.
Survival analysis may be parametric or semi-parametric. This volume of the Statistical Associates "Blue Book" series treats parametric survival analysis. "Parametric" means that an essential parameter, the baseline hazard function, must be specified by the researcher in advance. The baseline hazard function defines the chance of experiencing the event of interest (the "hazard", which traditionally was death or failure) when other predictors in the model are held constant. Positing the correct baseline hazard function is quite challenging, often leading the researcher to rely on semi-parametric survival analysis, which does not require this. Cox regression is the prime example of semi-parametric survival analysis and is treated in a separate volume.
A related term is "event history analysis," which is also called duration analysis, hazard model analysis, failure-time analysis, or transition analysis. Event history analysis is an umbrella term for procedures for analyzing duration-to-event data, where events are discrete occurrences. "Event history" studies have been common in the study of international relations, where events may be wars or civil conflicts. Many of the earlier conflict studies utilized Weibull and other parametric survival analysis models and therefore event history analysis is often seen as a type of parametric survival analysis.
Coleman (1981: 1) defined event history analysis in terms of three attributes: (1) data units (ex., individuals or organizations) move along a finite series of states; (2) at any time point, changes (events) may occur, not just at certain time points; and (3) factors influencing events are of two types, time-constant and time-dependent. Event history models focus on the hazard function, which reflects the instantaneous probability that the event of interest will occur at a given time, given that the unit of analysis has not experienced the event up to that time. While duration until death or failure were the classic examples, duration of peace until the outbreak of war was an example in international relations. In the last few decades, survival analysis has been applied to a wide range of events, including "hazards" which a positive meaning, such as duration until the event of adoption of an innovation in diffusion research. Other applications include study of longevity of trade agreements, strike durations, marriage durations, employment durations, and innumerable other subjects.
Table of Contents
Overview7
Key Concepts and Terms8
Types of survival analysis8
Nonparametric models8
Semi-parametric models8
Parametric models9
Count models9
States10
Durations10
Variables11
Time variable11
Covariates12
The event variable12
Data13
Censored data13
Survival and probability density functions13
The survival function13
The cumulative probability function and cumulative hazard plot14
The probability density function15
Hazard rates, hazard functions, and hazard ratios16
Overview16
The hazard function16
Parametric survival modeling17
Duration data17
Parametric model assumptions17
Proportional hazard models17
Accelerated failure time models18
Parametric models in Stata18
Overview18
Types of parametric models using Stata19
Exponential models20
Characteristics20
Exponential models with covariates20
Weibull models21
Scale and shape parameters21
Weibull proportional hazards models23
Weibull accelerated failure time models23
Example: Weibull models with Stata24
and 40 more pages of coverage
See also "Cox Regression"
Overview
Survival analysis is the study of time-to-event data. Its terminology traces back to medical studies where the event of interest was death, and to industrial studies where the event of interest was failure, such as burn-out of a motor or bulb. The objective in these seminal studies was to understand the correlates of survival, hence survival analysis.
Survival analysis may be parametric or semi-parametric. This volume of the Statistical Associates "Blue Book" series treats parametric survival analysis. "Parametric" means that an essential parameter, the baseline hazard function, must be specified by the researcher in advance. The baseline hazard function defines the chance of experiencing the event of interest (the "hazard", which traditionally was death or failure) when other predictors in the model are held constant. Positing the correct baseline hazard function is quite challenging, often leading the researcher to rely on semi-parametric survival analysis, which does not require this. Cox regression is the prime example of semi-parametric survival analysis and is treated in a separate volume.
A related term is "event history analysis," which is also called duration analysis, hazard model analysis, failure-time analysis, or transition analysis. Event history analysis is an umbrella term for procedures for analyzing duration-to-event data, where events are discrete occurrences. "Event history" studies have been common in the study of international relations, where events may be wars or civil conflicts. Many of the earlier conflict studies utilized Weibull and other parametric survival analysis models and therefore event history analysis is often seen as a type of parametric survival analysis.
Coleman (1981: 1) defined event history analysis in terms of three attributes: (1) data units (ex., individuals or organizations) move along a finite series of states; (2) at any time point, changes (events) may occur, not just at certain time points; and (3) factors influencing events are of two types, time-constant and time-dependent. Event history models focus on the hazard function, which reflects the instantaneous probability that the event of interest will occur at a given time, given that the unit of analysis has not experienced the event up to that time. While duration until death or failure were the classic examples, duration of peace until the outbreak of war was an example in international relations. In the last few decades, survival analysis has been applied to a wide range of events, including "hazards" which a positive meaning, such as duration until the event of adoption of an innovation in diffusion research. Other applications include study of longevity of trade agreements, strike durations, marriage durations, employment durations, and innumerable other subjects.
Table of Contents
Overview7
Key Concepts and Terms8
Types of survival analysis8
Nonparametric models8
Semi-parametric models8
Parametric models9
Count models9
States10
Durations10
Variables11
Time variable11
Covariates12
The event variable12
Data13
Censored data13
Survival and probability density functions13
The survival function13
The cumulative probability function and cumulative hazard plot14
The probability density function15
Hazard rates, hazard functions, and hazard ratios16
Overview16
The hazard function16
Parametric survival modeling17
Duration data17
Parametric model assumptions17
Proportional hazard models17
Accelerated failure time models18
Parametric models in Stata18
Overview18
Types of parametric models using Stata19
Exponential models20
Characteristics20
Exponential models with covariates20
Weibull models21
Scale and shape parameters21
Weibull proportional hazards models23
Weibull accelerated failure time models23
Example: Weibull models with Stata24
and 40 more pages of coverage
See also "Cox Regression"