Here we describe how the correlation between progression-free survival (PFS) and overall survival (OS) is computed.
Introduction
The illness-death model used in simIDM
is designed to
jointly model PFS and OS as endpoints in an oncology clinical trial.
Within each treatment arm, the model is specified through the transition
hazards
,
and
.
This approach allows us to consider the joint distribution of the two
endpoints and to derive the correlation between PFS and OS directly from
the assumed transition hazards.
Cor(PFS, OS): Statistical Background
Meller, Beyersmann, and Rufibach (2019) derive a closed formula for . The general formula for the correlation is The expected values of PFS and OS can be derived via the survival functions for PFS and OS, respectively: and The variance of PFS and OS are computed as follows: where and in a similar way for . is derived using together with the general formula for deriving expected survival times, and where This requires the transition probability , which has the form of a standard survival function:
We can compute the correlation based on assumptions or estimate it from data. Both work by simply plugging in assumed or estimated survival functions.
Example: Computing Cor(PFS, OS) directly
We consider three alternative scenarios to model transition hazards within a treatment arm:
- constant (exponential distribution)
- Weibull
- piecewise constant
library(simIDM)
# constant hazards:
transitionExp <- exponential_transition(h01 = 1.2, h02 = 1.5, h12 = 1.6)
# Weibull hazards:
transitionWeib <- weibull_transition(h01 = 1, h02 = 1.2, h12 = 1.3, p01 = 1.1, p02 = 0.8, p12 = 1.2)
# piecewise constant hazards:
transitionPwc <- piecewise_exponential(
h01 = c(1, 1.3), h02 = c(0.8, 1.5), h12 = c(1, 1),
pw01 = c(0, 3), pw02 = c(0, 1), pw12 = c(0, 8)
)
Now, we can compute the PFS-OS correlation with corTrans()
:
Estimating Cor(PFS, OS) from data
In case we are given trial data and want to estimate the PFS-OS correlation from the data, the following approach can be adopted:
- Specify the assumed distribution family.
- Estimate the transition hazards under this assumption from the data.
- Compute the correlation of PFS and OS.
We can estimate the parameters via maximum likelihood (ML), using the log-likelihood based on the counting process notation of Andersen et al. (1993), see also Meller, Beyersmann, and Rufibach (2019): where the sum is over all individuals. is a simplified notation for the transitions 0 1, 0 2 and 1 2, is the corresponding transition hazard and the survival function. is an indicator function, taking the value 1 if the -th individual made the -th transition and and are the time the -th individual enters and exits, respectively, the root state of the -th transition. The indicator function at the end of the formula is equal to 1 if the -th individual is at risk for the transition and 0 otherwise.
Example: Estimating Cor(PFS, OS) from data
Currently, this package supports parameter estimation for assuming
either constant or Weibull transition hazards. The estimateParams()
function expects a data
argument of the same format as used
throughout the package, and a transition
argument of class
TransitionParameters
, specifying the assumed distribution
and desired starting values for ML estimation. To demonstrate this, we
simulate data using constant transition hazards:
transitionExp <- exponential_transition(h01 = 1.2, h02 = 1.5, h12 = 1.6)
simData <- getOneClinicalTrial(
nPat = c(500), transitionByArm = list(transitionExp),
dropout = list(rate = 0.8, time = 12),
accrual = list(param = "time", value = 1)
)
We can estimate the parameters as follows:
# Create TransitionParameters object with starting values for ML estimation:
transition <- exponential_transition(h01 = 1, h02 = 1, h12 = 1)
# Estimate parameters:
est <- estimateParams(data = simData, transition = transition)
# Get estimated transition hazards:
est$hazards
#> $h01
#> [1] 1.25108
#>
#> $h02
#> [1] 1.6258
#>
#> $h12
#> [1] 1.627882
Then, in a final step, we pass est
to corTrans()
to compute the PFS-OS correlation.
Alternatively, one can combine these steps efficiently via corPFSOS()
,
which has an additional bootstrap
argument to quantify the
uncertainty of the correlation estimate:
corPFSOS(data = simData, transition = transition, bootstrap = TRUE, conf_level = 0.95)
#> $corPFSOS
#> [1] 0.5656184
#>
#> $lower
#> 2.5%
#> 0.5045539
#>
#> $upper
#> 97.5%
#> 0.8199518