Function is similar to ard_continuous()
, but allows for more complex
summaries. While ard_continuous(statistic)
only allows for a univariable
function, ard_complex(statistic)
can handle more complex data summaries.
Usage
ard_complex(
data,
variables,
by = dplyr::group_vars(data),
strata = NULL,
statistic,
fmt_fn = NULL,
stat_label = everything() ~ default_stat_labels()
)
Arguments
- data
(
data.frame
)
a data frame- variables
(
tidy-select
)
columns to include in summaries. Default iseverything()
.- by, strata
-
(
tidy-select
)
columns to tabulate by/stratify by for summary statistic calculation. Arguments are similar, but with an important distinction:by
: results are calculated for all combinations of the columns specified, including unobserved combinations and unobserved factor levels.strata
: results are calculated for all observed combinations of the columns specified.Arguments may be used in conjunction with one another.
- statistic
-
(
formula-list-selector
)
The form of the statistics argument is identical toard_continuous(statistic)
argument, except the summary function must accept the following arguments:x
: a vectordata
: the data frame that has been subset such that theby
/strata
columns and rows in which"variable"
isNA
have been removed.data_full
: the full data frameby
: character vector of theby
variablesstrata
: character vector of thestrata
variables It is unlikely any one function will need all of the above elements, and it's recommended the function passed accepts...
so that any unused arguments will be properly ignored. The...
also allows this function to perhaps be updated in the future with more passed arguments. For example, if one needs a second variable from the data frame, the function inputs may look like:foo(x, data, ...)
- fmt_fn
(
formula-list-selector
)
a named list, a list of formulas, or a single formula where the list element is a named list of functions (or the RHS of a formula), e.g.list(mpg = list(mean = \(x) round(x, digits = 2) |> as.character))
.- stat_label
(
formula-list-selector
)
a named list, a list of formulas, or a single formula where the list element is either a named list or a list of formulas defining the statistic labels, e.g.everything() ~ list(mean = "Mean", sd = "SD")
oreverything() ~ list(mean ~ "Mean", sd ~ "SD")
.
Examples
# example how to mimic behavior of `ard_continuous()`
ard_complex(
ADSL,
by = "ARM",
variables = "AGE",
statistic = list(AGE = list(mean = \(x, ...) mean(x)))
)
#> {cards} data frame: 3 x 10
#> group1 group1_level variable stat_name stat_label stat
#> 1 ARM Placebo AGE mean Mean 75.209
#> 2 ARM Xanomeli… AGE mean Mean 74.381
#> 3 ARM Xanomeli… AGE mean Mean 75.667
#> ℹ 4 more variables: context, fmt_fn, warning, error
# return the grand mean and the mean within the `by` group
grand_mean <- function(data, data_full, variable, ...) {
list(
mean = mean(data[[variable]], na.rm = TRUE),
grand_mean = mean(data_full[[variable]], na.rm = TRUE)
)
}
ADSL |>
dplyr::group_by(ARM) |>
ard_complex(
variables = "AGE",
statistic = list(AGE = list(means = grand_mean))
)
#> {cards} data frame: 6 x 10
#> group1 group1_level variable stat_name stat_label stat
#> 1 ARM Placebo AGE mean Mean 75.209
#> 2 ARM Placebo AGE grand_mean grand_me… 75.087
#> 3 ARM Xanomeli… AGE mean Mean 74.381
#> 4 ARM Xanomeli… AGE grand_mean grand_me… 75.087
#> 5 ARM Xanomeli… AGE mean Mean 75.667
#> 6 ARM Xanomeli… AGE grand_mean grand_me… 75.087
#> ℹ 4 more variables: context, fmt_fn, warning, error