Container for Results of resample()

This is the result container object returned by resample().

Note that all stored objects are accessed by reference. Do not modify any object without cloning it first.

ResampleResults can be visualized via mlr3viz's autoplot() function.

S3 Methods

as.data.table(rr, reassemble_learners = TRUE, convert_predictions = TRUE, predict_sets = "test")
ResampleResult -> data.table::data.table()
Returns a tabular view of the internal data.
c(...)
(ResampleResult, ...) -> BenchmarkResult
Combines multiple objects convertible to BenchmarkResult into a new BenchmarkResult.

Active bindings

task_type: (character(1))
Task type of objects in the ResampleResult, e.g. "classif" or "regr". This is NA for empty ResampleResults.
uhash: (character(1))
Unique hash for this object.
iters: (integer(1))
Number of resampling iterations stored in the ResampleResult.
task: (Task)
The task resample() operated on.
learner: (Learner)
Learner prototype resample() operated on. For a list of trained learners, see methods $learners().
resampling: (Resampling)
Instantiated Resampling object which stores the splits into training and test.
learners: (list of Learner)
List of trained learners, sorted by resampling iteration.
data_extra: (list())
Additional data stored in the ResampleResult.
warnings: (data.table::data.table())
A table with all warning messages. Column names are "iteration" and "msg". Note that there can be multiple rows per resampling iteration if multiple warnings have been recorded.
errors: (data.table::data.table())
A table with all error messages. Column names are "iteration" and "msg". Note that there can be multiple rows per resampling iteration if multiple errors have been recorded.

Methods

Method `new()`

Creates a new instance of this R6 class. An alternative construction method is provided by as_resample_result().

Usage

ResampleResult$new(data = ResultData$new(), view = NULL)

Arguments

data: (ResultData | data.table())
An object of type ResultData, either extracted from another ResampleResult, another BenchmarkResult, or manually constructed with as_result_data().
view: (character())
Single uhash of the ResultData to operate on. Used internally for optimizations.

Method `format()`

Helper for print outputs.

Usage

ResampleResult$format(...)

Arguments

...: (ignored).

Method `print()`

Printer.

Usage

ResampleResult$print(...)

Arguments

...: (ignored).

Method `help()`

Opens the corresponding help page referenced by field $man.

Usage

ResampleResult$help()

Method `prediction()`

Combined Prediction of all individual resampling iterations, and all provided predict sets. Note that, per default, most performance measures do not operate on this object directly, but instead on the prediction objects from the resampling iterations separately, and then combine the performance scores with the aggregate function of the respective Measure (macro averaging).

If you calculate the performance on this prediction object directly, this is called micro averaging.

Usage

ResampleResult$prediction(predict_sets = "test")

Arguments

predict_sets: (character())
Subset of {"train", "test"}.

Returns

Prediction or empty list() if no predictions are available.

Method `predictions()`

List of prediction objects, sorted by resampling iteration. If multiple sets are given, these are combined to a single one for each iteration.

If you evaluate the performance on all of the returned prediction objects and then average them, this is called macro averaging. For micro averaging, operate on the combined prediction object as returned by $prediction().

Usage

ResampleResult$predictions(predict_sets = "test")

Arguments

predict_sets: (character())
Subset of {"train", "test", "internal_valid"}.

Returns

List of Prediction objects, one per element in predict_sets. Or list of empty list()s if no predictions are available.

Method `score()`

Returns a table with one row for each resampling iteration, including all involved objects: Task, Learner, Resampling, iteration number (integer(1)), and (if enabled) one Prediction for each predict set of the Learner. Additionally, a column with the individual (per resampling iteration) performance is added for each Measure in measures, named with the id of the respective measure id. If measures is NULL, measures defaults to the return value of default_measures().

Usage

ResampleResult$score(
  measures = NULL,
  ids = TRUE,
  conditions = FALSE,
  predictions = TRUE
)

Arguments

measures: (Measure | list of Measure)
Measure(s) to calculate.
ids: (logical(1))
If ids is TRUE, extra columns with the ids of objects ("task_id", "learner_id", "resampling_id") are added to the returned table. These allow to subset more conveniently.
conditions: (logical(1))
Adds condition messages ("warnings", "errors") as extra list columns of character vectors to the returned table
predictions: (logical(1))
Additionally return prediction objects, one column for each predict_set of the learner. Columns are named "prediction_train", "prediction_test" and "prediction_internal_valid", if present.

Returns

data.table::data.table().

Method `obs_loss()`

Calculates the observation-wise loss via the loss function set in the Measure's field obs_loss. Returns a data.table() with the columns of the matching Prediction object plus one additional numeric column for each measure, named with the respective measure id. If there is no observation-wise loss function for the measure, the column is filled with NA values. Note that some measures such as RMSE, do have an $obs_loss, but they require an additional transformation after aggregation, in this example taking the square-root.

Usage

ResampleResult$obs_loss(measures = NULL, predict_sets = "test")

Arguments

measures: (Measure | list of Measure)
Measure(s) to calculate.
predict_sets: (character())
The predict sets.

Method `aggregate()`

Calculates and aggregates performance values for all provided measures, according to the respective aggregation function in Measure. If measures is NULL, measures defaults to the return value of default_measures().

Usage

ResampleResult$aggregate(measures = NULL)

Arguments

measures: (Measure | list of Measure)
Measure(s) to calculate.

Returns

Named numeric().

Method `filter()`

Subsets the ResampleResult, reducing it to only keep the iterations specified in iters.

Usage

ResampleResult$filter(iters)

Arguments

iters: (integer())
Resampling iterations to keep.

Returns

Returns the object itself, but modified by reference. You need to explicitly $clone() the object beforehand if you want to keeps the object in its previous state.

Method `discard()`

Shrinks the ResampleResult by discarding parts of the internally stored data. Note that certain operations might stop work, e.g. extracting importance values from learners or calculating measures requiring the task's data.

Usage

ResampleResult$discard(backends = FALSE, models = FALSE)

Arguments

backends: (logical(1))
If TRUE, the DataBackend is removed from all stored Tasks.
models: (logical(1))
If TRUE, the stored model is removed from all Learners.

Returns

Returns the object itself, but modified by reference. You need to explicitly $clone() the object beforehand if you want to keeps the object in its previous state.

Method `marshal()`

Marshals all stored models.

Usage

ResampleResult$marshal(...)

Arguments

...: (any)
Additional arguments passed to marshal_model().

Method `unmarshal()`

Unmarshals all stored models.

Usage

ResampleResult$unmarshal(...)

Arguments

...: (any)
Additional arguments passed to unmarshal_model().

Method `set_threshold()`

Sets the threshold for the response prediction of classification learners, given they have output a probability prediction for a binary classification task. This modifies the object in-place.

Usage

ResampleResult$set_threshold(threshold, ties_method = "random")

Arguments

threshold

(numeric(1))
Threshold value.

ties_method

(character(1))
Method to handle ties in probabilities when selecting a class label. Must be one of "random", "first" or "last" (corresponding to the same options in max.col()).

"random": Randomly select one of the tied class labels (default).
"first": Select the first class label among tied values.
"last": Select the last class label among tied values.

Method `clone()`

The objects of this class are cloneable with this method.

Usage

ResampleResult$clone(deep = FALSE)

Arguments

deep: Whether to make a deep clone.

Examples

task = tsk("penguins")
learner = lrn("classif.rpart")
resampling = rsmp("cv", folds = 3)
rr = resample(task, learner, resampling)
print(rr)
#> 
#> ── <ResampleResult> with 3 resampling iterations ───────────────────────────────
#>   task_id    learner_id resampling_id iteration     prediction_test warnings
#>  penguins classif.rpart            cv         1 <PredictionClassif>        0
#>  penguins classif.rpart            cv         2 <PredictionClassif>        0
#>  penguins classif.rpart            cv         3 <PredictionClassif>        0
#>  errors
#>       0
#>       0
#>       0

# combined predictions and predictions for each fold separately
rr$prediction()
#> 
#> ── <PredictionClassif> for 344 observations: ───────────────────────────────────
#>  row_ids     truth  response
#>        2    Adelie    Adelie
#>        5    Adelie    Adelie
#>        6    Adelie    Adelie
#>      ---       ---       ---
#>      335 Chinstrap Chinstrap
#>      338 Chinstrap Chinstrap
#>      340 Chinstrap Chinstrap
rr$predictions()
#> [[1]]
#> 
#> ── <PredictionClassif> for 115 observations: ───────────────────────────────────
#>  row_ids     truth  response
#>        2    Adelie    Adelie
#>        5    Adelie    Adelie
#>        6    Adelie    Adelie
#>      ---       ---       ---
#>      341 Chinstrap    Adelie
#>      342 Chinstrap Chinstrap
#>      343 Chinstrap    Gentoo
#> 
#> [[2]]
#> 
#> ── <PredictionClassif> for 115 observations: ───────────────────────────────────
#>  row_ids     truth  response
#>        1    Adelie    Adelie
#>       10    Adelie    Adelie
#>       15    Adelie    Adelie
#>      ---       ---       ---
#>      333 Chinstrap Chinstrap
#>      337 Chinstrap Chinstrap
#>      344 Chinstrap Chinstrap
#> 
#> [[3]]
#> 
#> ── <PredictionClassif> for 114 observations: ───────────────────────────────────
#>  row_ids     truth  response
#>        3    Adelie    Adelie
#>        4    Adelie    Adelie
#>        7    Adelie    Adelie
#>      ---       ---       ---
#>      335 Chinstrap Chinstrap
#>      338 Chinstrap Chinstrap
#>      340 Chinstrap Chinstrap
#> 

# folds scored separately, then aggregated (macro)
rr$aggregate(msr("classif.acc"))
#> classif.acc 
#>   0.9448258 

# predictions first combined, then scored (micro)
rr$prediction()$score(msr("classif.acc"))
#> classif.acc 
#>   0.9447674 

# check for warnings and errors
rr$warnings
#> Empty data.table (0 rows and 2 cols): iteration,msg
rr$errors
#> Empty data.table (0 rows and 2 cols): iteration,msg

S3 Methods

See also

Active bindings

Methods

Public methods

Method new()

Usage

Arguments

Method format()

Usage

Arguments

Method print()

Usage

Arguments

Method help()

Usage

Method prediction()

Usage

Arguments

Returns

Method predictions()

Usage

Arguments

Returns

Method score()

Usage

Arguments

Returns

Method obs_loss()

Usage

Arguments

Method aggregate()

Usage

Arguments

Returns

Method filter()

Usage

Arguments

Returns

Method discard()

Usage

Arguments

Returns

Method marshal()

Usage

Arguments

Method unmarshal()

Usage

Arguments

Method set_threshold()

Usage

Arguments

Method clone()

Usage

Arguments

Examples

Container for Results of `resample()`

Method `new()`

Method `format()`

Method `print()`

Method `help()`

Method `prediction()`

Method `predictions()`

Method `score()`

Method `obs_loss()`

Method `aggregate()`

Method `filter()`

Method `discard()`

Method `marshal()`

Method `unmarshal()`

Method `set_threshold()`

Method `clone()`