R/sperrorest_resampling.R
partition_disc.Rd
partition_disc
partitions the sample into training and tests
set by selecting circular test areas (possibly surrounded by an exclusion
buffer) and using the remaining samples as training samples
(leave-one-disc-out cross-validation). partition_loo
creates training and
test sets for leave-one-out cross-validation with (optional) buffer.
data.frame
containing at least the columns specified by
coords
vector of length 2 defining the variables in data
that
contain the x and y coordinates of sample locations.
radius of test area discs; performs leave-one-out resampling if radius <0.
radius of additional 'neutral area' around test area discs that is excluded from training and test sets; defaults to 0, i.e. all samples are either in the test area or in the training area.
Number of discs to be randomly selected; each disc constitutes a
separate test set. Defaults to nrow(data)
, i.e. one disc around each
sample.
seed1+i
is the random seed that will be used by set.seed in
repetition i
(i
in repetition
) to initialize the random number
generator before sampling from the data set.
If FALSE
, returns only test sample; if TRUE
, also the
training area.
optional argument to sample.
optional argument to sample: sampling with or without replacement?
see partition_cv
; however, see Note below: repetition
should normally be = 1
in this function.
arguments to be passed to partition_disc
A represampling object. Contains length(repetition)
resampling
objects. Each of these contains ndisc
lists with indices of test and (if
return_train = TRUE
) training sets.
Test area discs are centered at (random) samples, not at general
random locations. Test area discs may (and likely will) overlap independently
of the value of replace
. replace
only controls the replacement
of the center point of discs when drawing center points from the samples.
radius < 0
does leave-one-out resampling with an optional buffer.
radius = 0
is similar except that samples with identical coordinates
would fall within the test area disc.
Brenning, A. 2005. Spatial prediction models for landslide hazards: review, comparison and evaluation. Natural Hazards and Earth System Sciences, 5(6): 853-862.
data(ecuador)
parti <- partition_disc(ecuador,
radius = 200, buffer = 200,
ndisc = 5, repetition = 1:2
)
# plot(parti,ecuador)
summary(parti)
#> $`1`
#> n.train n.test
#> 545 729 8
#> 602 734 5
#> 409 712 9
#> 127 714 13
#> 338 688 19
#>
#> $`2`
#> n.train n.test
#> 256 723 12
#> 14 693 19
#> 242 718 10
#> 534 715 6
#> 138 721 7
#>
# leave-one-out with buffer:
parti.loo <- partition_loo(ecuador, buffer = 200)
summary(parti)
#> $`1`
#> n.train n.test
#> 545 729 8
#> 602 734 5
#> 409 712 9
#> 127 714 13
#> 338 688 19
#>
#> $`2`
#> n.train n.test
#> 256 723 12
#> 14 693 19
#> 242 718 10
#> 534 715 6
#> 138 721 7
#>