Occasionally, when there are repeated trials, an analyst will calculate statistics for worst-case results. For example, if a portfolio manager reports results every week, he or she might ask what the worst result will be over a period of 52 weeks. If the distribution of the returns in one week is known, a Monte Carlo simulation can be used to calculate statistics for this worst-case result. For example, one can calculate the expected worst-case result over 52 weeks, the 95th percentile of the worst-case result, and so on.
Below simulations_list will contain five matrices
The first matrix is 10,000 rows by 1 column (H = 1 day; not realistic just illustrative)
The second matrix is 10,000 rows by 5 columns (H = 5 days)
The third matrix is 10,000 rows by 20 columns (H = 20 days)
Note Linda Allen:
In contrast to VaR, WCS focuses on the distribution of the loss during the worst trading period (“period” being, e.g., one day or two weeks), over a given horizon (“horizon” being, e.g., 100 days or one year). The key point is that a worst period will occur with probability one.
So if each row is a trial (i.e., 10,000 rows = 10,000 trials), then we’re retrieving a vector of the worst period within the horizon for each of 10,000 trials. In this simulation, all periods are one day. So, the second matrix will retrieve (a vector of length 10,000 of) the worst one-day period in a five-day horizon. The third matrix will retrieve the worst one-day period in a 20-day horizon. The first matrix has only column, so the worst is the only value in the row: the statistics are the same.
The key function is worst_returns <- map_dbl(1:nrow(simulation), ~ min(simulation[.x, ])). Because it finds the minimum (ie, worst) value in each of the 10,000 rows. That’s the worst_returns vector.
Here is my interpretation, and the numbers are very similar to Linda Allen’s table.
library(tidyverse)# includes purrr, dplyr, tidyr, ggplot2, tibblelibrary(gt)set.seed(73914)# Vector of different numbers of daysdays_vector <-c(1, 5, 20, 100, 250)# Number of trialsY <-10000# trials; aka, sims# A full experiment; e.g., 2nd experiment will be 10,000 rows * 5 columns(= 5 days)# Each experiment has 10,000 rows but they have 5 | 20 | 100 | 250 columnssimulate_trials <-function(X) { simulations <-matrix(rnorm(X * Y), nrow=Y)return(simulations)}# List to of NULLs to store five simulations: 1, 5, 20, 100, 250 dayssimulations_list <-setNames(vector("list", length(days_vector)), days_vector)# Do an experiment for each number of dayssimulations_list <-map(days_vector, ~simulate_trials(.x))# This first LIST item is a matrix with 10,000 rows and 1 column# This second LIST item is a matrix with 10,000 rows and 5 (= horizon days) columnstr(simulations_list[[1]])
num [1:10000, 1] 0.21 0.337 -0.248 -0.744 -2.258 ...
str(simulations_list[[2]])
num [1:10000, 1:5] -0.279 0.752 1.203 1.138 -0.157 ...
# Function: Get the worst return for each row (trial)get_worst_returns <-function(simulation) {# .x is the current row index in the iteration# simulation[.x, ] selects the entire row because [x., ] is all columns# such that ~ min(simulation[.x, ]) is the minimum value in the row worst_returns <-map_dbl(1:nrow(simulation), ~min(simulation[.x, ]))return(worst_returns)}# Get the worst returns for each set of daysworst_returns_list <-map(simulations_list, ~get_worst_returns(.x))# Function: Get percentiles and meanget_percentiles_and_mean <-function(returns) { percentiles <-quantile(returns, probs =c(0.01, 0.05, 0.1, 0.25, 0.5)) mean_val <-mean(returns)c(percentiles, mean = mean_val)}# Get them percentiles_and_mean_list <-map(worst_returns_list, ~get_percentiles_and_mean(.x))# Print percentiles and meanpercentiles_and_mean_list