ESS-NS Experimentation


Jan Strappa

Paola Caymes-Scutari

Germán Bianchini

Supplementary Material


This notebook, source code and data are provided as additional material for the paper “Evolutionary Statistical System with Novelty Search: a Parallel Metaheuristic for Uncertainty Reduction Applied to Wildfire Spread Prediction”. This report contains the same results published in the paper, with slight modifications and additional tools that allow for better visualization and exploration of those results.

The first two sections have both a static and an interactive version of the code. The plots shown by default are from the interactive version, made using R with the plotly package.

The repository with the primary data and the source code is available at The complete source code can be seen by opening the source for this notebook in an editor, together with the functions.R script, which contains most of the code for static plots.

Authors and institutional information

1: Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Mendoza, Argentina.

2: Laboratorio de Investigación en Cómputo Paralelo/Distribuido (LICPaD), Facultad Regional Mendoza, Universidad Tecnológica Nacional, Argentina.


Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.


Preliminary requirements

Average Fitness: ESS-NS vs ESS, ESSIM-EA, and ESSIM-DE

All methods are compared over 5 cases of controlled fires, labeled by numbers (751, 520, 533, 519, 534). The x-axis corresponds to prediction steps, while the y-axis shows the average fitness for each step.

Fitness distribution

Fitness distribution for the 30 repetitions of each run. Each method is shown in a different color. Column number corresponds to prediction step. The y-axis corresponds to the fitness value.

Appendix: Calibration

Description of the experiment

Calibration was performed by varying two parameters: tournament probability and mutation probability.

The values vary among the following:

\(tour\_prob \in \{ 0.75, 0.8, 0.85, 0.9 \}\)

\(mut\_prob \in \{ 0.1, 0.2, 0.3, 0.4 \}\)


Fitness averages over different combinations of parameters

There is one table for each controlled fire. In each table, the rows show the fitness values, averaged over 30 repetitions, for a particular configuration of the parameters. For columns labeled by numbers, the number indicates the prediction step. The \(m\) column is the average fitness over all steps, and the last column, \(t (s)\), shows total runtime values in seconds. The runtimes for each repetition correspond to the whole execution (including all steps); the runtimes shown are averaged over 30 repetitions. The darker the color, the better the results, both for quality (fitness) and runtimes.

Fitness averages and runtimes (in seconds) for map 533.
  1 2 3 4 m t (s)
0.75, 0.1 0.672 0.675 0.731 0.696 0.694 2122.970
0.75, 0.2 0.754 0.773 0.743 0.751 0.755 2239.330
0.75, 0.3 0.709 0.784 0.722 0.755 0.742 2148.000
0.75, 0.4 0.737 0.790 0.769 0.784 0.770 2284.330
0.8, 0.1 0.706 0.722 0.745 0.736 0.727 2003.330
0.8, 0.2 0.737 0.707 0.703 0.765 0.728 2192.670
0.8, 0.3 0.743 0.737 0.731 0.762 0.743 2165.670
0.8, 0.4 0.737 0.741 0.745 0.769 0.748 2247.000
0.85, 0.1 0.739 0.801 0.770 0.781 0.773 2176.670
0.85, 0.2 0.719 0.806 0.784 0.766 0.769 2392.000
0.85, 0.3 0.707 0.762 0.738 0.740 0.737 2262.000
0.85, 0.4 0.703 0.781 0.723 0.769 0.744 2296.000
0.9, 0.1 0.785 0.801 0.766 0.751 0.776 2128.330
0.9, 0.2 0.717 0.817 0.728 0.758 0.755 2271.670
0.9, 0.3 0.717 0.743 0.700 0.757 0.730 2317.330
0.9, 0.4 0.782 0.784 0.721 0.780 0.767 2304.000
Fitness averages and runtimes (in seconds) for map 519.
  1 2 3 m t (s)
0.75, 0.1 0.886 0.931 0.783 0.867 1738.670
0.75, 0.2 0.881 0.926 0.834 0.880 1835.000
0.75, 0.3 0.897 0.910 0.771 0.859 1879.330
0.75, 0.4 0.897 0.882 0.812 0.863 1949.000
0.8, 0.1 0.893 0.912 0.728 0.845 1770.000
0.8, 0.2 0.890 0.924 0.800 0.871 1844.000
0.8, 0.3 0.901 0.926 0.779 0.869 1902.330
0.8, 0.4 0.875 0.923 0.811 0.870 1912.000
0.85, 0.1 0.865 0.834 0.782 0.827 1787.670
0.85, 0.2 0.872 0.925 0.741 0.846 1810.330
0.85, 0.3 0.896 0.901 0.772 0.856 1874.670
0.85, 0.4 0.904 0.907 0.765 0.859 1965.000
0.9, 0.1 0.864 0.914 0.751 0.843 1804.000
0.9, 0.2 0.897 0.906 0.805 0.869 1822.670
0.9, 0.3 0.863 0.923 0.755 0.847 1892.000
0.9, 0.4 0.898 0.917 0.724 0.846 1888.000
Fitness averages and runtimes (in seconds) for map 751.
  1 2 3 m t (s)
0.75, 0.1 0.893 0.888 0.805 0.862 992.433
0.75, 0.2 0.942 0.864 0.854 0.886 1064.730
0.75, 0.3 0.938 0.888 0.848 0.891 1042.970
0.75, 0.4 0.950 0.897 0.843 0.897 1048.870
0.8, 0.1 0.954 0.896 0.821 0.890 1011.500
0.8, 0.2 0.899 0.875 0.803 0.859 1056.670
0.8, 0.3 0.948 0.884 0.836 0.889 1034.700
0.8, 0.4 0.924 0.875 0.832 0.877 1064.630
0.85, 0.1 0.933 0.861 0.803 0.866 963.733
0.85, 0.2 0.941 0.888 0.841 0.890 1002.300
0.85, 0.3 0.937 0.900 0.855 0.897 1017.730
0.85, 0.4 0.932 0.876 0.822 0.877 1088.230
0.9, 0.1 0.898 0.888 0.794 0.860 1043.030
0.9, 0.2 0.932 0.892 0.841 0.889 1021.170
0.9, 0.3 0.947 0.887 0.843 0.892 1039.770
0.9, 0.4 0.947 0.883 0.834 0.888 1055.770
Fitness averages and runtimes (in seconds) for map 534.
  1 2 3 4 5 m t (s)
0.75, 0.1 0.738 0.573 0.588 0.804 0.768 0.694 1593.330
0.75, 0.2 0.697 0.590 0.700 0.796 0.751 0.707 1665.000
0.75, 0.3 0.762 0.575 0.692 0.839 0.757 0.725 1655.330
0.75, 0.4 0.762 0.575 0.699 0.822 0.742 0.720 1698.670
0.8, 0.1 0.696 0.566 0.662 0.809 0.756 0.698 1643.330
0.8, 0.2 0.778 0.590 0.687 0.822 0.765 0.728 1644.000
0.8, 0.3 0.738 0.582 0.679 0.829 0.723 0.710 1654.670
0.8, 0.4 0.793 0.547 0.692 0.811 0.734 0.716 1628.330
0.85, 0.1 0.754 0.590 0.708 0.825 0.753 0.726 1599.000
0.85, 0.2 0.770 0.547 0.667 0.786 0.780 0.710 1642.000
0.85, 0.3 0.692 0.563 0.676 0.839 0.759 0.706 1672.330
0.85, 0.4 0.744 0.590 0.668 0.841 0.759 0.720 1702.000
0.9, 0.1 0.667 0.547 0.669 0.808 0.756 0.689 1578.500
0.9, 0.2 0.778 0.590 0.700 0.811 0.770 0.730 1652.000
0.9, 0.3 0.771 0.582 0.708 0.795 0.731 0.717 1686.000
0.9, 0.4 0.762 0.582 0.700 0.825 0.755 0.725 1673.330
Fitness averages and runtimes (in seconds) for map 520.
  1 2 3 4 5 m t (s)
0.75, 0.1 0.879 0.720 0.864 0.817 0.883 0.833 2813.000
0.75, 0.2 0.882 0.769 0.861 0.837 0.884 0.847 3091.330
0.75, 0.3 0.882 0.777 0.855 0.807 0.881 0.840 3202.670
0.75, 0.4 0.883 0.769 0.862 0.823 0.882 0.844 3176.670
0.8, 0.1 0.888 0.713 0.866 0.765 0.882 0.823 3358.330
0.8, 0.2 0.888 0.785 0.862 0.763 0.883 0.836 3087.670
0.8, 0.3 0.884 0.777 0.863 0.786 0.883 0.838 3058.000
0.8, 0.4 0.882 0.755 0.861 0.790 0.881 0.834 3232.000
0.85, 0.1 0.882 0.760 0.862 0.801 0.885 0.838 2760.670
0.85, 0.2 0.888 0.781 0.860 0.815 0.879 0.845 2977.000
0.85, 0.3 0.883 0.781 0.855 0.741 0.884 0.829 3067.330
0.85, 0.4 0.879 0.759 0.857 0.816 0.880 0.838 3162.670
0.9, 0.1 0.880 0.711 0.860 0.734 0.884 0.814 2800.330
0.9, 0.2 0.882 0.710 0.862 0.786 0.881 0.824 2821.000
0.9, 0.3 0.882 0.775 0.866 0.824 0.880 0.845 3055.000
0.9, 0.4 0.885 0.779 0.864 0.810 0.882 0.844 3131.000

We computed the mean squared error (MSE) for each combination, using \(1-\bar{f}\) (where \(\bar{f}\) is the average fitness over all prediction steps, shown in column \(m\) in the tables above) as the error. Formally, for each combination of parameters \(\{tour\_prob,mut\_prob\}\), we computed:

\[\begin{equation}\label{eq:mse} MSE_{tour,mut} = \frac{1}{n}{\sum_{i=1}^n{(1 - \bar{f}_i)^2}} \end{equation}\]

where \(\bar{f}_i\) is the fitness average over all steps for each map, and \(n\) is the number of experiments, in this case, 5, corresponding to the 5 controlled fires.

The best combination by this criterion is:

Runtime analysis

The jitter (small black dots) shows the distribution of ESS-NS results for the 16 parameter combinations. The bigger black dots are the average runtime of ESS-NS over these combinations. The remaining points (in different shapes and colors) represent the average runtimes for ESS, ESSIM-EA and ESSIM-DE over 30 repetitions. Each label on the x-axis is a different controlled fire case. The y-axis is the runtime in human-readable format (hours, minutes, seconds).

Funding and Acknowledgments

This supplementary material and the corresponding article have been supported by:

  • Universidad Tecnológica Nacional under the project SIUTIME0007840TC,
  • FONCyT (Fondo para la Investigación Científica y Tecnológica, Agencia Nacional de Promoción de la Investigación, el Desarrollo Tecnológico y la Innovación, Argentina) under the project UUMM-2019-00042,
  • CONICET (Consejo Nacional de Investigaciones Científicas y Técnicas) through a postdoctoral scholarship for the first author.

We wish to thank María Laura Tardivo (ORCiD ID: 0000-0003-1268-7367, Universidad Nacional de Río Cuarto, Argentina) for providing primary results for the fitness and average runtimes of the methods ESS, ESSIM-EA and ESSIM-DE. These results were first published in summarized form in (Tardivo, Caymes Scutari, Bianchini, Méndez Garabetti, Cencerrado, et al. 2017) and (Tardivo, n.d., chap. 5), and are used here with her permission.

Thanks are also due to the LIDIC laboratory (Laboratorio de Investigación y Desarrollo en Inteligencia Computacional), Universidad Nacional de San Luis, Argentina, for providing the hardware equipment for the experimentation.


