Tuesday, May 9 |
07:00 - 09:00 |
Breakfast (Restaurant - Hotel Granada Center) |
09:00 - 09:30 |
Abhi Datta: Combining Machine Learning with Traditional Geospatial Models ↓ Spatial generalized linear mixed-models, consisting of a linear covariate effect and a Gaussian Process (GP) distributed spatial random effect, are widely used for analyses of geospatial data. We consider the setting where the covariate effect is non-linear and propose modeling it using a flexible machine learning algorithm like random forests or deep neural networks. We propose well-principled extensions of both random forests and neural networks, for estimating non-linear covariate effects in spatial mixed models where the spatial correlation is still modeled using GP. The basic principle is guided by how ordinary least squares extends to generalized least squares for linear models to account for dependence. We demonstrate how the same extension can be done for these machine learning approaches like random forests and neural networks. We provide extensive theoretical and empirical support for the methods and show how they fare better than naïve or brute-force approaches to use machine learning algorithms for spatially correlated data. We demonstrate the RandomForestsGLS R-package that implements this extension for random forests. (Main Meeting Room - Calle Rector López Argüeta) |
09:30 - 10:00 |
Andrew Zammit Mangion: Neural Point Estimation for Fast Optimal Likelihood-Free Inference ↓ Neural point estimators are neural networks that map data to parameter point estimates. They are fast, likelihood free and, due to their amortised nature, amenable to fast bootstrap-based uncertainty quantification. In this talk I give an overview of this relatively new inferential tool, giving particular attention to the ubiquitous problem of making inference from replicated data, which we address in the neural setting using permutation-invariant neural networks. Through extensive simulation studies we show that these neural point estimators can quickly and optimally (in a Bayes sense) estimate parameters in weakly-identified and highly-parameterised models, such as models of spatial extremes, with relative ease. We demonstrate their applicability through an analysis of extreme sea-surface temperature in the Red Sea where, after training, we obtain parameter estimates and bootstrap-based confidence intervals from hundreds of spatial fields in a fraction of a second. This is joint work with Matthew Sainsbury-Dale and Raphaël Huser. (Main Meeting Room - Calle Rector López Argüeta) |
10:00 - 10:30 |
Mikael Kuusela: Neural Likelihood Surface Estimation for Intractable Spatial Models ↓ Likelihood-based inference tends to be computationally intensive or wholly intractable for many common models in spatial statistics. Examples include Gaussian processes for large data sets and models for spatial extremes. Recent work has used neural networks to predict parameters in these models, circumventing the intractability of likelihood computations. Prediction, however, depends on the choice of a prior on the parameters and does not provide a straightforward means for frequentist uncertainty quantification. In this talk, I will demonstrate how to use tools from likelihood-free inference to learn the likelihood function of intractable spatial processes using convolutional neural networks. In cases where the exact likelihood is available, the method provides similar point estimation and uncertainty quantification performance as exact likelihood computations at a fraction of the computational cost. When the likelihood is unavailable, this method can learn the otherwise intractable likelihood function, providing inferences that are superior to existing approximations. The method is applicable to any spatial process on a regular grid for which fast forward simulations are available. (Main Meeting Room - Calle Rector López Argüeta) |
10:30 - 11:00 |
Coffee Break (Main Meeting Room - Calle Rector López Argüeta) |
11:00 - 11:06 |
Sweta Rai: Fast Parameter Estimation of GEV Distribution Using Neural Networks ↓ The generalized extreme-value (GEV) distribution is commonly used to model extreme events, such as floods, precipitation, and maximum temperature due to its heavy-tailed behavior. It is classified into three forms based on its shape: Fréchet, Gumbel, and Weibull. The goal is to fit the GEV distribution to the sample of extreme values and model these values using estimated parameters. The maximum likelihood (ML) method
is the conventional approach for parameter estimation, but it can be computationally intensive for large simulation studies. To overcome this limitation, we use a neural network for efficient and likelihood-free estimation. The network is trained using a set of chosen extreme quantiles, along with the Q1, Q2, and Q3 as inputs. The NN provides GEV parameter estimates with similar accuracy to ML but with a computational speedup. This NN estimator is applied to 1000−year annual maximum temperature from the Community Climate System Model version 3 (CCSM3) across
North America for three atmospheric concentrations: pre-industrial (289 ppm CO2), future conditions 700 ppm CO2, and 1400 ppm CO2, and compared with the ML approach. To account for estimation uncertainty, we employ parametric bootstrapping, inherent in the trained network. (Main Meeting Room - Calle Rector López Argüeta) |
11:06 - 11:12 |
Jordan Richards: Neural Bayes Estimators for Fast and Efficient Inference with Spatial Peaks-Over-Threshold Models ↓ Likelihood-based inference for spatial extremal dependence models is often infeasible in moderate or high dimensions, due to an intractable likelihood function and/or the need for computationally-expensive censoring to reduce estimation bias. Neural Bayes estimators are a promising recent approach to inference that use neural networks to transform data into parameter estimates. They are likelihood free, inherit the optimality properties of Bayes estimators, and substantially faster than classical methods. In this work, we adapt neural Bayes estimators for peaks-over-threshold dependence models; in particular, we develop methodology for coping with the computational challenges often encountered when modelling spatial extremes (e.g., censoring). We demonstrate substantial improvements in computational and statistical efficiency relative to conventional likelihood-based approaches using popular extremal dependence models, including max-stable, and r-Pareto, processes, and random scale mixture models. (Main Meeting Room - Calle Rector López Argüeta) |
11:12 - 11:18 |
Jonathan Koh: Predicting Risks of Temperature Extremes using Large-scale Circulation Patterns with r-Pareto Processes ↓ Many severe weather patterns in the mid-latitudes have been found to be connected to a particular atmospheric pattern known as blocking. This pattern obstructs the prevailing westerly large-scale atmospheric flow, changing flow anomalies in the vicinity of the blocking system to sustain weather conditions in the immediate region of its occurrence. Blockings’ presence and characteristics are thus important for the development of temperature extremes, which are rarely isolated in space, so one must not just account for their occurrence probabilities and intensities but also their spatial dependencies when assessing their associated risk. Here we propose a methodology that does so by combining tools from the spatial extremes and machine learning literature, to incorporate 500hPa geopotential (Z500) anomalies over the North Atlantic and European region as covariates to predict surface temperature extremes. This involves fitting Generalized r-Pareto processes with appropriate risk functionals to high-impact positive and negative temperature anomaly events across central Europe from 1979–2020, using loss functions motivated by extreme-value theory in a boosting algorithm. We check by simulation that the model parameters are identifiable and can be estimated adequately. We find which circulation patterns in the Euro-Atlantic sector are most important in determining the characteristics of these extremes, and show how they affect it. (Main Meeting Room - Calle Rector López Argüeta) |
11:18 - 11:24 |
Lydia Kakampakou: Modelling Temporal Changes in Spatial Extremal Dependence via a Conditional Framework ↓ With climate change being one of the biggest crises of our time, concentrated efforts are being made to develop statistical models able to adequately capture and predict the behaviour of natural processes affected by this phenomenon. These efforts are of particular significance when such processes are potentially catastrophic at extreme levels. In the case of spatio-temporal environmental datasets, the effect of climate change on marginal trends is well documented and several methods have been proposed to capture this kind of non-stationarity. However, this is not the case for changes in the dependence structure. Most available spatio-temporal models for extremes assume stationarity in this feature, which may be unrealistic in a changing climate. We propose an extension of the spatial conditional extremes modelling framework of Wadsworth and Tawn (2022) to accommodate for non-stationary spatial dependence and apply this extended framework to a range of spatio-temporal environmental datasets. (Main Meeting Room - Calle Rector López Argüeta) |
11:24 - 11:30 |
Silius Mortensønn Vandeskog: Efficient and Robust Modelling of High-Dimensional Spatial Conditional Extremes ↓ A successful general modelling framework for spatial extremes should be able to describe both weakening extremal dependence at increasing levels and changes in the type of extremal dependence class. It should also allow for computationally efficient inference in high dimensions, and it should be robust towards large deviations from the model assumptions in the data. We develop a general modelling framework for spatial extremes, based upon the spatial conditional extremes model. Inference is performed using integrated nested Laplace approximations (INLA), which allows for computationally efficient inference for high-dimensional problems. A post hoc transformation is applied after inference, which adjusts for model misspecification and leads to more robust estimates. The modelling framework is applied in a simulation study and in a case study of modelling extreme precipitation, and it displays great success in both settings. (Main Meeting Room - Calle Rector López Argüeta) |
11:30 - 11:36 |
Man Ho Suen: Aggregated Data Approach with inlabru ↓ It is not uncommon to have spatial misalignment in observed responses and covariates data in a point data setting. The poster is to present a novel approach to aggregate them within the INLA-SPDE framework. We will discuss how to conceptualize the domain and samplers during the mesh construction. (Main Meeting Room - Calle Rector López Argüeta) |
11:36 - 11:42 |
Xuanjie Shao: Deep Compositional Models for Nonstationary Extremal Dependence in Space ↓ Modeling the nonstationarity and anisotropy that often prevails in the extremal dependence of spatial data can be challenging. Inference for stationary, and isotropic models, is considerably easier, but the assumptions that underpin these models are not typically met by data observed over large, or topographically-complex, domains. A simple approach to accommodating spatial non-stationarity in Gaussian processes, proposed by Sampson and Guttorp (1992), is to warp the original spatial domain to a latent space where stationarity and isotropy can be reasonably assumed. However, estimation of the warping function can be computationally expensive and the transformation is not guaranteed to be injective, which can lead to physically-unrealistic transformations. Zammit-Mangion et al. (2021) overcame these issues by exploiting deep Gaussian processes, where the transformation is constructed using a deep composition of injective mappings. We present an extension of this methodology to model non-stationarity in extremal dependence of data, by leveraging popularly-applied parametric models for spatial extremal processes. (Main Meeting Room - Calle Rector López Argüeta) |
11:42 - 11:48 |
Lambert De Monte: A Geometric Investigation of the Hüsler–Reiss Family of Distributions ↓ Recent developments in the probability theory and statistical inference of extremal dependence structures via geometric approaches exploit gauge functions and their associated limit sets. Under the current scheme of study, the Hüsler–Reiss family of distributions forms a subclass of distributions that leads to degenerate limit sets, and statistical inference methods fail to capture their properties. In this line of work, we consider new transformations and scalings that lead to non-degenerate limit sets and possible avenues for statistical inference. (Main Meeting Room - Calle Rector López Argüeta) |
11:48 - 11:54 |
Maggie Bailey: Temporal Downscaling for Solar Radiation Using a Diurnal Template Model ↓ Global and regional climate model projections are useful for gauging future patterns of climate variables, including solar radiation, but data from these models is often too spatio-temporally course for local use. Within the context of solar radiation, the changing climate may have an effect on photo-voltaic (PV) production, especially as the PV industry moves to extend plant lifetimes to 50 years. Predicting PV production while taking into account a changing climate requires data at a resolution that is useful for building PV plants. We present a novel method to downscale global horizontal irradiance (GHI) data from daily averages to hourly profiles, while maintaining spatial correlation of parameters characterizing the diurnal profile of GHI. The method focuses on the use of a diurnal template which can be shifted and scaled according to the time or year and location. Variability in the profile is later added to account for clouds if the daily average value indicates a cloudy day. This analysis is applied to data from the National Solar Radiation Database provided by the National Renewable Energy Lab and a case study of the mentioned methods over California is presented. (Main Meeting Room - Calle Rector López Argüeta) |
11:54 - 12:00 |
Zhongwei Zhang: Extremal Dependence of Stochastic Processes Driven by Exponential-Tailed Lévy Noise ↓ Stochastic processes driven by exponential-tailed Lévy noise constitute important extensions of their Gaussian counterparts in order to capture deviations from Gaussianity, more flexible dependence structures, and sample paths with jumps. Popular examples include non-Gaussian Ornstein-Uhlenbeck (OU) processes and type-G Matérn stochastic partial differential equation (SPDE) random fields. This paper is concerned with the open problem of determining the extremal dependence induced by these processes. Both process types admit stochastic integral representations and have approximations on grids or triangulations that are used in practice for efficient simulations or inference. We first show that these approximations can be expressed as special cases of a class of linear transformations of independent, exponential-tailed random variables that bridges asymptotic dependence and independence in a novel, tractable way. This result is of independent interest since models that can capture both extremal dependence regimes are scarce and the construction of such flexible models is an active area of research. Based on this fundamental result, we show that the exponential-tailed non-Gaussian OU process is asymptotically independent, but with a different residual tail dependence function than its Gaussian counterpart. Furthermore, we show that the finite element approximation of the type-G SPDE model is asymptotically independent provided that the mesh is fine enough, and we conjecture that asymptotic independence is preserved in the limiting process. The computational advantage of the SPDE-based formulation of non-Gaussian processes is thus readily applicable to modeling spatial extremes. Our results are illustrated by a small simulation study. (Main Meeting Room - Calle Rector López Argüeta) |
12:00 - 12:30 |
Open Forum ↓ Time for participants to engage in open discussion, ask questions, share ideas, and make connections with others in attendance. (Main Meeting Room - Calle Rector López Argüeta) |
13:00 - 14:30 |
Lunch (Restaurant - Hotel Granada Center) |
14:30 - 16:00 |
Poster Presentations (Main Meeting Room - Calle Rector López Argüeta) |
16:00 - 16:30 |
Coffee Break (Main Meeting Room - Calle Rector López Argüeta) |
16:30 - 17:30 |
Roundtable Discussion II: Modeling Spatial Data and Extremes with Machine Learning (Leads: Sebastian Engelke and Andrew Zammit-Mangion) (Main Meeting Room - Calle Rector López Argüeta) |
19:30 - 21:30 |
Dinner (Restaurant - Hotel Granada Center) |