Spatial ARCH Process

Generalised spatial and spatiotemporal autoregressive conditional heteroscedasticity

Otto, P., Schmid, W., Garthoff, R. (2018): Generalised Spatial and Spatiotemporal Autoregressive Conditional Heteroscedasticity. Spatial Statistics 26 (DOI)

Abstract

In this paper, we introduce a new spatial model that incorporates heteroscedastic variance depending on neighbouring locations. The proposed process is considered as the spatial equivalent to the temporal autoregressive conditional heteroscedasticity (ARCH) model. We also show how the newly introduced spatial ARCH model can be used in spatiotemporal settings. In contrast to the temporal ARCH model, in which the distribution is known given the full information set for the prior periods, the distribution is not straightforward in the spatial and spatiotemporal setting. However, the model parameters can be estimated using the maximum-likelihood approach. Via Monte Carlo simulations, we demonstrate the performance of the estimator for a specific spatial weighting matrix. Moreover, we combine the known spatial autoregressive model with the spatial ARCH model assuming heteroscedastic errors. Eventually, the proposed autoregressive process is illustrated by an empirical example. Specifically, we model lung cancer mortality in 3108 U.S. counties and compare the newly introduced model with four benchmark approaches.

Background

ARCH and GARCH models are widely used for time series with volatility clustering, for example in finance. In our 2018 paper we introduced a spatial analogue, the spatial ARCH (spARCH) process. Here, conditional variances depend on neighbouring observations, rather than on past values in time. This provides a natural extension of the ARCH idea to spatial data, where uncertainty itself often shows spatial structure.

Key takeaways

Novelty: spARCH generalises ARCH/GARCH from time series to space by allowing local variances to depend on neighbouring values.
Two uses: as an error process (SARspARCH, improving regression models) or as a direct model for observed processes (e.g. financial networks).
Interpretation: extends Tobler’s law (“near things are more related”) from means to variances — volatility also clusters in space.
Practical gain: reduces residual variance clustering, improves fit, and yields more reliable inference and risk maps.
Flag to watch for: significant spatial dependence (e.g., measured by Moran's I for areal data) on squared residuals of your mean model is a clear signal that spARCH may be appropriate.

Main idea

In time series, ARCH captures that today’s volatility depends on yesterday’s squared return. In spARCH, a location’s variance depends on the squared residuals of its neighbours. If your neighbours are noisy, your location is more likely to be noisy as well. This mirrors how pollution, disease risk, or financial shocks spread across connected regions or agents.

Model representation

A compact formulation is:

\[ Y = m(X, \theta) + \xi, \quad \xi = \mathrm{diag}(h)^{1/2}\varepsilon, \quad h = \alpha + W\,\mathrm{diag}(\xi)\,\xi. \]

Here \(Y\) is the observed vector, \(m(X, \theta)\) is any mean model (e.g., linear regression, GAM, (deep) neural networks, etc.), \(\xi\) the residuals, and \(h\) their variances. The spatial weights matrix \(W\) defines neighbours (contiguity, distance, or network). This is directly analogous to ARCH, with “space” replacing “time”.

Interpretation and intuition

spARCH translates Tobler’s law of geography from the mean to the variance: uncertainty is also spatially related. Variance is not homogeneous, but clustered across space. This can be used either:

As an error process: SARspARCH models residual heteroscedasticity in regression settings, improving inference when covariates miss local drivers.
As a direct process: spARCH can model observed outcomes directly, e.g. volatility in financial networks.

Applications

In U.S. county-level lung-cancer mortality, SARspARCH removed residual variance clustering, improved fit statistics, and identified regions with higher uncertainty. More broadly, spARCH is relevant for financial returns in networks, environmental monitoring, or regional economics — any setting where volatility itself shows spatial patterns.

Research interests

I am interested in statistical data science and statistical methodology for data in multidimensional spaces, e.g., geo-referenced data. Most of my papers have one theory in common - Tobler's first law of geography: "everything is related to everything else, but near things are more related than distant things."