Scale-dependent parameter estimation

Scale-dependent parameter estimation is a wonderful idea that seems to be super-useful and yet super-underused in ecology. The basic idea is that we have a response variable (e.g., a population density) measured at some points in space (\(y(z)\)) and one or more predictor variables measured at the same, or other, points in space (\(x(z)\)). If we assume that the values of \(x\) determine the values of \(y\) at exactly the same point in space, then this becomes a standard regression problem. If the values of \(x\) affect nearby values of \(y\), however, this doesn’t work. The deterministic (linear) relationship underlying the relation, rather than \(E[y(z)] = \beta_0 + \beta_1 x(z)\), is \(E[y(z)] = \beta_0 + \beta_1 \int K(|z'-z|) x(z') \, dz\). In other words, \(y\) depends on a spatial convolution of \(x\) with a kernel \(K\) (sometimes written as \((K * x)(z)\).

If we happen to know \(K\), we can compute the convolution directly and put the result into some regression framework. But what if we don’t know \(K\)?

One standard approach in spatial ecology (REFS) is to assume a shape of the kernel \(K\) (typically uniform or “top-hat”, i.e. all locations at a distance less than some radius \(\rho\) affect the response at \(y\) equally, locations farther away have no effect). Then for each scale or range \(\rho\), compute the convolution (i.e. the average of the environmental at all points within a distance \(\rho\) of each response), fit the model, and compute some goodness-of-fit statistic (\(R^2\), AIC, etc.). Pick the \(\rho\) with the best goodness-of-fit and use it in the final model. In my opinion this is fine as far as it goes, but (1) it’s expensive (you need to compute a convolution for every possible scale and fit a model for each one); (2) it assumes a particular and unrealistic shape for \(K\); (3) it’s hard to account for uncertainty in the scale, or do inference on the scale itself (although these problems could certainly be overcome, e.g. by doing model averaging across models with different scales).

I have previously spent a lot of time and effort until about 2012 or so trying to come up with correlation-based methods to extract information about scale and shape of convolution kernels, using the idea of deconvolution and Fourier transform methods (Bolker 2009, 2011; Seabloom et al. 2005). I spent a long time on this stuff, but mostly didn’t get it published. It is theoretically elegant, and is in principle useful for combining the effects of different spatial processes (dispersal, competition, environmental filtering), but there are lots of technical hurdles that kept it from being practical.

Since then I have learned about scale-dependent parameter estimation, which gets at the idea of estimating \(K\) in a different way. It’s related to, or could be considered a variant of varying-coefficient or geographic regression (Gelfand et al. 2003; Finley and Banerjee 2020): in varying-coefficient regression, we assume that \(E[y(z_1)] = \beta_0(z_1) + \beta_1(z_1) x(z_1)\), where \(\beta_0\) and \(\beta_1\) are smoothly varying functions of spatial position. In other words, the regression coefficients are spatial variables, e.g. the effects of \(x\) on \(y\) could change (smoothly) from place to place. In scale-dependent parameter estimation, in contrast, we use the same basic tools to estimate \(E[y(z_1)] = \beta_0 + \beta_1 (K*x)(z_1)\) (the same equation that we introduced at the beginning).

The basic trick is to compute the average values of \(x\) within annuli around each response point (i.e. rings where \(\rho < |z_1-z_2| < \rho + \Delta \rho\)), include all of these columns in the set of predictors, and use the machinery of GAMs to penalize the fit so that parameters of \(\beta(\rho)\) vary smoothly with \(\rho\).

I got this idea from presentations of Thomas Cornulier and Alexandre Villers (Cornulier and Villers 2015; Cornulier 2016) (also not published!). They cite an earlier (less technically elegant) version that uses a similar idea to estimate the temporal scale of environmental influences (Sims et al. 2007).

I asked eco-stats tweeps about this in a twitter thread earlier this year but didn’t get much response.

I think this is a fantastic method that deserves to be better developed. Please let me know if you’re interested in working on it! (Or if you find a good reference showing that it has already been well developed and explicated by someone, somewhere …)

References

Bolker, Benjamin. 2011. “Inferring the Spatial Scale of Environmental Variation in Sapling Recruitment of Pinus Elliotti (Slash Pine).” Oral presentation. Austin, TX. http://www.math.mcmaster.ca/bolker/misc/esa2011_pines.pdf.

Bolker, Benjamin M. 2009. “Spatial Correlation and Deconvolution: (Attempting to) Estimate Spatial Process from Pattern.” Seminar. Case Western University, Cleveland OH. http://www.math.mcmaster.ca/bolker/misc/CW-deconv.pdf.

Cornulier, Thomas. 2016. “Modelling Habitat Selection Across Multiple Spatial Scales Using Varying Coefficient Regression.” International Statistical Ecology Conference. Seattle, Washington.

Cornulier, Thomas, and Andre Villers. 2015. “Modelling Resource Selection Across Multiple Spatial Scales Using Varying Coefficient Regression.” In. Spatial Statistics: Emerging Patterns. Avignon, France. https://www.researchgate.net/profile/Thomas-Cornulier/publication/345694591_Modelling_resource_selection_across_multiple_spatial_scales_using_varying_coefficient_regression/links/5fab006b4585150781077f37/Modelling-resource-selection-across-multiple-spatial-scales-using-varying-coefficient-regression.pdf.

Finley, Andrew O., and Sudipto Banerjee. 2020. “Bayesian Spatially Varying Coefficient Models in the spBayes R Package.” Environmental Modelling & Software 125 (C). https://doi.org/10.1016/j.envsoft.2019.104608.

Gelfand, Alan E, Hyon-Jung Kim, C. F Sirmans, and Sudipto Banerjee. 2003. “Spatial Modeling with Spatially Varying Coefficient Processes.” Journal of the American Statistical Association 98 (462): 387–96. https://doi.org/10.1198/016214503000170.

Seabloom, Eric, Ottar Bjørnstad, Benjamin Bolker, and Omar J. Reichman. 2005. “The Spatial Signature of Environmental Heterogeneity, Dispersal, and Competition in Successional Grasslands.” Ecological Monographs 75 (2): 199–214.

Sims, Michelle, David A. Elston, Ann Larkham, Daniel H. Nussey, and Steve D. Albon. 2007. “Identifying When Weather Influences Life-History Traits of Grazing Herbivores.” Journal of Animal Ecology 76 (4): 761–70. https://www.jstor.org/stable/4539182.

Scale-dependent parameter estimation

Ben Bolker

20 Aug 2021

References