Navigating challenges in spatial machine learning: Validation, uncertainty, algorithms, and reproducibility
DOI:
https://doi.org/10.3112/erdkunde.2026.04.01Keywords:
model validation, model performance metrics, uncertainty, machine learning algorithms, geospatial software, standardized protocolsAbstract
Spatial machine learning has matured considerably in recent years, providing powerful approaches for mapping and prediction. However, many challenges remain in achieving methodological robustness, transparency, and comparability. This paper examines six key methodological themes – validation, assessment, uncertainty quantification, algorithm design, software implementation, and reporting – and highlights the need for greater rigor to ensure reliable spatial predictions. We emphasize the importance of domain knowledge, appropriate sampling design, and prediction-domain adaptive cross-validation for realistic model evaluation. When training data fail to represent the prediction domain, areas beyond the model’s area of applicability should be masked, and performance should be reported only within unmasked regions. Uncertainty remains a prominent challenge, requiring methodological innovation to account for spatial dependencies and improved strategies for quantifying and communicating it to diverse audiences. Furthermore, while numerous methods now incorporate spatial information into machine learning, the lack of theoretical grounding and systematic benchmarking continues to constrain comparability and progress. Finally, we highlight the critical role of open, reproducible software tools and call for standardized reporting protocols tailored to spatial prediction workflows. Addressing these methodological and practical gaps will enhance reliability, transparency, and reproducibility in spatial machine learning applications.
References
Ahn S, Ryu D-W, Lee S (2020) A machine learning-based approach for spatial estimation using the spatial features of coordinate information. ISPRS International Journal of Geo-information 9: 587. https://doi.org/10.3390/ijgi9100587
An Y, Shim W, Jeong G (2023) High-resolution digital soil maps of forest soil nitrogen across South Korea using three machine learning algorithms. Forests 14: 1141. https://doi.org/10.3390/f14061141
Baltensweiler A, Walthert L, Hanewinkel M, Zimmermann S, Nussbaum M (2021) Machine learning based soil maps for a wide range of soil properties for the forested area of Switzerland. Geoderma Regional 27: e00437. https://doi.org/10.1016/j.geodrs.2021.e00437
Bastin J-F, Finegold Y, Garcia C, Mollicone D, Rezende M, Routh D, Zohner CM, Crowther TW (2019) The global tree restoration potential. Science 365: 76–79. https://doi.org/10.1126/science.aax0848
Becker M, Schratz P (2025) mlr3spatial: Support for spatial objects within the ’mlr3’ ecosystem. https://doi.org/10.32614/CRAN.package.mlr3spatial
Beguin J, Fuglstad G-A, Mansuy N, Paré D (2017) Predicting soil properties in the Canadian boreal forest with limited data: Comparison of spatial and non-spatial statistical approaches. Geoderma 306: 195–205. https://doi.org/10.1016/j.geoderma.2017.06.016
Benito BM (2021) spatialRF: Easy spatial regression with random forest. https://doi.org/10.5281/zenodo.4745208
Bouasria A, Bouslihim Y, Gupta S, Taghizadeh-Mehrjardi R, Hengl T (2023) Predictive performance of machine learning model with varying sampling designs, sample sizes, and spatial extents. Ecological Informatics 78: 102294. https://doi.org/10.1016/j.ecoinf.2023.102294
Brodie SJ, Thorson JT, Carroll G, Hazen EL, Bograd S, Haltuch MA, Holsman KK, Kotwicki S, Samhouri JF, Willis-Norton E, Selden RL (2020) Trade-offs in covariate selection for species distribution models: A methodological comparison. Ecography 43: 11–24. https://doi.org/10.1111/ecog.04707
Brugere L, Kwon Y, Frazier AE, Kedron P (2023) Improved prediction of tree species richness and interpretability of environmental drivers using a machine learning approach. Forest Ecology and Management 539: 120972. https://doi.org/10.1016/j.foreco.2023.120972
Bruin S de, Brus DJ, Heuvelink GBM, Ebbenhorst Tengbergen T van, Wadoux AMJ-C (2022) Dealing with clustered samples for assessing map accuracy by cross-validation. Ecological Informatics 69: 101665. https://doi.org/10.1016/j.ecoinf.2022.101665
Chen, W., Li, Y., Reich, B. J. a. Sun, Y. (2022) DeepKriging: Spatially dependent deep neural networks for spatial prediction. arXiv. https://arxiv.org/abs/2007.11972
Chrysanthopoulos E, Kallioras A (2025) Temporal and geographic extrapolation of soil moisture using machine learning algorithms. Catena 257: 109156. https://doi.org/10.1016/j.catena.2025.109156
Collins GS, Reitsma JB, Altman DG, Moons K (2015) Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD Statement. BMC Medicine 13: 1. https://doi.org/10.1186/s12916-014-0241-z
Courteille L (2025) Impact of cartographic representation of uncertainty on decision-making : The case of soil quality in land-use planning. PhD thesis. https://doi.org/10.13140/RG.2.2.19285.95205
Cressie NAC (1993) Statistics for spatial data. New York
Davison A, Hinkley D (1997) Bootstrap methods and their application. Cambridge
Dormann C, M. McPherson J, B. Araújo M, Bivand R, Bolliger J, Carl G, G. Davies R, Hirzel A, Jetz W, Daniel Kissling W, Kühn I, Ohlemüller R, Peres-Neto PR, Reineking B, Schröder B, Schurr FM, Wilson R (2007) Methods to account for spatial autocorrelation in the analysis of species distributional data: A review. Ecography 30: 609–628. https://doi.org/10.1111/j.2007.0906-7590.05171.x
Fernández-Delgado M, Cernadas E, Barro S, Amorim D (2014) Do we need hundreds of classifiers to solve real world classification problems? Journal of Machine Learning Research 15: 3133–3181. https://doi.org/10.5555/2627435.2697065
Gareth J, Witten D, Hastie T, Tibshirani R (2017) An introduction to statistical learning: With applications in R. Springer texts in statistics. New York; Heidelberg; Dordrecht; London
Gaspard G, Kim D, Chun Y (2019) Residual spatial autocorrelation in macroecological and biogeographical modeling: A review. Journal of Ecology and Environment 43: 19. https://doi.org/10.1186/s41610-019-0118-3
Ghariani, W. (2023) Spatial-kfold: A python package for spatial resampling toward more reliable cross-validation in spatial studies. https://github.com/WalidGharianiEAGLE/spatial-kfold
Goetz JN, Brenning A, Petschko H, Leopold P (2015) Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling. Computers & Geosciences 81: 1–11. https://doi.org/10.1016/j.cageo.2015.04.007
Goodling P, Belitz K, Stackelberg P, Fleming B (2024) A spatial machine learning model developed from noisy data requires multiscale performance evaluation: Predicting depth to bedrock in the Delaware river basin, USA. Environmental Modelling & Software 179: 106124. https://doi.org/10.1016/j.envsoft.2024.106124
Grimm V, Berger U, Bastiansen F, Eliassen S, Ginot V, Giske J, Goss-Custard J, Grand T, Heinz SK, Huse G, Huth A, Jepsen JU, Jørgensen C, Mooij WM, Müller B, Pe’er G, Piou C, Railsback SF, Robbins AM, Robbins MM, Rossmanith E, Rüger N, Strand E, Souissi S, Stillman RA, Vabø R, Visser U, DeAngelis DL (2006) A standard protocol for describing individual-based and agent-based models. Ecological Modelling 198: 115–126. https://doi.org/10.1016/j.ecolmodel.2006.04.023
Heuvelink GBM, Burrough PA, Stein A (1989) Propagation of errors in spatial modelling with GIS. International Journal of Geographical Information Systems 3: 303–322. https://doi.org/10.1080/02693798908941518
Heuvelink G, Webster R (2022) Spatial statistics and soil mapping: A blossoming partnership under pressure. Spatial Statistics 50: 100639. https://doi.org/10.1016/j.spasta.2022.100639
Jacob C, Masilamani US (2025) Performance evaluation of different machine learning algorithms for prediction of nitrate in groundwater in Thiruvannamalai District. CLEAN – Soil, Air, Water 53: 2400060. https://doi.org/10.1002/clen.202400060
Jemeļjanova M, Kmoch A, Uuemaa E (2024) Adapting machine learning for environmental spatial data - A review. Ecological Informatics 81: 102634. https://doi.org/10.1016/j.ecoinf.2024.102634
Jonard F, Feldman AF, Short Gianotti DJ, Entekhabi D (2022) Observed water and light limitation across global ecosystems. Biogeosciences 19: 5575–5590. https://doi.org/10.5194/bg-19-5575-2022
Kapoor S, Cantrell EM, Peng K, Pham TH, Bail CA, Gundersen OE, Hofman JM, Hullman J, Lones MA, Malik MM, Nanayakkara P, Poldrack RA, Raji ID, Roberts M, Salganik MJ, Serra-Garcia M, Stewart BM, Vandewiele G, Narayanan A (2024) REFORMS: Consensus-based Recommendations for Machine-learning-based Science. Science Advances 10: eadk3452. https://doi.org/10.1126/sciadv.adk3452
Kattenborn T, Schiefer F, Frey J, Feilhauer H, Mahecha MD, Dormann CF (2022) Spatially autocorrelated training and validation samples inflate performance assessment of convolutional neural networks. ISPRS Open Journal of Photogrammetry and Remote Sensing 5: 100018.https://doi.org/10.1016/j.ophoto.2022.100018
Kim H-J, Mawuntu KBA, Park T-W, Kim H-S, Park J-Y, Jeong Y-S (2023) Spatial autocorrelation incorporated machine learning model for geotechnical subsurface modeling. Applied Sciences 13: 4497. https://doi.org/10.3390/app13074497
Kmoch A, Harrison CT, Choi J, Uuemaa E (2025) Spatial autocorrelation in machine learning for modelling soil organic carbon. Ecological Informatics 86: 103057. https://doi.org/10.1016/j.ecoinf.2025.103057
Koldasbayeva D, Tregubova P, Gasanov M, Zaytsev A, Petrovskaia A, Burnaev E (2024) Challenges in data-driven geospatial modeling for environmental research and practice. Nature Communications 15: 10700. https://doi.org/10.1038/s41467-024-55240-8
Kuhn M (2008) Building predictive models in R using the caret package. Journal of Statistical Software 28: 1–26. https://doi.org/10.18637/jss.v028.i05
Kuhn M, Wickham H (2020) Tidymodels: A collection of packages for modeling and machine learning using tidyverse principles. https://www.tidymodels.org
Kumar C, Walton G, Santi P, Luza C (2025) Random cross-validation produces biased assessment of machine learning performance in regional landslide susceptibility prediction. Remote Sensing 17. https://doi.org/10.3390/rs17020213
Lang M, Binder M, Richter J, Schratz P, Pfisterer F, Coors S, Au Q, Casalicchio G, Kotthoff L, Bischl B (2019) mlr3: A modern object-oriented machine learning framework in R. Journal of Open Source Software. https://doi.org/10.21105/joss.01903
Legendre P (1993) Spatial autocorrelation: Trouble or new paradigm? Ecology 74: 1659–1673. https://doi.org/10.2307/1939924
Liemohn MW, Shane AD, Azari AR, Petersen AK, Swiger BM, Mukhopadhyay A (2021) RMSE is not enough: Guidelines to robust data-model comparisons for magnetospheric physics. Journal of Atmospheric and Solar-Terrestrial Physics 218: 105624. https://doi.org/10.1016/j.jastp.2021.105624
Linnenbrink J, Milà C, Ludwig M, Meyer H (2024) kNNDM CV: K-fold nearest-neighbour distance matching cross-validation for map accuracy estimation. Geoscientific Model Development 17: 5897–5912. https://doi.org/10.5194/gmd-17-5897-2024
Liu FT, Ting KM, Zhou Z-H (2008) Isolation Forest. Proceedings of the 2008 eighth IEEE international conference on data mining: 413–422. ICDM ’08. USA. https://doi.org/10.1109/ICDM.2008.17
Ludwig M, Moreno-Martinez A, Hölzel N, Pebesma E, Meyer H (2023) Assessing and improving the transferability of current global spatial prediction models. Global Ecology and Biogeography 32: 356–368. https://doi.org/10.1111/geb.13635
Luo W, Phung D, Tran T, Gupta S, Rana S, Karmakar C, Shilton A, Yearwood J, Dimitrova N, Ho TB, Venkatesh S, Berk M (2016) Guidelines for developing and reporting machine learning predictive models in biomedical research: A multidisciplinary view. Journal of Medical Internet Research 18: e323. https://doi.org/10.2196/jmir.5870
Lusk D, Wolf S, Svidzinska D, Dormann CF, Kattge J, Bruelheide H, Sabatini FM, Damasceno G, Moreno Martínez Á, Violle C, Hending D, Hähn GJA, Tabeni S, Phartyal S, Gonçalves F, Kreft H, Schmidt M, Chen H, Güler B, Dolezal J, Pielech R, Guido A, Dwyer C, Napoleone F, Willie J, Gasper AL, Macía MJ, Chytry M, Lenoir J, Thakur D, Dengler J, Świerszcz S, Altman J, Mucina L, Nerlekar AN, Kakinuma K, Rawat P, Stančić Z, Testolin R, Hatim MZ, Rodrigues F, Homeier J, Marques MCM, McCarthy JK, El-Sheikh MA, Korznikov K, Gerberding K, Kattenborn T (2026) Crowdsourced biodiversity monitoring fills gaps in global plant trait mapping. Nature Communications 17: 1203. https://doi.org/10.1038/s41467-026-68996-y
Mahoney, M. J. (2023) Waywiser: Ergonomic methods for assessing spatial models. arXiv. https://doi.org/10.48550/arXiv.2303.11312
Mahoney, M. J., Johnson, L. K., Silge, J., Frick, H., Kuhn, M. a. Beier, C. M. (2023) Assessing the performance of spatial cross-validation approaches for models of spatially structured data. arXiv. https://doi.org/10.48550/arXiv.2303.07334
Malone BP, Minasny B, McBratney AB (2016) Using R for digital soil mapping. Progress in soil science. Cham, Switzerland
Maxwell TL, Spalding MD, Friess DA, Murray NJ, Rogers K, Rovai AS, Smart LS, Weilguny L, Adame MF, Adams JB, Austin WEN, Copertino MS, Cott GM, Duarte De Paula Costa M, Holmquist JR, Ladd CJT, Lovelock CE, Ludwig M, Moritsch MM, Navarro A, Raw JL, Ruiz-Fernández A-C, Serrano O, Smeaton C, Van De Broek M, Windham-Myers L, Landis E, Worthington TA (2024) Soil carbon in the world’s tidal marshes. Nature Communications 15: 10265. https://doi.org/10.1038/s41467-024-54572-9
Meinshausen N (2006) Quantile regression forests. Journal of Machine Learning Research 7: 983–999. http://jmlr.org/papers/v7/meinshausen06a.html
Meyer H, Milà C, Ludwig M, Linnenbrink J, Schumacher F (2025) CAST: ’Caret’ applications for spatial-temporal models. https://doi.org/10.32614/CRAN.package.CAST
Meyer H, Pebesma E (2021) Predicting into unknown space? Estimating the area of applicability of spatial prediction models. Methods in Ecology and Evolution 12: 1620–1633. https://doi.org/10.1111/2041-210X.13650
Meyer H, Pebesma E (2022) Machine learning-based global maps of ecological variables and the challenge of assessing them. Nature Communications 13: 2208. https://doi.org/10.1038/s41467-022-29838-9
Meyer H, Reudenbach C, Hengl T, Katurji M, Nauss T (2018) Improving performance of spatio-temporal machine learning models using forward feature selection and target-oriented validation. Environmental Modelling & Software 101: 1–9. https://doi.org/10.1016/j.envsoft.2017.12.001
Meyer H, Reudenbach C, Woellauer S, Nauss T (2019) Importance of spatial predictor variable selection in machine learning applications - moving from data reproduction to spatial prediction. Ecological Modelling 411. https://doi.org/10.1016/j.ecolmodel.2019.108815
Mila C, Ludwig M, Pebesma E, Tonne C, Meyer H (2024) Random forests with spatial proxies for environmental modelling: Opportunities and pitfalls. EGUsphere 2024: 1–30. https://doi.org/10.5194/egusphere-2024-138
Mila C, Mateu J, Pebesma E, Meyer H (2022) Nearest neighbour distance matching Leave-One-Out Cross-Validation for map validation. Methods in Ecology and Evolution 13: 1304–1316. https://doi.org/10.1111/2041-210X.13851
Milne AE, Glendining MJ, Lark RM, Perryman SAM, Gordon T, Whitmore AP (2015) Communicating the uncertainty in estimated greenhouse gas emissions from agriculture. Journal of Environmental Management 160: 139–153. https://doi.org/10.1016/j.jenvman.2015.05.034
Minasny B, Bandai T, Ghezzehei TA, Huang Y-C, Ma Y, McBratney AB, Ng W, Norouzi S, Padarian J, Rudiyanto, Sharififar A, Styc Q, Widyastuti M (2024) Soil science-informed machine learning. Geoderma 452: 117094. https://doi.org/10.1016/j.geoderma.2024.117094
Mitchell M, Wu S, Zaldivar A, Barnes P, Vasserman L, Hutchinson B, Spitzer E, Raji ID, Gebru T (2019) Model cards for model reporting. FAT* ‘19: Proceedings of the Conference on Fairness, Accountability, and Transparency: 220–229. https://doi.org/10.1145/3287560.3287596
Molnar C (2021) Interpretable machine learning: A guide for making black box models explainable. Victoria, Canada
Mongan J, Moy L, Kahn CE (2020) Checklist for artificial intelligence in medical imaging (CLAIM): A guide for authors and reviewers. Radiology: Artificial Intelligence 2: e200029. https://doi.org/10.1148/ryai.2020200029
Naveau P, Allard D (2005) Modeling skewness in spatial data analysis without data transformation. In: Geostatistics Banff 2004: 929–937. https://doi.org/10.1007/978-1-4020-3610-1
Nikparvar B, Thill J-C (2021) Machine learning of spatial data. ISPRS International Journal of Geo-Information 10. https://doi.org/10.3390/ijgi10090600
Nowosad J, Meyer H (2025) Investigating Moran’s I properties for spatial machine learning: A preliminary analysis. AGILE: GIScience Series 6: 1–6. https://doi.org/10.5194/agile-giss-6-40-2025
Nussbaum M, Spiess K, Baltensweiler A, Grob U, Keller A, Greiner L, Schaepman ME, Papritz A (2018) Evaluation of digital soil mapping approaches with large sets of environmental covariates. SOIL: 1–22. https://doi.org/10.5194/soil-4-1-2018
Pabon-Moreno DE, Migliavacca M, Reichstein M, Mahecha MD (2022) On the potential of Sentinel-2 for estimating gross primary production. IEEE Transactions on Geoscience and Remote Sensing 60: 1–12
Patelli L, Cameletti M, Golini N, Ignaccolo R (2024) A path in regression Random Forest looking for spatial dependence: A taxonomy and a systematic review. Knoth S, Okhrin Y, Otto P (eds) Advanced statistical methods in process monitoring, finance, and environmental science: 467–489. Cham. https://doi.org/10.1007/978-3-031-69111-9
Patriche CV, Roşca B, Pîrnău RG, Vasiliniuc I (2023) Spatial modelling of topsoil properties in Romania using geostatistical methods and machine learning. Plos One 18: e0289286. https://doi.org/10.1371/journal.pone.0289286
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay Édouard (2011) Scikit-learn: Machine learning in python. Journal of Machine Learning Research 12: 2825–2830. http://jmlr.org/papers/v12/pedregosa11a.html
Piikki K, Wetterlind J, Söderström M, Stenberg B (2021) Perspectives on validation in digital soil mapping of continuous attributes - A review. Soil Use and Management 37: 7–21. https://doi.org/10.1111/sum.12694
Ploton P, Mortier F, Réjou-Méchain M, Barbier N, Picard N, Rossi V, Dormann C, Cornu G, Viennois G, Bayol N, Lyapustin A, Gourlet-Fleury S, Pélissier R (2020) Spatial validation reveals poor predictive performance of large-scale ecological mapping models. Nature Communications 11. https://doi.org/10.1038/s41467-020-18321-y
Rasmussen CE, Williams CKI (2008) Gaussian processes for machine learning. Adaptive computation and machine learning. Cambridge, Mass.
Rey SJ, Anselin L (2007) PySAL: A Python library of spatial analytical methods. The Review of Regional Studies 37: 5–27. https://doi.org/10.52324/001c.8285
Roberts DR, Bahn V, Ciuti S, Boyce MS, Elith J, Guillera-Arroita G, Hauenstein S, Lahoz-Monfort JJ, Schröder B, Thuiller W, Warton DI, Wintle, BA, Hartig F, Dormann, CF (2017) Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure. Ecography 40: 913–929. https://doi.org/10.1111/ecog.02881
Ryo M, Angelov B, Mammola S, Kass JM, Benito BM, Hartig F (2021) Explainable artificial intelligence enhances the ecological interpretability of black-box species distribution models. Ecography 44: 199–205. https://doi.org/10.1111/ecog.05360
Saha A, Basu S, Datta A (2023) Random forests for spatially dependent data. Journal of the American Statistical Association 118. https://doi.org/10.1080/01621459.2021.1950003
Saha A, Basu S, Datta A (2024) RandomForestsGLS: Random forests for dependent data. https://doi.org/10.32614/CRAN.package.RandomForestsGLS
Schmidinger J, Heuvelink GBM (2023) Validation of uncertainty predictions in digital soil mapping. Geoderma 437: 116585. https://doi.org/10.1016/j.geoderma.2023.116585
Schratz P, Becker M, Lang M, Brenning A (2024) mlr3spatiotempcv: Spatiotemporal resampling methods for machine learning in R. Journal of Statistical Software 111: 1–36. https://doi.org/10.18637/jss.v111.i07
Schratz P, Muenchow J, Iturritxa E, Richter J, Brenning A (2019) Hyperparameter tuning and performance assessment of statistical and machine-learning algorithms using spatial data. Ecological Modelling 406: 109–120. https://doi.org/10.1016/j.ecolmodel.2019.06.002
Schumacher FL, Knoth C, Ludwig M, Meyer H (2025) Estimation of local training data point densities to support the assessment of spatial prediction uncertainty. Geoscientific Model Development 18: 10185–10202. https://doi.org/10.5194/gmd-18-10185-2025
Sekulić A, Kilibarda M, Heuvelink GBM, Nikolić M, Bajat B (2020) Random forest spatial interpolation. Remote Sensing 12: 1687. https://doi.org/10.3390/rs12101687
Simoes R, Camara G, Queiroz G, Souza F, Andrade P, Santos L, Carvalho A, Ferreira K (2021) Satellite image time series analysis for big earth observation data. Remote Sensing 13: 2428. https://doi.org/10.3390/rs13132428
Sterlacchini S, Ballabio C, Blahut J, Masetti M, Sorichetta A (2011) Spatial agreement of predicted patterns in landslide susceptibility maps. Geomorphology 125: 51–61. https://doi.org/10.1016/j.geomorph.2010.09.004
Stock A, Gregr EJ, Chan KMA (2023) Data leakage jeopardizes ecological applications of machine learning. Nature Ecology & Evolution 7: 1743–1745. https://doi.org/10.1038/s41559-023-02162-1
Stojanova D, Ceci M, Appice A, Malerba D, Džeroski S (2013) Dealing with spatial autocorrelation when learning predictive clustering trees. Ecological Informatics 13: 22–39. https://doi.org/10.1016/j.ecoinf.2012.10.006
Sweet L, Müller C, Anand M, Zscheischler J (2023) Cross-validation strategy impacts the performance and interpretation of machine learning models. Artificial Intelligence for the Earth Systems 2. https://doi.org/10.1175/aies-d-23-0026.1
Takoutsing B, Heuvelink GBM, Stoorvogel JJ, Shepherd KD, Aynekulu E (2022) Accounting for analytical and proximal soil sensing errors in digital soil mapping. European Journal of Soil Science 73: e13226. https://doi.org/10.1111/ejss.13226
Tian X, Bruin S de, Simoes R, Isik MS, Minarik R, Ho Y-F, Şahin M, Herold M, Consoli D, Hengl T (2025) Spatiotemporal prediction of soil organic carbon density in Europe (2000–2022) using earth observation and machine learning. PeerJ 13: e19605. https://doi.org/10.7717/peerj.19605
Tobler WR (1970) A computer movie simulating urban growth in the Detroit region. Economic Geography 46: 234–240
Valavi R, Elith J, Lahoz-Monfort JJ, Guillera-Arroita G (2019) blockCV: An R package for generating spatially or environmentally separated folds for k-fold cross-validation of species distribution models. Methods in Ecology and Evolution 10: 225–232. https://doi.org/10.1111/2041-210X.13107
Verma V (2025) A comprehensive framework for residual analysis in regression and machine learning. Journal of Information Systems Engineering and Management 10: 34–46. https://doi.org/10.52783/jisem.v10i31s.4958
Wadoux AM-C, Heuvelink GB, De Bruin S, Brus DJ (2021) Spatial cross-validation is not the right way to evaluate map accuracy. Ecological Modelling 457: 109692
Wadoux AMJ-C, Heuvelink GBM (2023) Uncertainty of spatial averages and totals of natural resource maps. Methods in Ecology and Evolution 14: 1320–1332. https://doi.org/10.1111/2041-210X.14106
Wang S, Gao K, Zhang L, Yu B, Easa SM (2024) Geographically weighted machine learning for modeling spatial heterogeneity in traffic crash frequency and determinants in US. Accident Analysis & Prevention 199: 107528. https://doi.org/10.1016/j.aap.2024.107528
Wang Y, Khodadadzadeh M, Zurita-Milla R (2025) A dissimilarity-adaptive cross-validation method for evaluating geospatial machine learning predictions with clustered samples. Ecological Informatics 90: 103287. https://doi.org/10.1016/j.ecoinf.2025.103287
Webster R, Oliver MA (2007) Geostatistics for environmental scientists. Chichester, England. https://doi.org/10.1002/9780470517277
Wei J, Huang W, Li Z, Xue W, Peng Y, Sun L, Cribb M (2019) Estimating 1-km-resolution PM2.5 concentrations across China using the space-time random forest approach. Remote Sensing of Environment 231: 111221. https://doi.org/10.1016/j.rse.2019.111221
Yates KL, Bouchet PJ, Caley MJ, Mengersen K, Randin CF, Parnell S, Fielding AH, Bamford AJ, Ban S, Barbosa AM, others (2018) Outstanding challenges in the transferability of ecological models. Trends in Ecology & Evolution 33: 790–802. https://doi.org/10.1016/j.tree.2018.08.001
Zeevi T, Lieffrig EV, Staib LH, Onofrey JA (2025) Spatially-aware evaluation of segmentation uncertainty. arXiv. https://doi.org/10.48550/arXiv.2506.16589
Zhao W, Li Z, Li H, Li X, Yang P (2024) Soil salinity prediction in an arid area based on long time-series multispectral imaging. Agriculture 14. https://doi.org/10.3390/agriculture14091539
Zurell D, Franklin J, König C, Bouchet PJ, Dormann CF, Elith J, Fandos G, Feng X, Guillera-Arroita G, Guisan A, Lahoz-Monfort JJ, Leitão PJ, Park DS, Peterson AT, Rapacciuolo G, Schmatz DR, Schröder B, Serra-Diaz JM, Thuiller W, Yates KL, Zimmermann NE, Merow C (2020) A standard protocol for reporting species distribution models. Ecography 43: 1261–1277. https://doi.org/10.1111/ecog.04960
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Jakub Nowosad, Carmelo Bonannella, Darius Görgen, Marta Jemeljanova, Teja Kattenborn, Jan Linnenbrink, Hanna Meyer, Madlene Nussbaum, Luca Patelli, Rolf Simoes, Evelyn Uuemaa

This work is licensed under a Creative Commons Attribution 4.0 International License.