Predicting persistent forest fire refugia using machine learning models withtopographic, microclimate, and surface wind variables

Abstract

Persistent forest fire refugia are areas within fire-prone landscapes that remain fire-free over long periods of time and are crucial for ecosystem resilience. Modelling to develop maps of these refugia is key to informing fire and land use management. We predict persistent forest fire refugia using variables linked to the fire triangle (aspect, slope, elevation, topographic wetness, convergence and roughness, solar irradiation, temperature, surface wind direction, and speed) in machine learning algorithms (Random Forest, XGBoost; two ensemble models) and K-Nearest Neighbour. All models were run with and without ADASYN over-sampling and grid search hyperparameterisation. Six iterations were run per algorithm to assess the impact of omitting variables. Aspect is twice as influential as any other variable across all models. Solar radiation and surface wind direction are also highlighted, although the order of importance differs between algorithms. The predominant importance of aspect relates to solar radiation received by sun-facing slopes and resultant heat and moisture balances and, in this study area, the predominant fire wind direction. Ensemble models consistently produced the most accurate results. The findings highlight the importance of topographic and microclimatic variables in persistent forest fire refugia prediction, with ensemble machine learning providing reliable forecasting frameworks.

Description

DATA AVAILABILITY STATEMENT : Codes used are supplied in the Appendices in Supplementary Materials. Open-source data are used, and links to public archives thereof are supplied. SUPPLEMENTARY MATERIAL :APPENDIX S1. Patterns based validation, APPENDIX S2. Fire model pattern based validation, APPENDIX S3. PCA, APPENDIX S4. Random forest script with grid search and ADASYN sampling, APPENDIX S5. XGBoost implementation in Python, APPENDIX S6. The KNN implementation was performed in Python, APPENDIX S7. Ensemble script soft, APPENDIX S8. Ensemble hard script, APPENDIX S9. Experimental design showing experiment number, dataset and oversampling, X indicate dataset was used in experiments, APPENDIX S10. Best parameters per experiment.

Keywords

Fire refugia, Surface wind, Microclimate, Topography, Extreme gradient boosting (XGBoost), Aspect, Random forest, Ensembles, Machine learning, Persistent forest fire refugia

Sustainable Development Goals

SDG-15: Life on land
SDG-13: Climate action

Citation

Christ, S.; Kraaij, T.; Geldenhuys, C.J.; de Klerk, H.M. Predicting Persistent Forest Fire Refugia Using Machine Learning Models with Topographic, Microclimate, and Surface Wind Variables. ISPRS International Journal of Geo-Information 2025, 14, 480: 1-19. https://doi.org/10.3390/ ijgi14120480.