We collected soil data from multiple public information at disparate administrative levels. Soil data typically included information about soil profile and horizon properties. Environmental covariates encompassed a range of factors that influence soil characteristics according to the SCORPAN conceptual spatial inference model (McBratney et al., 2003). Spatial modeling focused on fitting a statistical model to estimate SOCc and SOCs across peninsular Spain using three distinct supervised learning approaches: quantile regression forest, ensemble machine learning, and auto-machine learning. The last step included spatial prediction once the spatial models for SOCc and SOCs were validated. To do that, each pixel was assigned the prediction from the most accurate model, i.e., the model that achieved the lowest uncertainty.
The used database comprised 8,361 georeferenced soil profiles, containing 27,931 pedogenetic soil horizons. We collected soil data from public domain resources or were facilitated by national institutions responsible for the information. Specifically, the Red Carbosol database contributed 78% of the samples, compiled through a collaborative network of Spanish soil experts across multiple research centers and universities, aggregating data from 635 different sources (Llorente et al., 2018). The second major source (18% of profiles) was the Consejería de Sostenibilidad, Medio Ambiente y Economía Azul (Andalusian Government, personal communication). The remaining 4% of the data were extracted from the LUCDEME database, which was compiled by various regional institutions, including Región de Murcia (Alias and Ortiz, 1986), the Agrarian Technological Institute (Junta de Castilla y León), and the University of Castilla La-Mancha (Bravo et al., 2019). Sampling periods spanned from 1954 to 2018, with most samples collected between 1965 and 2000.
REFERENCES
Alias, L. and Ortiz, R.: Memorias y mapas de suelos de las hojas del MTN a escala 1:100.000, 1986.
Bravo, S., García-Ordiales, E., García-Navarro, F. J., Amorós, J. Á., Pérez-de-los-Reyes, C., Jiménez-Ballesta, R., Esbrí, J. M., García-Noguero, E. M., and Higueras, P.: Geochemical distribution of major and trace elements in agricultural soils of Castilla-La Mancha (central Spain): finding criteria for baselines and delimiting regional anomalies, Environmental Science and Pollution Research, 26, 3100–3114, https://doi.org/10.1007/s11356-017-0010-6, 2019.
Llorente, M., Rovira, P., Merino, A., Rubio, A., Turrión, M., Bad\’\ia, D., Romanya, J., and González, J. C. J. A.: The CARBOSOL Database: a georeferenced soil profile analytical database for Spain, https://doi.org/10.1594/PANGAEA.884517, 2018.
McBratney, A. B. B., Mendonça Santos, M. L. L., and Minasny, B.: On digital soil mapping, Geoderma, 117, 3–52, https://doi.org/10.1016/S0016-7061(03)00223-4, 2003.