Welcome to visit Geoscience!

Geoscience ›› 2022, Vol. 36 ›› Issue (03): 972-978.DOI: 10.19657/j.geoscience.1000-8527.2022.03.17

• Geochemistry • Previous Articles    

Spatial Prediction of Surface Soil Ore-forming Elements Based on Machine Learning: Taking Rare Metal Rubidium as An Example

DAI Liangliang1(), NIE Xiaoli1(), GUO Jun1, GONG Hao1, WU Huanhuan2, ZHANG Tao1, TANG Yuanyuan1, MAO Cong1, PENG Zhigang1, HE Can1   

  1. 1. Changsha Natural Resources Comprehensive Survey Center of China Geological Survey, Changsha,Hunan 410600,China
    2. Xi’an Mineral Resources Survey Center, China Geological Survey,Xi’an,Shaanxi 710000,China
  • Received:2021-09-02 Revised:2021-10-18 Online:2022-06-10 Published:2022-07-19
  • Contact: NIE Xiaoli

Abstract:

Mass geochemical data of surface soil have been obtained in recent years with the development of geochemical surveys of land quality. However, there is an obvious defect in the dataset of 1∶50 000 large-scale surface samples, i.e., the lack of ore-forming elements. In view of the important role of ore-forming elements in the prospecting of mineral resources, this article attempts to provide a supplementary plan based on existing data. Taking the rare metal rubidium as an example, 2,548 groups of 1∶250 000 small-scale surface soil data in the same area were divided into two groups randomly using the random forest algorithm according to the ratio of 8∶2, with 80% of the data for model training and 20% of the data for model verifying. The combination of variable importance metric ranking and learning curve construction was used to select 8 elements (K, B, Ni, V, Zn, As, Co, Cu) as predictors. The goodness of fitness(R2)of the model to the training data and test data reached 0.983 2 and 0.895 6, respectively, indicating that the optimal method of predictor variables is effective. Subsequently, the above-mentioned predictive variable data of 1∶50 000 surface soil was imported into the model as input variables, and the predicted Rb element content was obtained. The predicted results were in line with the actual characteristics. This study indicating that it is feasible to introduce the big data machine learning random forest algorithm into the spatial quantitative prediction of surface soil geochemical element content, and the service application dimension of land quality geochemical data can be further expanded.

Key words: machine learning, random forest, surface soil, prediction of ore-forming elements

CLC Number: