Biome reconstruction on the Tibetan Plateau since the Last Glacial Maximum using a machine learning method

This dataset is the biome change data of the Tibetan Plateau since the last glacial maximum which was reconstructed by using a new method. Firstly, a random forest algorithm was applied to establish a pollen-biome classification model for reconstructing past vegetation changes of the Tibetan Plateau, and 1802 modern pollen assemblages from 17 vegetation zones in and around the Tibetan Plateau were used as the training set for the model development. The random forest model showed a reliable performance (accuracy > 76%) in predicting modern biomes from modern pollen assemblages based on a comparison with the observed biomes. Moreover, the random forest model had a significantly higher accuracy than the traditional biomization method. Then, the newly established random forest model is applied to the paleovegetation reconstruction of 51 fossil pollen sequences of the Tibetan Plateau. New age-depth models were developed for these fossil pollen records using the Bayesian method, and all fossil pollen records were linearly interpolated to 500-year time slices. Finally, the spatiotemporal changes of biomes on the Tibetan plateau over the past 22,000 years at an interval of 500 years were reconstructed by using the random forest model. This dataset can provide evidence for understanding the past variation of alpine vegetation and its mechanism; provide the basis for studying the impact of past climate change on vegetation on the Tibetan Plateau; and provide boundary conditions for climate simulation.

0 2022-02-14