Photosynthesis is a key process linking carbon and water cycles, and satellite-retrieved solar-induced chlorophyll fluorescence (SIF) can be a valuable proxy for photosynthesis. The TROPOspheric Monitoring Instrument (TROPOMI) on the Copernicus Sentinel-5P mission enables significant improvements in providing high spatial and temporal resolution SIF observations, but the short temporal coverage of the data records has limited its applications in long-term studies. We use machine learning to reconstruct TROPOMI SIF (RTSIF) over the 2001–2020 period in clear-sky conditions with high spatio-temporal resolutions (0.05°, 8-day). Our machine learning model achieves high accuracies on the training and testing datasets (R^2 = 0.907, regression slope = 1.001). The RTSIF dataset is validated against TROPOMI SIF and tower-based SIF, and compared with other satellite-derived SIF (GOME-2 SIF and OCO-2 SIF). Comparing RTSIF with Gross Primary Production (GPP) illustrates the potential of RTSIF for estimating gross carbon fluxes. We anticipate that this new dataset will be valuable in assessing long-term terrestrial photosynthesis and constraining the global carbon budget and associated water fluxes.
CHEN Xingan , HUANG Yuefei , NIE Chong , ZHANG Shuo , WANG Guangqian , CHEN Shiliu , CHEN Zhichao
Global solar radiation and diffuse horizontal solar radiation at Dome C (Antarctica) are measured by radiation sensors (pyranometers CM22, Kipp & Zonen Inc., The Netherlands), and water vapor pressure (hPa) at the ground are obtained from the IPEV/PNRA Project “Routine Meteorological Observation at Station Concordia”, http://www.climantartide.it. This dataset includes hourly solar radiation and its absorbing and scattering losses caused by the absorbing and scattering atmospheric substances (MJ m-2, 200-3600 nm), and the albedos at the top of the atmosphere and the surface. The above solar radiations are calculated by using an empirical model of global solar radiation (Bai, J.; Zong, X.; Lanconelli, C.; Lupi, A.; Driemel, A.; Vitale, V.; Li, K.; Song, T. 2022. Long-Term Variations of Global Solar Radiation and Its Potential Effects at Dome C (Antarctica). Int. J. Environ. Res. Public Health, 19, 3084. https://doi.org/10.3390/ijerph19053084). The observed global solar radiation and meteorological parameters are available at https://doi.org/10.1594/PANGAEA.935421. The data set can be used to study solar radiation and its attenuation at Dome C, Antarctica.
BAI Jianhui
Global solar radiation at Qomolangma station (The Tibetan Plateau) is measured by radiation sensor (pyranometers CM22, Kipp & Zonen Inc., The Netherlands), and water vapor pressure (hPa) at the ground is measured by HMP45C-GM (Vaisala Inc., Vantaa, Finland). This dataset includes hourly solar radiation and its absorbing and scattering losses caused by the absorbing and scattering atmospheric substances (MJ m-2, 200-3600 nm), and the albedos at the top of the atmosphere and the surface. The above solar radiations are calculated by using an empirical model of global solar radiation (Bai, J.; Zong, X.; Ma, Y.; Wang, B.; Zhao, C.; Yang, Y.; Guang, J.; Cong, Z.; Li, K.; Song, T. 2022. Long-Term Variations in Global Solar Radiation and Its Interaction with Atmospheric Substances at Qomolangma. Int. J. Environ. Res. Public Health, 19, 8906. https://doi.org/10.3390/ijerph19158906). The observed global solar radiation and meteorological variables are available at https://data.tpdc.ac.cn/zh-hans/data/b9ab35b2-81fb-4330-925f-4d9860ac47c3/. The data set can be used to study solar radiation and its attenuation at Qomolangma region.
BAI Jianhui
The thickness of the active layer of the three pole permafrost combines two sets of data products. The main reference data is the annual value of the active layer thickness from 1990 to 2015 generated by GCM model simulation. The data format of this data set is netcdf4 format, with a spatial resolution of 0.5 ° and a temporal resolution of years. The reference correction data set is the average value of active layer thickness from 2000 to 2015 simulated by statistical and machine learning (ML) methods. The data format is GeoTIFF format, the spatial resolution is 0.1 °, and the data unit is m. Through post-processing operations such as data format conversion, spatial interpolation, data correction, etc., this research work generates the permafrost active layer thickness data in netcdf4 format, with a spatial resolution of 0.1 °, a temporal resolution of years, a time range of 1990-2015, and a data unit of CM.
YE Aizhong
The original data of carbon flux in the three pole permafrost region are generated by GCM model simulation, and the original data are from http://www.cryosphere.csdb.cn/portal/metadata/5abef388-3f3f-4802-b3de-f4d233cb333b 。 This data set contains the prediction of future scenarios under different representative concentration paths (RCPs) in the next 2046-2065 years, including rcp2.6 scenario, rcp4.5 scenario and rcp8.5 scenario. The original data include parameters representing carbon flux such as NPP and GPP in the permafrost region of the Qinghai Tibet Plateau. The data format is netcdf4 format, with a spatial resolution of 0.5 ° and a temporal resolution of years. Through data format conversion, spatial interpolation and other post-processing operations, the NPP and GPP data in permafrost region in netcdf4 format are generated. The spatial resolution is 0.1 °, the time resolution is years, the time range is 2046-2065, and the data unit is gc/m2yr.
YE Aizhong
The original thickness data of the active layer of the three pole permafrost are generated by GCM model simulation, and the original data are from http://www.cryosphere.csdb.cn/portal/metadata/5abef388-3f3f-4802-b3de-f4d233cb333b 。 This data set contains the prediction of future scenarios under different representative concentration paths (RCPs) in the next 2046-2065 years, including rcp2.6 scenario, rcp4.5 scenario and rcp8.5 scenario. The content of the original data is the thickness of the active layer in the permafrost area of the Qinghai Tibet Plateau. The data format is netcdf4, with a spatial resolution of 0.5 ° and a temporal resolution of years. Through data format conversion, spatial interpolation and other post-processing operations, the active layer thickness in permafrost area in netcdf4 format is generated, with a spatial resolution of 0.1 °, a time resolution of years, a time range of 2046-2065, and the unit is cm.
YE Aizhong
The original data of the three pole permafrost range are generated by GCM model simulation, and the original data are from http://www.cryosphere.csdb.cn/portal/metadata/5abef388-3f3f-4802-b3de-f4d233cb333b 。 This data set contains the prediction of future scenarios under different representative concentration paths (RCPs) in the next 2046-2065 years, including rcp2.6 scenario, rcp4.5 scenario and rcp8.5 scenario. The original data content is the spatial range of permafrost and seasonal frozen soil in the Qinghai Tibet Plateau. The data format is netcdf4 format, with a spatial resolution of 0.5 ° and a temporal resolution of years. Through data format conversion, spatial interpolation and other post-processing operations, this research work generates the permafrost range data in netcdf4 format, with a spatial resolution of 0.1 °, a time resolution of years, and a time range of 2046-2065. Permafrost is represented by 1, and seasonal permafrost is represented by 0.
YE Aizhong
Project based on Landsat_ Through manual interpretation and machine learning algorithm, tm30m remote sensing data has completed the extraction of spatial pattern distribution information of six types of ecosystems in Qilian Mountains from 1990 to 2015, including forest, farmland, grassland, wetland, settlement city and desert. This set of data can be used to study the evolution law of regional ecosystem macro pattern, ecosystem service function evaluation, major ecological restoration project planning and effect evaluation. The evolution of ecosystem macro pattern is a macro response to the evolution of natural processes driven by climate socio-economic coupling. It is also a direct reflection of land use and land cover changes. It is also an important data basis for the evaluation of the effectiveness of regional sustainable development. The research can provide data basis for the evaluation of green development index in Qilian mountain area.
WU Feng
Based on the non survey method, referring to the provincial input-output table and county-level statistical data of the Qilian Mountain region, the project compiled the input-output table of the Qilian Mountain Region in 2017. This table provides a data basis for analyzing the production and consumption of regional economy and the virtual water resources contained in its products or services. The input-output table uses the input-output tables of Qinghai Province, Inner Mongolia Autonomous Region and Gansu Province in 2017, analyzes the industrial production, residents' consumption and interregional trade information of districts and counties included in the Qilian Mountains, and constructs the input-output table of the Qilian Mountains. The input-output table is the characterization of the regional macroeconomic structure and the level of regional products or services.
WU Feng
A dataset of spatio-temporal change of physical and virtual water in Qilian Mountains: Using the single-region input-output method, and the 2012 input-output table of Qilian Mountains, we developed a physical water-virtual water conversion model and explored the virtual water among different departments in Qilian Mountains in 2012. The law of water flow provides a theoretical basis for the optimal allocation of water resources in the natural-society complex system for the research on the optimal allocation of "mountains, waters, forests, fields, lakes, grass and sand" in the Qilian Mountains. It has been verified that this dataset has achieved the balance between the physical water consumption and the total virtual water consumption of various departments in the Qilian Mountains in 2012, indicating that the data is reliable. This data can provide a basis for the optimal allocation of water resources in the Qilian Mountains.
LIU Junguo
The data of Cenozoic plant macrofossils on the Qinghai Tibet Plateau includes leaves, seeds and fruits. It includes Latin and Chinese names of families, genera and species, times, places of origin, morphological descriptions, discussions, specimens and references. The species names are assigned according to the original literature. For fossil records revised by later research, the revised records were chosen; The age of the origin (fossil site) is assigned according to the latest literature. The terms and description paradigm of leaf shape description are referred to the book "Leaf Structure Manual"; The length, angle, and other measurement data in the description are derived from the original literature. The fossil records of the document are sorted alphabetically by Latin initials of families and genera. The data can provide important clues for studying the coupling relationship between the environmental climate changed and the evolution of vegetation and plant diversity in the Cenozoic Qinghai Tibetan Plateau.
ZHOU Zhekun , LIU Jia , CHEN Linlin , ROBERT Spicer , LI Shufeng , HUANG Jian , ZHANG Shitao , HUANG Yongjiang , JIA Linbo , HU Jinjin , SU Tao
This dataset includes data recorded by the Qinghai Lake integrated observatory network obtained from phenology camera observation data of the Alpine meadow and grassland ecosystem Superstation from January 1 in 2021 to December 31 in 2021. Data of January 31 to April 14 is missing due to the spilled storage. The site (98°35′41.62″E, 37°42′11.47″N) was located in the alpine meadow and alpine grassland ecosystem, near the SuGe Road in Tianjun County, Qinghai Province. The elevation is 3718m. The phenology camera adopts a vertical downward method to collect data, with the resolution of 2592*1944. Phenology photos in this data set were taken at 12:10 a day, which has a time error of ±10 min. The image is named as BSDCJZ BEIJING_IR_Year_Month_Day_Time.
Li Xiaoyan
The Qinghai Tibet Plateau is known as the "Asian water tower", and its runoff, as an important and easily accessible water resource, supports the production and life of billions of people around, and supports the diversity of ecosystems. Accurately estimating the runoff of the Qinghai Tibet Plateau and revealing the variation law of runoff are conducive to water resources management and disaster risk avoidance in the plateau and its surrounding areas. The glacier runoff segmentation data set covers the five river source areas of the Qinghai Tibet Plateau from 1971 to 2015, with a time resolution of year by year, covering the five river source areas of the Qinghai Tibet Plateau (the source of the Yellow River, the source of the Yangtze River, the source of the Lancang River, the source of the Nujiang River, and the source of the Yarlung Zangbo River), and the spatial resolution is the watershed. Based on multi-source remote sensing and measured data, it is simulated using the distributed hydrological model vic-cas coupled with the glacier module, The simulation results are verified with the measured data of the station, and all the data are subject to quality control.
WANG Shijin
This dataset contains the monthly evaporation rate and volumes for 7242 reservoirs from March 1984 to December 2016 across the world. The evaporation rate was calculated using the three datasets viz. (1) TerraClimate; (2) ERA5; (3) Princeton Global Forcings. The surface area of these reservoirs is obtained from the Global reservoir surface area dataset (GRSAD). The detailed descriptions for this dataset are presented in Tian et al (2021,2022). The basic information of the global reservoirs was provided by the Global Reservoir and Dam Database (GRanD).
TIAN Wei , LIU Xiaomang, WANG Kaiwen , BAI Peng , LIU Changming
As an important part of the global carbon pool, Arctic permafrost is one of the most sensitive regions to global climate change. The rate of warming in the Arctic is twice the global average, causing rapid changes in Arctic permafrost. The NDVI change data set of different types of permafrost regions in the Northern Hemisphere from 1982 to 2015 has a temporal resolution of every five years, covers the entire Arctic Rim countries, and a spatial resolution of 8km. Based on multi-source remote sensing, simulation, statistics and measured data, GIS method and ecological method are used to quantify the regulation and service function of permafrost in the northern hemisphere to the ecosystem, and all the data are subject to quality control.
WANG Shijin
Monthly data of 7cm soil moisture in the surface layer of China. The time range includes the historical period 1850-2014 and the future period 2015-2100 (the future period includes four different shared socio-economic paths: ssp1-2.6, ssp2-4.5, ssp3-7.0 and ssp5-8.5). The spatial resolution is 0.25 °. This data is based on the deep learning method, taking the 7cm surface soil moisture data of era5 land as a reference, and integrating the surface soil moisture data of 25 scaled down cmip6 models. In the context of climate change, data can be used for drought and vegetation correlation analysis.
FENG Donghan
This dataset contains the flux measurements from the Qinghai Lake eddy covariance system (EC) belonging to the Qinghai Lake basin integrated observatory network from January 1 to December 31 in 2021. The site (100° 29' 59.726'' E, 36° 35' 27.337'' N) was located on the Yulei Platform in Erlangjian scenic area, Qinghai Province. The elevation is 3209m. The EC was installed at a height of 16.1m, and the sampling rate was 10 Hz. The sonic anemometer faced north, and the separation distance between the sonic anemometer and the CO2/H2O gas analyzer (Gill&Li7500A) was about 0.17 m. The raw data acquired at 10 Hz were processed using the Eddypro post-processing software, including the spike detection, lag correction of H2O/CO2 relative to the vertical wind component, sonic virtual temperature correction, coordinate rotation (2-D rotation), corrections for density fluctuation (Webb-Pearman-Leuning correction), and frequency response correction. The EC data were subsequently averaged over 30 min periods. The observation data quality was divided into three classes according to the quality assessment method of stationarity (Δst) and the integral turbulent characteristics test (ITC): class 1-3 (high quality), class 4-6 (good), class 7-8 (poor, better than gap filling data), class9 (rejected). In addition to the above processing steps, the half-hourly flux data were screened in a four-step procedure: (1) data from periods of sensor malfunction were rejected; (2) data collected before or after 1 h of precipitation were rejected; (3) incomplete 30 min data were rejected when the missing data constituted more than 3% of the 30 min raw record; and (4) data were rejected at night when the friction velocity (u*) was less than 0.1 m/s. There were 48 records per day, and the missing data were replaced with -6999. The released data contained the following variables: DATE/TIME, wind direction (Wdir, °), wind speed (Wnd, m/s), the standard deviation of the lateral wind (Std_Uy, m/s), virtual temperature (Tv, ℃), H2O mass density (H2O, g/m3), CO2 mass density (CO2, mg/m3), friction velocity (ustar, m/s), stability (z/L), sensible heat flux (Hs, W/m2), latent heat flux (LE, W/m2), carbon dioxide flux (Fc, mg/ (m2s)), quality assessment of the sensible heat flux (QA_Hs), quality assessment of the latent heat flux (QA_LE), and quality assessment of the carbon flux (QA_Fc). The quality marks of sensible heat flux, latent heat flux and carbon flux are divided into three levels (quality marks 0 have good data quality, 1 have good data quality and 2 have poor data quality). In this dataset, the time of 0:30 corresponds to the average data for the period between 0:00 and 0:30; the data were stored in *.xls format. Detailed information can be found in the suggested references.
Li Xiaoyan
This dataset contains the flux measurements from the Alpine meadow and grassland ecosystem Superstation superstation eddy covariance system (EC) belonging to the Qinghai Lake basin integrated observatory network from January 1 to October 31 in 2021. The site (98°35′41.62″E, 37°42′11.47″N) was located in the alpine meadow and alpine grassland ecosystem, near the SuGe Road in Tianjun County, Qinghai Province. The elevation is 3718m. The EC was installed at a height of 4.5 m, and the sampling rate was 10 Hz. The sonic anemometer faced north, and the separation distance between the sonic anemometer and the CO2/H2O gas analyzer (CSAT3A &EC150) was about 0.17 m. The raw data acquired at 10 Hz were processed using the Eddypro post-processing software, including the spike detection, lag correction of H2O/CO2 relative to the vertical wind component, sonic virtual temperature correction, coordinate rotation (2-D rotation), corrections for density fluctuation (Webb-Pearman-Leuning correction), and frequency response correction. The EC data were subsequently averaged over 30 min periods. The observation data quality was divided into three classes according to the quality assessment method of stationarity (Δst) and the integral turbulent characteristics test (ITC): class 1-3 (high quality), class 4-6 (good), class 7-8 (poor, better than gap filling data), class9 (rejected). In addition to the above processing steps, the half-hourly flux data were screened in a four-step procedure: (1) data from periods of sensor malfunction were rejected; (2) data collected before or after 1 h of precipitation were rejected; (3) incomplete 30 min data were rejected when the missing data constituted more than 3% of the 30 min raw record; and (4) data were rejected at night when the friction velocity (u*) was less than 0.1 m/s. There were 48 records per day, and the missing data were replaced with -6999. Data during December 18 to December 24, 2018 were missing due to the data collector failure. The released data contained the following variables: DATE/TIME, wind direction (Wdir, °), wind speed (Wnd, m/s), the standard deviation of the lateral wind (Std_Uy, m/s), virtual temperature (Tv, ℃), H2O mass density (H2O, g/m3), CO2 mass density (CO2, mg/m3), friction velocity (ustar, m/s), stability (z/L), sensible heat flux (Hs, W/m2), latent heat flux (LE, W/m2), carbon dioxide flux (Fc, mg/ (m2s)), quality assessment of the sensible heat flux (QA_Hs), quality assessment of the latent heat flux (QA_LE), and quality assessment of the carbon flux (QA_Fc). The quality marks of sensible heat flux, latent heat flux and carbon flux are divided into three levels (quality marks 0 have good data quality, 1 have good data quality and 2 have poor data quality). In this dataset, the time of 0:30 corresponds to the average data for the period between 0:00 and 0:30; the data were stored in *.xls format. Detailed information can be found in the suggested references.
Li Xiaoyan
This dataset contains the flux measurements from the Subalpine shrub eddy covariance system (EC) belonging to the Qinghai Lake basin integrated observatory network from January 1 to October 13 in 2021. The site (100°6'3.62"E, 37°31'15.67"N) was located near Dasi, Shaliuhe Town, Gangcha County, Qinghai Province. Data missing due to instrument failure. The elevation is 3495m. The EC was installed at a height of 2.5m, and the sampling rate was 10 Hz. The sonic anemometer faced north, and the separation distance between the sonic anemometer and the CO2/H2O gas analyzer (Gill&Li7500A) was about 0.17 m. The raw data acquired at 10 Hz were processed using the Eddypro post-processing software, including the spike detection, lag correction of H2O/CO2 relative to the vertical wind component, sonic virtual temperature correction, coordinate rotation (2-D rotation), corrections for density fluctuation (Webb-Pearman-Leuning correction), and frequency response correction. The EC data were subsequently averaged over 30 min periods. The observation data quality was divided into three classes according to the quality assessment method of stationarity (Δst) and the integral turbulent characteristics test (ITC): class 1-3 (high quality), class 4-6 (good), class 7-8 (poor, better than gap filling data), class9 (rejected). In addition to the above processing steps, the half-hourly flux data were screened in a four-step procedure: (1) data from periods of sensor malfunction were rejected; (2) data collected before or after 1 h of precipitation were rejected; (3) incomplete 30 min data were rejected when the missing data constituted more than 3% of the 30 min raw record; and (4) data were rejected at night when the friction velocity (u*) was less than 0.1 m/s. There were 48 records per day, and the missing data were replaced with -6999. The released data contained the following variables: DATE/TIME, wind direction (Wdir, °), wind speed (Wnd, m/s), the standard deviation of the lateral wind (Std_Uy, m/s), virtual temperature (Tv, ℃), H2O mass density (H2O, g/m3), CO2 mass density (CO2, mg/m3), friction velocity (ustar, m/s), stability (z/L), sensible heat flux (Hs, W/m2), latent heat flux (LE, W/m2), carbon dioxide flux (Fc, mg/ (m2s)), quality assessment of the sensible heat flux (QA_Hs), quality assessment of the latent heat flux (QA_LE), and quality assessment of the carbon flux (QA_Fc). The quality marks of sensible heat flux, latent heat flux and carbon flux are divided into three levels (quality marks 0 have good data quality, 1 have good data quality and 2 have poor data quality). In this dataset, the time of 0:30 corresponds to the average data for the period between 0:00 and 0:30; the data were stored in *.xls format. Detailed information can be found in the suggested references.
Li Xiaoyan