一. Data overview This data interchange is the second data interchange of "genomics research on drought tolerance mechanism of typical desert plants in heihe basin", a key project of the major research program of "integrated research on eco-hydrological processes in heihe basin".The main research goal of this project is a typical desert sand Holly plants as materials, using the current international advanced a new generation of gene sequencing technology to the whole genome sequence and gene transcription of Holly group sequence decoding, so as to explore related to drought resistance gene and gene groups, and transgenic technology in model plants such as arabidopsis and rice) verify its drought resistance. 二, data content 1.Sequencing of the genome and transcriptome of lycophylla SPP. The genome size of Mongolian Holly was about 926 Mb, GC content 36.88%, repeat sequence proportion 66%, genome heterozygosity rate 0.56%, which indicated that the genome has many repeat sequences, high heterozygosity and belongs to a complex genome.Based on the predicted sequence results, we subsequently carried out in-depth sequencing of the genome of lysiopsis SPP. The obtained data were assembled to obtain a 937 Mb genome sequence (table 1), which was basically the same as the predicted genome size.Through to the sand Holly transcriptome sequencing and sequence assembly (table 2), received more than 77000 genes coding sequence (Unigene), these sequences are comments found that most of the gene sequence and legumes and soybean, garbanzo beans and bean has a higher similarity (figure 1), consistent with the fact of sand ilex leguminous plants. 一), and the sand Holly is a leguminous plants consistent with the fact. 2.Discovery of simple repeat sequence (SSR) molecular markers of sand Holly: There is a transcriptome data set of sand Holly in the network public database, and the sample collection site is zhongwei city, ningxia.But this is the location of the project team samples in minqin county, gansu province, in order to study whether this sand in different areas of the Holly sequence has sequence polymorphism, we first identify the minqin county plant samples in the genomes of simple sequence repeat (SSR) markers (table 3), and then, compares the transcriptome sequences of plant sample, found in part of SSR molecular marker polymorphism (table 4), these molecular markers could be used for the species of plant genetic map construction, QTL mapping and genetic diversity analysis in the study. 三, data processing instructions Sample collection place: minqin county, gansu province, latitude and longitude: N38 ° 34 '25.93 "E103 ° 08' 36.77".Genome sequencing: a total of 8 genomic DNA libraries of different sizes were constructed and determined by Illumina HiSeq 2500 instrument.Transcriptome sequencing: a library of 24 transcriptome mrnas was constructed and determined by Illumina HiSeq 4000. 四, the use of data and meaning We selected a typical desert plant as the research object, from the Angle of genomics, parse the desert plant genome and transcriptome sequences, excavated its precious drought-resistant gene resources, and to study their drought resistance mechanism of favorable sand Holly this ancient and important to the utilization of plant resources, as well as the heihe river basin of drought-resistant plant genetic breeding, ecological restoration and sustainable development.
The data set includes the river level observation data of point 2 in the dense runoff observation of the middle reaches of Heihe River from January 1, 2015 to December 31, 2015. The observation point is located in Heihe bridge, 312 National Road, Zhangye City, Gansu Province. The riverbed is sandy gravel with unstable section. The longitude and latitude of the observation point are n38.996667 °, e100.427222 °, altitude 1485m, river width 70m and 20m. Sr50 ultrasonic range finder is used for water level observation, with acquisition frequency of 30 minutes. The data includes the following parts: Water level observation, observation frequency 30 minutes, unit (CM); In 2015, the section of bridge no.2-312 was frequently disturbed by human beings. The dam was built within 1km of the upstream and downstream of the section. The unstable area of the hydrological section led to the disorder of the water level and flow curve. During the measurement, the stable flow and water level curve could not be obtained. The observation of water level is based on the manual observation of water level at 0:00 on January 1, 2015. In the later stage, the hydrological section of river undercut changes. The result is that the datum water level changes and negative value appears; Refer to Li et al. (2013) for hydrometeorological network or station information, and he et al. (2016) for observation data processing
The vegetation phenology data set of Heihe River basin provides remote sensing phenology products from 2012 to 2015. The spatial resolution is 1km and the projection type is sinusoidal. MODIS Lai product mod15a2 is used as the phenological remote sensing monitoring data source, and MODIS land cover classification product mcd12q1 is used as the auxiliary data set for extraction. The product algorithm first uses the time series data reconstruction method (bise method) to control the data quality of the input time series; then uses the main algorithm (logistic function fitting method) and the backup algorithm (piecewise linear fitting method) to extract the vegetation phenological parameters, realizes the complementary calculation method, guarantees the accuracy and improves the inversion rate. The algorithm can extract up to three growth cycles in a year, each growth cycle contains six data sets, including the start point of vegetation growth, the start point of growth peak, the end point of growth peak, the end point of growth, the fastest growth and the fastest decline. At the same time, it records the growth cycle type, growth season length, quality identification, etc., a total of 25 data sets. The phenology product reduces the missing rate of inversion, improves the stability of the product, and the data set is relatively reliable with rich information.
This data set includes the observation data of 25 water net sensor network nodes in Babao River Basin in the upper reaches of Heihe River from January 2015 to December 2015. 4cm and 20cm soil moisture / temperature is the basic observation of each node; some nodes also include 10cm soil moisture / temperature, surface infrared radiation temperature, snow depth and precipitation observation. The observation frequency is 5 minutes. The data set can be used for hydrological simulation, data assimilation and remote sensing verification. For details, please refer to "2015 data document 20160501. Docx of water net of Babao River in the upper reaches of Heihe River"
This data set is typical specific emissivity data set of Heihe River Basin. Data observation is from March 25, 2014 to June 30, 2015. Instrument: Portable Fourier transform infrared spectrometer (102f), hand-held infrared thermometer Measurement method: 102f was used to measure the radiation values of cold blackbody, warm blackbody, observation target and gold plate. Using the radiation value of the cold and warm blackbody, the 102f is calibrated to eliminate the influence of the instrument's own emission. By using the iterative inversion algorithm based on smoothness, the specific emissivity and the object temperature are inversed. The specific emissivity range is 8-14 μ m, and the resolution is 4cm-1. This data set contains the original radiation curves (in ASCII format) and recording files of cold blackbody, warm blackbody, measured target and gold plate obtained by 102f.
The 1km / 5day vegetation index (NDVI / EVI) data set of Heihe River basin provides a 5-day resolution NDVI / EVI composite product in 2015. The data uses the characteristics of China's domestic FY-3 satellite data with high time resolution (1 day) and spatial resolution (1km) to construct a multi angle observation data set. Based on the analysis of the multi-source data set and the existing composite vegetation index products and algorithms A global synthetic vegetation index product algorithm system based on multi-source data set is proposed. The vegetation index synthesis algorithm of MODIS is basically adopted, that is, the algorithm system of BRDF angle normalization method, cv-mvc method and MVC method based on the semi empirical walthal model. Using the algorithm system, the composite vegetation index is calculated for the first level data and the second level data, and the quality is identified. Multi-source data sets can provide more angles and more observations than a single sensor in a limited time. However, due to the difference of on orbit running time and performance of sensors, the observation quality of multi-source data sets is uneven. Therefore, in order to make more effective use of multi-source data sets, the algorithm system first classifies the quality of multi-source data sets, which can be divided into primary data, secondary data and tertiary data according to the observation rationality. The third level data are observations polluted by thin clouds and are not used for calculation. In the middle reaches of Heihe River, the verification results of farmland and forest areas show that the NDVI / EVI composite results of combined multi temporal and multi angle observation data are in good agreement with the ground measured data (RMSE = 0.105). Compared with the time series of MODIS mod13a2 product, it fully shows that when the time resolution is increased from 16 days to 5 days, a stable and high-precision vegetation index can describe the details of vegetation growth in detail. In a word, the NDVI / EVI data set of Heihe River Basin, which is 1km / 5day, comprehensively uses multi temporal and multi angle observation data to improve the estimation accuracy and time resolution of parameter products and better serves the application of remote sensing data products.
The 1 km / 5-day FVC data set of Heihe River basin provides the 5-day FVC synthesis results in 2015. The data uses the data of Terra / MODIS, Aqua / MODIS, and domestic satellites fy3a / MERSI and fy3b / MERSI to build a multi-source remote sensing data set with a spatial resolution of 1 km and a time resolution of 5 days. The whole country is divided into different vegetation divisions and land types, and the conversion coefficient of NDVI and FVC is calculated respectively. The conversion coefficient look-up table and 1km / 5-day synthetic NDVI product production area 1km / 5-day synthetic FVC product are used. In the Heihe River Basin, 1 km / 5-day synthetic FVC products can directly obtain vegetation coverage ratio through high-resolution data to reduce the impact of low-resolution data heterogeneity; in addition, select the typical period of vegetation growth and change, obtain the corresponding growth curve parameters of each pixel by fitting the vegetation index of each pixel time series; and then cooperate with land use map and vegetation classification map, To find the representative uniform pixel of the region to train the conversion coefficient of vegetation index. Compared with the results of high-resolution aster reference FVC in Heihe River Basin, the first step is to aggregate the aster products in Heihe River basin to 1km scale by combining the measured ground data and using the scale up method, and to obtain the aster aggregate FVC data, which is based on spot vegetation remote sensing data released by geoland 2 project (geov1 for short) The results show that the results of geov1 are higher than those of ASTER image combined with ground measurement, and the results of 1 km / 5-day synthetic FVC products in Heihe River Basin are between the two, and the results of 1 km / 5-day synthetic FVC products in Heihe River Basin in the experimental area are better than those of geov1 products. In a word, the comprehensive utilization of multi-source remote sensing data to improve the estimation accuracy and time resolution of FVC parameter products can better serve the application of remote sensing data products.
The 5-day Lai synthesis results in 2015 are provided by the 1 km / 5-day Lai data set of Heihe River Basin. The data set is constructed by using the data of Terra / MODIS, Aqua / MODIS, as well as the domestic satellites fy3a / MERSI and fy3b / MERSI to construct the multi-source remote sensing data set with a spatial resolution of 1 km and a time resolution of 5 days. Multi-source remote sensing data sets can provide more angles and more observations than a single sensor in a limited time. However, due to the difference of on orbit running time and performance of sensors, the observation quality of multi-source data sets is uneven. Therefore, in order to make more effective use of multi-source data sets, the algorithm first classifies the quality of multi-source data sets, which can be divided into first level data, second level data and third level data according to the observation rationality. The third level data are observations polluted by thin clouds and are not used for calculation. The purpose of quality evaluation and classification is to provide the basis for the selection of the optimal data set and the design of inversion algorithm flow. Leaf area index product inversion algorithm is designed to distinguish mountain land and vegetation type, using different neural network inversion model. Based on global DEM map and surface classification map, PROSAIL model is used for continuous vegetation such as grassland and crops, and gost model is used for forest and mountain vegetation. Using the reference map generated by the measured ground data of the forests in the upper reaches of Heihe River and the oasis in the middle reaches, and scaling up the corresponding high-resolution reference map to 1km resolution, compared with the Lai product, the product has a good correlation between the farmland and the forest area and the reference value, and the overall accuracy basically meets the accuracy threshold of 0.5%, 20% specified by GCOS. By cross comparing this product with Lais products such as MODIS, geov1 and glass, the accuracy of this Lai product is better than that of similar products compared with reference value. In a word, the synthetic Lai data set of 1km / 5 days in Heihe River Basin comprehensively uses multi-source remote sensing data to improve the estimation accuracy and time resolution of Lai parameter products, so as to better serve the application of remote sensing data products.
This data set includes the 2015 observation data of 9 water net nodes in the 5.5km × 5.5km observation matrix (red box in the thumbnail) of Yingke / Daman irrigation area in the middle reaches of Heihe River. The nine nodes contain 4cm and 10cm two-layer hydro probe II probes to observe the main variables such as soil moisture, soil temperature, conductivity and complex permittivity; the si-111 infrared temperature probe is set up at 4m height to observe the surface radiation infrared temperature of the underlying surface. The observation time frequency is 5 minutes. This data set can provide spatiotemporal continuous observation data set for remote sensing estimation of key water and heat variables of heterogeneous surface, remote sensing authenticity test, ecological hydrology research, irrigation optimization management and other research.
1、 Data Description: the data includes the river flow data at the outlet of No.2 catchment of hulugou small watershed from May 11, 2015 to September 3, 2015. 2、 Sampling location: the coordinates of river flow monitoring section are located at the outlet of No. 2 catchment near the red wall, with the coordinates of 99 ° 52 ′ 58.40 ″ E and 38 ° 14 ′ 36.85 ″ n.