Journal of Global Change Data & Discovery2025.9(1):59-70

[PDF] [DATASET]

Citation:Li, S. S., Li, J. L., Du, W. B., et al.Dataset Development on the Temporal and Spatial Distribution of Surface Reservoirs in Xinjiang Uygur Autonomous Region (1942–2022)[J]. Journal of Global Change Data & Discovery,2025.9(1):59-70 .DOI: 10.3974/geodp.2025.01.08 .

Dataset Development on the Temporal and Spatial Distribution of Surface Reservoirs in Xinjiang Uygur Autonomous Region (1942?C2022)

Li, S. S.1,2  Li, J. L.1,3*  Du, W. B.1,2  Liu, S. Q.1,3,4  Wang, H. Y.1,3,4  Jin, J. Y.1,3,4

1. Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi 830011, China;

2. School of Surveying and Mapping and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454000, China;

3. Key Laboratory of GIS & RS Application, Xinjiang Uygur Autonomous Region, Urumqi 830011, China;

4. University of Chinese Academy of Sciences, Beijing 100049, China

 

Abstract: Obtaining the temporal and spatial distribution of reservoirs in Xinjiang is crucial for analyzing the migration of water resources and the ecological evolution of oases and deserts. Based on the Sentinel-2 data in 2022, this study extracted the spatial distribution range of reservoirs larger than 0.001 km2 in Xinjiang, and collected the attribute information of reservoirs by integrating sources such as county chronicles and yearbooks, mainly including the name, longitude and latitude, average elevation, completion time, total storage capacity and maximum area, and basin of the reservoir. Based on to the water conservancy project standards and storage capacity, the reservoirs were classified into large, medium and small reservoirs. The results indicate that as of 2022, a total of 804 reservoirs have been constructed in Xinjiang, with a total storage capacity of 24.16 km3. Of these, there are 37 large, 175 medium and 592 small reservoirs, and 461 plain reservoirs and 343 mountain reservoirs. After 1980, the proportion of large mountain reservoirs has significantly increased. This dataset includes: (1) spatial distribution data of reservoirs in Xinjiang, 1942-2022; (2) reservoir attribute data, which includes records of attributes such as the reservoir name, area, capacity, built year, elevation, location, and river basin, etc. The dataset is archived in .shp and .xlsx formats, and consists of 9 data files with data size of 21.4 MB (compressed into 1 file with 5.86 MB).

Keywords: reservoirs; spatio-temporal changes; Xinjiang; remote sensing

DOI: https://doi.org/10.3974/geodp.2025.01.08

Dataset Availability Statement:

The dataset supporting this paper was published and is accessible through the Digital Journal of Global Change Data Repository at: https://doi.org/10.3974/geodb.2025.01.01.V1.

1 Introduction

Reservoirs are artificial water storage facilities built by damming on rivers to retain water. By storing and releasing water, they enable the spatio-temporal allocation of water resources and play crucial roles in agricultural irrigation, flood control, power generation, and urban water supply[1]. Xinjiang is situated in arid and semi-arid regions with scarce precipitation, limited water resources, and uneven spatio-temporal distribution. Reservoirs are an critical means to ensure sustainable water supply[2]. Xinjiang began constructing reservoirs in the 1940s, and since the founding of the People??s Republic of China, the scale of reservoir and water conservancy projects has continuously expanded[2,3]. Mountain control reservoirs, reservoirs in the middle and lower reaches of rivers, and irrigation canals have gradually developed into a multi-objective reservoir joint scheduling system[4], significantly improving the efficiency of water resource utilization, contributing to the expansion of artificial oases and cultivated land. Reservoirs have become a vital water source for the development of oasis agriculture, as well as production and daily life. Analyzing the historical changes in reservoir construction in Xinjiang is of crucial for understanding the development and utilization of water resources, as well as the expansion of artificial oases.

Remote sensing technology is a crucial tool for obtaining reservoir information[5?C7]. A number of spatial reservoir datasets based on remote sensing technology have been released domestically and internationally, including the Global Georeferenced Database of Dams (GOODD)[8], the Global Reservoir and Lake Surface Area Dataset (ReaLSAT)[9], the Georeferenced Global Dam and Reservoir Dataset (GeoDAR)[10], the Global Dam Tracker Database (GDAT)[11], the China Lake, Dam, Reservoir, and Large waterbody Dataset (China-LDRL)[12], the China Reservoir Dataset (CRD)[13], and the 2016?C2021 China Reservoir List[14]. Li[15] analyzed the characteristics and distribution patterns of reservoir dams in Xinjiang using the most recent and comprehensive reservoir data. However, this dataset only presents statistical data, lacking spatial attributes. Reservoir datasets derived from remote sensing not only include information on reservoir area and spatial location, but also attributes such as capacity, function, and type, which have been instrumental in research on water resource management and allocation[16]. However, most of these datasets contain incomplete descriptions of reservoir information in Xinjiang. For instance, GeoDAR provides information for only 16 reservoirs in Xinjiang, while the CRD dataset includes 673 records and is currently the most comprehensive. However, this dataset is based on published reservoir dam products and does not account for reservoirs constructed in recent years or some smaller reservoirs built in earlier stages. The primary reason for the discrepancy in the number of reservoirs across datasets is that many reservoirs in Xinjiang are small in area and capacity[17,18], leading to missed extractions when using the Landsat 30-m data source. Furthermore, the past decade has seen rapid development in reservoir construction in Xinjiang, and many current datasets lack recent updates on reservoir information.

To this end, this study uses Sentinel-2 images10-m data from the Xinjiang region in 2022 as the data source to extract the most recent reservoir distribution information. Based on this, combined with Landsat series satellite images since 1986 onwards, yearbooks, local chronicles, and other sources, attributes such as the construction time, reservoir capacity, elevation, and area are gathered. The spatio-temporal characteristics of reservoir construction in Xinjiang across different historical periods are then analyzed.

2 Metadata of the Dataset

The metadata of the Spatio-temporal distribution dataset of surface reservoirs in Xinjiang Uygur Autonomous Region (1942-2022)[19] is summarized in Table 1, which includes the name, author, geographical area, year of the dataset, spatial resolution, dataset files, data publisher, sharing service platform, and data sharing policy, etc.

 

Table 1  Metadata summary of the Spatio-temporal distribution dataset of surface reservoirs in Xinjiang Uygur Autonomous Region (1942-2022)

Items

Description

Dataset full name

Spatio-temporal distribution dataset of surface reservoirs in Xinjiang Uygur Autonomous Region (1942-2022)

Dataset short name

ReservoirXinjiang_1942-2022

Authors

Li, S. S., Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, School of Surveying and Mapping and Land Information Engineering, Henan Polytechnic University, 212104020037@home.hpu.edu.cn
Li, J. L., Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Key Laboratory of GIS & RS Application, Xinjiang Uygur Autonomous Region, lijl@ms.xjb.ac.cn

Du, W. B., Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, School of Surveying and Mapping and Land Information Engineering, Henan Polytechnic University, dwb@hpu.edu.cn
Liu, S. Q., Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Key Laboratory of GIS & RS Application
??Xinjiang Uygur Autonomous Region, University of Chinese Academy of Sciences, liushuaiqi22@mails.ucas.ac.cn
Wang, H. Y., Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Key Laboratory of GIS & RS Application
??Xinjiang Uygur Autonomous Region, University of Chinese Academy of Sciences, haoyu.wang@ugent.be
Jin, J. Y., Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Key Laboratory of GIS & RS Application, Xinjiang Uygur Autonomous Region, University of Chinese Academy of Sciences, jinjingyu22@mails.ucas.ac.cn

Geographical region

Xinjiang

Year

1942-2022

Spatial resolution

10 m (existing reservoirs); 30 m (disappeared reservoirs)

Data format

.shp, .xlsx

Data size

5.86 MB (after compression)

Data files

The total list and the spatial distribution of reservoirs in Xinjiang (1942-2022)

Foundations

Natural Science Foundation of Xinjiang Uygur Autonomous Region (2022D01E18); National Natural Science Foundation of China (U2003201, 41671034)

Computing platform

ArcGIS

Data publisher

Global Change Research Data Publishing & Repository, http://www.geodoi.ac.cn

Address

No. 11A, Datun Road, Chaoyang District, Beijing 100101, China

Data sharing policy

(1) Data are openly available and can be free downloaded via the Internet; (2) End users are

encouraged to use Data subject to citation; (3) Users, who are by definition also value-added service providers, are welcome to redistribute Data subject to written permission from the GCdataPR Editorial Office and the issuance of a Data redistribution license; and (4) If Data are used to compile new datasets, the ??ten percent principal?? should be followed such that Data records utilized should not surpass 10% of the new dataset contents, while sources should be clearly noted in suitable places in the new dataset[20]

Communication and

searchable system

DOI, CSTR, Crossref, DCI, CSCD, CNKI, SciEngine, WDS, GEOSS, PubScholar, CKRSC

3 Methods   

3.1 Data Sources

The research generates a spatiotemporal distribution dataset of surface reservoirs in Xinjiang (1942-2022) by integrating remote sensing images and yearbook archives. The primary data source is Sentinel-2 imagery from April to October 2022, used to capture the maximum spatial extent of reservoirs, with full coverage of the entire Xinjiang region achieved at least once per month. Additionally, combined with the collected chronicles and yearbooks from all administrative levels in Xinjiang, attributes such as reservoir capacity and construction time are extracted. A time series of reservoir area is generated using Landsat series remote sensing images to supplement the construction time of reservoirs lacking documentation records. The average elevation of reservoirs is derived from SRTM DEM. The data sources and their applications in this study are shown in Table 2.

 

Table 2  Data sources and applications

Data source

Temporal resolution

Spatial resolution

Count

Applications

Sentinle-2

5 days

10 m

2,392

Obtain the maximum water surface of the reservoir

Chronicles[21?C26]

year

/

103

Obtain the reservoir capacity and construction year

Landsat imagery

16 days

30 m

13,056

Obtain the construction time of the reservoir

SRTM DEM

/

20 m

/

Obtain the average elevation of the reservoir

 

For this study, cloud-free or less cloudy and snow-free remote sensing images were selected, including Sentinel-2 and Landsat series images. Sentinel-2 images[1] were acquired from April to October 2022, with a spatial resolution of 10 m and a revisit period of 5 days, which facilitates the monitoring of small water bodies with higher precision. The Level-1C products used in the study are available from the ESA official website. Landsat images were acquired from 1985 to 2022, including all available images from Landsat 5 TM, Landsat 7 ETM+, Landsat 8 OLI, and Landsat 9 OLI-2, forming a 40-years time series. The L1TP (Collection 2 T1 Level 1 Precision Terrain Corrected) data[2] used have undergone radiation calibration and high-precision orthorectification based on DEM data and are available from the Earth Explorer website of the United States Geological Survey (USGS).

3.2 Methodological Framework

The research and development process of the Spatio-temporal distribution dataset of surface reservoirs in Xinjiang (1942-2022) involves several key steps, including the extraction of the full water body range using a deep learning model, reservoir discrimination, determination of the reservoir construction time based on the long-time series, establishment of the attribute database, and integration of the attribute database with the spatial database (Figure 1).

3.2.1 Extraction of Water Body

U-Net is a widely used water body mapping model[27], particularly effective for mapping small water bodies in high-resolution images[28]. In this study, the U-Net model was employed to generate the water body map of Xinjiang from 2,392 Sentinel-2 images from April to October 2022. Initially, 300 water body samples covering the entire Xinjiang were created, each with dimensions of 1,500 ?? 1,500 pixels, with 80% used for model training and 20% for validation. The images and corresponding labels were cropped to a size of 256 ?? 256 pixels. To increase the number of training samples, the images were enhanced 108 times using techniques such as horizontal flipping, transposition, vertical flipping, rotation. The U-Net model was then trained with an initial learning rate of 0.005, a batch size of 8, and 300 epochs per network. The loss function used during training was FocalLoss, which

 

Figure 1  Flowchart of reservoir dataset development

 

effectively addresses class imbalance in segmentation compared to other classification functions. The U-Net model was executed on a workstation equipped with an NVIDIA Quadro RTX A6,000 graphics card, running under the Cuda framework in Python. The precision, overall accuracy, and recall rates of this model were 93%, 91%, and 88% respectively, demonstrating strong performance.

Based on the trained U-Net model, the area time series of all types of water bodies was generated using all cloud-free Sentinel-2 images from April to October 2022 and subsequently verified through manual editing. Finally, the maximum area was calculated using the area range from the time series. The goal was to uniformly represent its boundary using the maximum extent corresponding to the largest reservoir capacity. Although poor image quality and extensive cloud cover may lead to the omission of the maximum reservoir area, this was overlooked due to the high temporal resolution of Sentinel-2.

3.2.2 Reservoir Identification

Based on the water body extraction results, this study differentiates reservoirs from other water bodies, such as lakes and rivers, using auxiliary data, including river network data and high-resolution base maps. Reservoir identification in this study follows three steps: initial identification based on river network data, visual interpretation using high-resolution online images, and cross-validation with other datasets.

First, a 500-m buffer zone was created based on the high-resolution river network and used as a mask to intersect with the extracted water body data, thereby isolating reservoir elements. Additionally, to avoid overlooking diversion reservoirs located more than 500 m from the river, the study intersected the 500-m buffer zone mask, created from the river network data, with the water body dataset once more. According to the dataset results, only 58 reservoirs were located outside the buffer zone after this step.

Subsequently, the study refined and corrected the initial reservoir screening results using higher-resolution online images. A visual inspection was conducted to assess whether a dam was near the water body polygon. If no dam was found, the water body was classified as a lake or other types. If a dam near was present near the water body, it was added to the dataset as a reservoir.

Finally, the spatial range of reservoirs obtained was cross-validated with existing datasets. The publicly available reservoir dam datasets GeoDAR, China-LDRL, and CRD were selected and spatially matched with the dataset of this study. These datasets were used to supplement the missing spatial records in this study. Additionally, it was confirmed whether the spatial elements identified in this study that were absent in the reference datasets were reservoirs. Ultimately, the exact locations and extents of all reservoirs were manually verified.

3.2.3 Attribute Compilation

Information on characteristics such as reservoir storage capacity, completion time, and catchment area are obtained from local chronicles, yearbooks, literature records, and other sources. The total storage capacity values of reservoirs are primarily derived from yearbooks, county annals, and online search results. The completion times of reservoirs are obtained using 2 methods. First, the completion times of reservoirs are gathered from textual records, with accurate completion years identified for 716. Second, the completion or abandonment times of 81 reservoirs is determined based on Landsat time-series images. It is assumed that the year in which the annual average area of the reservoir water surface exceeds 0 represents the completion year of the reservoir. Information on the completion or abandonment times of only 7 reservoirs cannot be obtained. The elevation values of the reservoirs are derived from SRTM DEM data. The average elevation corresponding to the spatial extent of each reservoir is extracted from the SRTM DEM data and used as the elevation value of the reservoir.

Additionally, for the purpose of statistical analysis, reservoirs are classified according to their capacity and water inflow characteristics. According to the reservoir capacity[29], reservoirs with a capacity greater than 108 m3 are classified as large reservoirs, those with a capacity less than 107 m3 as small reservoirs, and those with a capacity between the two as medium reservoirs. Based on flow characteristics and referencing Chinese mountainous area spatial range data[30,31], reservoirs are classified into two types: mountainous reservoirs and plain reservoirs.

3.2.4 Bridging Attributes and Geolocation

The name of a reservoir is a crucial attribute that links the spatial and the attribute databases, serving as a marker to connect spatial location with attribute information. The names of reservoirs, along with attributes describing their geographical locations, such as admini­strative regions and associated river basins, are sourced from Google Map or AMAP. The names are then assigned to the reservoir vector polygons at their corresponding spatial locations. Finally, using the reservoir names, a spatial join tool is employed to link the spatial extent with the attribute information, including reservoir name, total storage capacity, maximum area, elevation, completion year, and reservoir type, among others.

4 Data Results and Validation

4.1 Dataset Composition

The dataset is the Spatio-temporal distribution dataset of surface reservoirs in Xinjiang (1942-2022), comprising both spatial distribution data and tabular data. The spatial data includes the distribution data (.shp) of large, medium-sized and small reservoirs in Xinjiang from 1942 to 2022. The tabular data contains the general inventory of Xinjiang Reservoirs (1942-2022) with the attributes. 

4.2 Data Results

4.2.1 Spatial Distribution Characteristics of Reservoirs

A total of 804 reservoirs (area > 0.001 km2) have been constructed in Xinjiang, with a total storage capacity of 24.16 km3. Among them, 37 are large reservoirs, 175 are medium reservoirs, and 592 are small reservoirs, with storage capacities of 16.583 km3, 6.025 km3, and 1.552 km3 respectively. The statistical results (Figure 2, 3) indicate that in the northern Xinjiang prefectures, Altay, Ili, Tacheng, Changji, Bortala, and Urumqi have constructed 119, 45, 123, 125, 11, and 37 reservoirs, respectively, with corresponding storage capacities of 5.290 km3, 5.093 km3, 1.478 km3, 0.834 km3, 0.140 km3, and 0.715 km3. In eastern Xinjiang, Hami and Turpan have 62 and 15 reservoirs, respectively, with storage capacities of 0.206 km3 and 0.220 km3. In southern Xinjiang, Kashgar, Hotan, Aksu, Bayingolin Mongol Autonomous Prefecture, and Kezilesu Kirgiz Autonomous Prefecture have 79, 57, 30, 29, and 23 reservoirs, respectively, with storage capacities of 4.861 km3, 0.899 km3, 1.526 km3, 1.184 km3, and 1.107 km3. The prefecture-level cities of Cocodala, Beitun, Karamay, Shuanghe, Aral, Huyanghe, Wujiaqu, Kunyu, and Tiemenguan have constructed a total of 49 reservoirs, with a total storage capacity of 0.608 km3. Overall, the number of reservoirs in northern Xinjiang exceeds that in southern Xinjiang, and the distribution of reservoirs in northern Xinjiang is more concentrated, primarily in Changji, Bortala, Altay, and Ili. In contrast, reservoirs in southern Xinjiang are mainly distributed near the tributaries of the Tarim River.

The number of reservoirs ineach river basin is ranked as follows: Manas Lake Basin (134), Irtysh River Basin (101), Bogda Region (79), Kunlun Region (60), Ili River Basin (56), Abey Lake Basin (56), Kashgar River Basin (55), Yarkant River Basin (48), Aral Lake Basin (45), Barkol-Yiwu Basin (37), Ulungur River Basin (30), Hami Basin (25), Tarim River Mainstream (22), Ayding Lake Basin (18), Aksu River Basin (13), Lop Nur Region (13), Ugan River Basin (9), and Qarqan River Basin (3). The proportion of small reservoirs is relatively high in each river basin. For example, in the Manas Lake Basin, which has the highest number of reservoirs, the proportion of small reservoirs reaches 71%. The storage capacity of each river basin is ranked as follows: Ili River Basin (5.082 km3), Irtysh River Basin (4.583 km3), Yarkant River Basin (4.437 km3), Manas Lake Basin (2.241 km3), Kashgar River Basin (1.533 km3), Kunlun Region (0.987 km3), Ugan River Basin (0.875 km3), Tarim River Mainstream (0.836 km3), Ulungur River Basin (0.803 km3), Abey Lake Basin (0.637 km3), Aksu River Basin (0.438 km3), Lop Nur Region (0.430 km3), Aral Lake Basin (0.365 km3), Bogda Region (0.312 km3), Ayding Lake Basin (0.223 km3), Qarqan River Basin (0.171 km3), Hami Basin (0.114 km3), and Barkol-Yiwu Basin (0.092 km3). The proportion of storage capacity held by large reservoirs is relatively high in each river basin. In river basins with large runoff, such as the Ili River Basin, Irtysh River Basin, and Yarkant River Basin, the proportion of storage capacity held by large reservoirs exceeds 80%. This is because in these areas with large runoff, large reservoirs possess higher regulation and storage capabilities.

4.2.2 Distribution of Reservoir Construction Years

From 1942 to 2022, the number of reservoirs in Xinjiang has steadily increased. Based on the quantity and characteristics of reservoir construction, this period can be divided into four stages: 1940-1960, 1960-1980, 1980-2000, and 2000-2022. The 1960-1980 and 2000-2010 periods were particularly peak years of reservoir construction. During 1940-1960, a total of 73 reservoirs were built, with a cumulative storage capacity of 1.18 km3. Small and medium reservoirs accounted for 96% of the total number, but only 68% of the total capacity. In the 1960-1980 period, 292 reservoirs were constructed, with a total

 

Figure 2  Spatial distribution map of reservoirs in Xinjiang

 

 

Figure 3  Statistical diagram of the number and volume of reservoirs in Xinjiang

 

storage capacity of 3.24 km3. Among these, large, medium, and small reservoirs accounted for 2%, 22%, and 76% of the total number, respectively. The corresponding proportions of storage capacity were 30%, 55%, and 15%. During this period, the construction of small and medium reservoirs still dominated. From 1980 to 2000, 192 reservoirs were built, with a total storage capacity of 3.30 km3. The proportions of large, medium, and small reservoirs in terms of number were 77%, 20%, and 3%, respectively. In terms of storage capacity, the proportions were 51%, 39%, and 10%, respectively. During 2000-2022, 224 reservoirs were built, including 21 large reservoirs, 54 medium reservoirs, and 149 small reservoirs. Compared with the previous periods, the number of large reservoirs has increased significantly. The total storage capacity during this period is 16.36 km3, with the proportions of large, medium, and small reservoirs being 83%, 14%, and 3%, respectively. The storage capacity of large reservoirs has also risen considerably. Overall, while the number of reservoirs has not increased dramatically, the total storage capacity has seen significant growth, particularly after 2005. This is mainly due to the construction of large reservoirs. It can be observed that large reservoirs, particularly in mountainous areas, play a crucial role in water regulation and storage. The construction of reservoirs in Xinjiang has gradually shifted towards large reservoirs in these areas. The trend of reservoir construction in Xinjiang over the years is illustrated in Figure 4.

 

Figure 4  Statistical charts on the number and volume of reservoirs built over the years

 

4.2.3 Construction of Plains and Mountain Reservoirs

From 1942 to 2022, a total of 461 plain reservoirs and 343 mountainous reservoirs were constructed in Xinjiang, with total storage capacities of 8.59 km3 and 15.57 km3, respectively. As shown in the statistical charts (Figure 5), the number of plain reservoirs generally follows a decreasing trend, while the number of mountainous reservoirs shows a steady increase, along with their corresponding storage capacities. Before the 1980s, due to limited construction techniques and economic conditions, the focus was primarily on building plain reservoirs. Between 1940 and 1980, 273 plain reservoirs were constructed, with a total storage capacity of 3.77 km3. In comparison, only 92 mountainous reservoirs were built, with a total storage capacity of 0.71 km3. After 1980, the construction of mountainous reservoirs accelerated. From 1980 to 2000, 111 plain reservoirs were built, with a total storage capacity of 1.25 km3, while 81 mountainous reservoirs were constructed, with a total storage capacity of 1.99 km3. During this period, the numbers of plain and mountainous reservoirs became more comparable, but the storage capacity of mountainous reservoirs was nearly twice that of plain reservoirs. Since 2000, the construction of mountains reservoirs has far outpaced that of plain reservoirs. In this period, 59 plain reservoirs were built, with a storage capacity of 3.49 km3, while 165 mountainous reservoirs were constructed, with a storage capacity of 12.87 km3. Both the number and storage capacity of mountainous reservoirs now significantly exceed those of plain reservoirs.

 

 

Figure 5  Statistics of the number and volume of plain and mountain reservoirs (1940?C2022)

4.3 Comparison with Other Datasets

The spatial distribution and attribute information of reservoirs are essential for effective water resource management and ensuring sustainability. Remote sensing technology has become a key tool for monitoring reservoirs dynamically, both at the global and regional scales. In addition to remote sensing, public statistical reports provide valuable record of reservoir data. In this study, a spatio-temporal distribution dataset of surface reservoirs in Xinjiang was developed by integrating high-resolution remote sensing imagery with records from water conservancy project archives.

Most existing surface water body change datasets conflate reservoirs with other types of water bodies, such as lakes, or fail to separately identify reservoirs, leading to an underestimating of the number of reservoirs in Xinjiang. For instance, GLAKES[28] dataset only labels 19 water body elements as reservoirs. While some inventories specifically focus on reservoirs in Xinjiang, their practical application is often limited due to the lack of geographical references. Li??s study[15], which analyzed the spatio-temporal distribution of reservoirs in Xinjiang using statistical data, reported that by 2022, 751 reservoirs had been built, with a total storage capacity of 29.776 km3. However, the findings of this study indicate that, excluding abandoned or non-functional reservoirs, there are currently 776 operating reservoirs, with a total storage capacity of 23.97 km3. Although the results are similar, Li??s study did not provide spatial locations on these reservoirs. Global reservoir datasets with spatial locations often overlook local reservoirs, resulting in significant discrepancies in the number of reservoirs in Xinjiang. For example, the GeoDAR[10] dataset only identifies 16 reservoirs in Xinjiang, and most of their attribute information is not publicly available. The CRD[13] dataset is the most comprehensive, listing 673 reservoirs with the spatial locations and some attribute like storage capacity and area. However, it lacks information on the construction time of reservoirs, making it difficult to analyze temporal changes in reservoir characteristics. In contrast (Table 3), this study, based on high- resolution remote sensing imagery, provides a more comprehensive map of Xinjiang??s reservoirs, visually illustrating their spatial distribution. Furthermore, this dataset integrates records from water conservancy project archives, assigning attributes like storage capacity, area, and elevation to each reservoir, and analyzing the spatio-temporal changes in reservoir characteristics by incorporating the construction time data.

 

Table 3  Comparison of ReservoirXinjiang_1942?C2022 with other datasets

Dataset

Domain

Production time

Number of reservoirs (Xinjiang)

Total volume
(Xinjiang)

Attributes

GLAKES[28]

Global

2022

19

/

coordinate, area, water source, type

GeoDAR[10]

Global

2022

16

9.17 km3 (Accessible)

coordinate, area, capacity, reference data sources

CRD[13]

China

2022

673

30.41 km3

name, coordinate, prefecture, area, storage, type, shape, length

Li??s study[15]

Xinjiang

2022

751

29.78 km3

distribution, total capacity

ReservoirXin-
jiang_1942
-2022

Xinjiang

2022

776

23.97 km3

name, coordinate, area, volume, shape length, altitude, built year, river, basin, prefecture, type

5 Discussion and Conclusion

Reservoirs are key indicators of a region??s water resource regulation and storage capacity, making their spatial distribution and characteristics essential data for water resource management. In this paper, the extent of reservoirs in Xinjiang was mapped using high-resolution remote sensing images, and an attribute database was established by integrating historical archives. This dataset covers the spatial distribution of reservoirs built in Xinjiang from 1942 to 2022 and illustrates the development of reservoir construction over time by incorporating construction dates. Through data analysis, it was found that since the founding of the People??s Republic of China, a total of 804 reservoirs have been built in Xinjiang, with a total storage capacity of 24.16 km3. Small and medium reservoirs dominate, making up approximately 95% of the total number of reservoirs. However, after 2005, the proportion of large reservoirs has increased. In terms of reservoir types, plain reservoirs predominated before 2010, accounting for 67% of the total. Since 2010, the construction of mountainous reservoirs has accelerated with these reservoirs now accounting for 83% of the total number of reservoirs built during this period.

Compared to existing datasets, the greatest advantage of this dataset is its inclusion of time attribute for reservoirs through two methods: historical records and long-time remote sensing time series. This approach addresses the limitations of single-phase remote sensing, where monitoring the water surface changes of reservoirs can be hindered by insufficient temporal resolution and the quality constraints of remote sensing images. With the provided time attribute, this dataset effectively captures the spatio-temporal changes of reservoirs in Xinjiang and the development of water conservancy, offering valuable insights for water resource utilization and planning. Additionally, by integrating multi source remote sensing images and historical archive materials, the dataset combines the strengths of both data sources, creating a completer and more reliable spatial and attribute database for Xinjiang??s reservoirs. The Spatio-temporal distribution dataset of surface reservoirs in Xinjiang (1942-2022) provides comprehensive data support for water resource allocation and enhancing water resource utilization efficiency, while also serving as a reference for future reservoir construction in Xinjiang.

 

Author Contributions

Li, J. L. and Du, W. B. made the overall design for the dataset development; Li, S. S. and Jin, J. Y. collected and processed data on the max extent and attributes of reservoirs in Xinjiang; Li, S. S. and Wang, H. Y. designed the models and algorithms; Li, S. S. and Liu, S. Q. completed the data validation; Li, S. S. and Li, J. L. wrote the paper.

 

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1]        Chinese Academy of Engineering, ??Research on water resources strategy for sustainable development of China in the 21st century?? project team. Comprehensive report on China??s sustainable development water resources strategy [J]. China Water Resources, 2000(8): 5?C17.

[2]        Tang, Q. C. The development in oases and rational use of water resources [J]. Journal of Arid Land Resources and Environment, 1995, 9(3): 107?C112.

[3]        Deng, M. J. ??Three Water Lines?? strategy: its spatial patterns and effects on water resources allocation in northwest China [J]. Acta Geographica Sinica, 2018, 73(7): 1189?C1203.

[4]        Liu, T. Research on joint-dispatching of mountain-plain reservoirs in the Yarkant River Basin [J]. Hydro Science and Cold Zone Engineering, 2023, 6(4): 68?C71.

[5]        Perin, V., Tulbure, M. G., Gaines, M. D., et al. A multi-sensor satellite imagery approach to monitor on-farm reservoirs [J]. Remote Sensing of Environment, 2022, 270: 112796.

[6]        Vu, D. T., Dang, T. D., Galelli, S., et al. Satellite observations reveal 13 years of reservoir filling strategies, operating rules, and hydrological alterations in the Upper Mekong River basin [J]. Hydrology and Earth System Sciences, 2022, 26(9): 2345-2364.

[7]        Liu, C., Tang, H. R., Ji, L. Y., et al. Spatial-temporal water area monitoring of the Miyun Reservoir using remote sensing imagery from 1984 to 2020 [J]. National Remote Sensing Bulletin, 2023, 27(2): 335?C350.

[8]        Mulligan, M., van Soesbergen, A., S??enz, L. GOODD, a global dataset of more than 38,000 georeferenced dams [J]. Scientific Data, 2020, 7(1): 31.

[9]        Khandelwal, A., Karpatne, A., Ravirathinam, P., et al. ReaLSAT, a global dataset of reservoir and lake surface area variations [J]. Scientific Data, 2022, 9(1): 356.

[10]     Wang, J., Walter, B. A., Yao, F., et al. GeoDAR: georeferenced global dams and reservoirs dataset for bridging attributes and geolocations [J]. Earth System Science Data, 2022, 14(4): 1869-1899.

[11]     Zhang, A. T., Gu, V. X. Global Dam Tracker: a database of more than 35,000 dams with location, catchment, and attribute information [J]. Scientific Data, 2023, 10(1): 111.

[12]     Wang, X., Xiao, X., Qin, Y., et al. Improved maps of surface water bodies, large dams, reservoirs, and lakes in China [J]. Earth System Science Data, 2022, 14(8): 3757-3771.

[13]     Song, C., Fan, C., Zhu, J., et al. A comprehensive geospatial database of nearly 100,000 reservoirs in China [J]. Earth System Science Data, 2022, 14(9): 4017-4034.

[14]     Zhang, H., Chen, F., Wang, L., et al. Reservoir inventory for China in 2016 and 2021 [J]. Scientific Data, 2023, 10(1): 609.

[15]     Li, J., Liu, Y., Peng, Z. X., et al. Characteristics and construction managementideas of reservoir dams in Xinjiang [J]. China Water Resources, 2023(16): 41-46.

[16]     Lehner, B., Liermann, C. R., Revenga, C., et al. High-resolution mapping of the world??s reservoirs and dams for sustainable river-flow management [J]. Frontiers in Ecology and the Environment, 2011, 9(9): 494-502.

[17]     Li, J., Liu, Y., Wu, T., et al. 70 years of dam construction in Xinjiang [J]. Journal of China Institute of Water Resources and Hydropower Research, 2020, 18(5): 322-330.

[18]     Deng, M. J., Yu, H. M., Li, X. Q., et al. Dam construction progress in Xinjiang [J]. Water Resources and Hydropower Engineering, 2010, 41(7): 29-35.

[19]     Li, S. S., Li, J. L., Du, W. B., et al. Spatio-temporal dataset of reservoirs in Xinjiang Uygur Autonomous Region, China (1942-2022) [J/DB/OL]. Digital Journal of Global Change Data Repository, 2025. https://doi.org/10.3974/geodb.2025.01.01.V1.

[20]     GCdataPR Editorial Office. GCdataPR data sharing policy [OL]. https://doi.org/10.3974/dp.policy.2014.05 (Updated 2017).

[21]     Fuyun County Party History Local Chronicles Compilation Committee. Fuyun County Chronicle [M]. Urumqi: Xinjiang People??s Publishing House, 2003.

[22]     Qiemo County Local Chronicles Compilation Committee. Qiemo County Chronicle [M]. Urumqi: Xinjiang People??s Publishing House, 1996.

[23]     Kuitun City Local Chronicles Compilation Committee. Kuitun City Chronicle [M]. Beijing: Zhonghua Book Company, 1999.

[24]     Kashgar Local Chronicles Compilation Committee. Kashgar Prefecture Chronicles [M]. Urumqi: Xinjiang People??s Publishing House, 2004.

[25]     Korla City Local Chronicles Compilation Committee. Korla City Chronicle 1991?C2005 [M]. Urumqi: Xinjiang People??s Publishing House, 2017.

[26]     Xinjiang Aksu River Basin Management Office. Chronicles of the Aksu River Basin [M]. Beijing: Local Records Publishing House, 2006.

[27]     Zhao, X., Wang, H., Liu, L., et al. A method for extracting lake water using ViTenc-UNet: taking typical lakes on the Qinghai-Tibet Plateau as examples [J]. Remote Sensing, 2023, 15(16): 4047.

[28]     Pi, X., Luo, Q., Feng, L., et al. Mapping global lake dynamics reveals the emerging roles of small lakes [J]. Nature Communications, 2022, 13(1): 5777.

[29]     Ministry of Water Resources of the People??s Republic of China. Standard for rank classification and flood protection criteria of water and hydropower projects (SL 252??2017) [S]. Beijing: China Water & Power Press, 2017.

[30]     Deng, W., Li, A. N., Nan, X., et al. Digital Mountain Map of China [M]. Beijing: China Map Publishing House, 2015.

[31]     Nan, X., Li, A. N., Deng, W. Dataset of ??Digital mountain map of China?? (2015) [DB/OL]. A Big Earth Data Platform for Three Poles, 2022.



[1] European Space Agency. https://scihub.copernicus.eu/.

[2] NASA, USGS. https://earthexplorer.usgs.gov/.

Co-Sponsors
Superintend