Journal of Global Change Data & Discovery2022.6(1):58-64

[PDF] [DATASET]

Citation:Lai, X. L., Han, N. L., Huang, P. J.Spatial-Temporal Population Dataset of Hainan Island (2013 -2018)[J]. Journal of Global Change Data & Discovery,2022.6(1):58-64 .DOI: 10.3974/geodp.2022.01.08 .

Spatial-Temporal Population Dataset of Hainan Island (2013-2018)

Lai, X. L.  Han, N. L.*  Huang, P. J.

Department of Land Resources Management, School of Public Administration, Hainan University, Haikou 570228, China

 

Abstract: The spatial-temporal population dataset of Hainan Island (2013-2018) is based on the 2013-2018 NPP/VIIRS night light data, and performs Image synthesis, unstable light sources removal, and year-by-year corrections to eliminate the effects of transient light sources and background noise. Based on the correlation analysis between the processed night lighting data and population statistics data, a regression model is established, and the population spatial modeling is carried out according to the error classification, so as to eliminate the influence of mountain topography, small population and other factors on the simulation accuracy. Finally, the spatial distribution dataset of population in Hainan Island is generated. The dataset is archived in .tif format with a spatial resolution of 500 m and consists of 26 data files, 11.0 MB (compressed into one file, 168 KB), reflecting the characteristics of temporal and spatial distribution of population in Hainan Island from 2013 to 2018.

Keywords: Hainan Island; population; NPP/VIIRS night light data; regression model; population spatialization

DOI: https://doi.org/10.3974/geodp.2022.01.08

CSTR: https://cstr.escience.org.cn/CSTR:20146.14.2022.01.08

Dataset Availability Statement:

The dataset supporting this paper was published and is accessible through the Digital Journal of Global Change Data Repository at: https://doi.org/10.3974/geodb.2021.08.02.V1 or https://cstr.escience.org.cn/CSTR:20146.11.2021.08.02.V1.

1 Introduction

Nightlight remote sensing images can provide important basis for the estimation of socio-economic parameters such as GDP, population, electricity consumption, greenhouse gas emissions, poverty index and Gini coefficient and so on[1]. At present, the commonly used night light data are DMSP/OLS night light data and NPP/VIIRS night light data, both of which come from the National Oceanic and Atmospheric Administration (NOAA). The difference between them is that the time series of DMSP/OLS night light remote sensing image is from 1992 to 2013, and the spatial resolution is 1 km, while the time series of NPP/VIIRS night light remote sensing image is from 2012 to now, and the spatial resolution is 500 m. At the same time, NPP/VIIRS night light data were captured by Suomi-NPP sate­llite using VIIRS, using polar orbit, and obtained by stitching together multiple cloud-free images[2]. The NPP/VIIRS sensor has 22 bands, with a wavelength range of 0.4-12??m, covering the visible and infrared spectrum. Spectral resolution is 16 bit, and radiation detector with wider band and on-orbit radiometric correction technology effectively improve the quality of light image[3]. Thanks to its stronger light capture sensitivity, higher spatial and temporal resolution, NPP/VIIRS data is more suitable for small and medium-scale human activities research[4], and the spatial distribution of population information extracted from this can be intuitive reflects the characteristics of the temporal and spatial evolution of the population. Therefore, this dataset is processed based on NPP/VIIRS night lighting data, and the spatial distribution information of population in Hainan Island is obtained by correlation analysis and regression modeling with the demographic data of 18 cities and counties of Hainan Island, in order to provide data basis and decision-making basis for Hainan??s territorial spatial planning or scientific research.

2 Metadata of the Dataset

The metadata of Night lights and statistical data fusion of Hainan Island population 500-m grid dataset (2013-2018)[5]dataset is showed in Table 1.

 

Table 1  Metadata summary of the Night lighting and census integrated 500 m raster population dataset of Hainan Island (2013-2018)

Items

Description

Dataset full name

Night lighting and census integrated 500 m raster population dataset of Hainan Island (2013-2018)

Dataset short name

PopulationHaiNan_2013-2018

Authors

Lai, X. L., Hainan University, 1309842628@qq.com

Han, N.L., Hainan University, nlhan@hainu.edu.cn

Huang, P. J., Hainan University, 1538541468@qq.com

Geographical region

Hainan island                 Year      2013-2018

Temporal resolution

Year                        Spatial resolution   500 m

Data format

.tif

 

 

Data size

168 KB (After compression)

 

 

Data files

Regional stable NPP/VIIRS nighttime light dataset of Hainan island, 2013-2018
Hainan island population spatial dataset based on NPP/VIIRS night light data from 2013 to 2018

Foundations

Hainan Province (HNSK(ZD)19-119); Natural Science Foundation of Hainan Province (2019RCO16); Hainan University (kyqdsx1962)

Data publisher

Global Change Research Data Publishing & Repository, http://www.geodoi.ac.cn

Address

No. 11A, Datun Road, Chaoyang District, Beijing 100101, China

Data sharing policy

Data from the Global Change Research Data Publishing & Repository includes metadata, datasets (in the Digital Journal of Global Change Data Repository), and publications (in the Journal of Global Change Data & Discovery). Data sharing policy includes: (1) Data are openly available and can be free downloaded via the Internet; (2) End users are encouraged to use Data subject to citation; (3) Users, who are by definition also value-added service providers, are welcome to redistribute Data subject to written permission from the GCdataPR Editorial Office and the issuance of a Data redistribution license; and (4) If Data are used to compile new datasets, the ??ten per cent principal?? should be followed such that Data records utilized should not surpass 10% of the new dataset contents, while sources should be clearly noted in suitable places in the new dataset[6]

Communication and searchable system

DOI, CSTR, Crossref, DCI, CSCD, CNKI, SciEngine, WDS/ISC, GEOSS

3 Methodology

3.1 Study Area and Data Sources

The study area of ​​this paper is the 18 cities and counties in the land area of ​​Hainan Island (excluding Sansha city), which are Haikou city, Sanya city and Danzhou city, and 15 province-level counties. Hainan Island is located between 108??37¢E-111??03¢E and 18??10¢N-20??10¢N. The terrain of the whole island is low and flat all around, with a high tower in the middle, presenting a dome-long mountain topography. The original night light data of this dataset are mainly the NPP/VIIRS night light remote sensing monthly composite mean image during 2013-2018, which comes from the NPP/VIIRS sensor dataset in the National Oceanic and Atmospheric Administration (NOAA)[1], with a resolution of 500 m. Hainan Island??s administrative division data comes from the China Natural Resources Database[2]; Hainan Island??s demographic data comes from the Hainan Provincial Statistical Yearbook[7].

3.2 Technical Route

The technical route of this study includes three parts: NPP/VIIRS night light data acquisition, data processing and population spatialization modeling (Figure 1).

Although the monthly data of NPP/VIIRS night light data have eliminated the effects of lightning, moonlight irradiation and cloud coverage, there are still transient light sources such as aurora, fire, and ship lights and background noise. Therefore, it is necessary to process the original data to obtain stable annual average data. Therefore, NPP/VIIRS night light data processing includes the process of annual mean image synthesis, elimination of negative and extremely high values, removal of unstable light sources, and continuous correction[8,9].

Population spatialization modeling includes correlation analysis, regression modeling and correction. We toke the city and county to analyze the correlation between the night light brightness from 2013 to 2017 and the corresponding demographic data. The regression modeling analysis of demographic data and night light brightness values ​​shows that the R2 of the cubic linear function model is the highest, so the population spatialization model uses a cubic linear function. The 2018 demographic data was used to verify the 2018 population simulation results of the model. The results show that the average error of the overall simulated population of Hainan Island is 36.55%, and the error of individual cities and counties is much larger than the average error.

 

 

Figure 1  Data development technology roadmap

4 Data Results and Validation

4.1 Data Composition

The population spatial dataset of Hainan Island from 2013 to 2018 is composed of 26 data files, and the data is named in the form of ??subject + time??.

4.2 Data Products

After processing the original data through the above technical route, the Night lighting and census integrated 500 m raster population dataset of Hainan Island (2013-2018) with a resolution of 500 m´500 m is obtained, in which the data unit is the total population on the 500m grid. The characteristics of the spatial and temporal distribution of Hainan Island??s population from 2013 to 2018 from the dataset show that the population distribution of Hainan Island is scattered on the whole. The population is mainly distributed in coastal cities and counties, and mainly concentrated in the two cities of Haikou and Sanya. The population of central cities and counties is relatively sparse, and formed the spatial polar distribution pattern of north Haikou and south Sanya (Figure 2, 3). From 2013 to 2018, the population of Hainan Island has a relatively obvious growthtrend. The population increment is mainly distributed in the suburbs of cities and counties, and spreads from the urban center to the periphery, such as Haikou and Sanya are particularly obvious. On the whole, the growth population is mainly distributed in the urban areas of coastal cities and counties, the areas along the high-speed railways and the high-speed roads around the island (Figure 4).

 

Figure 2  Population spatialization of Hainan Island in 2013

 

4.3 Data Validation

The NPP/VIIRS night light data from 2013 to 2018 is used to retrieve the population of the past years through the model, and the errors are verified according to the statistical data, and the large errors are grouped and then modeled again. Based on the secondary modeling after error grouping, the accuracy of population simulation is significantly improved compared with the accuracy of the first simulation. The average relative error of the model inversion

 

Figure 3  Population spatialization of Hainan Island in 2018

 

 

Figure 4  Population growth of Hainan Island from 2013‒2018

 

results is 18.19%, and the regression coefficient R2 is above 0.9. The regression coefficient R2 results are close to the results of Li et al.[10] and Cao et al.[11]. The model error is similar to that of Zhao et al.[12], and higher than the average relative error of Wang et al.[13] and Chen et al.[14].

At the level of cities and counties, Wuzhishan city has a large error in the retrieve results . The reason is that Wuzhishan city is located in the hinterland of the central mountainous area of Hainan Island, and its urban topography is undulating and its population is sparse, so it is difficult to accurately retrieve its population spatial distribution only by night light data. According to the administrative divisions of Hainan province, the cities and counties are divided into prefecture-level cities, county-level cities, ordinary counties and ethnic minority autonomous counties. It can be found that the population simulation error of prefecture-level cities is the smallest, followed by ethnic minority autonomous counties. The error of county-level cities is slightly higher than that of minority autonomous counties, and the error of ordinary counties is the largest. There is no regular difference in population simulation accuracy between cities and counties with different administrative levels.

At the same time, Fu et al. conducted population regionalization based on the spatial distribution characteristics and regional differences of the population in China, and screened out land use types with strong correlation with population distribution as model parameters, established a multiple regression model to achieve population spatialization, and four factors of urban population density, traffic conditions, DEM and total amount control are used to calibrate the model[15], and finally obtained the national population distribution dataset. Among them, the average relative error of the population spatialization data of Hainan Island in 2010 was only 0.92%[16]. Therefore, these methods can be referenced in the future to combine night light data for population spatialization to improve accuracy.

 

Table 2  Model simulation results

Classification

Cities and counties

Statistical
population data
(ten thousand people)

First population simulation
(ten thousand people)

Error

Second population simulation
(ten thousand people)

Error

Prefecture
-level city

Haikou city

230.23

258.12

12.12%

194.61

15.47%

Sanya city

 77.39

121.09

56.46%

 74.51

 3.72%

Danzhou city

 99.84

 90.34

 9.52%

117.89

18.08%

Average error

26.03%

??

12.42%

County
-level city

Wenchang city

 56.89

 42.34

25.58%

 51.57

 9.35%

Qionghai city

 51.57

 43.34

15.97%

 52.66

 2.11%

Wanning city

 57.86

 47.28

18.28%

 57.13

 1.27%

Wuzhishan city

 10.71

 26.27

145.25%

 16.36

52.77%

Dongfang city

 42.97

 62.63

45.74%

 51.95

20.89%

Average error

50.16%

??

17.28%

Ordinary
county

Dingan county

 29.76

 30.05

 0.99%

 39.55

32.88%

Tunchang county

 26.85

 24.67

 8.11%

 35.09

30.68%

Chengmai county

 49.44

 57.02

15.33%

 69.19

39.94%

Lingao county

45.1

 35.25

21.84%

 44.32

 1.73%

Average error

11.57%

??

26.31%

Minority Autonomous County

Ledong county

 48.27

 48.94

 1.39%

 59.08

22.39%

Qiongzhong county

 18.02

 24.86

37.95%

 14.87

17.49%

Baoting county

 15.28

 26.68

74.59%

16.8

 9.93%

Lingshui county

 33.39

 52.65

57.67%

 43.06

28.95%

Baisha county

 17.34

 26.71

54.05%

 16.83

 2.91%

Changjiang county

 23.35

 36.68

57.07%

 27.26

16.75%

Average error

47.12%

??

16.40%

Overall average error

36.55%

??

18.19%

5 Discussion and Conclusion

NPP/VIIRS night lighting data population spatial modeling can basically reflect the spatial distribution of population, but the simulation accuracy of Wuzhishan city is poor. The reason may be due to the fact that Wuzhishan city is located in the central mountainous area, with large terrain undulations, which greatly affects the night light data, and the population of Wuzhishan city is small, which is far from other cities and counties[14].

To solve this problem, more detailed population zoning of the study area will be considered in the future. At the same time, combined with land use data, POI and other multi-source data[17], and multi geographical factor weighting method, spatial weighted regression or neural network models will be adopted to improve the inversion accuracy. This dataset is the 500-m resolution data of Hainan Island??s population based on NPP/VIIRS night light data retrieve. This dataset can reflect the spatial and temporal distribution of Hainan Island??s population from 2013 to 2018, and can provide an auxiliary basis for the current Hainan territorial spatial planning and related research.

 

Author Contributions

Han, N. L. designed the algorithms of dataset. Lai, X. L. collected and processed the night light data and designed the model.  Huang,P. J. made data verification.  Lai, X. L. wrote the data paper.

 

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1]      Li, D. R., Li, X. On noctilucent remote sensing data mining [J]. Journal of Surveying and mapping, 2015, 44(6): 591?C601.

[2]      Li, F., Mi, X. N., Liu, J., et al. Spatialization of GDP in Beijing based on NPP-VIIRS nighttime light data [J]. Remote sensing for land and resources, 2016, 28(3): 19?C24.

[3]      Li, F., Sun, G. T., Wang, Q. L., et al. Spatialization of power consumption at county level based on NPP-VIIRS nighttime light data [J]. Geomatics & Spatial Information, 2018, 41(4): 8?C11, 15, 18.

[4]      Gao, Y., Wang, H., Wang, P. T. Spatial analysis of coastal population based on population census and multi-source nighttime light data [J]. Resources science, 2013, 35(12): 2517?C2523.

[5]      Lai, X. L., Han, N. L., Huang, P. J. Night lighting and census integrated 500 m raster population dataset of Hainan Island (2013-2018) [J]. Digital Journal of Global Change Data Repository, 2021. https://doi.org/10.3974/geodb.2021.08.02.V1. https://cstr.escience.org.cn/CSTR:20146.11.2021.08.02.V1.

[6]      GCdataPR Editorial Office. GCdataPR data sharing policy [OL]. https://doi.org/10.3974/dp.policy.2014.05 (Updated 2017).

[7]      Hainan Bureau of Statistics, National Bureau of Statistics Hainan Survey Team. Hainan statistical yearbook [OL]. 2014-2019. http://www.hainan.gov.cn/hainan/tjnj/list3.shtml.

[8]      Hu, W. A., Liu, C. L., Zhan, Q. W. Synthesis method and comparative verification of nPP-VIIRS annual night light data in China [J/OL]. Journal of Guilin University of Technology, 2020: 1?C10 [2020-12-23].

[9]      Zhou, Y., Chen, Y., Liu, Y., et al. Generation and verification of NPP-VIIRS annual nighttime light data [J]. Remote sensing information, 2019, 34(2): 62?C68.

[10]    Li, X., Chen, Z. J., Wu, J. X., et al. Research on grid method of urban resident population based on night light data and spatial regression model [J]. Journal of geo-information science, 2017, 19(10): 1298?C1305.

[11]    Cao, L. Q., Li, P. X., Zhang, L. P. Urban population estimation based on DMSP/OLS nighttime light data: a case study of counties in Hubei province [J]. Remote Sensing Information, 2009(1): 83?C87.

[12]    Zhao, Y., Wang, L., Li, X. X., et al. Spatialization of population in Dalian based on NPP/VI RS night light and land use [J]. Resources development and market, 2018, 34(10): 1372?C1376.

[13]    Wang, X. X., Zhao, M. S, Xie, Y., et al. Spatialization of population data in Jiangsu province based on NPP- VIIRS night light remote sensing [J]. Journal of heilongjiang institute of technology, 2020, 34(3): 31?C35.

[14]    Chen, Q., Hou, X. Y., Wu, L. Comparative analysis of spatialization model of population based on land use data and night light data: a case study of high efficiency ecological economic zone in the Yellow River Delta [J]. Human geography, 2014, 29(5): 94?C100.

[15]    Fu, J. Y., Jiang, D., Huang, Y. H. Dataset of kilometer-grid population distribution in China [J]. Acta Geographica Sinica, 2014, 69(Sup.): 41?C44.

[16]    Fu, J. Y., Jiang, D., Huang, Y. 1 km grid population dataset of China [DB/OL]. Global Change Data Repository, 2014. https://doi.org/10.3974/geodb.2014.01.06.V1.

[17]    Zhao, X., Song, Y. Q., Liu, Y. L., et al. Spatialization of population based on satellite remote sensing and POI data: a case study of Guangzhou [J]. Tropical geography, 2020, 40(1): 101?C109.



[1] National Oceanic and Atmospheric Administration, U.S. https://ngdc.noaa.gov.

[2] China Natural Resources Database. http://www.naturalresources.csdb.cn/index.asp.

Co-Sponsors
Superintend