Journal of Global Change Data & Discovery2025.9(1):87-95

[PDF] [DATASET]

Citation:Zhan, L. F., Dong, B. H., Xu, B., et al.Dataset Development on Centennial Homogenized Monthly/Yearly Mean Temperature at Jiujiang Meteorological Station, Jiangxi Province of China (1924–2023)[J]. Journal of Global Change Data & Discovery,2025.9(1):87-95 .DOI: 10.3974/geodp.2025.01.10 .

Dataset Development on Centennial Homogenized Monthly/Yearly Mean Temperature at Jiujiang Meteorological Station, Jiangxi Province of China (1924?C2023)

Zhan, L. F.1,2  Dong, B. H.1,2  Xu, B.1,2*   Li, Y.1,2   Xin, J. J.1,2   Wang, L. Y.3

1. Jiangxi Provincial Climate Center, Nanchang 330096, China;

2. Nanchang National Climate Observatory, Nanchang 330200, China;

3. Jiangxi Vocational and Technical College of Information Application, Nanchang 330043, China

 

Abstract: Long-term homogenized observational time series are crucial for accurate assessment and attribution of climate change. However, at various times, most meteorological stations in China are affected by factors such as station relocation, instrument replacement, and environmental change, resulting in heterogeneity in observational data series. In this study, based on multisource monthly mean temperature data, the monthly mean temperature data recorded at Jiujiang Meteorological Station in Jiangxi Province (China) during 1924?C2023 were interpolated using the standardized sequential method, taking data from Wuhan Meteorological Station as reference. The interpolated data were subjected to a homogenization test using the penalized maximal F test method, and then corrected using the quantile matching method. Thus, the Centennial homogenized monthly/yearly mean temperature dataset of Jiujiang Meteorological Station, Jiangxi Province, China (1924?C2023) was constructed. Comparative analysis with centennial homogenized temperature data of neighboring meteorological stations revealed a correlation coefficient of >0.9, verifying that the construction method is scientific and that this dataset has certain reliability. The dataset is archived in .txt data format, and consists of 2 data files with data size of 16.5 KB.

Keywords: Jiujiang; temperature; centennial series; interpolated; homogenized

DOI: https://doi.org/10.3974/geodp.2025.01.10

Dataset Availability Statement:

The dataset supporting this paper was published and is accessible through the Digital Journal of Global Change Repository at: https://doi.org/10.3974/geodb.2024.12.03.V1.

1 Introduction

In the context of global climate change, study of regional climatic variability is of great importance for improved understanding of the spatial characteristics of climate change and the associated potential impacts[1,2]. The East Asian monsoon region is one area that is highly sensitive to the effects of global climate change, where regional climate change is driven directly by global warming and multiple other factors such as the regional atmospheric circulation and changes in land use cover[3]. Jiujiang is an important city in the middle-lower reaches of the Yangtze River (China) that has typical East Asian monsoon climate characteristics. Its long-term climate records provide a valuable basis for studying the characteristics of the regional climate and their relationship with global climate change. However, owing to the diverse sources of historical meteorological observational data, a number of missing measurements, and the inhomogeneity inherent in the time series, high-quality datasets that accurately reflect the long-term trend of climate change in the Jiujiang region are scarce[4].

Constructing continuous, reliable, and long-term time series of homogenized meteorological data forms the basis of climate change research[5,6]. Homogenization is the process of systematically correcting and unifying multisource meteorological observational data to eliminate systematic differences and nonhomogeneous characteristics between different data elements so that they reflect the true characteristics of climate change. In recent years, many studies have achieved remarkable results using homogenization techniques to correct historical meteorological data[7?C10]. However, in specific applications, it remains necessary to combine regional characteristics and historical data characteristics for more detailed and targeted processing.

In this study, taking the Jiujiang Meteorological Station (Jiangxi Province, China) as the research object, multisource meteorological data from 1924?C1938 and from 1951?C2023 were integrated with homogenized data from neighboring stations from 1924?C2016. Then, a homogenized monthly temperature dataset for Jiujiang Meteorological Station from 1924?C2023 was constructed using the standardized sequential method, a homogenization test, and breakpoint correction technology. Finally, the temperature change characteristics at Jiujiang Meteorological Station over the past 100 years were revealed. This study provides important data support for climate change research in the Jiujiang region, and represents a reference for the processes of homogenization and sequence construction of meteorological data in other regions.

2 Metadata of the Dataset

The name, authors, geographical region, data period, temporal resolution, spatial resolution, dataset composition, data publication and sharing service platform, and data sharing policies of the Centennial homogenized monthly/yearly mean temperature dataset of Jiujiang Meteorological Station, Jiangxi Province, China (1924?C2023)[11] are provided in Table 1.

3 Data Development Methods

3.1 Data Sources

The Jiujiang Meteorological Station data from January 1924 to March 1938 were obtained from the ??China Temperature Data??[13] record compiled by the Joint Data Office of the

Table 1  Metadata summary of the Centennial homogenized monthly/yearly mean temperature dataset at Jiujiang Meteorological Station, Jiangxi Province, China (1924?C2023)

Items

Description

Dataset full name

Centennial homogenized monthly/yearly mean temperature dataset at Jiujiang Meteorological Station, Jiangxi Province of China (1924?C2023)

Dataset short name

MeanTempJiujiang1924?C2023

Authors

Zhan, L. F., Jiangxi Provincial Climate Center, lf.zhan@foxmail.com

Xu, B., Jiangxi Provincial Climate Center, 1176325432@qq.com

Dong, B. H., Jiangxi Provincial Climate Center, dongbaohua_jx@163.com

Li, Y., Jiangxi Provincial Climate Center, 908791309@qq.com

Geographical region

Jiujiang

Year

1924?C2023

Temporal resolution

Month, year

Spatial resolution

Meteorological station

Data format

.txt

Data size

16.5 KB

Data files

The monthly/yearly mean air temperature (unit ??) of Jiujiang Meteorological Station from 1924 to 2023

Foundations

China Meteorological Administration (CMA2024QN15); Science and Technology Department of Jiangxi Province (20223BBG71019, 2023KYG01001); Shanghai Meteorological Service (QYHZ202106); Guangdong Meteorological Service (ZJLY202312); Nanchang National Climatic Observatory (JX2023Z09); Jiangxi Meteorological Service (JX2022ZHHFXPC06)

Computing environment

Python, R

Data publisher

Global Change Research Data Publishing & Repository, http://www.geodoi.ac.cn

Address

No. 11A, Datun Road, Chaoyang District, Beijing 100101, China

Data sharing policy

(1) Data are openly available and can be free downloaded via the Internet; (2) End users are encouraged to use Data subject to citation; (3) Users, who are by definition also value-added service providers, are welcome to redistribute Data subject to written permission from the GCdataPR Editorial Office and the issuance of a Data redistribution license; and (4) If Data are used to compile new datasets, the ??ten percent principal?? should be followed such that Data records utilized should not surpass 10% of the new dataset contents, while sources should be clearly noted in suitable places in the new dataset[12]

Communication and searchable system 

DOI, CSTR, Crossref, DCI, CSCD, CNKI, SciEngine, WDS, GEOSS, PubScholar, CKRSC

 

China Central Meteorological Administration and the Institute of Geology and Geophysics (Chinese Academy of Sciences), which was published in 1954. The data from 1951?C2023 were extracted from the ??Tianqing?? meteorological big data cloud platform[14] developed by the National Meteorological Information Center. The centennial homogenized data of neighboring reference stations were taken from the dataset established by the Institute of Atmospheric Physics of the Chinese Academy of Sciences, after correcting the centennial homogenized monthly temperature series of 32 stations in China. This dataset well represents the characteristics of large-scale climate change in China over the past century[15]. The geographical location of Jiujiang Meteorological Station is shown in Figure 1.

3.2 Algorithm Principles

3.2.1 Data Interpolation

The temperature series of Jiujiang Meteorological Station prior to 1951 was constructed

based on monthly mean temperature data in the ??China Temperature Data??. However, the ??????:  

Figure 1   Map of the geo-location of Jiujiang Meteorological Station
temperature data record of Jiujiang Meteo­rological Station after preliminary integration showed continuous missing measurements from April 1938 to December 1950, with a rate of missing data up to 12.8%. To restore data integrity, the missing data were interpolated using the standardized sequential method, and neighboring stations with complete long-term time series sequences were taken as reference stations.

The calculation under the standardized sequential method can be expressed as follows[16]:

                                                                                                   (1)

                                                                                                       (2)

                                                                                                  (3)

where denotes the standardized sequence,  denotes the mean standardized sequence of the reference stations, represents the data to be interpolated in the ith month, j denotes the jth reference station, refers to the data of the ith month of the jth station,  and are the multiyear mean and the standard deviation of the data of the ith month of the jth station, respectively, n represents the number of reference stations, and  and  are the multiyear mean and the standard deviation of the data of the ith month of the station to be interpolated, respectively.

According to the requirements of reference[17], the Jiujiang Meteorological Station should be used as the reference when selecting neighboring stations, and the data of neighboring meteorological stations within a horizontal distance of 300 km should be selected. Regarding the selected meteorological station, the starting observation year must be earlier than 1924, the data integrity must be high, and the site elevation should be similar to that of Jiujiang Meteorological Station. After comprehensive consideration, Wuhan Meteorological Station was finally selected as the reference station for the neighboring stations.

3.2.2 Homogenization Test and Correction

(1) Breakpoint test method

It was not until the early 1950s that China had relatively complete and systematic observational data. Prior to 1951, owing to the diverse sources of data recorded by China??s meteorological observation stations, the continuity, consistency, and standardization of the time series were inadequate. The lack of a reliable basis on which to evaluate the rationality of nonhomogeneous test results before 1951 further increased the difficulty of constructing a reference time series that truly reflected local climate change.

RHtest V4 software package is a statistical tool designed for homogeneity testing and adjustment of climate data. It is primarily used to detect and correct biases in climatic time series data (e.g., temperature, precipitation, etc.) caused by non-climatic factors (such as station relocations, instrument changes, observation method updates, etc.), thereby improving the reliability and consistency of the data. The penalized maximum F test in the RHtest V4 software package is a test method that can be performed without a reference sequence. Its primary advantage is that it can substantially reduce test bias caused by nonhomogeneous reference sequences and incomplete metadata information. Previous studies fully demonstrated the effectiveness and reliability of this method. Therefore, based on the monthly mean temperature series, this study adopted the penalized maximum F test method to conduct a homogenization test on the monthly mean temperature series of Jiujiang Meteorological Station from 1924?C2023. The test procedure followed the method discussed in reference[18].

(2) Breakpoint correction method

The quantile matching method[19] was used to correct the monthly data series. This method can ensure that the segments in the sequence to be tested match each other in the empirical distribution after eliminating the linear trend. In this study, 2 types of breakpoints were corrected: those before 1954 and those after 1954.

1) Breakpoints before 1954. Detected in both the yearly and the monthly series, these breakpoints represented notably discontinuous points. However, owing to lack of supporting detailed metadata, the correction positions were determined based on the breakpoint times detected in the monthly data series.

2) Breakpoints detected after 1954. Found in the yearly series or in the monthly series, these breakpoints were identified with supported from corresponding metadata. If the time of occurrence of these breakpoints differed from the information recorded in the station metadata by no more than one year, the position of the breakpoint was replaced and adjusted according to the specific time of the metadata record[6].

??????:  

Figure 2  Flowchart of the dataset development
The test adopted a 95% confidence level to ensure the reliability of the test results. This method is helpful for reducing test errors caused by nonhomogeneous reference sequences and lack of detailed metadata information, thereby improving the accuracy of the sequence correction.

3.3 Technical Roadmap

The technical roadmap for the development of the Centennial homogenized monthly/yearly mean temperature dataset of Jiujiang Meteorological Station, Jiangxi Province, China (1924?C2023) is shown in Figure 2. The meteorological observational data from 1924?C1938 and from 1951?C2023 were integrated with the statistically homogenized correction data from 1924?C2016. Then, the standardized sequential method was used to interpolate missing data in the Jiujiang Meteor­ological Station record. Based on the homogeni­zation test and the correction procedure, the yearly homogenized meteorological data series of the Jiujiang Meteorological Station from 1924?C2023 was constructed.

4 Data Results and Validation

4.1 Dataset Composition

This dataset contains monthly/yearly mean temperature data of the Jiujiang Meteorological Station from 1924?C2023. The monthly average temperature lists the data for year, month, and corresponding monthly average temperature (??C); the annual average temperature lists the year and corresponding annual average temperature (??C). The dataset is archived in the .txt format, and consists of 2 data files with data size of 16.5 KB.

4.2 Data Results

In this study, a complete centennial time series temperature dataset was constructed. As shown in Figure 3, prior to the 1950s, the yearly mean temperature at Jiujiang Meteorological Station showed a downward trend, whereas from the 1950s to the early 1990s there was no obvious change. From the 1990s, the yearly mean temperature began to show a clear upward trend. Overall, the rate of change during 1924?C2023 was approximately 1.0 ??/100a. As shown in Figure 4, obvious seasonal changes occurred in the distribution of the monthly mean temperature at Jiujiang Meteorological Station. The temperature was markedly higher in summer (June?CAugust) and notably lower in winter (December?CFebruary), showing typical monsoon climate characteristics. Additionally, from the late 20th century to the early 21st century, both the duration and the intensity of high temperatures in summer increased. This change is consistent with the characteristics of regional climate change under the background of global warming.

 

Figure 3  Statistical analysis of the yearly mean temperature at Jiujiang Meteorological Station (1924?C2023)

Figure 4  Statistics of the homogenized monthly mean temperature data at Jiujiang Meteorological Station (1924?C2023)

??????:  

Figure 5  Statistical analysis of temperature change trend at Jiujiang Meteorological Station (1924?C2023)

The trend of temperature change in different periods and the results of the statistical significance test are shown in Figure 5. Red columns indicate significant temperature change at the significance level of ??=0.05; blue columns indicate temperature change that did not reach the significance level. It is evident from Figure 5 that temperature showed a statistically significant upward trend during January?CJune (especially during February?CApril), October?CNovember, and throughout the entire year. This may be related to the intensified trend of climate warming, with faster warming in winter and spring, and delayed heat release in autumn, leading to significant temperature increases during these periods. During July?CSeptember and in December, the temperature changed only a little, and it even showed a downward trend in August. In these periods, the change trend did not reach the significance level. This is likely due to the fact that in summer, the region is under the control of stable weather systems such as the subtropical high, resulting in relatively small interannual variations in temperature. In December, which is early winter, cold air activities are frequent but their intensity and duration vary from year to year, leading to non-significant temperature changes.

4.3 Data Validation

The correlation coefficient, root mean square error, and standard deviation can reflect the performance of this dataset in different periods. As shown in Table 2, the correlation coefficient of this dataset for the annual average data from 1924 to 2023 was 0.94, indicating strong correlation between this dataset and the reference station. The full-year root mean square error of 0.27 indicates that the mean deviation between this dataset and the reference station was small; the standard deviation of 0.59 indicates that the volatility of the full-year data was low.

For the monthly data, the range of the correlation coefficient was 0.88?C0.98, and the correlation coefficient of the winter half year was higher than that of the summer half year, indicating that this dataset has stronger correlation with the reference station in the winter half year. The root mean square error range was 0.46?C1.13, with the smallest (largest) value in spring (winter), indicating that the value in spring is closest to that of the reference station, while the error is largest in winter. This is due to the fact that in spring, temperatures gradually warm up and the changes are relatively smooth, resulting in smaller root-mean-square errors (RMSE) compared to the reference station. In winter, temperatures are lower and are affected by cold air, leading to greater temperature fluctuations and larger RMSE compared to the reference station. The standard deviation range was 0.97?C1.9, with the lowest (highest) value in June (February), indicating less (more) fluctuation of the data.

 

Table 2  Comparison of monthly and yearly mean temperatures of Jiujiang Meteorological Station  (1924?C2023) with those of reference stations

Monthly/Yearly

Correlation coefficient

Root mean square error (??C)

Standard deviation (??C)

1

0.97

1.09

1.42

2

0.98

0.77

1.90

3

0.97

0.46

1.60

4

0.94

0.50

1.30

5

0.93

0.50

1.18

6

0.91

0.82

0.97

7

0.92

0.53

1.22

8

0.90

0.63

1.11

9

0.88

0.55

1.09

10

0.91

0.63

1.09

11

0.95

0.89

1.24

12

0.97

1.13

1.39

Annual average

0.94

0.27

0.59

5 Discussion and Conclusion

In this study, a homogenized monthly/yearly mean temperature dataset for 1924?C2023 was constructed for Jiujiang Meteorological Station in Jiangxi Province, China. The ??China Temperature Data?? record, ??Tianqing?? meteorological big data cloud platform of the National Meteorological Information Center, and centennial homogenized data of neighboring stations were comprehensively applied to interpolate missing data using the standardized sequential method. Additionally, RHtest V4 software was used to test and correct the homogeneity of the sequence, thereby ensuring continuity and reliability of the time series. Verification results showed that this dataset has strong correlation with the reference station (yearly correlation coefficient: 0.94) and a small root mean square error (0.27). Therefore, it can better reflect the characteristics of climate change in the Jiujiang region over the past century.

This study found that temperature change at the Jiujiang Meteorological Station has obvious seasonal characteristics, with statistically significant higher temperatures in summer (June?CAugust) and statistically significant lower temperatures in winter (December?C February), reflecting the typical characteristics of the East Asian monsoon climate. From the long-term trend (except in August), the mean temperature at Jiujiang Meteorological Station in each month showed an upward trend, especially during February?CApril when the most statistically significant temperature rise occurred, which is consistent with the regional characteristics of large temperature rise in spring and winter under the background of global warming. However, the temperature change during July?CSeptember was small, especially in August when there was a downward trend, and it failed to reach the significance level, which might reflect the complexity of the region being affected by local climate change or monsoon variability in summer. Additionally, both the duration and the intensity of high temperatures in summer from the late 20th century to the early 21st century increased, further indicating that the regional climate is being affected profoundly by global warming.

Despite major progress being achieved, this study had certain limitations. For example, owing to the diverse sources and insufficient metadata information of meteorological data prior to 1951, the verification and correction of breakpoints depended on the statistical characteristics of the series, which might have had certain impact on the accuracy of the results. Moreover, the potential impact of extreme climatic events on data homogeneity needs further exploration. In future studies, higher-resolution climate simulations should be combined with observational data to examine the driving mechanism and response characteristics of regional climate change, thereby providing more precise scientific support for the formulation of policies intended to address climate change.

 

Author Contributions

Xu, B. and Dong, B. H. completed the overall design for the development of the dataset. Zhan, L. F. collected and processed the Jiujiang temperature data, and designed the model and the algorithm. Zhan, L. F. and Li, Y. completed the data verification. Zhan, L. F., Xin, J. J. and Wang, L. Y. wrote the data paper.

 

Conflicts of Interest

The authors declare no conflicts of interest.

 

References

[1]        Arnella, N. W., Lowe, J. A., Challinor, A. J., et al. Global and regional impacts of climate change at different levels of global temperature increase [J]. Climatic Change, 2019, 155(3): 377?C391.

[2]        Pan, X. D. A review of future climate change research based on regional climate models [J]. Plateau Meteorology, 2018, 37(5): 1440?C1448.

[3]        Ren, G., Chan, J. C. L., Kubota, H., et al. Historical and recent change in extreme climate over East Asia [J]. Climatic Change, 2021, 168(3): 1?C19.

[4]        Zhang, J. L., Liu, B., Ren, S. Q., et al. A 4 km daily gridded meteorological dataset for China from 2000 to 2020 [J]. Scientific Data, 2024, 11(1): 1230.

[5]        Tang, G. L., Ding, Y. H., Wang, S. W., et al. Comparative analysis of temperature curves in China over the past century [J]. Advances in Climate Change Research, 2009, 5(2): 71?C78.

[6]        Si, P., Hao, L. S., Luo, C. J., et al. Interpolation and homogeneity correction of missing temperature records at Baoding Meteorological Station in Hebei [J]. Advances in Climate Change Research, 2017, 13(1): 41?C51.

[7]        Li, Q. X. Current status and prospects of homogeneity research in Chinese climate data [J]. Meteorological Science and Technology, 2016, 6(3): 67?C74.

[8]        Shen, Y. L. Construction and evaluation of a high-resolution meteorological element grid dataset for Qinghai Province [J]. Meteorological Science and Technology, 2023, 13(4): 29?C33.

[9]        Wang, H. J., Tu, S. Y., Chen, Z. H. Experiment and error analysis of interpolation methods for missing daily temperature data [J]. Meteorology, 2008, 34(7): 83?C91.

[10]     Wang, Y. P. Comparison and homogeneity analysis of observation data from old and new meteorological stations in Gansu Province [J]. Plateau Meteorology, 2023, 42(2): 506?C514.

[11]     Zhan, L. F., Xu, B., Dong, B. H., et al. Centennial homogenized monthly/yearly mean temperature dataset at Jiujiang Meteorological Station, Jiangxi Province of China (1924?C2023) [J/DB/OL]. Digital Journal of Global Change Data Repository, 2024. https://doi.org/10.3974/geodb.2024.12.03.V1.

[12]     GCdataPR Editorial Office. GCdataPR data sharing policy [OL]. https://doi.org/10.3974/dp.policy.2014.05 (Updated 2017).

[13]     Central Weather Bureau. China Temperature Data [M]. Beijing: Central Weather Bureau, 1954.

[14]     China Meteorological Administration. CAAC meteo data services [OL]. http://www.weatherdt.com. cn/ datasource.html.

[15]     Institute of Atmospheric Physics, Chinese Academy of Sciences. CAAC meteo data services [OL]. http://www.weatherdt.com.cn/datasource.html.

[16]     Li, Z., Yan, Z. W., Cao, L. J., et al. Further-adjusted long-term temperature series in China based on MASH [J]. Advances in Atmospheric Sciences, 2018, 35(8): 909?C917.

[17]     Yu, Y., Li, J., Ren, Z. H., et al. Application of standard series method in interpolation of missing daily average temperature data [J]. Meteorology, 2012, 38(9): 1135?C1139.

[18]     Si, P., Xie, Y. Y. Homogeneity analysis of total solar radiation data in Tianjin [J]. Climate and Environment Research, 2015, 20(3): 269?C276.

[19]     Wang, X. L., Chen, H., Wu, Y., et al. New techniques for the detection and adjustment of shifts in daily precipitation data series [J]. Journal of Applied Meteorology and Climatology, 2010, 49(12): 2416?C2436.

 

 

 

 

Co-Sponsors
Superintend