Table 9 Statistics of foundation of the dataset
Foundation
|
Dataset
|
%
|
No fund
|
53
|
15.45
|
One fund
|
162
|
47.23
|
More than one fund
|
128
|
37.32
|
Total
|
343
|
100
|
3.4 Statistics of Datasets by Founding Agencies
With or without foundations (Table 9): Of all the published datasets, 84.55% of them were founded, 15.45% of them were conducted by self support. Among the with-founding group, 47.23% of the datasets were funded by a single funding project; 37.32% of the datasets were funded by two or more funding projects, those datasets were usually large, broad-covered, time-serial datasets.
Foundation details (Table 10): Of all 343 datasets from 532 projects (programs) , 139 projects (26.13%) were from Ministry of Science and Technology of P. R. China; 123 projects (23.12%) were from Chinese Academy of Sciences and 118 projects (22.18%) were from Natural Science Foundation of China.
Table 10 Statistics of funds supporting datasets development
Foundations
|
Number of Funds
|
%
|
Foundations
|
Number of Funds
|
%
|
Ministry of Science and
Technology
|
139
|
26.13
|
National Development and Reform Commission
|
2
|
0.38
|
Chinese Academy of Sciences
|
123
|
23.12
|
Ministry of Environmental Protection
|
2
|
0.38
|
National Natural Science
Foundation of China
|
118
|
22.18
|
State Forestry Administration
|
2
|
0.38
|
Province/Company
|
91
|
17.11
|
National Space Administration
|
2
|
0.38
|
Ministry of Education
|
19
|
3.57
|
China Earthquake Administration
|
2
|
0.38
|
State Oceanic Administration
|
10
|
1.88
|
Ministry of Water Resources
|
1
|
0.19
|
National Social Science Fund
|
6
|
1.13
|
Ministry of Personnel
|
1
|
0.19
|
Ministry of Land and Resources
|
3
|
0.56
|
China National Tourism Administration
|
1
|
0.19
|
China Meteorological
Administration
|
2
|
0.38
|
|
|
|
Abroad
|
8
|
1.50
|
Total
|
532
|
100
|
4 Association of Datasets, Research Papers, and Data Papers
In order to promote the informed research data re-use, products in three dimensions have been considered: research datasets, research data paper and original research paper[7]. GCdataPR was the first case in World Data System (WDS) of International Council of Sciences (ICSU) and China GEO[8].
The publication of research data through the platform of Global Change Research Data Publishing and Repository (GCdataPR, http://www.geodoi.ac.cn) gives authors a new choice to protect their data and enjoy relevant credit. By the end of 2017, 37 academic journals have participated GCdataPR partner team, they can enhance their visibility through recommendation of research data from the papers published in their journals. GCdataPR was on the top 50 of Big Data Products, Services and Solutions of China (the only one from scientific research and education) in 2016[9] and searchable list of Data Citation Index of Clarivate Analytics since 2016.
Development of research data can be very complex and may not be fully instructed in original research papers. Therefore, it is necessary to provide more information to new users in order to apply the data in new research fields. This is so called data paper, published through Journal of Global Change Data and Discovery.
To give potential data users more insight about research data, the online hot link for each dataset to original research paper is provided at the GCdataPR platform. This means that data can be re-used on a transparent background, and credits and responsibilities of each part can be clarified in the very beginning.
It is worth noting that the above three steps have not been taken shoulder by shoulder. GCdataPR was launched in 2014, part of its data papers were published in Journal of Geography. This lag of data paper publication remained until March, 2017, when the new journal delivered its first issue. With the completion of trinity, more data papers will be published through the new journal. Table 11 gives a brief for this trend.
5 Statistics of Data Sharing
Table 11 Statistics of datasets with its informaiton
Year
|
Dataset
|
Data Paper
|
Res. Paper
|
Total Paper
|
2014
|
36
|
20
|
1
|
21
|
2015
|
34
|
0
|
0
|
0
|
2016
|
190
|
0
|
95
|
95
|
2017
|
83
|
73
|
23
|
96
|
Total
|
343
|
93
|
119
|
212
|
5.1 Datasets
Table 12 presented the major static items of data sharing during 2014-2017. More than 880,000 users visited the system, accumulatively. Through over 30,000 IPs, users have downloaded more than 2TB datasets in 120,000 times. The data users (IP) and the data file downloaded increased year by year (Figure 1, Figure 2).
Table 12 Statistics of data sharing through the GCdataPR
Year
|
Visitors
|
Accum.
Visitors
|
Added Data
Users (IP)
|
Accum. Data Users(IP)
|
Data Files
Downloaded
|
Accum. Data
File downloaded
|
Data Downloaded (GB)
|
Accum. Data Downloaded(GB)
|
2014
|
332,846
|
332,846
|
174
|
174
|
822
|
822
|
25.79
|
25.79
|
2015
|
124,668
|
457,514
|
9,764
|
9,938
|
23,726
|
24,548
|
976.11
|
1,001.90
|
2016
|
339,870
|
797,384
|
10,701
|
20,639
|
47,867
|
72,415
|
703.31
|
1,705.21
|
2017
|
83,434
|
880,818
|
16,158
|
36,797
|
53,493
|
125,908
|
970.49
|
2,675.70
|
Figure 1 Statistics of annual added and cumulative data users of GCdataPR (2014–2017)
Figure 2 Yearly and cumulative data file downloaded from GCdataPR (2014-2017)
5.2 Data Users by Countries
Data users from 76 countries all over the world have downloaded the datasets. Most of the users were from China (32,749, 89%), followed by the United States, Australia, Japan, and Malaysia (Figure 3).
Figure 3 Map of the GcdataPR data users (IP)
6 Discussion
Global change research data is comprehensive and inter-disciplinary, just like Global change research. There is no doubt that the datasets and the data paper publishing together with the citation of the original research paper, could provide the scientific community with a new possibility to facilitate data quality assurance and informed research data re-use.
However, at its preliminary and exploratory stage, several key issues should be tackled further in a wide range of fields, such as data publishing, archiving, disseminating, sharing, application, citation, measurement of impact, etc. The most marked issues are: scientific community as a whole has not given enough credit to research data published; lack of enforcement of intellectual property protection for research data; inconsistency of data citation, the data content may need cover more regions and the data authors may expand to more communities.
Although the IGSNRR/CAS, SGC try hard to enhance the capacity building and communication in research communities and universities under the support of Chinese Academy of Sciences, the awareness of research data publishing should be more involved, especially from founding agencies; research institutes, universities. The due credit should be given to the each of stakeholders related to the published datasets.
There are still many confusing issues about data intellectual property protection among the scientific communities and the society. For instance, what makes up an innovative dataset and what should be protected by law? How to balance the protection and sharing? It is valuable to have such course as a basic education in universities about the data intellectual property.
Unlike paper citation, there are no common understanding the guidelines and standards for data citation. Usually, the data used in a paper may be specifically cited in terms of “acknowledge”, or in “data and method”, or just overlooked. In short, there was no such a tradition as paper citation before. That was history. In our new system, the data users are requested to formally cite both dataset according to the dataset citation format and the data paper according to the data paper citation format.
References
[1] Future Earth (2013). Future earth initial design: report of the transition team [R]. Paris: International Council for Science (ICSU).
[2] Experts Group of National Major Research Program on Global Change Research. Global change strategy research report of China [R]. 2009.
[3] USGCRP. The national global change research plan 2012–2021: a strategic plan for the U.S. global change research program [R]. 2012.
[4] Editorial office of Journal of Global Change Data & Discovery. Guidelines of global change research data publishing and repository [J]. Journal of Global Change Data & Discovery, 2017, 1(3): 253-261. DOI: 10.3974/geodb.2017.03.01.
[5] Geographical Society of China. Global change research data publishing & sharing rankings (Top 10) [J]. Journal of Global Change Data & Discovery, 2017, 1(2): 249-251. DOI: 10.3974/geodp.2017.02.23.
[6] Jiang, D., Song, X. F., Zhang, G. Y. A new milestone of scientific data sharing in China [R]. Journal of Global Change Data and Discovery, 2017, 1(2): 246-248. DOI: 10.3974/geodp.2017.02.22.
[7] Liu, C. Global change research data publishing and repository [J]. Acta Geographica Sinica, 2014, 69(Sup): 3-11.
[8] Liu, C., Guo, H. D., Uhlir, P. F., et al. GCdataPR: infrastructure for data publishing repository & sharing in/for/with developing countries [J]. Journal of Global Change Data & Discovery, 2017, 1(1): 3-11. DOI: 10.3974/geodp.2017.01.02.
[9] Shi, R. X., Zhu, Y. Q., Jiang, D., et al. GCdataPR: an outstanding case in “Excellent Products, Services and Applications of Big Data in China” [R]. Journal of Global Change Data & Discovery, 2017, 1(2): 245. DOI: 10.3974/geodp.2017.02.21.