Journal:Big data management for cloud-enabled geological information services

From LIMSWiki
Revision as of 22:00, 19 February 2018 by Shawndouglas (talk | contribs) (Saving and adding more.)
Jump to navigationJump to search
Full article title Big data management for cloud-enabled geological information services
Journal Scientific Programming
Author(s) Zhu, Yueqin; Tan, Yongjie; Luo, Xiong; He, Zhijie
Author affiliation(s) China Geological Survey, Ministry of Land and Resources, University of Science and
Technology Beijing, Beijing Key Laboratory of Knowledge Engineering for Materials Science
Editors Liu, A.
Year published 2018
Volume and issue 2018(2018)
Page(s) 1327214
DOI 10.1155/2018/1327214
ISSN 1875-919X
Distribution license Creative Commons Attribution 4.0 International
Website https://www.hindawi.com/journals/sp/2018/1327214/
Download http://downloads.hindawi.com/journals/sp/2018/1327214.pdf (PDF)

Abstract

Cloud computing as a powerful technology of performing massive-scale and complex computing plays an important role in implementing geological information services. In the era of big data, data are being collected at an unprecedented scale. Therefore, to ensure successful data processing and analysis in cloud-enabled geological information services (CEGIS), we must address the challenging and time-demanding task of big data processing. This review starts by elaborating the system architecture and the requirements for big data management. This is followed by the analysis of the application requirements and technical challenges of big data management for CEGIS in China. This review also presents the application development opportunities and technical trends of big data management in CEGIS, including collection and preprocessing, storage and management, analysis and mining, parallel computing-based cloud platforms, and technology applications.

Introduction

In the era of big data, the data-driven modeling method enables us to exploit the potential of massive amounts of geological data easily.[1][2][3] In particular, by mining the data scientifically, one can offer new services that bring higher value to customers. Furthermore, it is now possible to implement the transition from digital geology to intelligent geology by integrating multiple systems in geological research through the use of big data and other technologies.[4]

The application of geological data management in the cloud makes it possible to fully utilize structured and unstructured data, including geology, minerals, geophysics, geochemistry, remote sensing, terrain, topography, vegetation, architecture, hydrology, disasters, and other digital geological data distributed in every place on the surface of the earth.[4][5] Moreover, the geological cloud will enable the integration of data collection, resource integration, data transmission, information extraction, and knowledge mining, which will pave the way for the transition from data to information, from information to knowledge, and from knowledge to wisdom. In addition, it supports data analysis, mining, organization, and management services for the scientific management of land resources, prospecting breakthrough strategic action and social services, while conducting multilevel, multiangle, and multiobjective demonstration applications on geological data for government decision-making, scientific research, and public services.[5]

Big data technologies are bringing unprecedented opportunities and challenges to various application areas, especially to geological information processing.[2][6][7] Under these circumstances, there are some advancements achieved in the development of this area.[8][9] Furthermore, from various disciplines of science and engineering, there has been a growing interest in this research field related to geological data generated in the geological information services (GIS). We analyzed the number of those documents indexed in the “Web of Science” research database.[10] In Figures 1 and 2, we can easily find that, in the past ten years, the number of those documents in which “geological data” is in the title and in the topic is increasing, respectively. Hence, geological data analysis in GIS is an interesting and important research topic currently.


Fig1 ZhuSciProg2018 2018-2018.png

Figure 1. The trend of the number of documents in which “geological data” is in the title, from 2007 to 2016

Fig2 ZhuSciProg2018 2018-2018.png

Figure 2. The trend of the number of documents in which “geological data” is in the topic, from 2007 to 2016.

Considering the development status of cloud-enabled geological information services (CEGIS) and the application requirements of big data management analysis, this article describes the significant impact and revolution on GIS brought by the advancement of big data technologies. Furthermore, this article outlines the future application development and technology development trend of big data management analysis in CEGIS.

The remainder of this article is organized as follows. In the next section we provide a review on CEGIS, with an emphasis on the descriptions for the system architecture and those requirements from big data management. Then, the challenges for big data management in CEGIS are presented. The key technologies and trends on big data management in CEGIS are analyzed afterwards, and finally we draw conclusions from the research.

Review on cloud-enabled geological information services

The construction of a geological cloud differs from the current big data analysis based on the internet of things (IoT). Having a deep understanding of data characteristics is necessary to collect, process, analyze, and interpret data in different fields, because the nature and types of data vary in different fields and in different problems. Geology is a data-intensive science, and geological data are characterized with multisource heterogeneity, spatiotemporal variation, correlation, uncertainty, fuzziness, and nonlinearity. Therefore, the geological cloud has a certain degree of confidentiality and it is highly domain-specific; meanwhile, it is developed on the basis of a large amount of geological data accumulated over a long period of time.[5][11] There are many real-time data generated from geological disasters and the geological environment. The geological cloud includes core basic data, which can be divided into three parts: an existing structured database, some unstructured data, and public application data. Therefore, it is important to take good advantage of the existing traditional structured data, use the big data technologies to deal with the relevant unstructured data, and also consider the peripheral public data.

Geological big data are multidimensional, and they consist of both structured and unstructured data.[12] The technical methods of big data analysis differ greatly from those of professional databases. Long-term geological survey and study have yeilded years of geological information, forming a rich and professional database, which is an important fundamental assurance for land and resources science management, geological survey, and geological information public service.[13] This “professional cloud” objectively requires technology research and development, such as the construction of a professional local area network, a data sharing platform, and geological big data visualization services. Hence, the construction of a geological cloud service is closely related to land resource management, deployment decisions, and the application demand of public service. The key technologies of research and development include the following: unstructured data extraction and mining analysis, structured and unstructured data mixed storage and management, big data sharing platform, data transmission, and visualization.[11]

References

  1. Vermeesch, P.; Garzenti, E. (2015). "Making geological sense of ‘Big Data’ in sedimentary provenance analysis". Chemical Geology 409: 20-27. doi:10.1016/j.chemgeo.2015.05.004. 
  2. 2.0 2.1 Chen, J.; Xiang, J.; Hu, Q. et al. (2016). "Quantitative Geoscience and Geological Big Data Development: A Review". Acta Geologica Sinica 90 (4): 1490–1515. doi:10.1111/1755-6724.12782. 
  3. Zhu, Y.; Tan, Y.; Li, R. et al. (2016). "Cyber-physical-social-thinking modeling and computing for geological information service system". International Journal of Distributed Sensor Networks 12 (11). doi:10.1177/1550147716666666. 
  4. 4.0 4.1 Kim, Y.-H.; Yarlagadda, P. (2013). "Cloud Computing Model for Big Geological Data Processing". Applied Mechanics and Materials 475–476: 306-311. doi:10.4028/www.scientific.net/AMM.475-476.306. 
  5. 5.0 5.1 5.2 Chen, J.; Li, J.; Cui, N.; Yu, P. (2015). "The construction and application of geological cloud under the big data background". Geological Bulletin of China 34 (7): 1260–1265. http://caod.oriprobe.com/articles/46629977/The_construction_and_application_of_geological_cloud_under_the_big_dat.htm. 
  6. Li, C. (2010). [10.1109/GEOINFORMATICS.2010.5567743 "The technical infrastructure of geological survey information grid"]. Proceedings from the 18th International Conference on Geoinformatics 2010: 1–6. 10.1109/GEOINFORMATICS.2010.5567743. 
  7. Wu, L.; Xue, L.; Li, C. et al. (2015). "A Geospatial Information Grid Framework for Geological Survey". PLoS One 10 (12): e0145312. doi:10.1371/journal.pone.0145312. 
  8. Evangelidis, K.; Ntouros, K.; Makridis, S.; et al. (2014). "Geospatial services in the Cloud". Computers & Geosciences 63: 116–122. doi:10.1016/j.cageo.2013.10.007. 
  9. Huang, M.; Liu, A.; Wang, T.; Huang, C. (2017). "Green data gathering under delay differentiated services constraint for internet of things". Wireless Communications and Mobile Computing. https://www.hindawi.com/journals/wcmc/aip/9715428/. 
  10. "Web of Science". Clarivate Analytics. https://www.webofknowledge.com/. 
  11. 11.0 11.1 Yang, C.; Yu, M.; Hu, F. et al. (2017). "Utilizing cloud computing to address big geospatial data challenges". Computers, Environment and Urban Systems 61 (Part B): 120–128. doi:10.1016/j.compenvurbsys.2016.10.010. 
  12. Wu, L.; Xue, L.; Li, C. et al. (2017). "A Knowledge-Driven Geospatially Enabled Framework for Geological Big Data". International Journal of Geo-Information 6 (6): 166. doi:10.3390/ijgi6060166. 
  13. Tan, Y. (2016). "Architecture and Key Issues of Geological Big Data and Information Service Project". Geomatics World 23 (1): 1–6. http://caod.oriprobe.com/articles/48928882/Architecture_and_Key_Issues_of_Geological_Big_Data_and_Information_Ser.htm. 

Notes

This presentation is faithful to the original, with only a few minor changes to presentation. In some cases important information was missing from the references, and that information was added. Grammar has been updated to make the content more readable.