This exploratory research of data lakes in big data times is a prominent topic for both academia and industry. One of the main motivations behind is that companies need to cope with more data than ever before, and the problems of how to analyze even how to store data are becoming more and more challenging in many industries. The occurrence of the concept of a data lake to meet such big data problems is enlightening and will most likely be considered in any relevant big data strategy. This review paper presents a summary of some popular data lake concepts at present, followed by its advantages, potential risks and criticism from some professionals as well. Additionally, a general process in a data lake is described.
목차
Abstract 1. Introduction 2. Background 3. Process in a data lake 3.1 Data Ingestion: 3.2 Data Storage: 3.3 Data Analytics: 4. Criticisms and Suspicion 5. Potential risks 6. Conclusion References
키워드
Big DataData LakeData AnalysisData IngestionData Visualization
저자
Ajit Singh [ Department of Computer Science, Patna Women's College, Patna, India ]
Corresponding Author
Sultan Ahmad [ Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, Al-Kharj, Saudi Arabia ]
Gouse Pasha Mohammed [ Department of Computer Science, Deanship of Preparatory Year, Prince Sattam Bin Abdulaziz University, Al-Kharj, Saudi Arabia ]
한국AI디지털융합학회(구 한국디지털융합학회) [The Korean Academic Society of AI Digital Convergence]
설립연도
2015
분야
사회과학>경영학
소개
본 학회는 디지털 경영에 관련된 디지털 미디어, 디지털 통신, 디지털 방송, 디지털 콘텐츠, 디지털 문화, 디지털 사회, 디지털 유통, 디지털 금융, 디지털 물류, 디지털 정책, 디지털 기술, 디지털 교육 그리고 디지털과 아날로그의 비교 등에 대한 학제간 연구와 실사구시적인 적용을 통하여 디지털 경영의 발전과 한국이 세계적인 디지털 강국으로 성장하기 위한 학술적인 기반과 실무적인 지침을 조성하는 것을 목적으로 하고 있습니다.
간행물
간행물명
IJICTDC [International Journal of Information Communication Technology and Digital Convergence]