Electrical and Computer Engineering Publications
Document Type
Article
Publication Date
7-22-2024
Volume
13
Issue
7
Journal
Computers
First Page
1
URL with Digital Object Identifier
https://doi.org/10.3390/computers13070183
Last Page
25
Abstract
This paper presents a comprehensive literature review on the evolution of data-lake technology, with a particular focus on data-lake architectures. By systematically examining the existing body of research, we identify and classify the major types of data-lake architectures that have been proposed and implemented over time. The review highlights key trends in the development of data-lake architectures, identifies the primary challenges faced in their implementation, and discusses future directions for research and practice in this rapidly evolving field. We have developed diagrammatic representations to highlight the evolution of various architectures. These diagrams use consistent notations across all architectures to further enhance the comparative analysis of the different architectural components. We also explore the differences between data warehouses and data lakes. Our findings provide valuable insights for researchers and practitioners seeking to understand the current state of data-lake technology and its potential future trajectory.
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.