A4 Article in conference proceedings
Migrating from a Centralized Data Warehouse to a Decentralized Data Platform Architecture (2021)
Loukiala, A., Joutsenlahti, J.-P., Raatikainen, M., Mikkonen, T., & Lehtonen, T. (2021). Migrating from a Centralized Data Warehouse to a Decentralized Data Platform Architecture. In L. Ardito, A. Jedlitschka, M. Morisio, & M. Torchiano (Eds.), PROFES 2021 : 22nd International Conference on Product-Focused Software Process Improvement, Proceedings. Product-Focused Software Process Improvement (pp. 36-48). Springer International Publishing. Lecture Notes in Computer Science, 13126. https://doi.org/10.1007/978-3-030-91452-3_3
JYU authors or editors
Publication details
All authors or editors: Loukiala, Antti; Joutsenlahti, Juha-Pekka; Raatikainen, Mikko; Mikkonen, Tommi; Lehtonen, Timo
Parent publication: PROFES 2021 : 22nd International Conference on Product-Focused Software Process Improvement, Proceedings. Product-Focused Software Process Improvement
Parent publication editors: Ardito, Luca; Jedlitschka, Andreas; Morisio, Maurizio; Torchiano, Marco
Place and date of conference: Turin, Italy, 26.11.2021
ISBN: 978-3-030-91451-6
eISBN: 978-3-030-91452-3
Journal or series: Lecture Notes in Computer Science
ISSN: 0302-9743
eISSN: 1611-3349
Publication year: 2021
Number in series: 13126
Pages range: 36-48
Number of pages in the book: 308
Publisher: Springer International Publishing
Place of Publication: Cham
Publication country: Switzerland
Publication language: English
DOI: https://doi.org/10.1007/978-3-030-91452-3_3
Publication open access: Not open
Publication channel open access:
Publication is parallel published (JYX): https://jyx.jyu.fi/handle/123456789/79781
Additional information: Also part of the Programming and Software Engineering book sub series (LNPSE, volume 13126).
Abstract
To an increasing degree, data is a driving force for digitization, and hence also a key asset for numerous companies. In many businesses, various sources of data exist, which are isolated from one another in different domains, across a heterogeneous application landscape. Well-known centralized solution technologies, such as data warehouses and data lakes, exist to integrate data into one system, but they do not always scale well. Therefore, robust and decentralized ways to manage data can provide the companies with better value give companies a competitive edge over a single central repository. In this paper, we address why and when a monolithic data storage should be decentralized for improved scalability, and how to perform the decentralization. The paper is based on industrial experiences and the findings show empirically the potential of a distributed system as well as pinpoint the core pieces that are needed for its central management.
Keywords: enterprises; information management; information management systems; data systems; data warehouses; information administration; data; decentralisation; distributed systems; centralisation
Free keywords: data warehousing; data platform architecture; distributed data management; data decentralization
Contributing organizations
Ministry reporting: Yes
Reporting Year: 2021
JUFO rating: 1