Vis enkel innførsel

dc.contributor.authorCorodescu, Andrei-Alin
dc.contributor.authorNikolov, Nikolay
dc.contributor.authorKhan, Akif Quddus
dc.contributor.authorSoylu, Ahmet
dc.contributor.authorMatskin, Mihhail
dc.contributor.authorPayberah, Amir
dc.contributor.authorRoman, Dumitru
dc.date.accessioned2022-03-03T09:52:52Z
dc.date.available2022-03-03T09:52:52Z
dc.date.created2021-11-26T11:33:25Z
dc.date.issued2021-11-09
dc.identifier.isbn978-1-4503-8314-1
dc.identifier.urihttps://hdl.handle.net/11250/2982739
dc.description.abstractThe development of the Edge computing paradigm shifts data processing from centralised infrastructures to heterogeneous and geographically distributed infrastructure. Such a paradigm requires data processing solutions that consider data locality in order to reduce the performance penalties from data transfers between remote (in network terms) data centres. However, existing Big Data processing solutions have limited support for handling data locality and are inefficient in processing small and frequent events specific to Edge environments. This paper proposes a novel architecture and a proof-of-concept implementation for software container-centric Big Data workflow orchestration that puts data locality at the forefront. Our solution considers any available data locality information by default, leverages long-lived containers to execute workflow steps, and handles the interaction with different data sources through containers. We compare our system with Argo workflow and show significant performance improvements in terms of speed of execution for processing units of data using our data locality aware Big Data workflow approach.en_US
dc.description.sponsorshipThis work was partly funded by the EC H2020 project “DataCloud” (Grant nr. 101016835) and the NFR project “BigDataMine” (Grant nr. 309691).en_US
dc.language.isoengen_US
dc.publisherAssociation for Computing Machineryen_US
dc.relation.ispartofMEDES '21: Proceedings of the 13th International Conference on Management of Digital EcoSystems
dc.relation.ispartofseriesMEDES: Management of Emergent Digital EcoSystems;MEDES '21: Proceedings of the 13th International Conference on Management of Digital EcoSystems
dc.subjectBig data workflowsen_US
dc.titleLocality-Aware Workflow Orchestration for Big Dataen_US
dc.typeConference objecten_US
dc.description.versionpublishedVersionen_US
cristin.ispublishedtrue
cristin.fulltextoriginal
dc.identifier.doihttps://doi.org/10.1145/3444757.3485106
dc.identifier.cristin1959647
dc.source.volume21en_US
dc.source.issue21en_US
dc.relation.projectNorges forskningsråd: 309691en_US
dc.relation.projectEC/H2020/101016835en_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel