Skip to main navigation Skip to search Skip to main content

Flexible data partitioning schemes for parallel merge joins in semantic web queries.

Abstract

In the context of the Semantic Web, large amounts of data must be preprocessed and stored so that they can be queried efficiently later. The key technology in this topic are triple stores, in which all information is stored in the form of (subject, predicate and object) triple patterns. Depending on the triple patterns used within the queries, very different value distributions can be observed within these datasets. Currently, these properties are only exploited implicitly during join optimization in the form of histograms or similar technologies. This paper proposes a new way to take advantage of these different distributions using different partitioning schemes at runtime. This means that an optimal partitioning scheme can be used depending on the data access in order to improve query performance. In the experiments we achieve speedups up to a factor of 5.92 in comparison to no partitioning, and a performance improvement of up to 81% compared to a not optimal number of partitions.
Original languageEnglish
Pages237-256
DOIs
Publication statusPublished - 2021

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being
  2. SDG 4 - Quality Education
    SDG 4 Quality Education
  3. SDG 9 - Industry, Innovation, and Infrastructure
    SDG 9 Industry, Innovation, and Infrastructure
  4. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities
  5. SDG 12 - Responsible Consumption and Production
    SDG 12 Responsible Consumption and Production
  6. SDG 14 - Life Below Water
    SDG 14 Life Below Water
  7. SDG 15 - Life on Land
    SDG 15 Life on Land

Research Areas and Centers

  • Centers: Center for Artificial Intelligence Luebeck (ZKIL)
  • Research Area: Intelligent Systems

Fingerprint

Dive into the research topics of 'Flexible data partitioning schemes for parallel merge joins in semantic web queries.'. Together they form a unique fingerprint.

Cite this