Federated SPARQL Queries Processing with Replicated Fragments

Abstract : Federated query engines allow to consume linked data from SPARQL endpoints. Replicating data fragments from different sources allows to reorganize data to better fit federated query processing of data consumers. However, existing federated query engines poorly support replication. In this paper, we propose a replication-aware federated query engine that extends state-of-art federated query engine ANAPSID and FedX with Fedra, a source selection strategy that approximates the source selection problem with fragments replication (SSP-FR). For a given set of endpoints with replicated fragments and a SPARQL query, the problem is to find the endpoints to contact in order to minimize the number of tuples to transfer from endpoints to the federated query engines. We devise the Fedra source selection algorithm that approximates SSP-FR. We implement Fedra in the state-of-the-art federated query engines FedX and ANAPSID, and empirically evaluate their performance. Experimental results suggest that Fedra efficiently solves SSP-FR, reducing the number of selected SPARQL endpoints as well as the size of query intermediate results.
Complete list of metadatas

http://hal.univ-nantes.fr/hal-01169601
Contributor : Gabriela Montoya <>
Submitted on : Monday, June 29, 2015 - 6:10:16 PM
Last modification on : Thursday, October 10, 2019 - 9:42:03 PM
Long-term archiving on : Tuesday, April 25, 2017 - 7:47:07 PM

Files

paper.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01169601, version 1

Collections

Citation

Gabriela Montoya, Hala Skaf-Molli, Pascal Molli, Maria-Esther Vidal. Federated SPARQL Queries Processing with Replicated Fragments. The Semantic Web - ISWC 2015 - 14th International Semantic Web Conference, Oct 2015, Bethlehem, United States. ⟨hal-01169601⟩

Share

Metrics

Record views

1817

Files downloads

1275