Linked Open Data Integration Benchmark (LODIB) Specification - V1.0

Carlos R. Rivero (University of Sevilla, Spain)
Andreas Schultz (Freie Universität Berlin, Germany)
Chris Bizer (Freie Universität Berlin, Germany)
This version:
Latest version:
Publication Date: 02/20/2012


Linked Data sources on the Web use a wide range of different vocabularies to represent data describing the same type of entity. For some types of entities, like people or bibliographic record, common vocabularies have emerged that are used by multiple data sources. But even for representing data of these common types, different user communities use different competing common vocabularies. Linked Data applications that want to understand as much data from the Web as possible, thus need to overcome vocabulary heterogeneity and translate the original data into a single target vocabulary. To support application developers with this integration task, several Linked Data translation systems have been developed. These systems provide languages to express declarative mappings that are used to translate heterogeneous Web data into a single target vocabulary. This document specifies the LODIB - Linked Open Data Integration Benchmark. LODIB is a benchmark for comparing the expressivity as well as the runtime performance of Linked Data translation systems. The benchmark aims to reflect the real-world heterogeneities that exist on the Web of Linked Data and has thus been designed based on statistics that were derived from the LOD Cloud.

Table of Contents

Appendix A: Changes

Appendix B: Acknowledgements

This work was supported by the EU FP7 grants LOD2 - Creating Knowledge out of Interlinked Data (Grant No. 257943), the European Commission (FEDER), the Spanish and the Andalusian R&D&I programmes (grants P07-TIC-2602, P08-TIC-4100, TIN2008-04718-E, TIN2010-21744, TIN2010-09809-E, TIN2010-10811-E, and TIN2010-09988-E).