Old Release

This documentation relates to an old version of DSpace, version 5.x. Looking for another version? See all documentation.

Support for DSpace 5 ended on January 1, 2023.  See Support for DSpace 5 and 6 is ending in 2023

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

Exchanging repository contents

The most sites on the Internet are oriented towards human consumption. While HTML may be a good format to create websites it is not a good format to export data in a way a computer can work with. Like the most software for repositories DSpace supports OAI-PMH as an interface to export the stored data. While OAI-PMH is well known in the field of repositories it is rarely known elsewhere (e.g. Google retired its support for OAI-PMH in 2008). The Semantic Web is an generic approach to publish data on the Internet together with information about its semantics. The W3C released standards like RDF or SPARQL for publishing structured data on the Web in a way computers can easily work with. The data stored in repositories is particularly suited to be used in the Semantic Web, as metadata is already available. It doesn’t have to be generated or entered manually for publication as Linked Data. For most repositories, at least for Open Access repositories, it is quite important to share their stored content. Linked Data is a rather big chance for repositories to present their content in a way it can easily be accessed, interlinked and (re)used.

Terminology

We don't want to give a full introduction into the Semantic Web and its technologies here as there can by found many on the web. Nevertheless we want to give a short glossar about the terms used most often in this content to make the following documentation more readable.

 

Semantic WebThe term "Semantic Web" refers to the part of the Internet containing Linked Data. As in the World Wide Web the Semantic Web is created by links between the data.

Linked Data

Linked Open Data

Linked Data is used for data in RDF, following the Linked Data Principles. The Linked Data Principles describes expected behavior by data publishers that shall ensure that the data published is easy to find, easy to retrieve, can be linked easily and links to other data as well.

Linked Open Data is Linked Data published using an open license. Technically there is no difference between Linked Data and Linked Open Data (often abbreviated as LOD), it is only a question of the license used to publish.

RDF
RDF/XML
Turtle
N-Triples
N3-Notation
RDF is an acronym for Resource Description Framework, a meta data model. Don't think of RDF as a format, as it is a model. Nevertheless there are different formats to serialize data following RDF. RDF/XML, Turtle, N-Triples and N3-Notation are probably the most known formats to serialize data in RDF.
Triple StoreA triple store is a database to natively store data following the RDF approach.
SPARQLThe SPARQL Protocol and RDF Query Language is a protocol to query triple stores. Since SPARQL version 1.1 it can be used to manipulate triple stores as well, to store, delete or updata data in triple stores.
SPARQL endpointA SPARQL endpoint is an SPARQL interface of a triple store. Since SPARQL 1.1 a SPARQL endpoint can be read-only, allowing to query the stored data only or it can be read-writable allowing to modified stored data as well.

Linked (Open) Data Support within DSpace

Starting with DSpace 5.0 DSpace supports to provide stored contents as Linked (Open) Data.

 

 

  • No labels