You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

This document is under development.

DOI support

We (at Technische Universität Berlin) want to use DOIs for Items within DSpace. We are thinking about using DOIs for Communities and Collections at first we'll concentrate on items. A DOI is a well known persistent identifier and with the external identifier support atMire introduced to DSpace 3.0 with the versioning support it should be possible to add support to mint, register and delete DOIs using DSpace.

Registration agencies, DataCite and EZID

To register a DOI one has to make a contract with a DOI registration agency, several agencies exists. Different DOI registration agencies have different rules. Some offers registration of DOI specially or only for academic environment, others only for publishing companies. Most of the registration agencies take fees for registering DOIs, all of them have different rules describing for what kind of item a DOI can be registered. To implement DOI support for DataCite we have to take care that every registration agency has their own API (see below).

DataCite is an organization that aims to support the access to, the acceptance of and the archiving of research data. On of the services offered by DataCite members is to register DOIs. DataCite has several members that act as DOI registration agency. Some of the members tells their customers to use the API of DataCite directly others offers their own APIs. So to register a DOI at a member of DataCite does not automatically means to use DataCites API directly.

We will register our DOIs using the service of TIB Hannover a german member of DataCite. We will use the DataCite API directly. EZID is a DOI registration agency in the U.S. that is although part of DataCite. EZID offers their own API, so that EZID customers one profit directly from our development.

DOIIdentifierProvider, DOIConnector and DataCiteConnector

Knowing this situation we developed a DOIIdentifierProvider that should perform everything on the side of DSpace that is necessary to support DOIs. For example after minting and registering a DOI it safes the DOI as a metadata value of an Item. To be able to extend our DOIIdentifierProvider we put a DOIConnector between our DOIIdentifierProvider and the API for the registration agency. The DOIConnector has to support seven methods and should be quite easy to implement for any API of a DOI registration agency. The seven methods are:

  • one method to check if a DOI is already reserved,
  • one method to check if a DOI is reserved for a given DSO,
  • one method to check if a DOI is already registered,
  • one method to check if a DOI is registered for a given DSO,
  • one method to reserve a DOI for a given DSO,
  • one method to register a DOI for a given DSO,
  • one method to delete a DOI for a given DSO.

We already developed a DataCiteConnector that implements these methods for everyone that uses the DataCite API directly. As told above, EZID has their own API, but it should be quite simple to implement a DOIConnector providing these seven methods with the EZID API.

Metadata

DataCite wants to get metadata of the objects the DOIs addresses. The DataCite Schema (http://schema.datacite.org) defines a XML structure to describe the metadata of an object. We developed a DIM2DataCite crosswalk that takes the metadata of a DSpace Item and transforms it into a XML using DataCite Schema 2.2. As far as I know, EZID does not use this XML so that probably another crosswalk is needed. It should be discussed (see below or in the JIRA ticket) how we want to deal with metadata updates as the API for external identifiers does not define a mechanism to update metadata for external identifer yet.

Status

What's done already?

A first version of a DOIIdentivierProvider is complete. A interface for a DOIConnector is defined. The reserveration, registration and deletion of a DOI had been tested here. All the code can be found in our DSpace repository on github, in the branch DOI: https://github.com/tuub/DSpace/tree/DOI

 

What's still to do?

 

 

  • No labels