Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 4.0

Description

The Fedora-Kepler Integration demonstration project is an investigation of methods of interaction between the Kepler Scientific Workflow Engine and a Fedora Commons Repository. The goal of the project is to create a Kepler workflow that accesses data from a Fedora Repository, performs operations on the data and then stores the result back in a Fedora Repository.

The Fedora-Kepler investigation was performed by Rick Moore for Fedora Commons and the Cornell University Library. Funding for this project was provided under a grant from the Mellon Foundation.

Project Technologies

Fedora Digital Repository

The project used Fedora Repository V3.2.1, released May 8, 2009. It was chosen because it contained the first available release of a streamlined REST API for Fedora. This version is available from Fedora Commons at http://www.fedora-commons.org/software/current. The Fedora REST API is documented at http://fedora-commons.org/confluence/display/FCR30/REST+API.

Kepler Scientific Workflow

Kepler is a java-based application for building and executing scientific workflows. It is a product of the Kepler Project originated by UC Davis, UC Santa Barbara, and UC San Diego.

A detailed analysis of Kepler is available on the Kepler Scientific Workflow Engine page in this wiki. It provides a basic understanding of Kepler terminology and the issues encountered. The page also contains information on downloading Kepler and navigating it's documentation maze.

Jython

Much of the code developed for this project was written in Jython, an Java implementation of the Python programming language. It supplements Java with powerful dynamic language support and gives developers access to the strengths of both Python and Java. Jython is available from The Jython Project (http:www.jython.org) web site.

To learn more about Jython visit one of the following sites:

JyFedoREST

JyFedoREST is a Jython package for creating and managing objects in a Fedora Repository. It is a minimal Jython wrapper around V3.2.1 of the Fedora REST API. It is described in detail on it's own project page.

FCRepoKepler package

FCRepoKepler is a Jython package containing a few helper classes that make life easier when developing actors for Kepler. The project page contains complete documentation of the classes available in the package.

Test Case - REST API

Several actors and workflows were developed to test Fedora's REST API using JyFedorREST from within Kepler. They are described in detail on the REST API Tests page. The most interesting of these is the Ingest With Datastream workflow.

Test Case - DataSTAR

DataSTAR is a Cornell Library initiative aimed at providing a staging archive for scientific data that is to be shared among several collaborators, both within the Cornell community and external to it.The actors and workflows developed for this part of the project are descibed in detail on the DataSTAR Use Case page.