Contribute to the DSpace Development Fund

The newly established DSpace Development Fund supports the development of new features prioritized by DSpace Governance. For a list of planned features see the fund wiki page.

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

The repository home for this project is: http://scm.dspace.org/svn/repo/sandbox/SimpleArchiveFormat_Builder-prototype/

The input for a command-line batch ingest of materials to DSpace is well documented, and is called "Simple Archive Format", however there don't seem to be tools available that easily facilitate creating a Simple Archive Format package. The approach that the Simple Archive Format Packager is taking is that someone is tracking all of the items of future collection in a folder, and keeping metadata about it in a spreadsheet.

Thus the input to the Simple Archive Format Packager is a spreadsheet/CSV that has a column filename for the bitstream/file, and other columns that will have fully qualified metadata attributes, such as dc.description or dc.contributor.author.

.h2 To get started with the code in the sandbox repo.
#Check out directory from the svn repository.
#In an IDE, (tested in NetBeans), Create a Java Application with Existing Sources.
#Add the source directory 'src'
#Download and add the third party libraries (.jars) as mentioned in the README.

You will then need to edit BatchProcess.java so that inputDir, and metaFile match the path to the sample_data, or whatever collection data you are throwing at it.

In the source tree there is a sample_data directory to help kick start testing and development of this tool.

The expected output of this tool is going to be something that satisfies the specification laid out by "Simple Archive Format".

The goal of this sandbox project is to clean up the current process for command-line batch loading materials into DSpace. With this possibly being plugged in as a "packager", so that an admin loading materials can use the DSpace launcher to create a package from their input materials. This created package is then something in the proper format for direct ingestion into DSpace via import.

  • No labels