Page History
...
Among the acccomplishments of the project are the following:
Software Tools
Checksum Checker
A tool for verifying the integrity of bitstreams in the asset store, developed in conjunction with the University of Rochester. This tool is now part of the DSpace 1.4 codebase.
TechMDExtractor
A tool for validating the formats of stored bitstreams, and optionally, for extracting technical metadata from the bitstreams. Harvard University's JHOVE provides the underlying functionality. This tool is awaiting integration into the DSpace codebase, but you can view the documentation TechMDExtractor.
Workflow Pre-ingest Step
An optional workflow step in DSpace that will validate the format of every bitstream upon ingest, and provide the system administrator with extracted metadata for files that are either invalid or not well-formed. Currently JHOVE provides all underlying functionality, although I'm hopeful that someone will expand this step to provide additional functionality, such as virus-checking and migration-on-ingest. This is also awaiting integration into the DSpace codebase; documentation is PreIngest.
Documents
General Documents
- IST_final.pdf Exploring Strategies for Digital Preservation for DSpace@Cambridge, in Proceedings of the Archiving 2005 Conference, Society for Imaging Science and Technology, April 2005.
- JhoveLNZComp Why We Chose JHOVE
Format Background Reports
As part of the preservation planning process, we compiled background documents on several formats. These formats are ones we are very interested in preserving, but perhaps just as importantly, they are formats for which there does not yet seem to be a vast amount of information available in the digital preservation community. The background documents depend heavily on foundations laid by three sources:
...
- backgrd-HTML.pdf HTML 4.01
- backgrd-XHTML.pdf XHTML 1.0
- backgrd-XLS.pdf MS Excel 10.0
- backgrd-PPT.pdf MS PowerPoint 10.0
- backgrd-MSWord.pdf MS Word 10.0
Preservation Options Reports
These reports summarize the pros and cons of various migration options.
...