This documentation refers to an earlier version of Islandora. https://wiki.duraspace.org/display/ISLANDORA/Start is current.

Skip to end of metadata
Go to start of metadata

Overview

The Web Archive Solution Pack adds all required Fedora objects to allow users to ingest and retrieve web archives through the Islandora interface.

Dependencies

Downloads

Release Notes and Downloads

Configuration

The Web Archive Solution Pack configuration options can be accessed at http://path.to.your.site/admin/islandora/solution_pack_config/web_archive. Set the paths for warcindex and warcfilter here:



Content Models, Prescribed Datastreams and Forms

The Web Archive Solution Pack comes with the following objects in http://path.to.your.site/admin/islandora/solution_pack_config/solution_packs:

A file ingested using the Web Archive Solution Pack's content model and LAME will have the following datastreams:

RELS-EXT

Default Fedora relationship metadata

MODS

MODS record filled out during ingest

DC

Dublin Core record

OBJ

Original WARC file uploaded

TNDefault thumbnail icon for audio objects

The Web Archive Solution Pack comes with the Web Archive MODS form.

Solr Indexing

If you are using Solr 4+, the WARC_FILTERED datastream will automatically be indexed via Apache Tika. You will need to add ds.WARC_FILTERED^1 to the Query fields form in Adminstration » Islandora » Solr Index » Solr Settings (admin/islandora/search/islandora_solr/settings).


  • No labels