Current Release
This documentation covers the latest release of Islandora 7.x. For the very latest in Islandora, we recommend Islandora 8.

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Current »

Overview

The Web Archive Solution Pack adds all required Fedora objects to allow users to ingest and retrieve web archives through the Islandora interface.

Dependencies

Downloads

Release Notes and Downloads

Configuration

The Web Archive Solution Pack configuration options can be accessed at http://path.to.your.site/admin/islandora/solution_pack_config/web_archive. Set the paths for warcindex and warcfilter here:


If you are using Solr 4+, the WARC_FILTERED datastream will automatically be indexed via Apache Tika. You will need to add ds.WARC_FILTERED^1 to the Query fields form in http://path.to.your.site/admin/islandora/search/islandora_solr/settings.

Content Models, Prescribed Datastreams and Forms

The Web Archive Solution Pack comes with the following objects in http://path.to.your.site/admin/islandora/solution_pack_config/solution_packs:

  • Islandora Web Archive Content Model (islandora:sp_web_archive)
  • Web Archive Collection (islandora:sp_web_archive_collection)

A file ingested using the Web Archive Solution Pack's content model will have the following datastreams:

RELS-EXT

Default Fedora relationship metadata

MODS

MODS record filled out during ingest

DC

Dublin Core record

OBJ

Original WARC file uploaded

TNDefault thumbnail icon for WARC objects
PNGOptional screenshot to represent the WARC
PDFOptional pdf to store with the WARC
WARC_CSVWARC Index
WARC_FILTEREDWARC filtered for Solr index

The Web Archive Solution Pack comes with the Web Archive MODS form.

  • No labels