Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Generally, Islandora performs best when each ZIP directory is smaller than 2 GB.
  • Each directory within the ZIP file represents an issue.
  • Files within the issue directory will become datastreams on the issue object.
  • Directories within the issue directory contain files that will become newspaper page objects.  
  • File names must match the Islandora datastream IDs that match each file. This means that every page image needs to be renamed "OBJ.tifftif" in order to be treated as a newspaper page object by Islandora.

...

└── issue1
    ├── 1
    │   ├── JP2.jp2
    │   ├── JPG.jpegjpg
    │   ├── OBJ.tif
    │   ├── OCR.asctxt
    │   └── TN.jpegjpg
    ├── 2
    │   ├── JP2.jp2
    │   ├── JPG.jpegjpg
    │   ├── OBJ.tif
    │   ├── OCR.asctxt
    │   └── TN.jpegjpg
    └── MODS.xml

Descriptive Metadata

...

  • Zip file - Upload the ZIP file for batch ingest.
  • Create PDFs? - Checking this box creates a PDF derivative that contains all the pages associated with a newspaper issue.
  • Namespace for created objects - Set the namespace for the issue and page objects created for this batch ingest.
  • Generate OCR? - Checking this box causes OCR to be generated for each Page object. OCR will be attached as a datastream to each page. If checked, another option appears below it, "Aggregate OCR?".
  • Aggregate OCR? Check this box to create an OCR datastream in the issue object that aggregates the OCR datastreams from all of the page-level objects in that issue.
  • Notify admin after ingest? - Check this box to send an email to the site admin (user 1) that a newspaper batch ingest has completed. This requires the Drupal Rules module and a rule for newspaper batch notifications.
  • Ingest immediately? - Checking this box will cause the batch to go through both steps of the ingest (pre-processing and actual ingest) immediately.
    • If you do not check "Ingest Immediately", the files will be pre-processed only and added to the Islandora batch queue for an administrator to approve.
    • To approve the batch, go to Administration > Reports > Islandora Batch Sets and select "View Items in Set" next to an unprocessed set. To process the set, click "Process Set" and process all items.
      Pre-processed newspaper batch items in the batch queue

...