Archive of digitised newspapers

Title (goal): Archive of digitised newspapers
Primary Actor: The State and University Library, Denmark
Scope: Organization (black-box)
Level: Very high level summary
(Story): 30 million pages of newspapers ingested over a 3 year period. Data consists of JPEG2 files externally referenced. Metadata consists of JPEG2 file analysis (using jpylizer), MODS metadata, MIX technical metadata and ALTO OCR files. Access is by daily harvesting of new and changed objects to dissemination system.

Page tree