You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 11 Next »

Inventory of Hypatia Collections

Stanford

Xanadu

A Marc record for the collection is available in SearchWorks.

Collection consists of 6 hard drives and there is a very basic finding aid describing the contents of the collection available at http://findingaids.stanford.edu/xtf/view?docId=ead/mss/m1292.xml;chunk.id=headerlink;brand=default;query=xanadu

File Structure of the Collection on disk

Contents of the collection are currently stored on \\sul-wallaby\ForensicsLab\01-OBJECT_POOL\M1292 Xanadu

Directory Structure is as follows:

  • Disk Images
  • Photo Images of Drives
  • EAD

The Disk Images folder contains 3 forensic disk images from 3 physical hard drives.  The forensic disk images are named CMxx.dd with the "CM" standing for computer media.  This folder also contains two additional metadata files for each forensic disk image.  The first is a .txt file that contains technical metadata about the forensic imaging process (example CM01.001\). The second is a .csv file that lists the partitions and files contained on the hard drive (example CM01.001\). This file also contains the root path, creation dates, and whether the file was deleted on the media and subsequentially recovered.

The Photo Images of Drives folder contains digital photographs of the source media (JPEG), in this case images of the front and back of the harddrives. Currently, this folder only contains images of a single harddrive.

The EAD folder contains the Encoded Archival Description file for the Xanadu collection (example EAD\). This file currently does not contain any pointers to where the hard drives are physically located in the collection. We are also currently missing reference identifiers to the computer media in the finding aid. I believe this is just an oversight but I'm following up with Special Collections to determine why they are missing.

Yale

Collection title

Number of files/objects

Extent in (mega/giga)bytes

EAD filename

Level of description of born-digital material

James Tobin papers

27 disk images + metadata (approx 80 files total)

36 MB

mssa.ms.1746.bpg.xml

Disks are described individually within EAD as separate components

Henry Ashby Turner papers

~5-10

TBC

mssa.ms.1691.bpg.xml

Components represent individual digital objects within a specific subseries

Love Makes a Family records

TBC

TBC

mssa.ms.1962.bpg.xml

Only described at high-level aggregations

Pelli Clarke Pelli records

TBC

TBC

mssa.ms.1939.bpg.xml

Currently completely undescribed

New Haven Oral Histories

TBC

TBC

mssa.ru.1055.bpg.xml

Described as individual "interviews" - audio file + MS Word document

James Welch papers (Beinecke)

TBC

TBC

beinecke.welch.bpg.xml

TBC

Virginia

What I have to submit is some EAD for the Cheuse collection, and 4 zip files which match the id number of <co2> elements in the EAD. The zip files contain images of each disk and pdf files. I can't actually image the disks...I don't have the hardware yet. For the purposes of the tests, what I did was: 

  1. Took pictures of the floppies
  2. Created a  directory structure that matched the structure in the EAD and put the images of each disk in the appropriate folder
  3. Added a dummy pdf to each folder
  4. Zipped up each folder and ran it through Rubymatica which:
    1. unzips
    2. Creates some technical metadata within a METS.xml file
    3. Rezips

 So the .zip archives included .txt, .xml, .jpg, and .pdf files

Hull

Collection title

Number of files/objects

Extent (mega/giga)
bytes

EAD filename

Level of description of born-digital material

Stephen Gallagher

paper records (7.5m) 

14,320 digital files 
39 Amstrad disks still to read

n/a

13.6 GB
tbc

U DGA.xml
- current (beta) structure
of the collection ONLY 

Currently working through the material, with detailed series descriptions
- novel/screenplay etc being created in CALM


Socialist Health
Association 

paper records (6.5m)

2558 digital files

n/a

670MB

U DSM.xml
- paper based material
ONLY

Preliminary cursory look only - scheduled to start this shortly
- focus has been Stephen Gallagher due to the larger volume & complexity

  • No labels