Inventory of Hypatia Collections

Stanford

Stephen Jay Gould

The collection was re-processed due to a change in storage location and new ideas on relationships between files and EAD.

Processed files are currently stored in

\\sul-wallaby\ForensicsLab\01-OBJECT_POOL\M1437 Stephen Jay Gould\Gould 06_27_2011

and in Sul-Brick/sulguest/Stanford/Gould 06_27_2011

Directory Structure is as follows:

Computer Media Photo
EAD
FTK html
FTL xml
Logical Image
Transit Solution

"FTK html" folder is used to store report from AccessDataFTK in html.

"FTK xml" folder is used to store report from AccessDataFTK in xml.

"Logical Image" folder is used to store the logical images and the audit logs of disk imaging.

"Transit Solution" folder is used to store the html version of the original files created by Transit Solution.

Xanadu

A Collection consists of 6 hard drives. A Marc record for the collection is available in SearchWorks; a very basic finding aid describes the contents of the collection.

Contents of the collection are currently stored on \\sul-wallaby\ForensicsLab\01-OBJECT_POOL\M1292 Xanadu

Xanadu EAD and Hypatia fixture objects

Directory Structure is as follows:

Disk Images
Photo Images of Drives
EAD

The Disk Images folder contains 3 forensic disk images from 3 physical hard drives. The forensic disk images are named CMxx.dd with the "CM" standing for computer media. This folder also contains two additional metadata files for each forensic disk image. The first is a .txt file that contains technical metadata about the forensic imaging process (example CM01.001\). The second is a .csv file that lists the partitions and files contained on the hard drive (example CM01.001\). This file also contains the root path, creation dates, and whether the file was deleted on the media and subsequentially recovered.

The Photo Images of Drives folder contains digital photographs of the source media (JPEG), in this case images of the front and back of the harddrives.

The EAD folder contains the Encoded Archival Description file for the Xanadu collection (example EAD\). This file currently does not contain any pointers to where the hard drives are physically located in the collection. We are also currently missing reference identifiers to the computer media in the finding aid. I believe this is just an oversight but I'm following up with Special Collections to determine why they are missing.

Yale

Collection title	Number of files/objects	Total Extent in (mega/giga)bytes	Extent to be transferred for development	EAD filename	Level of description of born-digital material
James Tobin papers	27 disk images + metadata (approx 80 files total)	36 MB	36 MB	mssa.ms.1746.bpg.xml	Disks are described individually within EAD as separate components
Henry Ashby Turner papers	~5-10	~200 MB	~80 MB	mssa.ms.1691.bpg.xml	Components represent individual digital objects within a specific subseries
Love Makes a Family records	TBC	~36 GB	TBC	mssa.ms.1962.bpg.xml	Only described at high-level aggregations
Pelli Clarke Pelli records	TBC	~6 GB	TBC	mssa.ms.1939.bpg.xml	Currently completely undescribed
New Haven Oral Histories	TBC	~101 GB	TBC	mssa.ru.1055.bpg.xml	Described as individual "interviews" - audio file + MS Word document
James Welch papers (Beinecke)	TBC	TBC	TBC	beinecke.welch.bpg.xml	TBC

Virginia

What I have to submit is some EAD for the Cheuse collection, and 4 zip files which match the id number of <co2> elements in the EAD. The zip files contain images of each disk and pdf files. I can't actually image the disks...I don't have the hardware yet. For the purposes of the tests, what I did was:

Took pictures of the floppies
Created a directory structure that matched the structure in the EAD and put the images of each disk in the appropriate folder
Added a dummy pdf to each folder
Zipped up each folder and ran it through Rubymatica which:
1. unzips
2. Creates some technical metadata within a METS.xml file
3. Rezips

So the .zip archives included .txt, .xml, .jpg, and .pdf files

Hull

Files transferred via external hard drive/USB pen drive so no physical media to photograph

Collection title	Number of files/objects	Total Extent (mega/giga) bytes	Extent to be transferred for development	EAD filename	Level of description of born-digital material
Stephen Gallagher	paper records (7.5m) 14,320 digital files (excluding 39 Amstrad disks still to be read)	n/a 13.6 GB tbc	~200-300 MB	U DGA.xml - current (beta) structure of the collection ONLY	Currently working through the material, with detailed series descriptions - novel/screenplay etc being created in CALM
Socialist Health Association	paper records (6.5m) 2558 digital files	n/a 670MB	TBC	U DSM.xml - paper based material ONLY	Preliminary cursory look only - scheduled to start this shortly - focus has been Stephen Gallagher due to the larger volume & complexity

Child pages

Collections Inventory