Inventory of Hypatia Collections
Preparation of Collections for Hypatia
Collection Name / Institution | All Files on SUL-BRICK | Analysis | Prototype Fixture Objects | Hooks from item to file objects | Ingest Processor Outputs | Hypatia App | Collection Processed into | Collection Processed into | Hypatia App Has Data |
---|---|---|---|---|---|---|---|---|---|
Xanadu / Stanford
|
|
|
| | Stanford | Stanford | Stanford | Stanford | Stanford |
Gould / Stanford
| |
| | | Stanford | Stanford | Stanford | Stanford | Stanford |
Koch / Stanford
| |
| Stanford | Stanford | Stanford | Stanford | Stanford | Stanford | Stanford |
Creeley /Stanford
| |
| Stanford | Stanford | Stanford | Stanford | Stanford | Stanford | Stanford |
Gallagher / Hull
|
|
| Uva |
| Uva | Stanford |
|
|
|
Socialist Health / Hull
|
|
| Uva |
| Uva | Stanford |
|
|
|
Tobin / Yale
|
|
| |
| Uva | Stanford |
|
|
|
Turner / Yale
|
|
| Uva |
| Uva | Stanford |
|
|
|
Cheuse / UVa
| |
| Uva |
| Uva | Stanford |
|
|
|
General conversion and data mapping
Stanford
Collection Name | Estimated Size of Collection in Hypatia |
---|---|
M1437 Gould | 2.5 GB |
M1292 Xanadu | 5.0 GB |
M0662 Creeley | 3.0 GB |
M1584 Koch | 35 GB |
Stephen Jay Gould
The collection was re-processed due to a change in storage location and new ideas on relationships between files and EAD.
Stanford FTK to Hypatia object mapping
Processed files are currently stored in
...
Contents of the collection are currently stored on \\sul-wallaby\ForensicsLab\01-OBJECT_POOL\M1292 Xanadu
Xanadu EAD and Hypatia fixture objects
Directory Structure is as follows:
- Disk Images
- Computer Media Photo Images of Drives
- EAD
The Disk Images folder contains 3 forensic disk images from 3 physical hard drives. The forensic disk images are named CMxx.dd with the "CM" standing for computer media. This folder also contains two additional metadata files for each forensic disk image. The first is a .txt file that contains technical metadata about the forensic imaging process (example CM01.001\). The second is a .csv file that lists the partitions and files contained on the hard drive (example CM01.001\). This file also contains the root path, creation dates, and whether the file was deleted on the media and subsequentially recovered.
...
- Assets loaded on sul-brick; in directory /home/sulguest3/Yale/mssa.ms.1691 - there are only 2 files.
- Each file asset is associated with a specific component; in other words, only two components have assets associated with them. The assets are a Microsoft Access database and a FileMaker Pro database.
- The components that have an asset associated with them contain a dao element. This element's xlink:href attribute is a file URI that points to the location on sul-brick (this is a hack, but it should be sufficient)
Virginia
What I have to submit is some EAD for the Cheuse collection, and 4 zip files which match the id number of <co2> elements in the EAD. The zip files contain images of each disk and pdf files. I can't actually image the disks...I don't have the hardware yet. For the purposes of the tests, what I did was:
- Took pictures of the floppies
- Created a directory structure that matched the structure in the EAD and put the images of each disk in the appropriate folder
- Added a dummy pdf to each folder
- Zipped up each folder and ran it through Rubymatica which:
- unzips
- Creates some technical metadata within a METS.xml file
- Rezips
...
Summary
Collection title | Number of files/objects | Total Extent in (mega/giga)bytes | Extent to be transferred for development | EAD filename | Level of description of born-digital material |
---|---|---|---|---|---|
Alan Cheuse papers | EAD + FTK output (metadata, plus approx 1,400 files) | approx 55 MB | approx 55 MB | uva10726.xml | disk images were processed using FTK. Labels assigned to FTK objects correspond with values in <unitid> tags. those <unitid>s are listed below. |
unitids:
- e002001
- e002002
- e002003
- e002004
- e002005
- e002006
- e002007
- e002007b
- e007
- e0100 – e0144
- EXCEPT e0136…this disk is unreadable, no FTK content
- e0557-- e0557t
- EXCEPT e0557r…the disk is unreadable
- e0422 – e0429
- EXCEPT e0421, e0421a and e0423…unreadable disks
Hull
Files transferred via external hard drive/USB pen drive so no physical media to photograph
Collection title | Number of files/objects | Total Extent (mega/giga) | Extent to be transferred for development | EAD filename | Level of description of born-digital material |
---|---|---|---|---|---|
Stephen Gallagher | paper records (7.5m) | n/a | ~200 -300 MB MB | U DGA.xml | Currently working through the material, with detailed series descriptions |
Socialist Health | paper records (6.5m) | n/a | TBC | U DSM.xml | Preliminary cursory look only - scheduled to start this shortly |