Stanford directory output for Gould collection contains a locally transformed version of the FTK output:
- M1437 Gould
- Computer Media Photo
- EAD
- FTK xml
- files
- Report.fo
- Report.xml
- Report_transformed.xml
- Disk Image
Sample of transformed output:
<?xml version="1.0" encoding="UTF-8"?>
<ftk_report xmlns:fo="http://www.w3.org/1999/XSL/Format">
<Series>Series 6</Series>
<file>
<filename>BU3A5</filename>
<Item_Number>1004</Item_Number>
<filepath>CM006.001/NONAME [FAT12]/[root]/BU3A5</filepath>
<filesize>35654</filesize>
<file_creation_date>n/a</file_creation_date>
<file_accessed_date>n/a</file_accessed_date>
<file_modified_date>12/8/1988 6:48:48 AM (1988-12-08 14:48:48 UTC)</file_modified_date>
<MD5_Hash>976EDB782AE48FE0A84761BB608B1880</MD5_Hash>
<Restricted>False</Restricted>
<Label>[Medium] 5.25 inch Floppy Disks</Label>
<Label>[Type] Books</Label>
<Label>[Title]The Burgess Shale and the Nature of History</Label>
<filetype>WordPerfect 4.2</filetype>
<Duplicate_File> </Duplicate_File>
<Export_path>files\BU3A5.wp</Export_path>
</file>
<file>
<filename>BUR3-1</filename>
<Item_Number>3005</Item_Number>
<filepath>CM005.001/NONAME [FAT12]/[root]/BUR3-1</filepath>
<filesize>92745</filesize>
<file_creation_date>n/a</file_creation_date>
<file_accessed_date>n/a</file_accessed_date>
<file_modified_date>12/8/1988 6:35:06 AM (1988-12-08 14:35:06 UTC)</file_modified_date>
<MD5_Hash>D3EB7E35856697B8F697193DB2CB4D83</MD5_Hash>
<Restricted>False</Restricted>
<Label>[Medium] 5.25 inch Floppy Disks</Label>
<Label>[Type] Books</Label>
<Label>[Title]The Burgess Shale and the Nature of History</Label>
<filetype>WordPerfect 4.2</filetype>
<Duplicate_File> </Duplicate_File>
<Export_path>files\BUR3-1.wp</Export_path>
</file>
Each <file> segment will be a the basis of a separate Hypatia Digital Object. This is an example where the atomistic model adds overhead (separate metadata and content objects) and an integrated object combining commonMetadata and genericContent could be considered.
Information from: Disk Image // CMnnn.001.txt |
maps to |
notes |
---|---|---|
<collection_title>Stephen J. Gould Papers |
Collection object |
Equivalent to EAD <archdesc><title> |
<series>Series 6: Born Digital Materials |
Series set object |
Equivalent to EAD series <c><unittitle> |
<note>5.25 inch Floppy Disks</note> |
Series object |
Corresponds to EAD <physdesc> |
<callnumber>M1437</callnumber> |
Collection object |
Corresponds to EAD <archdesc><did><unitid> |
Information from: FTK xml // Report_transformed.xml |
maps to (all within item objects) |
notes |
||
---|---|---|---|---|
<filename>BU3A5</filename> |
n/a |
this is just the file name prefix, minus the file extension, if any. <export_path> (below) has the full name of the file. |
||
<Item_Number>1004</Item_Number> |
n/a |
internal reference only, to disambiguate reference in the FTK report |
||
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="0b22063d-c690-4155-bdfc-6df811b366d2"><ac:plain-text-body><![CDATA[ |
<filepath>CM006.001/NONAME [FAT12]/[root]/BU3A5</filepath> |
|
original file in FTK xml // files |
]]></ac:plain-text-body></ac:structured-macro> |
<disk_image_no>CM006</disk_image_no> |
descMetadata |
This token, taken from the head of the <filepath>, is the only data link between the FTK output for a file object and the corresponding media object. We want a data link in descriptive metadata as well as an RDF link to the corresponding object. |
||
<filesize>35654</filesize> |
|
Could be used by conversion to compare against the file size as computed locally, a quick check prior to checksum validation? |
||
<filesize_unit>B</filesize_unit> |
|
Needed to correctly interpret <filesize>, if used |
||
<file_creation_date>n/a</file_creation_date> |
note? |
|
||
<file_accessed_date>n/a</file_accessed_date> |
note? |
|
||
<file_modified_date>12/8/1988 6:48:48 AM (1988-12-08 14:48:48 UTC)</file_modified_date> |
note? |
|
||
<MD5_Hash>976EDB782AE48FE0A84761BB608B1880</MD5_Hash> |
|
Used for checksum validation of a file during processing. This value will eventually be part of contentMetadata |
||
<restricted>False</Restricted> |
|
true=visible staff only, not discoverable .... Hypatia only |
||
<label name="Medium">5.25 inch Floppy Disks</label> |
|
Part of <location> (1) |
||
<label name="Type">Books</label> |
|
tag |
||
<label name="Title">The Burgess Shale and the Nature of History</label> |
descMetadata |
|
||
<filetype>WordPerfect 4.2</filetype> |
note? |
|
||
<Duplicate_File> </Duplicate_File> |
|
* blank, null value or empty string - original file, not a duplicate |
||
<export_path>files\BU3A5.wp</export_path> |
|
take filename for fully qualified object filename from name shown after "files\" |
(1) Location/container information
<location>Stephen J. Gould Papers - Series 6: Born Digital Materials - CM006 (5.25 inch Floppy Disks)