You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 7 Next »

Stanford directory output for Gould collection contains a locally transformed version of the FTK output:

  • M1437 Gould
    • Computer Media Photo
    • EAD
    • FTK xml
      • files
      • Report.fo
      • Report.xml
      • Report_transformed.xml
      • Disk Image

Sample of transformed output:

<?xml version="1.0" encoding="UTF-8"?>
<ftk_report xmlns:fo="http://www.w3.org/1999/XSL/Format">
    <Series>Series 6</Series>
    <file>
        <filename>BU3A5</filename>
        <Item_Number>1004</Item_Number>
        <filepath>CM006.001/NONAME [FAT12]/[root]/BU3A5</filepath>
        <filesize>35654</filesize>
        <file_creation_date>n/a</file_creation_date>
        <file_accessed_date>n/a</file_accessed_date>
        <file_modified_date>12/8/1988 6:48:48 AM (1988-12-08 14:48:48 UTC)</file_modified_date>
        <MD5_Hash>976EDB782AE48FE0A84761BB608B1880</MD5_Hash>
        <Restricted>False</Restricted>
        <Label>[Medium] 5.25 inch Floppy Disks</Label>
        <Label>[Type] Books</Label>
        <Label>[Title]The Burgess Shale and the Nature of History</Label>
        <filetype>WordPerfect 4.2</filetype>
        <Duplicate_File> </Duplicate_File>
        <Export_path>files\BU3A5.wp</Export_path>
    </file>
    <file>
        <filename>BUR3-1</filename>
        <Item_Number>3005</Item_Number>
        <filepath>CM005.001/NONAME [FAT12]/[root]/BUR3-1</filepath>
        <filesize>92745</filesize>
        <file_creation_date>n/a</file_creation_date>
        <file_accessed_date>n/a</file_accessed_date>
        <file_modified_date>12/8/1988 6:35:06 AM (1988-12-08 14:35:06 UTC)</file_modified_date>
        <MD5_Hash>D3EB7E35856697B8F697193DB2CB4D83</MD5_Hash>
        <Restricted>False</Restricted>
        <Label>[Medium] 5.25 inch Floppy Disks</Label>
        <Label>[Type] Books</Label>
        <Label>[Title]The Burgess Shale and the Nature of History</Label>
        <filetype>WordPerfect 4.2</filetype>
        <Duplicate_File> </Duplicate_File>
        <Export_path>files\BUR3-1.wp</Export_path>
    </file>

Each <file> segment will be a the basis of a separate Hypatia Digital Object.  This is an example where the atomistic model adds overhead (separate metadata and content objects) and an integrated object combining commonMetadata and genericContent could be considered.

Information from: Disk Image // CMnnn.001.txt

maps to

notes

<collection_title>Stephen J. Gould Papers

Collection object
   descMetadata
      <mods:title>
Series & item objects
   descMetadata
      <mods:location> (1)

Equivalent to EAD <archdesc><title>

<series>Series 6: Born Digital Materials

Series set object
   descMetadata
      <mods:title>
Item objects
   descMetadata
      <mods:location> (1)

Equivalent to EAD series <c><unittitle>

<note>5.25 inch Floppy Disks</note>

Series object
   descmetadata
      <mods:physicalDescription>
          <mods:extent>

Corresponds to EAD <physdesc>
Value will be the same as "Medium" at the file level.

<callnumber>M1437</callnumber>

Collection object
   descMetadata
      <mods:identifier type="unitid" displayLabel="Call Number:">

Corresponds to EAD <archdesc><did><unitid>


Information from: FTK xml // Report_transformed.xml

maps to (all within item objects)

notes

<filename>BU3A5</filename>

n/a

this is just the file name prefix, minus the file extension, if any. <export_path> (below) has the full name of the file.

<Item_Number>1004</Item_Number>

n/a

internal reference only, to disambiguate reference in the FTK report

<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="0b9f3892-902b-4b50-b010-843285929c14"><ac:plain-text-body><![CDATA[

<filepath>CM006.001/NONAME [FAT12]/[root]/BU3A5</filepath>

 

original file in FTK xml // files
]]></ac:plain-text-body></ac:structured-macro>
display derivatives in Display Derivatives named using <item_number>

<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="d2920bc0-7f5a-4d0f-acef-f369d2576308"><ac:plain-text-body><![CDATA[take object filepath for fully qualified object filename from portion after [root], up to but not including the final filename token

]]></ac:plain-text-body></ac:structured-macro>

<disk_image_no>CM006</disk_image_no>

descMetadata
   <mods:location> (1)

This token, taken from the head of the <filepath>, is the only data link between the FTK output for a file object and the corresponding media object. We want a data link in descriptive metadata as well as an RDF link to the corresponding object.

<filesize>35654</filesize>

 

Could be used by conversion to compare against the file size as computed locally, a quick check prior to checksum validation?

<filesize_unit>B</filesize_unit>

 

Needed to correctly interpret <filesize>, if used

<file_creation_date>n/a</file_creation_date>

note?

 

<file_accessed_date>n/a</file_accessed_date>

note?

 

<file_modified_date>12/8/1988 6:48:48 AM (1988-12-08 14:48:48 UTC)</file_modified_date>

note?

 

<MD5_Hash>976EDB782AE48FE0A84761BB608B1880</MD5_Hash>

 

Used for checksum validation of a file during processing. This value will eventually be part of contentMetadata

<restricted>False</Restricted>

 

true=visible staff only, not discoverable .... Hypatia only

<label name="Medium">5.25 inch Floppy Disks</label>

 

Part of <location> (1)

<label name="Type">Books</label>

 

tag

<label name="Title">The Burgess Shale and the Nature of History</label>

descMetadata
   <mods:title>

 

<filetype>WordPerfect 4.2</filetype>

note?

 

<Duplicate_File> </Duplicate_File>

 

* blank, null value or empty string - original file, not a duplicate
* "Primary" - possibly indicates Primary file to keep/store
* "Secondary" - indicates a duplicate file to be ignored
--> ignore for now

<export_path>files\BU3A5.wp</export_path>

 

take filename for fully qualified object filename from name shown after "files\"

(1) Location/container information

<location>Stephen J. Gould Papers - Series 6: Born Digital Materials - CM006 (5.25 inch Floppy Disks)

  • No labels