Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Wiki Markup
Stanford directory output for Gould collection contains a locally transformed version of the FTK output:

...



* M1437 Gould

...


** Computer Media

...

 Photo
*** CM001.jpg
*** (etc)
** Disk Image
*** CM001.001
*** CM001.001.csv
*** CM001.001.txt
*** (etc)
** Display Derivatives
*** {filename}.htm
** EAD
** FTK xml
*** files
**** {filename}.extension\]
*** Report.fo
*** Report.xml
*** Report_transformed.xml

...


*** Disk Image

...

Sample of transformed output:

<?xml version="1.0" encoding="UTF-8"?>


Sample of the starting lines of the .txt file describing the media object.


{panel}
Created By AccessData® FTK® Imager 3.0.1.1467 110406

Case Information:
Acquired using: ADI3.0.1.1467
Case Number: M1437
Evidence Number: CM004
Unique Description:
Examiner: Peter Chan
Notes: 5.25 inch Floppy Disk
{panel}
Sample of transformed FTK file available as input:

{panel}
<ftk_report xmlns:fo="http://www.w3.org/1999/XSL/Format">
&nbsp;&nbsp;&nbsp; <series>Series 
<Series>Series 6</Series> &nbsp;&nbsp;&nbsp; <file> &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <filename>BU3A5</filename> &nbsp;&nbsp;&nbsp;&nbsp;
6: Born Digital Materials</series>
&nbsp;&nbsp;&nbsp; 
<Item_Number>1004</Item_Number> &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <filepath>CM006.001/NONAME \[FAT12\]/\[root\]/BU3A5</filepath> &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <filesize>35654</filesize>
<collection_title>Stephen Jay Gould papers</collection_title>
&nbsp;&nbsp;&nbsp;
&nbsp;&nbsp;&nbsp;&nbsp; <file_creation_date>n/a</file_creation_date> &nbsp;&nbsp;&nbsp;&nbsp;
 <callnumber>M1437</callnumber>
&nbsp;&nbsp;&nbsp; 
<file_accessed_date>n/a</file_accessed_date>
<file>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
<file_modified_date>12/8/1988 6:48:48 AM (1988-12-08 14:48:48 UTC)</file_modified_date>
<filename>BU3A5</filename>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
<MD5
<item_
Hash>976EDB782AE48FE0A84761BB608B1880<
number>1004</
MD5
item_
Hash>
number>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
<Restricted>False</Restricted> &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <Label>\[Medium\] 5.25 inch Floppy Disks</Label> &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <Label>\[Type\] Books</Label> &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <Label>\[Title\]The Burgess Shale and the Nature of History</Label> &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <filetype>WordPerfect 4.2</filetype>
<filepath>CM006.001/NONAME \[FAT12\]/\[root\]/BU3A5</filepath>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
<Duplicate_File> </Duplicate_File> &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <Export_path>files\BU3A5.wp</Export_path>
<disk_image_no>CM006</disk_image_no>
&nbsp;&nbsp;&nbsp;
</file> &nbsp;&nbsp;&nbsp; <file> &nbsp;&nbsp;
&nbsp;&nbsp;&nbsp;&nbsp;
&nbsp; <filename>BUR3-1</filename>
 <filesize>35654</filesize>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
<Item
<filesize_
Number>3005<
unit>B</
Item
filesize_
Number>
unit>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
<filepath>CM005.001/NONAME
 
\[FAT12\]/\[root\]/BUR3-1</filepath> &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <filesize>92745</filesize> &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
<file_creation_date>n/a</file_creation_date>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <file_accessed_date>n/a</file_accessed_date>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <file_modified_date>12/8/1988 6:
35
48:
06
48 AM (1988-12-08 14:
35
48:
06
48 UTC)</file_modified_date>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
<MD5
<md5_
Hash>D3EB7E35856697B8F697193DB2CB4D83<
hash>976EDB782AE48FE0A84761BB608B1880</
MD5
md5_
Hash>
hash>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
<Restricted>False<
<restricted>False</
Restricted>
restricted>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
<Label>\[Medium\] 5.25 inch Floppy Disks</Label>
<access_rights>Public</access_rights>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
<Label>\[Type\] Books</Label>
 <medium>5.25 inch Floppy Disks</medium>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
<Label>\[Title\]The
<title>The Burgess Shale and the Nature of 
History<
History </
Label>
title>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <filetype>WordPerfect 4.2</filetype>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
<Duplicate
<duplicate_File> </
Duplicate
duplicate_File>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
<Export
<export_path>files\
BUR3-1.wp</Export
BU3A5</export_path>
&nbsp;&nbsp;&nbsp; </file>

{panel}
Each <file> segment will be a the basis of a separate Hypatia Digital Object.
Panel
Wiki Markup

...

&nbsp; This is an example where the atomistic model adds overhead (separate metadata and content objects) and an integrated object combining commonMetadata and genericContent could be considered.

...



|| Information from: Disk Image // CMnnn.001.txt

...

 || maps

...

notes

 to \\ || notes ||
| <collection_title>Stephen J. Gould

...

Collection object
   descMetadata
      <mods:title>
Series & item objects
   descMetadata
      <mods:location> (1)

...

<series>Series 6: Born Digital Materials

...

Series set object
   descMetadata
      <mods:title>
Item objects
   descMetadata
      <mods:location> (1)

...

Equivalent to EAD series <c><unittitle>

...

<note>5.25 inch Floppy Disks</note>

...

Series object
   descmetadata
      <mods:physicalDescription>
          <mods:extent>

...

Corresponds to EAD <physdesc>
Value will be the same as "Medium" at the file level.

...

<callnumber>M1437</callnumber>

...

Collection object
   descMetadata
      <mods:identifier type="unitid" displayLabel="Call Number:">

...

Corresponds to EAD <archdesc><did><unitid>

...

Information from: FTK xml // Report_transformed.xml

...

maps to (all within item objects)

...

notes

...

<filename>BU3A5</filename>

...

n/a

...

this is just the file name prefix, minus the file extension, if any. <export_path> (below) has the full name of the file.

...

<Item_Number>1004</Item_Number>

...

n/a

...

internal reference only, to disambiguate reference in the FTK report

...

<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="2263798f-83ee-41c4-a565-000d689d1a66"><ac:plain-text-body><![CDATA[

...

<filepath>CM006.001/NONAME [FAT12]/[root]/BU3A5</filepath>

...

 

...

]]></ac:plain-text-body></ac:structured-macro>

 Papers \\ | Collection object \\
&nbsp;&nbsp; descMetadata \\
&nbsp; &nbsp; &nbsp; <mods:title> \\
Series & item objects \\
&nbsp;&nbsp; descMetadata \\
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <mods:location> (1) \\ | Equivalent to EAD <archdesc><title> \\
\\ |
| <series>Series 6: Born Digital Materials \\ | Series set object \\
&nbsp;&nbsp; descMetadata \\
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <mods:title> \\
Item objects \\
&nbsp;&nbsp; descMetadata \\
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <mods:location> (1) \\ | Equivalent to EAD series <c><unittitle> \\ |
| <note>5.25 inch Floppy Disks</note> \\ | Series object \\
&nbsp;&nbsp; descmetadata \\
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <mods:physicalDescription> \\
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <mods:extent> \\ | Corresponds to EAD <physdesc> \\
Value will be the same as "Medium" at the file level. \\ |
| <callnumber>M1437</callnumber> | Collection object \\
&nbsp;&nbsp; descMetadata \\
&nbsp; &nbsp; &nbsp; <mods:identifier type="unitid" displayLabel="Call Number:"> | Corresponds to EAD <archdesc><did><unitid> \\ |
\\
|| Information from: FTK xml // Report_transformed.xml || maps to (all within item objects) \\ || notes \\ ||
| <filename>BU3A5</filename> \\ | n/a \\ | this is the original file name as it appeared on the original media. \\ |
| <Item_Number>1004</Item_Number> \\ | n/a \\ | internal reference only, to disambiguate reference in the FTK report \\ |
| <filepath>CM006.001/NONAME \[FAT12\]/\[root\]/BU3A5</filepath> \\ | | original file in FTK xml // files \\
display derivatives in Display Derivatives named using <item_number> \\
\\
take object filepath for fully qualified object filename from portion after \[root\], up to but not including the final filename token \\ |
| <disk_image_no>CM006</disk_image_no>

...

 \\ | descMetadata \\
&nbsp;&nbsp; <mods:location> (1)

...

 \\ | This token, taken from the head of the <filepath>, is the only data link between the FTK output for a file object and the corresponding media object. We want a data link in descriptive metadata as well as an RDF link to the corresponding object.

...

 \\ |
| <filesize>35654</filesize>

...

 

...

Could be used by conversion to compare against the file size as computed locally, a quick check prior to checksum validation?

 \\ | | Could be used by conversion to compare against the file size as computed locally, a quick check prior to checksum validation? \\ |
| <filesize_unit>B</filesize_unit>

...

 

...

Needed to correctly interpret <filesize>, if used

 | | Needed to correctly interpret <filesize>, if used \\ |
| <file_creation_date>n/a</file_creation_date>

...

note?

...

 

 \\ | note? \\ | |
| <file_accessed_date>n/a</file_accessed_date>

...

note?

...

 

 \\ | note? \\ | |
| <file_modified_date>12/8/1988 6:48:48 AM (1988-12-08 14:48:48 UTC)</file_modified_date>

...

note?

...

 

 \\ | note? \\ | |
| <MD5_Hash>976EDB782AE48FE0A84761BB608B1880</MD5_Hash>

...

 

...

Used for checksum validation of a file during processing. This value will eventually be part of contentMetadata

...

<restricted>False</Restricted>

...

 

...

true=visible staff only, not discoverable .... Hypatia only

...

 \\ | | Used for checksum validation of a file during processing. This value will eventually be part of contentMetadata \\ |
| <restricted>False</Restricted> \\ | | true=visible staff only, not discoverable .... Hypatia only \\ |
| <label name="Medium">5.25 inch Floppy Disks</label>

...

 

...

 \\ | | Part of <location> (1)

...

 |
| <label name="Type">Books</label>

...

 

...

tag

...

 \\ | | tag \\ |
| <label name="Title">The Burgess Shale and the Nature of History</label>

...

descMetadata
   <mods:title>

...

 

...

 \\ | descMetadata \\
&nbsp;&nbsp; <mods:title> \\ | |
| <filetype>WordPerfect 4.2</filetype>

...

note?

...

 

 \\ | note? \\ | |
| <Duplicate_File> </Duplicate_File>

...

 

...

 \\ | | \* blank, null value or empty string - original file, not a duplicate

...

 \\
\* "Primary" - possibly indicates Primary file to keep/store

...

 \\
\* "Secondary" - indicates a duplicate file to be ignored

...

 \\
\--> ignore for

...

 now \\ |
| <export_path>files\BU3A5.wp</export_path>

...

 

...

take filename for fully qualified object filename from name shown after "files\"

(1) Location/container information

...

 | | The file as available for the DOR object. Note it may have a file extension added by FTK. \\ |
(1) Location/container information \-\- for every file object created, create a <mods:location> description that places the resource in the context of the collection by combining collection name, intermediate series/group/etc name(s), and the ID+description of the media on which the file resides, e.g.,


&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <location>Stephen J. Gould Papers - Series 6: Born Digital Materials - CM006 (5.25 inch Floppy Disks)