...
Section | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
|
Each <file> segment will be a the basis of a separate DOR /Hypatia Digital Object.or Hypatia Digital Object. The differences are:
- Stanford DOR (Digital Object Registry) objects are metadata-only, with content externally managed. Objects at Stanford will not have "content" datastreams.
- Stanford objects have an identityMetadata datastream that may or may not be present in Hypatia demo objects. Regardless, it is not a standard part of Hydra-compliant objects.
Collection and Series objects
...
File objects are the node objects representing individual files. The atomistic model has would have these objects constructed as a parent (metadata) object and a child (content) object. Do we want to consider an integrated object combining For simplicity, we will create these File objects as a single object, combining the Hydra commonMetadata and genericContent models instead?.
Sample of transformed FTK file available as input:
...
Information from: FTK xml // Report_transformed.xml | maps to (within item objects) | notes | ||
---|---|---|---|---|
<filename>BU3A5</filename> | n/a | this is the original file name as it appeared on the original media. | ||
<Item_Number>1004</Item_Number> | n/a | internal FTK reference only, to disambiguate references in the FTK report | ||
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="4a249227256ac3a2-d73ba442-429a4a16-8db29f48-a5d00aba13b57c3bb72ccdad"><ac:plain-text-body><![CDATA[ | <filepath>CM006.001/NONAME [FAT12]/[root]/BU3A5</filepath> |
| location of file on original media | ]]></ac:plain-text-body></ac:structured-macro> |
<disk_image_no>CM006</disk_image_no> | descMetadata | This token, taken from the head of the <filepath>, is the only data link between the FTK output for a file object and the corresponding media object. We want a data link in descriptive metadata as well as an RDF link to the corresponding object. | ||
<filesize>35654</filesize> |
| Could be used by conversion to compare against the file size as computed locally, a quick check prior to checksum validation? | ||
<filesize_unit>B</filesize_unit> |
| Needed to correctly interpret <filesize>, if used | ||
<file_creation_date>n/a</file_creation_date> | note? |
| ||
<file_accessed_date>n/a</file_accessed_date> | note? |
| ||
<file_modified_date>12/8/1988 6:48:48 AM (1988-12-08 14:48:48 UTC)</file_modified_date> | note? |
| ||
<MD5_Hash>976EDB782AE48FE0A84761BB608B1880</MD5_Hash> |
| Used for checksum validation of a file during processing. This value will eventually be part of contentMetadata, but probably not as a value transferred from here. | ||
<restricted>False</Restricted> |
| true=visible staff only, not discoverable .... Hypatia only | ||
<type>Books</type> | descMetadata | <typeOfResource>? <topic? or <genre>? authority? | ||
<title>The Burgess Shale and the Nature of History</title> | descMetadata |
| ||
<filetype>WordPerfect 4.2</filetype> | descMetadata |
| ||
<Duplicate_File> </Duplicate_File> |
| * blank, null value or empty string - file is unique in collection, no duplicates | ||
<export_path>files\BU3A5.wp</export_path> |
| The file as saved by FTK for further processing. | ||
(implied) | RELS-EXT | A link to the Media object |
...