Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Site

Collection

EAD structure / location of born digital materials [type of hydra object]

notes

<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="a5460e16fdcecfbb-90ae38b2-43574fcb-92d9b67f-f60ce6c165f6310601995505"><ac:plain-text-body><![CDATA[

Hull

Gallagher

collection [set]
]]></ac:plain-text-body></ac:structured-macro>

 

Hull

Socialist Health Assoc.

 

 

Stanford

Xanadu

collection

Wiki Markup
*\[set\]*

- <c> file (10 of 10) -- unittitle "Born Digital"
Wiki Markup
*\[set\]*

- - <c> item -- unitid: CM01
Wiki Markup
*\[item\]*

- - <c> item -- unitid: CM02
Wiki Markup
*\[item\]*

- - <c> item -- unitid: CM03
Wiki Markup
*\[item\]*

1. Target "born digital" sub-level identified by <unittitle>
2. Collection only described to the container level (hard drives).
3. EAD "item" level  corresponds to target Fedora "item".
4. Item <unitid> is used as a filename stem to bind content files to Hypatia objects.

Stanford

Gould

collection -- unittitle "Stephen Jay Gould papers" unitid: M1437

Wiki Markup
*\[set\]*

- <c> series -- unittitle "Series 6: Born Digital Materials"
Wiki Markup
*\[set\]*

EAD only goes down to the single "Born digital" series description, with no details expressed at lower levels. A rationalized directory structure and FTK output are intended to support a direct translation into Hypatia objects for both unprocessed and processed views without an intermediary EAD.

Virginia

Cheuse

 

 

Yale

Conn. Oral Histories

 

 

Yale

Love Makes a Family

 

 

Yale

Pelli

 

 

Yale

Tobin

collection

Wiki Markup
*\[set\]*

- <c01> series (3 of 7) -- unittitle "Accession 2004-M-088"
Wiki Markup
*\[set\]*

<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="cd47245529243a02-2ea8c070-46584137-a497983d-50f14a1c079874d9698f4810"><ac:plain-text-body><![CDATA[- - <c02> file (28 of 29) -- unittitle "Computer diskettes [3.5 inch]"
Wiki Markup
*\[set\]*

]]></ac:plain-text-body></ac:structured-macro>
- - - <c03 file -- unitid: 2004-M-088.0001
Wiki Markup
*\[item\]*

           :
- - - <c03 file -- unitid: 2004-M-088.0027"
Wiki Markup
*\[item\]*

1. Target sub-level identified by <unittitle>
2. Collection only described to the container level (diskettes).
3. EAD "item" level  corresponds to target Fedora "item".
4. Item <unitid> is used as a filename stem to bind content files to Hypatia objects.

Yale

Turner

 

 

Yale

Welch

 

 

...

In general, "item" level entries in an EAD will map to individual Hypatia objects. The "file" level" may also be considered an item node for a specific EAD. Unless otherwise indicated for a specific conversion, other levels will translate to set objects in Hypatia, even if they are empty sets because the EAD had no item-level description.

EAD-to-MODS - general information

There is a scarcity, dare I say dearth, of tools available to do this mapping or to offer an existing implemented conversion.  So the mappings here are based on data encountered in the Hypatia sample EADs and can be augmented as we go along. They are informed by two sources:

Assumption: Working assumption is that all descriptive metadata, for collections, intermediate sets (levels) and digital objects will be MODS.

Issue: the mapping from EAD to MODS is not perfect and is not fully reversible:

Stanford FTK-backed Digital Object creation

Stanford uses the Forensic Toolkit (FTK) software to analyze and characterize the contents of computer media. Starting with the Gould collection, we will only provide a single series node in the EAD to represent "Born Digital Materials". Conversion routines will be able to auto-generate objects representing the unprocessed collection (the media artifacts themselves, e.g., hard drives and floppy discs) as well as detailed file content objects from a modified form of the FTK output.

EAD-to-MODS - general information

There is a scarcity, dare I say dearth, of tools available to do this mapping or to offer an existing implemented conversion.  So the mappings here are based on data encountered in the Hypatia sample EADs and can be augmented as we go along. They are informed by two sources:

Assumption: Working assumption is that all descriptive metadata, for collections, intermediate sets (levels) and digital objects will be MODS.

Issue: the mapping from EAD to MODS is not perfect and is not fully reversible:

  • The original EAD header, while preserved along with the original EAD, is not The original EAD header, while preserved along with the original EAD, is not part of the makeup of the resulting Digital Objects and would not change.
  • While no metadata information should be lost, it may not be mappable to original form of expression if multiple styles or patterns are reconciled into one, e.g., implicit vs explicit labels based on <head> elements.
  • There are artifacts of EAD markup, such as qualifying element attributes, that have no corresponding place in MODS and will not be brought over unless further accommodation is made.

...

Panel

<repository encodinganalog="3.1.2">Hull University Archives</repository>

<unitid encodinganalog="3.1.1" label="Reference" countrycode="GB" repositorycode="50">U DGA/1/2/5/a</unitid>

<unitid label="Call Number:" countrycode="US" repositorycode="US-CtY">MS 1746</unitid>

<physdesc encodinganalog="3.1.5" label="Extent">

<accessrestrict id="ref5"> ...
<userestrict id="ref4"> ...
<prefercite id="ref6"> ...

<origination label="creator">
     <persname rules="aacr" source="ingest">Gould, Stephen Jay</persname>
</origination>

<persname rules="aacr" source="naf">Shearer, Rhonda Roland, 1954- </persname>

<unitdate normal="1951/1996" type="inclusive" calendar="gregorian" era="ce">1951-1996</unitdate>

<note type="bpg">
     <p>This encoded finding aid is compliant with the Yale EAD Best Practice Guidelines, Version 1.0.</p>
</note>

Conversion rule: Attributes not specifically targeted for conversion will be ignored/lost.

Issue: retaining ref and level information , do these map to appropriate container descriptions?

Issue: "otherlevel" levels -- <c level="otherlevel" otherlevel="SubSeries"> (Hull)

Issue: Stanford <container> conventions

 <c id="ref432" level="file">

...

CtY">MS 1746</unitid>

<physdesc encodinganalog="3.1.5" label="Extent">

<accessrestrict id="ref5"> ...
<userestrict id="ref4"> ...
<prefercite id="ref6"> ...

<origination label="creator">
     <persname rules="aacr" source="ingest">Gould, Stephen Jay</persname>
</origination>

<persname rules="aacr" source="naf">Shearer, Rhonda Roland, 1954- </persname>

<unitdate normal="1951/1996" type="inclusive" calendar="gregorian" era="ce">1951-1996</unitdate>

<note type="bpg">
     <p>This encoded finding aid is compliant with the Yale EAD Best Practice Guidelines, Version 1.0.</p>
</note>

Conversion rule: Attributes not specifically targeted for conversion will be ignored/lost.

Issue: retaining ref and level information, do these map to appropriate container descriptions?

Issue: "otherlevel" levels -- <c level="otherlevel" otherlevel="SubSeries"> (Hull)

Issue: Stanford <container> conventions and mapping into a MODS "Located in" note

We will create a concise representation of the physical/logical location (as appropriate) of the materials in the context of the collection and its hierarchy. It will be a single string concatenating

  • Collection name
  • Intermediate series, subseries names etc if present
  • The container label
  • The container type + value

Is this generalizable, across Stanford collections? across institutions?

Examples:

Collection

EAD

MODS

Gould

<c id="ref432" level="file">
    <did>
       <unittitle>Gardner, Howard</unittitle>
       <container id="cid57883022" type="Box" label="Mixed materials">4</container>
       <container parent="cid57883022" type="Folder">27</container>

<mods:location>
   <physicalLocation displayLabel="Located in">Stephen Jay Gould papers - Series 1: Correspondence - Mixed materials - Box 4</physicalLocation>
</mods:location>

Hensen

<c id="ref50" level="item">
     <did>
         <unittitle>CM01</unittitle>
         <unitid>CM01</unitid>
         <container id="cid59523001" type="Carton" label="Computer disks / tapes">11</container>

<mods:location>
   <physicalLocation displayLabel="Located in">Keith Henson. Papers relating to Project Xanadu, XOC and Eric Drexler - Born-Digital Materials - Computer disks / tapes - Carton 11</physicalLocation>
</mods:location>

Issue: Derived <mods:location> information

Where all items objects are derived from FTK information about files in a directory, how is this logial_physical locaiton information assembled and presented?

Collection

FTK

MODS

Gould

 

<mods:location>
<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="d046a2cb-21a7-4fea-8fec-819e367ae049"><ac:plain-text-body><![CDATA[    <physicalLocation displayLabel="Located in">Stephen Jay Gould papers - Series 6: Born Digital Materials - [directoryname]</physicalLocation>
]]></ac:plain-text-body></ac:structured-macro>
</mods:location>

...

Issue: Recursively nested <descgrp>

...