Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Anchor
Ingest Books
Ingest Books

Batch Ingest Books

When batch ingesting books, each folder/directory represents one book, and nested folders/directories represent the pages. 

Sample Directory Structure

ZIP File

  • Book One (Folder) - contains a single metadata file (MODS.xml)
    • Page 1 (Folder) - contains a single .tiff file (OBJ.tiff)
    • Page 2 (Folder) - contains a single .tiff file (OBJ.tiff)
    • Page 3 (Folder) - contains a single .tiff file (OBJ.tiff)
  • Book Two (Folder) - contains a single metadata file (MODS.xml)
    • Page 1 (Folder) - contains a single .tiff file (OBJ.tiff)
    • Page 2 (Folder) - contains a single .tiff file (OBJ.tiff)
    • Page 3 (Folder) - contains a single .tiff file (OBJ.tiff)

Books must be broken up into separate directories, such that each directory at the "top" level (in the target directory or Zip file) represents a book. Book pages are their own directories inside of each book directory.

Files are assigned to object datastreams based on their basename, so a folder structure like:

  • my_cool_book/
    • MODS.xml
    • 1/
      • OBJ.tiff
    • 2/
      • OBJ.tiff

The above would result in a two-page book.

Each page directory name will be used as the sequence number of the page created.

A file named --METADATA--.xml can contain either MODS, DC or MARCXML which is used to fill in the MODS or DC streams (if not provided explicitly). Similarly, Within each book, an XML file named "MODS.xml" provides the metadata record. Inside each folder, a .tiff file, named "OBJ.tiff" become the page. If you have downloaded MARC XML, you will also be able to upload a file called --METADATA--.mrc (containing binary MARC) in lieu of the MODS.xml file.will be transformed to MODS and then possibly to DC, if neither are provided explicitly.

If no MODS is provided at the book level - either directly as MODS.xml, or transformed from either a DC.xml or the "--METADATA--" file discussed above - the directory name will be used as the title.