Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Note

Before you can batch ingest objects, you will need to have downloaded and installed the Islandora Batch Importer module. If you want to batch ingest books, you will need to have downloaded and installed the Islandora Book Batch module; if you want to batch ingest newspaper issues, you will need to have downloaded and installed the Islandora Newspaper Batch. It is also strongly encouraged that you review the mods_to_dc.xsl within the Islandora Book Batch module if you plan to ingest MODS metadata. Reviewing the mods_to_dc.xsl will help you to understand what type of Dublin Core will be produced by the mods_to_dc.xsl. For example, you may notice that the mods_to_dc.xsl will not produce clean Dublin Core subject tags - all individual MODS subject tags will be expressed as one Dublin Core subject tab. The mods_to_dc.xsl will also not map names tags if no roleTerm with a type attribute has been specified. The mods_to_dc.xsl is a Library of Congress XSLT and the Islandora community does not make modifications to this file. You are encouraged to make your own edits to the mods_to_dc.xsl if you need to modify the XSLT.

For larger collections, Islandora is able to pull multiple files out of a zipped archive and ingest them into Fedora as a batch. There are a few ways that this can be done. You can upload .zip archives full of:

This page will run through the specifics of each one. In these examples, we will be batch-ingesting PDF files into a collection with the 'PDF Solution Pack' content model applied to its collection policy.

...

Sample Directory Structure

ZIP File

  • book_01 (book 1 folder)
    Book One (Folder) - contains a single metadata file (
    • MODS.xml (descriptive metadata record for book 1)
    • Page 1 (Folder) - contains a single .tiff file (OBJ.tiff)
    • Page 2 (Folder) - contains a single .tiff file (OBJ.tiff)
    • Page 3 (Folder) - contains a single .tiff file (OBJ.tiff)
    Book Two (Folder) - contains a single metadata file (MODS.xml)
    • 001 (page 1 folder)
      • OBJ.tiff - single tiff representing page 1
    • 002 (page 2 folder)
      • OBJ.tiff - single tiff representing page 2
    • 003 (page 3 folder)
      • OBJ.tiff - single tiff representing page 3
  • book_02 (book 2 folder)
    • MODS.xml (descriptive metadata record for book 2)
    • 001 (page 1 folder)
      • OBJ.tiff - single tiff representing page 1
    • 002 (page 2 folder)
      • OBJ.tiff - single tiff representing page 2
    • 003 (page 3 folder)
      • OBJ.tiff - single tiff representing page 3
    • Page 1 (Folder) - contains a single .tiff file (OBJ.tiff)
    • Page 2 (Folder) - contains a single .tiff file (OBJ.tiff)
    • Page 3 (Folder) - contains a single .tiff file (OBJ.tiff)

Within each book, an XML file named "MODS.xml" provides the metadata record. Inside each folder, a .tiff file, named "OBJ.tiff" become the page. If you have downloaded MARC XMLthe Islandora MARCXML module installed and enabled, you will also be able to upload a file called --METADATA--.mrc (containing binary MARC) in lieu instead of the MODS.xml file (the MARC file will be converted to MODS and DC during ingest). 

Anchor
Newspaper batch ingest
Newspaper batch ingest

Batch Ingest Newspapers

When batch ingesting newspapers, you must already have an existing newspaper-level object. Each ingest folder contains folders that represent issues of the newspaper, and each issue directory contains folders that represent separate page images. 

For sample directory structures and configuration options, see the Newspaper Batch Ingest instructions.