Page History

Note

Before you can batch ingest objects, you will need to have downloaded and installed the Islandora Batch Importer module. If you want to batch ingest books, you will need to have downloaded and installed the Islandora Book Batch module; if you want to batch ingest newspaper issues, you will need to have downloaded and installed the Islandora Newspaper Batch. It is also strongly encouraged that you review the mods_to_dc.xsl within the Islandora Book Batch module if you plan to ingest MODS metadata. Reviewing the mods_to_dc.xsl will help you to understand what type of Dublin Core will be produced by the mods_to_dc.xsl. For example, you may notice that the mods_to_dc.xsl will not produce clean Dublin Core subject tags - all individual MODS subject tags will be expressed as one Dublin Core subject tab. The mods_to_dc.xsl will also not map names tags if no roleTerm with a type attribute has been specified. The mods_to_dc.xsl is a Library of Congress XSLT and the Islandora community does not make modifications to this file. You are encouraged to make your own edits to the mods_to_dc.xsl if you need to modify the XSLT.

For larger collections, Islandora is able to pull multiple files out of a zipped archive and ingest them into Fedora as a batch. There are a few ways that this can be done. You can upload .zip archives full of:

This page will run through the specifics of each one. In these examples, we will be batch-ingesting PDF files into a collection with the 'PDF Solution Pack' content model applied to its collection policy.

...

Sample Directory Structure

ZIP File

book_01 (book 1 folder)
Book One (Folder) - contains a single metadata file (
- MODS.xml (descriptive metadata record for book 1)
- Page 1 (Folder) - contains a single .tiff file (OBJ.tiff)
- Page 2 (Folder) - contains a single .tiff file (OBJ.tiff)
- Page 3 (Folder) - contains a single .tiff file (OBJ.tiff)
Book Two (Folder) - contains a single metadata file (MODS.xml)
- 001 (page 1 folder)
  - OBJ.tiff - single tiff representing page 1
- 002 (page 2 folder)
  - OBJ.tiff - single tiff representing page 2
- 003 (page 3 folder)
  - OBJ.tiff - single tiff representing page 3
book_02 (book 2 folder)
- MODS.xml (descriptive metadata record for book 2)
- 001 (page 1 folder)
  - OBJ.tiff - single tiff representing page 1
- 002 (page 2 folder)
  - OBJ.tiff - single tiff representing page 2
- 003 (page 3 folder)
  - OBJ.tiff - single tiff representing page 3
- Page 1 (Folder) - contains a single .tiff file (OBJ.tiff)
- Page 2 (Folder) - contains a single .tiff file (OBJ.tiff)
- Page 3 (Folder) - contains a single .tiff file (OBJ.tiff)

Within each book, an XML file named "MODS.xml" provides the metadata record. Inside each folder, a .tiff file, named "OBJ.tiff" become the page. If you have downloaded MARC XMLthe Islandora MARCXML module installed and enabled, you will also be able to upload a file called --METADATA--.mrc (containing binary MARC) in lieu instead of the MODS.xml file (the MARC file will be converted to MODS and DC during ingest).

Anchor
Newspaper batch ingest
Newspaper batch ingest

Batch Ingest Newspapers

When batch ingesting newspapers, you must already have an existing newspaper-level object. Each ingest folder contains folders that represent issues of the newspaper, and each issue directory contains folders that represent separate page images.

For sample directory structures and configuration options, see the Newspaper Batch Ingest instructions.

Page tree

Versions Compared

Old Version 1

New Version Current

Key

Anchor
Newspaper batch ingest
Newspaper batch ingest

Batch Ingest Newspapers

Page tree

Page History

Versions Compared

Old Version 1

New Version Current

Key

AnchorNewspaper batch ingestNewspaper batch ingest

Batch Ingest Newspapers

Anchor
Newspaper batch ingest
Newspaper batch ingest