This documentation refers to an earlier version of Islandora. https://wiki.duraspace.org/display/ISLANDORA/Start is current.

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

Overview

The Islandora Paged Content module is shared by the Book Solution Pack and Newspaper Solution Pack modules to provide numbered, individual pages as objects within each type of collection. It takes files in TIFF format, and is able to create several kinds of derivatives depending on the type of collection they are being ingested into.

Dependencies

Configuration

Very few configuration options exist for the paged content module out-of-the-box; most of the configuration should occur with the solution pack the pages are being ingested into. However, a configuration page does exist at http://path.to.your.site/admin/islandora/paged_content, and includes the following options:

PDF Derivative Settings

The Paged Content module requires the Ghostscript executable to be installed on your server, and the path to the executable to be entered here, on the configuration page, in order for multi-page PDFs to be compiled using each page in the book or newspaper. More information about installing Ghostscript on your server can be found at the official website, http://www.ghostscript.com/.

Content Models, Prescribed Datastreams and Forms

The Paged Content Solution Pack comes with the following objects in http://path.to.your.site/admin/islandora/solution_packs:

  • Islandora Page Content Model (islandora:pageCModel)

An image ingested using the Paged Content Solution Pack's content model using ImageMagick, the Large Image Solution Pack and the Islandora OCR modules will have the following datastreams:

OBJ

Original TIFF file uploaded

DC

Dublin Core record

PDF

PDF derivative created by Ghostscript

JP2

JPEG 2000 derivative created by ImageMagick

JPGSmaller JPEG derivative created by ImageMagick
TNThumbnail icon created from the image during the ingest process
RELS-INTInternal Fedora relationship metadata defining the dimensions of the JP2 datastream
OCRThe raw output from Tesseract
HOCRA converted version of the OCR datastream, intended to be more human-readable
RELS-EXTDefault Fedora relationship metadata

The Paged Content Solution Pack does not come with any forms.

  • No labels