Islandora Paged Content

Overview

The Islandora Paged Content module is shared by the Book Solution Pack and Newspaper Solution Pack modules to provide numbered, individual pages as objects within each type of collection. It takes files in TIFF format, and is able to create several kinds of derivatives depending on the type of collection they are being ingested into.

Dependencies

The Book Solution Pack or Newspaper Solution Pack is used to create collections of paged content. It is advisable to install one of those solution packs, and check their pages for additional dependencies.
Ghostscript is used to compile PDF derivatives into a single document
Core Collection Solution Pack

Configuration

Very few configuration options exist for the paged content module out-of-the-box; most of the configuration should occur with the solution pack the pages are being ingested into. However, a configuration page does exist at http://path.to.your.site/admin/islandora/paged_content, and includes the following options:

PDF Derivative Settings

The Paged Content module requires the Ghostscript executable to be installed on your server, and the path to the executable to be entered here, on the configuration page, in order for multi-page PDFs to be compiled using each page in the book or newspaper. More information about installing Ghostscript on your server can be found at the official website, http://www.ghostscript.com/.

Content Models, Prescribed Datastreams and Forms

The Paged Content Solution Pack comes with the following objects in http://path.to.your.site/admin/islandora/solution_packs:

Islandora Page Content Model (islandora:pageCModel)

An image ingested using the Paged Content Solution Pack's content model using ImageMagick, the Large Image Solution Pack and the Islandora OCR modules will have the following datastreams:

OBJ	Original TIFF file uploaded
DC	Dublin Core record
PDF	PDF derivative created by Ghostscript
JP2	JPEG 2000 derivative created by ImageMagick
JPG	Smaller JPEG derivative created by ImageMagick
TN	Thumbnail icon created from the image during the ingest process
RELS-INT	Internal Fedora relationship metadata defining the dimensions of the JP2 datastream
OCR	The raw output from Tesseract
HOCR	A converted version of the OCR datastream, intended to be more human-readable
RELS-EXT	Default Fedora relationship metadata

The Paged Content Solution Pack does not come with any forms.

Child pages