You are viewing an old version of this page. View the current version.
Compare with Current
View Page History
Version 1
Next »
Overview
The Islandora Paged Content module is shared by the Book Solution Pack and Newspaper Solution Pack modules to provide numbered, individual pages as objects within each type of collection. It takes files in TIFF format, and is able to create several kinds of derivatives depending on the type of collection they are being ingested into.
Dependencies
Configuration
Very few configuration options exist for the paged content module out-of-the-box; most of the configuration should occur with the solution pack the pages are being ingested into. However, a configuration page does exist at http://path.to.your.site/admin/islandora/paged_content, and includes the following options:
PDF Derivative Settings
The Paged Content module requires the Ghostscript executable to be installed on your server, and the path to the executable to be entered here, on the configuration page, in order for multi-page PDFs to be compiled using each page in the book or newspaper. More information about installing Ghostscript on your server can be found at the official website, http://www.ghostscript.com/.
Content Models, Prescribed Datastreams and Forms
The Paged Content Solution Pack comes with the following objects in http://path.to.your.site/admin/islandora/solution_packs:
- Islandora Page Content Model (islandora:pageCModel)
An image ingested using the Paged Content Solution Pack's content model using ImageMagick, the Large Image Solution Pack and the Islandora OCR modules will have the following datastreams:
OBJ | Original TIFF file uploaded |
DC | Dublin Core record |
PDF | PDF derivative created by Ghostscript |
JP2 | JPEG 2000 derivative created by ImageMagick |
JPG | Smaller JPEG derivative created by ImageMagick |
TN | Thumbnail icon created from the image during the ingest process |
RELS-INT | Internal Fedora relationship metadata defining the dimensions of the JP2 datastream |
OCR | The raw output from Tesseract |
HOCR | A converted version of the OCR datastream, intended to be more human-readable |
RELS-EXT | Default Fedora relationship metadata |
The Paged Content Solution Pack does not come with any forms.