Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The Book Solution Pack module allows for the creation of a book collectionobject, and the ingesting of pages page objects into that book. Essentially, a 'book' object is a special type of collection into which page objects can be ingested and organized. 

A Book's PDF can be generated from each page. Pages are based on an uploaded pdf of the entire book or individual jpegs, or tiffs of every page. From the uploaded pages its possible to generate images for use in the "Islandora Internet Archive Bookreader". It is also possible to generate PDF files per page. OCR and OCR coordinate data can also be generated from the uploaded pages.

Book Structure

...

  • Collection
    • Book object
      • Page object 01
      • Page object 02

Dependencies

...

Release Notes and Downloads

Usage

The Book Solution Pack module functions slightly different than most other solution packs, in that it is a collection that acts similarly to a regular data object. Books exists as a collection, but are able to perform tasks like derivative creation and viewing similar to how other content models function.

There are a few steps to creating a book using the Book Solution Pack module:

Creating a Book

  1. In a new or existing collection, go to the Manage tab, click the Collection tab, and select Create a new collection, or go to an existing one, and add the 'islandora:bookCModel' content model to its collection policy. Click "Update Collection Policy."
  2. In the Manage tab, click "Add an object to this collection" and select "Book Content Model."Go to that collection's 'Manage' tab, and add an object to the collection (from the 'Collection' section)
  3. Fill out the Book MODS form and submit itwith descriptive metadata for the book.
  4. On the next page, there is an option to upload a PDF of a book. Use this option if you don't have separate page images to upload. The Book content model can create separate page objects from a multi-page PDF.

    1. To skip this step and create an empty book object, do not upload a PDF.

  5. Click "Ingest."

Adding Pages to a Book

  1. Go to a Book object's Manage tab and click on the Book tab.

  2. To add a single page, click "Add Page." To add multiple pages, click "Add zipped pages."
    1. Zipped pages will be added in the order they appear in the zipped directory.
    2. Pages can be either JPG or TIFF format.
  3. Under "Language", select the book language for OCR or select "Do not perform OCR."
  4. Browse to the page or pages to upload.
  5. Click "Ingest." The pages will be added and derivatives created; this may take awhile for many pages.

Managing Pages

Once pages have been added, you can view them on the book's Pages tab. To manage pages, go to the book's Manage tab and click the Book tab. From this tab, you can create a PDF of the pages, perform OCR on the pages, create image derivatives and set the book thumbnail, change the reading order (page progression), reorder the pages, or delete pages.

The Usage section of the Islandora Paged Content documentation has You will now be brought to the Book's object page. From here, you can use the Paged Content module to manipulate the book. Check the Usage section of Islandora Paged Content for more information on how to add and manipulate pages within a book.

...

The Book Solution Pack configuration page can be found at http://path.to.your.site/admin/islandora/solution_pack_config/book, and includes the following configuration options:

...

This section allows you to configure the Book Solution Pack module to create derivative datastreams for pages. The following derivative datastreams can be set, with their accompanying dependencies:

PDFImageMagick
TN, JPEG, JP2Large Image Solution Pack
OCR, HOCR

Islandora OCR

Parent Solr

...

Field

This field defines the field in Solr that associates pages with their parent book objects. The default is `RELS_EXT_isMemberOf_uri_ms`.

Metadata Display

Check "Display object metadata" if you would like the book's descriptive metadata, "In collections" statement, and description to display below the book viewer. If unchecked, only the book viewer will appear on the object pageThese two fields define how Solr is able to find the PID of a page's parent book, and that page's number, respectively. Changing these fields requires a working knowledge of Solr queries and RDF, and for the most part, they should remain untouched.

Book Viewers and Page Viewers

These two sections change how Islandora handles a request in the the 'View' tab of a Book object and Page object, respectively. Islandora includes out-of-the-box support for the Internet Archive Bookreader Book Reader and OpenSeadragon as book and page viewers; check . Check the links in the Dependencies section above for more information on installing these components. Alternatively, if you have PDF versions of books and pages, you can display those in the PDF.js reader.

Image RemovedImage Added

Content Models, Prescribed Datastreams and Forms

...

A book ingested with all derivative creation options checked will have the following datastreams:

RELS-EXT

Default Fedora relationship metadata

MODSMODS metadata form

DC

Dublin Core record

TNThumbnail image

PDF

PDF derivative created by ImageMagick

Check Islandora Paged Content for information on an individual page's datastreams.

The Book Solution Pack comes with the Islandora Book MODS form.

Islandora 6 Compatibility

Note

While books for Islandora 6 and 7 can exist in the same repository, books ingested through Islandora 6 will not display properly in Islandora 7. A migration script is required.

Batch Processing

  An optional module to ingest multiple book and page objects is available.