Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3

Overview

The Islandora OAI module (based on the oai2forcck Drupal module) provides support for a site to be visible via the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). In short, a site properly configured using this module has its Solr index - and accompanying metadata - visible to other sites that harvest OAI-compatible metadata. These harvesters make various types of requests at a URL that you can specify, and your site responds with metadata information that they in turn can add to massive archival indices. This makes it much easier for researchers to find objects on your site.

For more information on the OAI-PMH, you may consult the official documentation at http://www.openarchives.org/OAI/openarchivesprotocol.html.

Dependencies

Note

Besides installing the Islandora Solr modules, you will also need to correctly configure Solr and GSearch in order for Islandora OAI to work. The OAI module passes information to metadata harvesters based on results it finds from your Solr index; if Solr is not properly configured, OAI won't function either.

Downloads

Release Notes and Downloads

Usage

Islandora OAI works mostly autonomously. It gets requests from metadata harvesters in the form of HTTP POST keys that come after your OAI URL. Your site then sends back information, in XML format, based on the values of the keys that were given. You can check that your configuration is correct by manually entering these keys in your browser's address bar, and seeing what comes back.

A simple check you can run involves asking your OAI URL for a list of information about your repository. To do this, you will need to know a few of your site's OAI configuration options. More information on this can be found in the next section of this page.

To check for the first few records, use your browser to access the following site:

http://path.to.your.site/repository?verb=Identify

Where:

  • path.to.your.site/repository is the URL found on the Islandora OAI configuration page, in the Configuration section, under 'The path of the Repository'
  • Identify is a verb that is designated by the OAI-PMH to return basic configuration information about your OAI metadata repository.

If your Solr Index is set up correctly, and you entered the URL properly, you should see an XML file containing information about your OAI setup.

Configuration

Configuration options for the Islandora OAI module can be found at http://path.to.your.site/admin/islandora/islandora-oai, and include the following options:

Configuration

  • Repository Name - The name that harvesters will attach to metadata pulled from your repository.
  • The path of the Repository - The URL that harvesters will make requests at.
  • Repository unique identifier - The middle section of the identifier used when metadata harvesters pass the identifier= key. With this in place, an identifier for each of your objects' metadata will be generated as oai:unique_identifier:namespace_pid.
  • Admin Email - An optional email address to be attached to harvested metadata
  • Maximum Response Size - The maximum number of records that will be issued per response. If the number of records requested exceeds this number, Islandora OAI will also issue a 'resumption token', which the harvester can use to issue another request from the point they stopped at. This method is used to control flow and prevent servers from diverting too much resources to metadata harvesters.
  • Expiration Time - The amount of time, in seconds, before a resumption token should expire.
  • Solr date field - A datestamp to be appended to the metadata via the Solr index.
  • Solr RELS-EXT collection field - Fields entered here establish the object relationship of metadata to be passed on to the harvester.
  • Solr XACML role field - The site's Solr fields defining viewing permissions.
  • Solr hasModel field - The site's Solr field defining an object's content model.
  • Exclude Content Models - A list of content models, defined by their PID, to exclude from harvests.

Metadata Format

This section allows you to configure the settings for the OAI-PMH'smetadata_prefix verb; Islandora uses XSL files to define the method for transforming your site's metadata datastreams into a format compatible with the OAI-PMH. Islandora OAI comes with two XSL files; they convert the MODS datastream of an object to either Electronic Thesis and Dissertation Metadata Standard format or Dublin Core format, which then can be served up to a harvester.

  • Metadata Format - The metadata format you would like to use. This will change the next three fields.
  • Metadata Prefix - The default variable for the metadata_prefix verb.
  • Metadata Namespace - The URL that contains XSD files defining the Metadata Format
  • Schema Location - The actual XSD file in the Metadata Namespace that defines the Metadata Format.

Transformations - This section allows you to configure the way Islandora converts your metadata datastreams into a format compatible with the OAI-PMH.

  • Metadata Datastream ID - The datastream ID where object metadata is stored (MODS by default).
  • File to use for transforming ______ - The XSL file used to convert that datastream into a metadata format OAI will recognize and use.
  • Upload a file - If you want to run custom conversions from a different datastream or to a different Metadata Format, you can upload these here.