...
If you have not already done so, provision a local Islandora 2.0 environment to use for the ingests. We recommend using ISLE with the default install profilecodebase/sandbox option. Running the command: starter_dev will bring up the site in the preferred theme. More complete ISLE documentation for starter instances can be found on the ISLE wiki here.
Determine File Location
Workbench can be configured to retrieve source files either by URL or from a locally available directory. In the former case, file URLs must be included in a column in the CSV, while for the later case the location of the directory is specified in the config file and the file names are listed in a column in the CSV.
...
Islandora Workbench works best when ingesting one collection at a time. To begin, login to Islandora 2.0 in your web browser and create a new collection.
Get the CSV File
Islandora Workbench requires a csv in either Google Sheets or on your local disk. The AG_Photos spreadsheet is provided as a sample input_csv and can be upload to your Google Drive
Prepare Config File
Islandora Workbench uses YAML files to configure its operations. These files are documented in detail. Here is an example config file, including a link to a sample CSV:CSV. You must download the CSV and open in Google Sheets to be able to correctly run the example.
task: create
host: "https://islandora.traefik.me/"
username: xxxx
password: xxxx
media_type: file input_csv: 'https://wiki.lyrasis.org/download/attachments/273351517/AG%20Photos.xlsx?version=1&modificationDate=1674544809225&api=v2: 'xxx'
id_field: PID
csv_field_templates:
- field_rights: "http://rightsstatements.org/vocab/CNE/1.0/"
- field_member_of: 103 xxxx
- field_model: 13 xxxx
- field_resource_type: 25 xxxx
- field_display_hints: 21xxxx
default_file_mimetype: 'image/tiff'
default_file_extension: ".tif"
use_node_title_for_media: 1
allow_adding_terms: true
...
The csv_field_templates are fields that will apply to every resource in the collection. The numbers referenced in these fields are Drupal Node IDs; you will need to update these numbers in your config file based on the Node IDs in your Drupal instance.
input_csv
The public link to your spreadsheet in Google Sheets
Note: If the gid of your spreadsheet does not automatically set to 0, you may need to set google_sheets_gid with the value from your spreadsheet. More information is available in the relevant workbench documentation .
field_member_of
This is the Node ID of the collection you created in step 2. You can find the ID by hovering over any of the tabs when you view the collection - it will be in the URL as “/node/id”.
...
title | id | parent_id | field_member_of |
Easthampton Town Hall | 1 | 100 | |
Nehemiah Strong House | 2 | 100 | |
Amherst College, Lawrence Observatory | 3 | 100 |
...
Configuring Complex Objects in the CSV
...