Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Note
titleRecommended Metadata

It is recommended to minimally provide "dc.title" and, where applicable, "dc.date.issued".  Obviously you can (and should) provide much more detailed metadata about the Item.  For more information see: Metadata Recommendations.

 


The contents file simply enumerates, one file per line, the bitstream file names. See the following example:

...

  • Resume. If, during importing, you have an error and the import is aborted, you can use the --resume (-R) flag to resume the import where you left off after you fix the error.

  • Specifying the owning collection on a per-item basis from the command line administration tool

    If you omit the -c flag, which is otherwise mandatory, the ItemImporter searches for a file named "collections" in each item directory. This file should contain a list of collections, one per line, specified either by their handle, or by their internal db id. The ItemImporter then will put the item in each of the specified collections. The owning collection is the collection specified in the first line of the collections file.

    If both the -c flag is specified and the collections file exists in the item directory, the ItemImporter will ignore the collections file and will put the item in the collection specified on the command line.

    Since the collections file can differ between item directories, this gives you more fine-grained control of the process of batch adding items to collections.

  • Importing with BTEmetadata records

    The DSpaceOutputGenerator, which writes the metadata into the DSpace Simple Archive Format, has been updated to produce the collections file, if a metadata field named "collections" (reserved word) exists in the original metadata. This is mainly applicable to the CSV input format which is more flexible, but could also be implemented with a Modifier that adds the "collections" field to each Record in the BTE pipeline.

    Important note: an entry with the "collections" key should be in the output map that is used by the DSpaceOutputGenerator.
    More info in Importing Items via basic bibliographic formats (Endnote, BibTex, RIS, TSV, CSV) and online services (OAI, arXiv, PubMed, CrossRef, CiNii).

UI Batch Import

...

Warning

Batch

...

Import via the

...

UI is not available in DSpace 7.0. It is scheduled to be restored in a later 7.x release (currently 7.1), see DSpace Release 7.0 Status.  The below screenshots/process were from the old JSPUI and will need updating once this feature is rebuilt in 7.x.

Batch import can also take place via the Administrator's UI. The steps to follow are:

A. Prepare the data

  1. Items, i.e. the

A. Prepare the data

  1. Items, i.e. the metadata and their bitstreams, must be in the Simple Archive Format described earlier in this chapter. Thus, for each item there must be a separate directory that contains the corresponding files of the specific item.
  2. Moreover, in each item directory, there can be another file that describes the collection or the collections that this item will be added to. The name of this file must be "collections" and it is optional. It has the following format:


    Each line contains the handle of the collection. The collection in the first line is the owning collection while the rest are the other collections that the item should belong to.
  3. Compress the item directories into a zip file. Please note that you need to zip the actual item directories and not just the directory that contains the item directories. Thus, the final zip file must directly contain the item directories.
  4. Place the zip file in a public domain URL, like Dropbox or Google Drive or wherever you have access to do so. Since such a zip file can be very big in size, the batch import UI needs the URL to download it for a public location rather than just upload it and get a timeout exception

...

2) If you do not specify an owning collection, and for some items no "collections" file exists in the item directory, then the item will not be imported in DSpace 


Finally, when you submit the form you will receive a message informing you that the import process is being executed in the background (since it may take long). At the end, you will receive a success or failure email (to the email address of your DSpace account) informing you of the status of the import.

...

  1. Login.
  2. Visit "My DSpace" page.

  3. On the next page, you can see the history of batch imports. For each import, the following information is available:

    The status of the batch import (success or failure)
    The number of items that the user tried to import
    The number of items that were actually imported

     


    Moreover, the user can take the following actions:

    Download the map file that was produced during the import. This file contains a list of items that were imported with the corresponding handle assigned to them by DSpace.

    Delete the imported items. Everything that was imported will be deleted (including the history directory in the "[dspace]/import" directory)

    In case of failure, the user can "Resume" the import. The user is taken to the upload form again, but the system recognizes the initial import (and the map file) in order to resume the old import. There is a red label in the form that informs the user about the "Resume" form.

UI Batch Import (XMLUI)

A SimpleArchiveFormat package can be imported by an administrator in XMLUI. The SimpleArchiveFormat package needs to be compressed into a ZIP file, and it be will be uploaded to XMLUI through the browser. DSpace will then process that ZIP, and ingest items into DSpace. A stable network connection is recommended, as your browser will need to upload a potentially large ZIP file, and then wait while DSpace processes that ZIP file. 

While logged in as an administrator, click on Batch Import (ZIP):

Image Removed

Then, choose the owning collection from the collection dropdown, and browse to the ZIP file on your computer that has the SimpleArchiveFormat ZIP file.

Image Removed

If successful, you will get a green message with a list of handles that were imported. It is what is considered the "map file".

Image Removed

If an error occurred, you will get a red error message with the issue:

Image Removed

...


Exporting Items

The item exporter can export a single item or a collection of items, and creates a DSpace simple archive in the aforementioned format for each exported item. The items are exported in a sequential order in which they are retrieved from the database. As a consequence, the sequence numbers of the item subdirectories (item_000, item_001) are not related to DSpace handle or item IDs.

...