You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 19 Next »

Summary

Introduction

This page presents the procedure for migrating data from DSpace to VIVO. It answers the use case of a VIVO instance in read-only mode used to present the metadata contained in DSpace

Goals

  • The scenario to be realized by this procedure consists in developing the necessary steps in order to migrate the metadata of two DSpace instances (the DSpace-6 Demo instance and the DSpace-7 Demo instance) to a local VIVO instance
  • At the end of this procedure, the experimenter should have a fully operational VIVO instance containing the metadata harvested from DSpace-6-Demo DSpace-7-Demo, both of which are available from the web.
  • The experimenter will also have in his possession, the necessary information to harvest in VIVO the metadata of a DSpace instance that he will have chosen and that it is possible to harvest from an OAI-PMH endpoint

Useful addresses

TitleURLDescription
DSpace-6 Demo Home Pagehttps://demo.dspace.org/This entry page links to other links concerning the DSpace-6 demo
DSpace-6 Demo UIhttps://demo.dspace.org/xmlui/This page is the DSpace-6 api allowing to manipulate metadata
DSpace-6 OAI Apihttps://demo.dspace.org/oai/requestOAI API used to harvest data
DSpace-7 Demo Home Pagehttps://demo7.dspace.org/homeThis entry page links to other links concerning the DSpace-7 demo
VIVO Project GitHub Home Pagehttps://github.com/vivo-projectSource code location needed to install VIVO
DSpace-VIVO Integration Project (DV-IP)https://github.com/vivo-community/DSpace-VIVOSource code location for the migration of DSpace metadata to VIVO
DSpace-VIVO ETL Examplehttps://github.com/vivo-community/DSpace-VIVO/tree/main/test/org.vivoweb.dspacevivo.etlexampleSource code location for extract-transform-load (ETL) metadata processing from DSpace to VIVO  

Useful variable and constant names

TitleVar NameVar ValueDescription
Project root directoryDVIP_HOME_PRJ~/dspace-vivo-prjThe value content is a suggestion
Git root directoryGIT_REPO$DVIP_HOME_PRJ/00-GITDirectory containing extracted GIT projects
Default VIVO login (username - password)admin@vivo.orgVivo1234.To be used to log-in as a VIVO administrator
local server URLs SOLRhttp://localhost:8983/solr/#/

VIVOhttp://localhost:8080/vivo-dspace/

Software requirements

  • jdk 11
  • maven 3.6.3
  • Linux Ubuntu
  • No solr or tomcat instance should be running on the computer
  • Linux bash



Setting up

Setting up the necessary resources for running VIVO

Step name and descriptionCommands
Setting up project
mkdir -p ~/dspace-vivo-prj/00-GIT 
Retrieve the DV-IP source code
 git clone https://github.com/vivo-community/DSpace-VIVO
Install Solr + Tomcat
./DSpace-VIVO/releng/org.vivoweb.dspacevivo.installer/00-INIT/install-tomcat-solr-app.sh
Installing/compiling VIVO
./DSpace-VIVO/releng/org.vivoweb.dspacevivo.installer/01-VIVO/vivo-git-clone.sh 
./DSpace-VIVO/bundles/org.vivoweb.dspacevivo/script/vivo-compile-and-deploy-for-tomcat.sh

Run - Start/Stop VIVO


Starting VIVO
source ./DSpace-VIVO/bundles/org.vivoweb.dspacevivo/script/00-env.sh 
solr-start.sh
tomcat-start.sh
To show VIVO in a Web Browser
browse-vivo.sh
For stopping VIVO
tomcat-stop.sh
solr-stop.sh

Installing the migration utilities

Step name and descriptionCommands
Install Apache Jena and its other associated tools
./DSpace-VIVO/releng/org.vivoweb.dspacevivo.installer/99-OTHER_TOOLS/jena-git-clone-and-deploy.sh
Compiling/Installing  DSpace-VIVO-EXEMPLE and its code libraries
./DSpace-VIVO/test/org.vivoweb.dspacevivo.etlexample/script/mvn_install_example.sh

Confirm the installation

The purpose of this step is to validate the correct installation of the components necessary for the scenario to proceed. Here is a series of command that can be executed along with their execution result allowing you to compare them with the result of your own installation

Step name and descriptionCommands
Validate that all necessary GIT projects are cloned and properly deployed
Excute 'ls' command from $GIT_REPO
ls -l
total 24
drwxrwxr-x  6 heon heon 4096 mai 20 14:04 data-format-translator
drwxrwxr-x  7 heon heon 4096 mai 20 11:02 DSpace-VIVO
drwxrwxr-x  9 heon heon 4096 mai 20 11:08 Vitro
drwxrwxr-x 11 heon heon 4096 mai 20 11:08 Vitro-languages
drwxrwxr-x 10 heon heon 4096 mai 20 11:08 VIVO
drwxrwxr-x 11 heon heon 4096 mai 20 11:08 VIVO-languages
Execute 'ls' from $GIT_REPO in deploy directory
ls -dl ./DSpace-VIVO/deploy/*/
drwxrwxr-x 9 heon heon 4096 mai 20 11:07 ./DSpace-VIVO/deploy/app-solr/
drwxrwxr-x 9 heon heon 4096 mai 20 11:07 ./DSpace-VIVO/deploy/app-tomcat/
drwxrwxr-x 2 heon heon 4096 mai 20 14:05 ./DSpace-VIVO/deploy/lib/
drwxrwxr-x 7 heon heon 4096 mai 20 14:04 ./DSpace-VIVO/deploy/translator/
drwxrwxr-x 9 heon heon 4096 mai 20 11:13 ./DSpace-VIVO/deploy/vivo-home/
Test the utilities to make sure they are working
Setting up environment variables in your session (From $GIT_REPO)
$ source DSpace-VIVO/bundles/org.vivoweb.dspacevivo/script/00-env.sh
Validate Solr
$ solr-start.sh 
Waiting up to 180 seconds to see Solr running on port 8983 [|]  
Started Solr server on port 8983 (pid=1741315). Happy searching!

$ solr-status.sh 
Found 1 Solr nodes: 
Solr process 56366 running on port 8983
{
  "solr_home":"xxxxxxx/00-GIT/DSpace-VIVO/deploy/app-solr/server/solr",
  "version":"8.11.1 0b002b11819df70783e83ef36b42ed1223c14b50 - janhoy - 2021-12-14 13:50:55",
  "startTime":"2022-05-19T15:15:10.534Z",
  "uptime":"0 days, 17 hours, 25 minutes, 10 seconds",
  "memory":"151 MB (%29.5) of 512 MB"}
Validate Tomcat
$ tomcat-start.sh
Using CATALINA_BASE:   xxxxxxx/00-GIT/DSpace-VIVO/deploy/app-tomcat
Using CATALINA_HOME:   xxxxxxx/00-GIT/DSpace-VIVO/deploy/app-tomcat
Using CATALINA_TMPDIR: xxxxxxx/00-GIT/DSpace-VIVO/deploy/app-tomcat/temp
Using JRE_HOME:        /opt/jdk-11.0.9
Using CLASSPATH:       xxxxxxx/00-GIT/DSpace-VIVO/deploy/app-tomcat/bin/tomcat-juli.jar
Using CATALINA_OPTS:   
Tomcat started.
Test Apache-Jena
$ sparql -version 2>/dev/null
Jena:       VERSION: 3.17.0
Jena:       BUILD_DATE: 2020-11-25T19:40:23+0000

Visual confirmation in your web browser

Visual of SolrVisual for VIVO-DSpace
URL = http://localhost:8983/solr/#/URL = http://localhost:8080/vivo-dspace/

Migrate data from DSpace6-Demo and DSpace-7-Demo into VIVO with default migration values

  • No labels