Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Change directory to example-scripts/example-jdbc
  • Load the demodb.sql file into your mysql database

    No Format
    mysql -u root -p (if your running on the VM, the default root password is vitro123)
    create database if not exists demodb;
    GRANT ALL PRIVILEGES ON demodb.* TO demodb@localhost IDENTIFIED BY 'demodb';
    exit;
  • Now the database has been created and a user called demodb has been created that can connect to it. Next, we'll load the demodb.sql to the new database.

    No Format
    mysql -u demodb -p demodb < demodb.sql
    
  • When prompted enter your demodb user password, which we set above to demodb.
  • Edit the vivo.model.xml file
  • Edit changenamespace-departments.config.xml, changenamespace-people.config.xml, changenamespace-positions.config.xml files and set the namespace parameters in each one to be your vivo namespace
    • For more information on these parameters and their use, please see ChangeNamespace
  • Edit the run-jdbc.sh and remove-last-jdbc-harvest.sh files and set the HARVESTER_INSTALL_DIR= to be the directory you unpacked the harvester in
  • Please refer to http://issues.library.cornell.edu/browse/VIVOHARV-124 regarding a documented issue in databaseclone.config.xml
  • Run bash run-jdbc.sh
  • Restart tomcat and apache2. You may also need to force the index to rebuild to see the new data. The index can be rebuilt by issuing the following URL in a browser:http://your.vivo.address/vivo/SearchIndex. This will require site admin permission, and prompt you to login if your not already.

...

Once your ready to run a large dataset, it is advisable to the record storage from files to a database. Although this will make it harder to find individual records, speed and performance will be increased during the fetch and translate stage. To do so:

  • Edit the nano raw-records.config.xml to use TDB, which is a semantic data store

    No Format
    <RecordHandler>
            <Param name="rhClass">org.vivoweb.harvester.util.repo.JenaRecordHandler</Param>
            <Param name="type">tdb</Param>
            <Param name="dbDir">data/raw-records</Param>
    </RecordHandler>
    
  • Edit the translated-records.config.xml to use TDB, which is a semantic data store

    No Format
    
    <RecordHandler>
            <Param name="rhClass">org.vivoweb.harvester.util.repo.JenaRecordHandler</Param>
            <Param name="type">tdb</Param>
            <Param name="dbDir">data/translated-records</Param>
    </RecordHandler>