...
- These instructions assume that you are performing a clean install. Product functionality may not be as expected if you install over an existing installation of an earlier version.
- VIVO Developers: These instructions are for installing, configuring and running the VIVO harvester. Instructions for developing on the VIVO Harvester library can be found at Harvester Development Toolkit.
The VIVO Harvester is a library of tools written in Java for exporting data from external systems and importing it into VIVO. The code is released in two ways, as a debian file and as tar file. Linux machines that use apt-get style repositories (debian, ubuntu) can install the debian file by downloading it and using dpkg -i filename.deb to install. The tarball can be unpacked on any OS where java is supported and run.
...
Requirements
- Required Software
- Sun Java 6
- Maven (maven2 or maven3): This is required to compile the code. It automatically handles the retrieval of any third-party dependencies, performs code compilation, and performs unit tests.
- Subversion: This is required for code checkout
- Deb Version
*The debian package installs itself into the /usr/share/vivo/harvester/ directory. It should prompt before updating config files when installing a new version of the harvester over an old version. - Tarball
- Unpacks into any directory you wish and can be run with no additional installation. This is a great option if you don't have root access on the machine you're working on.
...
- National Data Source Examples
- Local Data Source Examples
- JDBC
- D2RMap
- CSV
Other things
- RecordHandlers
*Backup segments of the harvest process - Temporary Models as used by the harvester
*XPathTool use in the Environment Config File
...
Using the harvester as a library works well when you want to embed it's functionality into an already existing java application. Utilizing the tools are straightforward. See our JavadocJavadoc [1] documentation for specifics on each tool. A working example harvest is also shipped starting with the harvester 1.0 release. See the DemoPSMerge.java file included the demos folder.
...
- External Data Source - This is the foreign source
- Fetch - Retrieves data from foreign source
- Raw Data - A simple database or as simple XML
- Translate - Turns the raw data into RDF
- RDF - RDF models which can be dumped into RDF/XML.
- Score] - First find similarities and rate them, second determine and apply matches based on a threshold of difference.
- Qualify - Changes any unmatched data
- Transfer (Update) - move into a vivo model (through an update process if possible.)
Vivo - Final model in RDF visible from the webapp. |
...
- Fetch
- Translate
- GlozeTranslator
- SPARQLTranslator
- XSLTranslator
- Score/Match
- Transfer
- Diff
- Qualify
- Utilities
...