History
The Harvester began life as a specialized ETL tool meant to ease the process of data ingest into VIVO. It has transformed into a general semantic ETL tool.
Introduction
The VIVO Harvester is a library of tools designed to take data from external data sources and ingest it into VIVO. The library was originally developed at the University of Florida during the 2009-2011 NIH Grant. Development of the Harvester follows a monthly release cycle. New features are built in the first 2-3 weeks of the cycle, with testing and releasing occurring during the 3rd and 4th week of the cycle. Use the links below to learn more about individual tools the harvester is comprised of, or read the Harvester User Guide to learn more about using the harvester.
Harvester Overview
Harvester Instructions
Walkthrus for example scripts
Video Walkthrus
Screencasts of example harvester runs: https://sourceforge.net/projects/vivo/files/VIVO%20Harvester/Demonstration/
Case Studies/Examples
University of Florida PubMed Harvest
University of Florida PeopleSoft Harvest
University of Florida Department of Sponsored Research Harvest
Class Documentation
- Fetch
- Translate
- GlozeTranslator
- SPARQLTranslator
- XSLTranslator
- Score/Match
- Transfer
- Diff
- Qualify
- Utilities
Development Documentation
- Harvester Development Toolkit
- Harvester Documentation Procedures
- Harvester Planned Features
- Advanced PubMed name matching diagram
- New Harvest Workflow Proposal
- RecordHandler Tool Specification
- RecordCompare Tool Specification
- SPARQLFetch Tool Specification
- Update Tool Specification
- Arbitrary Sparql Structure Scoring