Introduction
The VIVO Harvester is a library of tools designed to read and transform data from external data sources and ingest it into VIVO or potentially any other triplestore or semantic platform. The library was originally developed by a team at the University of Florida during the 2009-2011 NIH Grant.
The VIVO Harvester is currently maintained on GitHub by John Fereira from Cornell as part of VIVO-related projects including AgriVIVO and USDA VIVO. Other contributions to ongoing Harvester enhancements have been made by Alex Viggio through Symplectic, Ltd.
Development of the Harvester at first followed a monthly release cycle. Now a new version of the VIVO Harvester is released alongside each never major version of VIVO. Use the links below to learn more about individual tools the harvester is comprised of, or read the Harvester User Guide to learn more about using the harvester.
Harvester Instructions
Walkthrus for example scripts
Deployment
Video Walkthrus
Screencasts of example harvester runs: https://sourceforge.net/projects/vivo/files/VIVO%20Harvester/Demonstration/
Case Studies/Examples
University of Florida PubMed Harvest
University of Florida PeopleSoft Harvest
University of Florida Department of Sponsored Research Harvest
Class Documentation
- Fetch
- Translate
- GlozeTranslator
- SPARQLTranslator
- XSLTranslator
- Score/Match
- Transfer
- Diff
- Qualify
- Utilities
Development Documentation
- Harvester Development Toolkit
- Harvester Documentation Procedures
- Harvester Planned Features
- Advanced PubMed name matching diagram
- New Harvest Workflow Proposal
- RecordHandler Tool Specification
- RecordCompare Tool Specification
- SPARQLFetch Tool Specification
- Update Tool Specification
- Arbitrary Sparql Structure Scoring