Page History

...

Change directory to example-scripts/example-wos
Edit the vivo.model.xml file
- Set the dbURL, dbUser, dbPass, and Namespace
- For more information on these parameters and their use, please see Harvester _ vivo _ configuration _ file
Edit changenamespace-authors.config.xml, changenamespace-authorship.config.xml, changenamespace-publication.config.xml, changenamespace-subjectarea.config.xml, changenamespace-subjectarea.config.xml and changenamespace-journal.config.xml files and set the namespace parameters in each one to be your vivo namespace
- For more information on these parameters and their use, please see ChangeNamespace
Edit the run-wos.sh file and set the HARVESTER_INSTALL_DIR= to be the directory you unpacked the harvester in
Run bash run-wos.sh

...

Once your ready to run a large dataset, it is advisable to the record storage from files to a database. Although this will make it harder to find individual records, speed and performance will be increased during the fetch and translate stage. To do so:

Edit the nano raw-records.config.xml to use TDB, which is a semantic data store

No Format
<RecordHandler> <Param name="rhClass">org.vivoweb.harvester.util.repo.JenaRecordHandler</Param> <Param name="type">tdb</Param> <Param name="dbDir">data/raw-records</Param> </RecordHandler>

Edit the translated-records.config.xml to use TDB, which is a semantic data store

No Format
<RecordHandler> <Param name="rhClass">org.vivoweb.harvester.util.repo.JenaRecordHandler</Param> <Param name="type">tdb</Param> <Param name="dbDir">data/translated-records</Param> </RecordHandler>

...

Space shortcuts

Page tree

Versions Compared

Old Version 1

New Version 2

Key