Harvester . Utilities

Merge

Combines multiple related rdf records into a single rdf record

This tool Merge takes a set of records and a regular expression to find related records and combines them in another set of records

Merge Parameters

wordiness - (optional) sets the lowest level of log messages to be displayed to the console. The lower the log level, the more detailed the messages.

Possible Values:

baseRegex - Regex for finding primary records (with a grouping for the subsection to use to find sub-records)

Example:
<Param name="baseRegex">tableName_(id_-_.?)</Param> - A regular expression to match record IDs and grouping capture to isolate the id.

input - (optional - at least one of this and/or inputOverride) the configuration file that describes the input record set. The parameters for this config file are described in the Record Sets section below.

Example:

inputOverride - (optional - at least one of this and/or input) specify the parameters for the record set without a config file and/or override specific parameters from the given config file. The parameters that can be set/overridden are described in the Record Sets section below.

Example:

output - (optional - at least one of this and/or outputOverride) the configuration file that describes the output record set. The parameters for this config file are described in the Record Sets section below.

Example:

outputOverride - (optional - at least one of this and/or output) specify the parameters for the record set without a config file and/or override specific parameters from the given config file. The parameters that can be set/overridden are described in the Record Sets section below.

Example:

Configuration Example

<Config>
	<Param name="wordiness">INFO</Param>
	<Param name="baseRegex">tableName_(id_-_.*?)</Param>
	<Param name="input">record-set.conf.xml</Param>
	<Param name="outputOverride">rhClass=org.vivoweb.harvester.util.repo.JenaRecordHandler</Param>
	<Param name="outputOverride">dataFieldType=http://yourDomain.com/propbase#myProp</Param>
	<Param name="outputOverride">jenaConfig=jena-model.conf.xml</Param>
</Config>