Page History
...
This bean declares the data loader to be used to load publications from. It has one property "dataloadersMap", a map that declares key-value pairs, thas that is a unique key and the corresponding data loader to be used. Here is the point where a new data loader can be added, in case the ones that are already supported do not meet your needs.
...
a) fieldMap : it is a map that specifies the mapping between the keys that hold the metadata in the input format and the ones that we want to have internal in the BTE. At the end of this article there is a table that summarises the fields that are used from the three online services (pubmed, arXiv and crossRef) - which are the ones that the submission lookup step is capable of reading from the online services - and the keys used internally in the BTE.
Some loaders have more properties:
...
Code Block | ||
---|---|---|
| ||
<bean id="phase2linearWorkflow" /> |
This bean specifies the processing steps to be applied to the records metadata before they proceed to the output generator of the transformation engine. Currenty, two steps are supported, but you can add yours as well.
Code Block | ||
---|---|---|
| ||
<bean id="fieldMergeModifier" />
<bean id="valueConcatenationModifier" />
<bean id="languageCodeModifier" /> |
These beans are the processing steps that are supported by the 2nd phase of transformation engine. The first mergest he merges the values of multiple keys to a new key. The second one concatenates the values of a specific key to a unique value. The third one translated the three-letters language code to two-letters one (ie: eng to en)
Code Block | ||
---|---|---|
| ||
<bean id="org.dspace.submit.lookup.DSpaceWorkspaceItemOutputGenerator" /> |
This bean declares the output generator to be used which is, in this case, a DSpaceWorkspaceItem generator. It accepts two properties:
a) outputMap: A map from the intermediate keys to the DSpace metadata schema fields. The table below displays the default output mapping. As you can see, some fields, while the are read from the input source, are not output in DSpace since there are no default metadata schema fields to host them. However, if you create the corresponding metadata field registry, you can come back in this configuration to add a map between the input field key and the DSpace metadata field.
b) extraMetadataToKeep: A list b) extraMetadataToKeep: A list of DSpace metadata schema fields to keep in the output
The following table presents the available keys from the online services, the keys that BTE uses in phase1 and the final output map to DSpace metadata fields.
Arxiv | PubMed | CrossRef | BTE Key (phase 1) | Extra Keys created by BTE (phase 2) | DSpace Metadata Field |
---|---|---|---|---|---|
title | articleTitle | articleTitle | title | dc.title | |
published | pubDate | year | issued | dc.date.issued | |
id | url | ||||
summary | abstractText | abstract | dc.description.abstract | ||
comment | note | ||||
pdfUrl | fulltextUrl | ||||
doi | doi | doi | doi | dc.identifier | |
journalRef | journalTitle | journalTitle | journal | dc.source | |
author | author | authors | authors | dc.contributor.author | |
authorWithAffiliation | authorsWithAffiliation | ||||
primaryCategory | arxivCategory | dc.subject | |||
category | arxivCategory | dc.subject | |||
pubmedID | pubmedID | ||||
publicationStatus | publicationStatus | ||||
pubModel | |||||
printISSN | printISSN | jissn | dc.identifier.issn | ||
electronicISSN | electronicISSN | jeissn | |||
journalVolume | volume | volume | |||
journalIssue | issue | issue | |||
language | language | dc.language.iso | |||
publicationType | doiType | subtype | dc.type | ||
primaryKeyword | keywords | allkeywords | dc.subject | ||
secondaryKeyword | keywords | allkeywords | dc.subject | ||
primaryMeshHeading | mesh | allkeywords | dc.subject | ||
secondaryMeshHeading | mesh | allkeywords | dc.subject | ||
startPage | firstPage | firstpage | |||
endPage | lastPage | lastpage | |||
printISBN | pisbn | dc.identifier.isbn | |||
electronicISBN | eisbn | ||||
editionNumber | editionnumber | ||||
seriesTitle | seriestitle | ||||
volumeTitle | volumetitle | ||||
publicationType | |||||
editors | editors | dc.contributor.editor | |||
translators | translators | dc.contributor.other | |||
chairs | chairs | dc.contributor.other | |||
naid | |||||
ncid | |||||
keywords | |||||
publisher |
Info | ||
---|---|---|
| ||
The configuration file hosts options for two services. BatchImport service and SubmissionLookup service. Thus, some beans that are not used in the first service, are not mentioned in this documentation. However, since both services are based on the BTE, some beans are used by both services. |
...