Page History

...

This bean declares the data loader to be used to load publications from. It has one property "dataloadersMap", a map that declares key-value pairs, thas that is a unique key and the corresponding data loader to be used. Here is the point where a new data loader can be added, in case the ones that are already supported do not meet your needs.

...

a) fieldMap : it is a map that specifies the mapping between the keys that hold the metadata in the input format and the ones that we want to have internal in the BTE. At the end of this article there is a table that summarises the fields that are used from the three online services (pubmed, arXiv and crossRef) - which are the ones that the submission lookup step is capable of reading from the online services - and the keys used internally in the BTE.

Some loaders have more properties:

...

Code Block

language	html/xml

<bean id="phase2linearWorkflow" />

This bean specifies the processing steps to be applied to the records metadata before they proceed to the output generator of the transformation engine. Currenty, two steps are supported, but you can add yours as well.

Code Block

language	html/xml

<bean id="fieldMergeModifier" />
<bean id="valueConcatenationModifier" />
<bean id="languageCodeModifier" />

These beans are the processing steps that are supported by the 2nd phase of transformation engine. The first mergest he merges the values of multiple keys to a new key. The second one concatenates the values of a specific key to a unique value. The third one translated the three-letters language code to two-letters one (ie: eng to en)

Code Block

language	html/xml

<bean id="org.dspace.submit.lookup.DSpaceWorkspaceItemOutputGenerator" />

This bean declares the output generator to be used which is, in this case, a DSpaceWorkspaceItem generator. It accepts two properties:

a) outputMap: A map from the intermediate keys to the DSpace metadata schema fields. The table below displays the default output mapping. As you can see, some fields, while the are read from the input source, are not output in DSpace since there are no default metadata schema fields to host them. However, if you create the corresponding metadata field registry, you can come back in this configuration to add a map between the input field key and the DSpace metadata field.

b) extraMetadataToKeep: A list b) extraMetadataToKeep: A list of DSpace metadata schema fields to keep in the output

The following table presents the available keys from the online services, the keys that BTE uses in phase1 and the final output map to DSpace metadata fields.

Arxiv	PubMed	CrossRef	BTE Key (phase 1)	Extra Keys created by BTE (phase 2)	DSpace Metadata Field
title	articleTitle	articleTitle	title		dc.title
published	pubDate	year	issued		dc.date.issued
id			url
summary	abstractText		abstract		dc.description.abstract
comment			note
pdfUrl			fulltextUrl
doi	doi	doi	doi		dc.identifier
journalRef	journalTitle	journalTitle	journal		dc.source
author	author	authors	authors		dc.contributor.author
authorWithAffiliation			authorsWithAffiliation
primaryCategory			arxivCategory		dc.subject
category			arxivCategory		dc.subject
	pubmedID		pubmedID
	publicationStatus		publicationStatus
	pubModel
	printISSN	printISSN	jissn		dc.identifier.issn
	electronicISSN	electronicISSN	jeissn
	journalVolume	volume	volume
	journalIssue	issue	issue
	language		language		dc.language.iso
	publicationType	doiType	subtype		dc.type
	primaryKeyword		keywords	allkeywords	dc.subject
	secondaryKeyword		keywords	allkeywords	dc.subject
	primaryMeshHeading		mesh	allkeywords	dc.subject
	secondaryMeshHeading		mesh	allkeywords	dc.subject
	startPage	firstPage	firstpage
	endPage	lastPage	lastpage
		printISBN	pisbn		dc.identifier.isbn
		electronicISBN	eisbn
		editionNumber	editionnumber
		seriesTitle	seriestitle
		volumeTitle	volumetitle
		publicationType
		editors	editors		dc.contributor.editor
		translators	translators		dc.contributor.other
		chairs	chairs		dc.contributor.other
			naid
			ncid
			keywords
			publisher

Info

title	I can see more beans in the configuration file that are not explained above. Why is this?

The configuration file hosts options for two services. BatchImport service and SubmissionLookup service. Thus, some beans that are not used in the first service, are not mentioned in this documentation. However, since both services are based on the BTE, some beans are used by both services.

...

All Versions

DSpace Documentation

Page tree

Versions Compared

Old Version 23

New Version 24

Key