Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

This bean declares the data loader to be used to load publications from. It has one property "dataloadersMap", a map that declares key-value pairs, thas that is a unique key and the corresponding data loader to be used. Here is the point where a new data loader can be added, in case the ones that are already supported do not meet your needs.

...

a)  fieldMap : it is a map that specifies the mapping between the keys that hold the metadata in the input format and the ones that we want to have internal in the BTE. At the end of this article there is a table that summarises the fields that are used from the three online services (pubmed, arXiv and crossRef) - which are the ones that the submission lookup step is capable of reading from the online services - and the keys used internally in the BTE. 

Some loaders have more properties:

...

Code Block
languagehtml/xml
<bean id="phase2linearWorkflow" />

 This bean specifies the processing steps to be applied to the records metadata before they proceed to the output generator of the transformation engine. Currenty, two steps are supported, but you can add yours as well.

 

Code Block
languagehtml/xml
<bean id="fieldMergeModifier" />
<bean id="valueConcatenationModifier" />
<bean id="languageCodeModifier" />

 These beans are the processing steps that are supported by the 2nd phase of transformation engine. The first mergest he merges the values of multiple keys to a new key. The second one concatenates the values of a specific key to a unique value. The third one translated the three-letters language code to two-letters one (ie: eng to en)

   

Code Block
languagehtml/xml
<bean id="org.dspace.submit.lookup.DSpaceWorkspaceItemOutputGenerator" />

This bean declares the output generator to be used which is, in this case, a DSpaceWorkspaceItem generator. It accepts two properties:

a) outputMap: A map from the intermediate keys to the DSpace metadata schema fields. The table below displays the default output mapping. As you can see, some fields, while the are read from the input source, are not output in DSpace since there are no default metadata schema fields to host them. However, if you create the corresponding metadata field registry, you can come back in this configuration to add a map between the input field key and the DSpace metadata field.

b) extraMetadataToKeep: A list b) extraMetadataToKeep: A list of DSpace metadata schema fields to keep in the output


The following table presents the available keys from the online services, the keys that BTE uses in phase1 and the final output map to DSpace metadata fields.

ArxivPubMedCrossRefBTE Key (phase 1)Extra Keys created by BTE (phase 2)DSpace Metadata Field
titlearticleTitlearticleTitletitle dc.title
publishedpubDateyearissued dc.date.issued
id  url  
summaryabstractText abstract dc.description.abstract
comment  note  
pdfUrl  fulltextUrl  
doidoidoidoi dc.identifier
journalRefjournalTitlejournalTitlejournal dc.source
authorauthorauthorsauthors dc.contributor.author
authorWithAffiliation  authorsWithAffiliation  
primaryCategory  arxivCategory dc.subject
category  arxivCategory dc.subject
 pubmedID pubmedID  
 publicationStatus publicationStatus  
 pubModel    
 printISSNprintISSNjissn dc.identifier.issn
 electronicISSNelectronicISSNjeissn  
 journalVolumevolumevolume  
 journalIssueissueissue  
 language language dc.language.iso
 publicationTypedoiTypesubtype dc.type
 primaryKeyword keywordsallkeywords dc.subject
 secondaryKeyword keywordsallkeywords dc.subject
 primaryMeshHeading meshallkeywords dc.subject
 secondaryMeshHeading meshallkeywords dc.subject
 startPagefirstPagefirstpage  
 endPagelastPagelastpage  
  printISBNpisbn dc.identifier.isbn
  electronicISBNeisbn  
  editionNumbereditionnumber  
  seriesTitleseriestitle  
  volumeTitlevolumetitle  
  publicationType   
  editorseditors dc.contributor.editor
  translatorstranslators dc.contributor.other
  chairschairs dc.contributor.other
   naid  
   ncid  
   keywords  
   publisher  

 

 

Info
titleI can see more beans in the configuration file that are not explained above. Why is this?

The configuration file hosts options for two services. BatchImport service and SubmissionLookup service. Thus, some beans that are not used in the first service, are not mentioned in this documentation. However, since both services are based on the BTE, some beans are used by both services.

...