Method used to ingest data from PubMed SOAP interface. Brings in data as XML selected by either queries or record ranges and returns a stream of raw RDF/XML. Method can call a variety of fetch methods that allow selecting records based on a range of different attributes such as date added, date modified, number range, affiliation, etc.
To successfully harvest from PubMed:
Runs, sanitizes, and outputs the results of a EFetch request to the xmlWriter
Sanitizes the XML in preparation for the output stream
<?xml version="1.0" encoding="UTF-8"?> <Task type="org.vivoweb.harvester.fetch.PubmedSOAPFetch"> <Param name="email">swilliams@ichp.ufl.edu</Param> <Param name="output">config/recordHandlers/Pubmed-XML-h2RH.xml</Param> <Param name="termSearch">ufl AND edu[ad]</Param> <Param name="numRecords">100</Param> <Param name="batchSize">1000</Param> </Task> |