This is the detailed design document for the harvesting of National Library of Medicine publications.
Overview
Method used to ingest data from NIH EUtils interface. It is a subclass of NIHFetch which has a description of the command line arguments.
Useage
NLMJournalFetch="java $OPTS Xms$MIN_MEM -Xmx$MAX_MEM -Dharvester-task=$HARVESTER_TASK -Dprocess-task=NLMJournalFetch -cp bin/harvester$VERSION.jar:bin/dependency/* org.vivoweb.harvester.fetch.NLMJournalFetch"
Methods
serializeFetchRequest
- create EFetchJournalsServiceStub
- create EFetchResult from EFetchJournalsServiceStub with EFetchRequest which was passed in.
- Using an XMLStreamWriter to create a MTOMAwareXMLSerializer
sanitizeXML
- take in String
- replace invalid characters
- write out sanatized xml