Leveraged MARC -> BIBFRAME Converter Framework
FIXME randy stern, David Neiman
HFA
HFA (Harvard Film Archive) data was originally in FilemakerPro format. A one-off Java program was created using FilemakerPro database drivers to extract data from the two relevant tables of interest. These data were output to XML format in a very large single file.
FGDC
FGDC originated in XML format so no converting of this format was necessary.
BIBFRAME Conversion
The bib2lod project was used as base code for converting both the HFA and FGDC XML data. An extension of this base code was made for each of these input format. This custom code for each project was necessary due to significant difference between the datapoints available for each format. The XML input for each of these formats was converted to RDF output. This RDF output was imported into a Vitrolib web application, one for each format type. During the development process extensive test cases were written for each format type and vetted a domain expert.