Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

UNEDITED NOTES 

facilitator: Steven Folsom

Themes Identified:

  • Requires a mix of automated and manual methods
  • Need tools to do this, e.g. present user with automated matches and allow them to make changes (this could then be used to tune the algorithm)
  • There's a potential to open this up to communities beyond library professionals (crowd-sourcing/niche-sourcing)

 

UNEDITED NOTES

Table 1

DPLA: placename resolution

...

  • use entire record as context for resolution
  • points vs. shapes in geo entity resolution
  • crowdsourcing opportunity?
  • OCLC - several passes through data, information from multiple sources (ISNI, VIAF, etc.)
  • need public feedback for last 20%
  • refine algorithms based on crowdsourcing feedback
  • machine transformation and confidence rating – mark that is machine-generated, with date

...

Table 2

strings --> things

  • need string info in perpetuity
  • accuracy, testability of ambiguity
  • places ... think maps ...
  • people
  • dates ... map interface
  • subjects

...

what if we had no metadata and started only with full text?

...

Table 3

challenges

  • solutions – would be awesome

...

music parsing

image identity

...

Table 4

UCSD – mix of auto & manual review

...