UNEDITED NOTES
facilitator: Steven Folsom
Themes Identified:
- Requires a mix of automated and manual methods
- Need tools to do this, e.g. present user with automated matches and allow them to make changes (this could then be used to tune the algorithm)
- There's a potential to open this up to communities beyond library professionals (crowd-sourcing/niche-sourcing)
UNEDITED NOTES
Table 1
DPLA: placename resolution
...
- use entire record as context for resolution
- points vs. shapes in geo entity resolution
- crowdsourcing opportunity?
- OCLC - several passes through data, information from multiple sources (ISNI, VIAF, etc.)
- need public feedback for last 20%
- refine algorithms based on crowdsourcing feedback
- machine transformation and confidence rating – mark that is machine-generated, with date
...
Table 2
strings --> things
- need string info in perpetuity
- accuracy, testability of ambiguity
- places ... think maps ...
- people
- dates ... map interface
- subjects
...
what if we had no metadata and started only with full text?
...
Table 3
challenges
- solutions – would be awesome
...
music parsing
image identity
...
Table 4
UCSD – mix of auto & manual review
...