...
- use entire record as context for resolution
- points vs. shapes in geo entity resolution
- crowdsourcing opportunity?
- OCLC - several passes through data, information from multiple sources (ISNI, VIAF, etc.)
- need public feedback for last 20%
- refine algorithms based on crowdsourcing feedback
- machine transformation and confidence rating – mark that is machine-generated, with date
(new group)
strings --> things
- need string info in perpetuity
- accuracy, testability of ambiguity
- places ... think maps ...
- people
- dates ... map interface
- subjects
...
what if we had no metadata and started only with full text?
(new group)
challenges
- solutions – would be awesome
...
music parsing
image identity
(new group)
UCSD – mix of auto & manual review
...