Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Meeting Materials

...

Working Group Charter and Logistics

Reference: Working Group Charter

  • What does it mean for linked data to be an "approach" rather than a part of a system?
    • Having a REST API return LOD
  • Sanderson: Are we concerned with update/delete/write/etc?
    • Rayle: Those are part of the larger picture, but for the purposes of this group's deliverables, we are focused on search and return of data. Everything is fair game in brainstorming, and a subsequent working group is possible. 
  • Sanderson: Previous attempts at standardizing search APIs have failed. We need to be very specific about details.
    • Rayle: Add links to outside documentation under "References".

...

Include Page
Terminology (inclusion) - Search API Best Practices for Authoritative Data Working Group
Terminology (inclusion) - Search API Best Practices for Authoritative Data Working Group

...

Instructions: All topics related to accessing authoritative data.  This can include topics that are not directly related to search APIs.  Topics of interest to the group, but not directly related to search APIs will be considered for the tail end of the working group if there is time or considered for a new working group if there is enough interest.  So let all your thoughts flow.

  • Caching
    • NOTE: There is some overlap in the ideas listed under the two areas of caching, for example, there are questions for both types of caching around versioning of authorities.
    • Caching 1 (Storing local copies of entire dataset)
      • Downloads - cache management synchronization
      • Versioning of authorities and impact on updates to cached dataset
    • Caching 2 (Caching of a single label or small pieces of data about a single term)
      • Notification of updates when entity descriptions change, or at least ability to search by dates/types of changes
      • Deprecations - mechanism to state that this term is replaced by another term - how do end users know that the term is no longer valid
      • Versioning of authorities and impact on labels that have been deprecated, changed, or deleted
      • Learning how something has changed and knowing what to do in response
  • Reconciliation
    • entity reconciliation
    • reconciliation - reconcile more than just adjunct works and references to outside data and across languages
    • concerns with sameas connections between data that may not be accurate
    • reconciliation - why isn't this just the openrefine API; implemented and well understood
  • Accuracy 1 (The ability of an API  to return relevant data) , Accuracy 2 (The ability of a user to select a term from multiple similar terms given a set of search results) , and Entity Resolution  (Strings-to-Things - The process of identifying that a String Label is the label for a thing identified by a URI)
    • NOTE: I put these all together because they are highly interrelated.  What and how well the API returns relevant data greatly affects the user's ability to accurately select the correct term from the results.
    • NOTE: Authorities bear the primary weight of entity resolution by assigning a primary label to a URI.  Once a term is selected, the label comes from the authority.  It is in this section since the end user has a string in mind when they type a query, and the search/selection process turns the search query string into a selected URI with a label.
    • How do humans choose between two similar but distinct entities?
    • identify you have the authority you want - got the right john smith
    • Extended Context
      • right information to search and display - need an easy way to define
      • moving to linked data - which attributes to include for each entity
      • May need more context some times
      • Can users personalize how the data comes back?
      • being able to manage and choose the context of your query where context is provenance, community recommendations, relationships, etc
      • Presence or absence of connections to other entities; making choices based on interconnectedness and fullness of data
    • Order of Presentation 
      • Rank ordering so results are displayed with first results as best result
      • Listed alphabetically
      • Option for left anchored search
      • Pagination
        • expanding set of results
        • turning off pagination to get all results
        • server side pagination vs client side
    • Other 
      • Which labels to display when multiple labels - across languages and scripts and kinds of name
  • API approach
    • browsing with context when know what you are looking for and a good amount of time, catalogers know what they are looking for
    • searching to discover when you don't know what you are looking for
    • One service doesn't fit all needs.  Suggest, Search, Browse - each serve different needs
    • How to surface versioning of authorities through API
    • what is the impact on APIs with respect to deprecations
    • to enable local/specialized authorities to participate by implementing the API - allow local to define a narrower term to a broader 
    • API - focus first on retrieval by REST before search and browse
    • more consistent and granular error reporting to determine the source of the error
    • handling different data models (the organization of the returned data)
    • handling different syntaxes
  • Data Related
    • Which data elements are unique or intended to be unique?
    • Which data elements are intended for end users (such as library users browsing a collection) and which are intended primarily for internal use
    • Suggest SKOS ontology (Avoid talking about modeling for years) and
    • Suggest JSON-LD for format
    • handling different data models (the organization of the returned data
handling different syntaxes
  • Scalability
    • responsivity for autocomplete (< 10ms)
  • Other
    • indexing - connections of relationships between entities
    • suggest 2.0 document
    • Learning from users
    • Drive recommendations from Use cases
    • discovery of (and enrollment in) new authorities
    • standards-based

...