Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Attendees: Greg, Tim, Steven, Lynette, Jason, Simeon

Regrets:  Huda

Next week: Simeon out, Jason will run

Discovery (WP3)

  • https://github.com/LD4P/discovery/projects/2 for issues etc. 
  • Draft of a discovery plan: https://docs.google.com/document/d/1zKYW7FQVVNvyd0XjjW0qWznX9PC3jbmOE6Kz_yygPjs/edit?usp=sharing
  • Research: how to go from knowledge graph to an index
  • DASH! (Displaying Authorities Seamlessly Here)
    • Dashboard design meeting kickoff notes
    • User reps D&A meeting: Expect next follow-up in August (Slides: from user reps meeting 2021-04-09 and result was "not no")
    • https://docs.google.com/document/d/1PgQi3xobsPhr9DUHU_YGeimL1OjNiiTdkiNWb36r3Gg/edit
    • Usability testing and followup for DASH: Usability results
      • Usability results, a few little things to finish up
      • GitHub issues
      • 2021-0809-2303
        • Positive meeting with D&A user reps. General attitude is that some form should go into production at some stage. Left reps with access to site and Lenora will collect feedback by end September
        • Some concerns about performance, don't want to slow the item view page
        • Like design philosophy of clicking to see more rather than overloading page
        • Tim did some work this week to indicate source and to better fill out the publication timel 
        • CORS error on wikidata, solved by adding a user-agent string via proxy request. They have strategies to throttle requests which might mean we could not use live calls for map coordinated in production
        • Ready for Monday D&A user reps meeting
  • BANG! (Bibliographic Aspects Newly GUI'd)
    • Jamboard link
    • Expect to include Works. Need to do something beyond what we already have live from the OCLC concordance data.
    • 2021-0809-2003
      • Huda via Slack - I had to unzip that pcc data file. Ran a few small queries directly using curl and command line initiated fuseki server. Will run larger queries soon.
      • Huda working on item page using ISBN. Having to do a number of LC queries in sequence; ISBN->work->hub->translation_hub→works→ISBNs; will also try Steven's wikidata query
      • Experimenting with SVDE dataset using Jena API. So far looks like have to store dataset unzipped
  • DAG Calls
    • 2021-08-20 August slow... last call had discussion of creating a list of linked data discovery systems and KP whitepaper. Have noted that Google have changed how they display KP for people. Huda following up with KP co-authors to decide path forward

...

  • Qa Sinopia Collaboration – Support and evolve QA+cache instance for use with Sinopia
    • 2021-0809-2003
      • Spent a good bit of time discussion Issue #126 which looks at adding
      • No meeting with Stanford this week.  I provided Michelle a report in Slack of the first year activities and expected work for the final year.
      • Dave and I went over the Authorities project board.  There are several issues that require input from Steven for prioritization.  There are two issues that Dave is pursuing. Issue #126 add additional subauthorities for geonames to limit results to common `person populated areas` (e.g. city, state, province, country)Issue #133 allow exact match to be case-insensitive, but if there are multiple matches and one is a case match, then rank case match 1.  There is not an agreement within the group that this is a priority.  There was discussion about other approaches like using machine learning to create an authority of publisher cities.  Seemed to be a general opinion that this would be nice, but not if it causes other scheduled work to drop off.  Dave plans to explore the new subauth for PPA.
  • Best Practices for Authoritative Data working group (focus on Change Management)
    • 2021-0809-20 03 Reviewed Activity Streams min extensions document. (2021-08-30 version)  Mostly acceptable.   A major decision is to make the primary stream a simple notification of the changes and add an `instrument` property that holds the RDF_Patch or other encoding that expresses the changes that occurred.  We are also looking at having an OrderedCollection that has all changes since the last baseline complete dump.  The process for end users for incremental updates is go to the `last` entry in the stream, walk backward until you find your last processed date, and then walk forward processing the updates in order.  I still need to update the document to reflect these changes. Changes since last time:
      • adds usage examples of the entry point Activity Stream (OrderedCollection) including links to first and last page
      • adds usage examples of pages of Activities (OrderedCollectionPage) including links to previous and next page
      • uses core activity types Create, Delete, Update
      • changes name of extended activity types to Deprecate, Merge, Split
      • extracts out rdf_patch and sparql from the Activity by using the instrument property to point to a simple Object that has the patch/query defined in the content property of the object
  • Cache Containerization Plan - Develop a sustainable solution that others can deploy
    • 2021-0809-2003
      • Lynette has auto-build/deploy of the qa_server_container image going to a test repo.  Issue #38 Adding an auto-build/deploy that goes to a public repo keyed off a github release of the main branch will be trivial once the public repo is created in ECR.
      • Lynette going to try to follow Greg's instructions for AWS install
      • Lynette checked in with Justin Littman about our application for open-source license on dockerhub.  He hasn't heard anything, so he reapplied last Friday.
      • - nothing new this week
      • Greg Greg looking at where to store docker images. Don't have permission for dockerhub but Greg looking at public AWS repo

Other Topics

  • Sinolio - Sinopia-FOLIO
    • 2021-08-20 Jason has shared use-cases with Michelle, Jason to follow up on scope
  • OCLC Linked Data / Entities Advisory Group
    • 2021-08-06 Huda has tried out UI and got API access. UI takeaway is that they aren't relying on the wikibase UI but still have same data at the moment. The search includes facet by type (e.g. work, person) and now have number of  results and pagination. Individual person page shows both data and provenance (which adds clutter to page) but doesn't have links to work. Lynette notes that search results are not RDF, will require custom module in order to incorporate in QA, also a lack of context
    • 2021-08-06 Question of what the OCLC business model of for-fee API access might mean. Would lookup via QA be allowed without a fee?
  • PCC 
    • 2021-06-25 Task Group on Non-RDA Entities headed by standing committee on standards will formally propose a list of non-RDA entity types. Steven will join and work picking up again
    • 2021-06-25 Planning group for data exchange met with meeting planned for 9/10 September: day 1 - foundational agreements including vendors (actors/profiles); day 2 - identify partnerships and tests (extending what is already happening as part of grant); also profiles group will pick up in July
    • 2021-08-06 Working on non-RDA types and data-exchange meeting going ahead
  • Authorities in FOLIO
    • Hope to include URIs as part of Cornell FOLIO migration, possible LD4P work
    • 2021-06-11: Devs in FOLIO are working on MARC authority storage and basic features for maintaining authorities. mock-ups provided and have asked for feedback and test cases (positive and negative)
    • 2021-06-25 Waiting on CUL-IT capacity, Simeon had suggested August perhaps

...