Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Date:

Attendees: Tim, Jason, Greg, Huda, Lynette, Simeon

Regrets:    Steven

Discovery (WP3)

  • https://github.com/LD4P/discovery/projects/2 for issues etc. 
  • Draft of a discovery plan: https://docs.google.com/document/d/1zKYW7FQVVNvyd0XjjW0qWznX9PC3jbmOE6Kz_yygPjs/edit?usp=sharing
  • Research: how to go from knowledge graph to an index
  • DASH! (Displaying Authorities Seamlessly Here)
    • Dashboard design meeting kickoff notes
    • User reps D&A meeting: Expect next follow-up in August (Slides: from user reps meeting 2021-04-09 and result was "not no")
    • https://docs.google.com/document/d/1PgQi3xobsPhr9DUHU_YGeimL1OjNiiTdkiNWb36r3Gg/edit
    • Usability testing and followup for DASH: Usability results
      • Usability results, a few little things to finish up
      • GitHub issues
      • 2021-08-1323
        • CORS error on wikidata, solved by adding a user-agent string via proxy request. They have strategies to throttle requests which might mean we could not use live calls for map coordinated in production
        • Ready for Monday D&A user reps meeting
        • Worked with Greg and Tim to get latest into the "dashExperiment" GitHub branch and deployed onto http://ld4p3-web.library.cornell.edu/ . Now used same indexes as FOLIO D&A and recent post-FOLIO transition code from upstream D&A Blacklight
          • Tim finished mockups for author pages that more closely follow the design of the original subject pages. 
          • Huda will review any remaining bugs in DASH work
        • User reps meeting now scheduled for Aug 23 for better attendance
  • BANG! (Bibliographic Aspects Newly GUI'd)
    • Jamboard link
    • Expect to include Works. Need to do something beyond what we already have live from the OCLC concordance data.
    • 2021-08-0620
      • Steven has been working on Wikidata queries based on data such as LCCN from MARC record, focusing on motion pictures
      • Huda getting compressed data (12GB) from Dave for the PCC ShareVDE dataset
      • Thread for BANG! work - possibility of adding Wikidata info on one side and Hubs on the others. Not yet sure about the SVDE data - they typically show the superwork and not sure how this translates to our catalog environment. Would like to be able to link to both physical and digital items. Open question of whether we should focus on particular formats or subsets of items, will need to query data to better understand
      2021-08-13
      • Huda working on item page using ISBN. Having to do a number of LC queries in sequence; ISBN->work->hub->translation_hub→works→ISBNs; will also try Steven's wikidata query
      • Experimenting with SVDE dataset using Jena API. So far looks like have to store dataset unzipped
      • Huda has 33GB PCC ShareVDE dataset from Dave (Fuseki triplestore dump), will need Java to query
      • Expect to set up fork for BANG! to start experimenting with UI
      • Greg working to set up Huda with and AWS dev machine
  • DAG Calls
    • 2021-08-13 Next meeting for Aug 17, plan to work on spreadsheet 20 August slow... last call had discussion of creating a list of linked data discovery systems for discovery. Build on some past work from OCLC that had linked data systems of all sorts. Also looking at knowledge panel white paper. For future meetings looking to get presentations from folks who have done relevant UX research.and KP whitepaper. Have noted that Google have changed how they display KP for people. Huda following up with KP co-authors to decide path forward

Linked-Data Authority Support (WP2)

  • Qa Sinopia Collaboration – Support and evolve QA+cache instance for use with Sinopia
    • 2021-08-1320
      • No meeting with Stanford this week
      • Michelle has requested info for the report including info on working groups, QA improvements for search accuracy and performance, and containerization.  Hoping to do most of this offline in Slack/email before next meeting.
      • .  I provided Michelle a report in Slack of the first year activities and expected work for the final year.
      • Dave and I went over the Authorities project board.  There are several issues that require input from Steven for prioritization.  There are two issues that Dave is pursuing. 
        • Issue #126 add additional subauthorities for geonames to limit results to common `person populated areas` (e.g. city, state, province, country)
        • Issue #133 allow exact match to be case-insensitive, but if there are multiple matches and one is a case match, then rank case match 1
  • Best Practices for Authoritative Data working group (focus on Change Management)
    • 2021-08-13 No meeting this week. Meeting next Monday to discuss 20 Reviewed Activity Streams min extensions document.  Mostly acceptable.  A major decision is to make the primary stream a simple notification of the changes and add an `instrument` property that holds the RDF_Patch or other encoding that expresses the changes that occurred.  We are also looking at having an OrderedCollection that has all changes since the last baseline complete dump.  The process for end users for incremental updates is go to the `last` entry in the stream, walk backward until you find your last processed date, and then walk forward processing the updates in order.  I still need to update the document to reflect these changes.
  • Cache Containerization Plan - Develop a sustainable solution that others can deploy
    • 2021-08-1320
      • Lynette has auto-build/deploy of
      • Lynette adding Circle CI for running tests. Using this to debug why github actions fails to deploy the qa_server_container automatically to ECR.  Did some review of instructions.  Need to update once we have a public ECR imageimage going to a test repo.  Issue #38 Adding an auto-build/deploy that goes to a public repo keyed off a github release of the main branch will be trivial once the public repo is created in ECR.
      • Lynette going to try to follow Greg's instructions for AWS install
      • Lynette checked in with Justin Littman about our application for open-source license on dockerhub.  He hasn't heard anything, so he reapplied last Friday.
      • Greg looking at where to store docker images. Don't have permission for dockerhub but Greg looking at public AWS repo

Other Topics

  • Sinolio - Sinopia-FOLIO
    • 2021-08-20 Jason has shared use-cases with Michelle, Jason to follow up on scope
  • OCLC Linked Data / Entities Advisory Group
    • 2021-08-06 Huda has tried out UI and got API access. UI takeaway is that they aren't relying on the wikibase UI but still have same data at the moment. The search includes facet by type (e.g. work, person) and now have number of  results and pagination. Individual person page shows both data and provenance (which adds clutter to page) but doesn't have links to work. Lynette notes that search results are not RDF, will require custom module in order to incorporate in QA, also a lack of context
    • 2021-08-06 Question of what the OCLC business model of for-fee API access might mean. Would lookup via QA be allowed without a fee?
  • PCC 
    • 2021-06-25 Task Group on Non-RDA Entities headed by standing committee on standards will formally propose a list of non-RDA entity types. Steven will join and work picking up again
    • 2021-06-25 Planning group for data exchange met with meeting planned for 9/10 September: day 1 - foundational agreements including vendors (actors/profiles); day 2 - identify partnerships and tests (extending what is already happening as part of grant); also profiles group will pick up in July
    • 2021-08-06 Working on non-RDA types and data-exchange meeting going ahead
  • Default branch name - Working through repositories in Renaming of LD4P Repositories
  • Authorities in FOLIO
    • Hope to include URIs as part of Cornell FOLIO migration, possible LD4P work
    • 2021-06-11: Devs in FOLIO are working on MARC authority storage and basic features for maintaining authorities. mock-ups provided and have asked for feedback and test cases (positive and negative)
    • 2021-06-25 Waiting on CUL-IT capacity, Simeon had suggested August perhaps

...

Next Meeting(s), anyone out?:

...

  • 2021-08-27 - Jason will lead, Simeon out, Lynette may be out, Huda outCancel