Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Date: 

Attendees:  Lynette, Tim, Steven, Jason, John, Huda, Simeon

Regrets: 

Agenda & Notes

Review actions from 2020-05-01 Cornell LD4P2 Meeting notes

  • E. Lynette Rayle QA performance
    • 2020-04-17 Did an analysis sorting response time by complexity and size but it didn't show a clear picture, will try a little more analysis next week. When Ligatus and CERL were added - why is Ligatus fast and CERL slow? Dave expects to have more time soon
    • 2020-04-24 Lynette did an analysis of performance to try to understand whether speed is clearly related to data size or complexity of extended context. Result are that there isn't clear correlation. Tried to parallelize parts of QA and in some places saw slowdown, one place found speed improvement where the complexity is high. However, in the complex cases the times are often still rather long (0.5–2s) but not markedly longer than somewhat less complex queries. Still the worst cases are because of the retrieve time from Dave's cache, he is looking at why CERL is slow when we might not expect it to be. Unfortunately no clear path to improving everything from the QA side: Lynette will try to understand why the OCLCFAST graph load is so slow.
    • 2020-05-08 No progress (Lynette working on exhibits about half the worrk, but has worked on accuracy), no updates from Dave
  • E. Lynette Rayle  to set up best practices working group around linked data APIs for authorities → documenting on Linked Data API Charter 1 - Best Practices for Authoritative Data Working Group
    • 2020-05-08 Now starting first Monday in June and then every other week for 4 months, getting folks to do some work up-front. Think that a later WG might look at change management
  • Huda Khan reflections on Knowledge Graph Conference – see slack comments. Meeting overall was very industry/enterprise including real-estate etc.. Good presentation by Oracle with information about a DB where they can do both SQL and SPARQL (e.g. start with relational DB, create RDF dataset from it, query either), many other presentations just assumed SPARQL or GraphQL. There was discussion of what is knowledge graph is, consensus that it is just a bunch of RDF or property graphs for knowledge/semantic representation. Not much discussion of performance, perhaps because much of the work is about offline analysis and then machine learning etc. Discussion from Yahoo! about rich cards (essentially knowledge panels)
  • Blacklight summit happening May 7,8 virtual - Huda Khan presented demo video yesterday, expecting Jenn, Melissa, Frances to attend. Mostly demos yesterday. HathiTrust ETAS is mentioned a lot

...