You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 7 Current »

Date: 

Attendees: Tim, Steven, Lynette, Simeon, Huda, John, Jason

Regrets:

Agenda & Notes

Review actions from 2020-02-14 Cornell LD4P2 Meeting notes

  • Huda Khan to discuss with Astrid and David possible collaboration with U Chicago over usability (and maybe others in DOG team)
  • E. Lynette Rayle QA performance
    • 2020-02-14 Dave has made progress this week, is moving to have all data in index tailored to search in order to avoid SPARQL queries at search time, results are in a blob of RDF. Working on CERL first, expect to get this our soon. Will then try MeSH and OCLC FAST, then LC.
    • 2020-02-21 CERL was deployed with the new index strategy but no before and after to compare. However, this is small so we need to wait for LC or such to get a sense of possible improvement
    • 2020-02-25 New authorities have been brought online all the way through to Sinopia. These include CERL (searching person, corporate, imprint, or all of these) and Ligatus. Additionally, MeSH has been updated to include extended context and support for searching by subject or publication type.
    • 2020-03-06: Dave in process of converting everything over. Unsure of status for any authority, including MeSH. When LC done, we'll know whether this has impact since there are considerable usage data for LC. Dave and Lynette each working on other projects at the moment.
  • Simeon Warner to ask Adam Smith to investigate cost and any issues with setting up a D&A Beta system to allow broader testing of some discovery ideas from this work.
    • 2020-03-06: NOT DONE
  • John Skiles Skinner to continue discussion with Hathi trust about an API or access to their index
    • 2020-02-28 HathiTrust have allowed institutional accounts to add a query parameter to get XML output, may also provide IP based access for prototypes. Have already made demo with a mock-up of access
    • 2020-03-06: Huda sent them IP Address... and then confirmed that it was indeed ours. Follow-up needed. John Skiles Skinner will do that before next meeting.
  • Huda Khan to copy lessons learned from BAM! into the main wiki and check scripts into github
    • BAM! lessons learned in wiki (DONE)
    • Scripts making way into GitHub (in progress)
    • 2020-03-06: NOT DONE. URLs need replacement; Huda will replace with comment + dummy URL. Will be completed 3/13
  • E. Lynette Rayle  to ask Tiziana about SHARE-VDE APIs for real-time up-to-date search and for possible engagement in linked data best practices for authoritative data working group
    • 2020-03-06: NOT DONE. Will email by 2020-03-13!
  • Huda Khan will submit proposal for Knowledge Graph Conference
    • 2020-03-06: DONE. 

Status updates and planning

  • Discovery presentation 3/3 debrief
    • positive feedback from many; high engagement from attendees
    • open syllabus data had positive review
    • timeline: visuals! there was at least one person who really liked this
    • knowledge panel – critique was wrt: info overload but not that this was not worth-while
    • auto-suggest and no-search-result both well-received
    • discogs metadata was well-received - method of bringing in trusted data. there are use cases where we may wish to index discogs data for search
    • recording is in Drive. notes will be there. is it alright to send out follow-up email thanking people for attending with a link to the video? Questions raised about privacy, value for viewers and whether this should be public v. CUL-only.  Notes summarizing can go on wiki. DECISION: put video in LD4P-Internal. Can share internally for those who request.
    • Follow-up: summary of what we think we've learned. Goal is to prioritize work based on strongest feedback. Wait until next Friday to share broadly, assuming we've made decisions at that point.
      • this affords us 3.5 months to work on moving 1-3 items toward production... but not making it production-ready. includes analyzing existing infrastructure and consider whether formal usability testing is possible/advisable (using usability working group)
      • we are not looking at new work... this is to take current work forward
  • Cataloging Sinatra and other 45's (Discogs data, https://github.com/ld4p/qa_server/issues?q=is%3Aissue+is%3Aopen+label%3ADiscogs)
    • Lookups for place not usable and hence places are not being recorded, relies on work from Dave to fix: https://github.com/LD4P/qa_server/issues/248 & https://github.com/LD4P/qa_server/issues/240
    • Have a currently insurmountable issue with nested profiles. When create Work profile with nested Instance profile there isn't a URI for the Instance (it just gets hung from a bnode). Without a URI the title of the Instance doesn't get indexed. The Sinopia team are unable to fix this in the near term.
    • Cataloging work continues with the above limitations
    • 2020-02-28 Steven update – I did a bunch of PCC profile and LOC policy related writing/correspondence; met with Huda, Tim, and John to discuss the Discovery Event (happy to help facilitate/notetake/rove on the day of the event); worked with Sinopia team to understand title search and display bugs that have been affecting Sinatra work (Jeremy has created https://github.com/LD4P/sinopia_editor/issues/2090 which looks at part of the problem); I still need to clean up the QA/Sinopia priority list to reflect the work completed by Lynette and Dave.
  • Enhanced Discovery (see also https://wiki.duraspace.org/x/sJI7Bg and https://github.com/LD4P/discovery/projects/1)
    • SMASH! (dev to run through 7 Feb, then user testing, video and write-up) – dev complete, video done, Hitchcock homage and cameos still under consideration, lessons learned document in process and also annif use summary
    • Open meeting March 3, 2-3:30pm in Mann 102 and should Zoom it too
    • Will continue on Hathi work...
    • How will we decide what to take forward from KAPOW!, BAM! and SMASH!? (or as Tim put it, "what happens in late February?")
      • Do discovery session... get feedback
      • Knowledge panels – could we make a component that is easily reusable in any Blacklight? How much are local customizations key?
      • Semantic stuff .. annif ... relationships in data to get relevant semantic links and use of hierarchy in data 
      • Call number browse and other virtual browse notions, with semantics/facets?
      • Use of linked-data descriptions from Sinopia - what can we do in discovery that is different?
  • Authority Lookups for Sinopia (Lookup infrastructure: https://github.com/LD4P/qa_server/projects/2, Authority requests: https://github.com/LD4P/qa_server/projects/1)
    • When deployment issue solved... will then put out CERL and ligatus and extended context for MeSH along with sub-authorities. Also some refactoring associated with monitoring status page (including fixing a memory leak due to a long-used hash - ruby doesn't reclaim space from deleted entries)
    • Not sure whether Dave has redone the index to avoid SPARQL for MeSH – if it is done then we will have a comparison
  • Travel and meetings (see LD4P2 Cornell Meeting Attendances)
  • Next meetings:
    • ...
  • No labels