2021-03-19 Steven has looked at SVDE works and is thinking about how to find works via ISBN to then find other Instances. Is looking via SPARQL on Dave's ingested data
2021-02-26 Have to develop SPARQL queries to pull out certain sorts of connected Work. Don't expect data to be very dense but do expect that we would get useful connections between print and electronic for example. We already have a link based on the OCLC concordance file from several years ago.
ACTION - Steven Folsom and Huda Khan to work on building an equivalent of the OCLC concordance file based on SVDE data and then do a comparison to see how they are similar and different
Discussion with small group regarding call number classifications on e-resources, where there is/isn't sufficient metadata compared to equivalent/related physical objects. Possibility of gathering some useful examples around needs/wishes around e-resource discovery.
2021-03-26 - Other CUL staff met a couple of weeks ago. Examples where print and e-resource versions have differing metadata quality. Can we connect the resources better and/or copy metadata from one to the other? Has connection to our work to identify different versions. Nothing for LD4P3 to do until they have come up with appropriate examples
See Steven's comments on ShareVDE data. Dave wants to look at making a direct connection to ShareVDE GraphQL API and translate it on-the-fly to something that QA can work with. There are going to be some complexities with how to structure queries and extended context based on the variability of data shapes.
Created the charter for the next working group describing the expected outputs for change management. I plan to announce the charter on Monday and begin reaching out to individuals to get folks on board.
2021-03-26 - Lynette will speak about results of first group at Discovery Affinity group next week
Lynette began documenting the deployment to AWS adding in an overview, background knowledge, and architecture sections. Next is to start looking at how to use the templates.
Greg is going to follow up with Dave about how to work with him on his containerization efforts
Expect to work on demo screencast after refining documentation
Greg and Lynette have been working together on documentation and this has been very useful in uncovering missing things. Greg found that he needs to make sure AWS actions can be performed by a less powerful user
Greg has discussion containerization of cache infrastructure. Greg has some tasks to help with
Developing Cornell's functional requirements in order to move toward linked data
Purpose? Vision for mid-term (3-5 years) transition to support linked-data at Cornell. May include things we don't yet have or cannot yet do, but not long-term vision of post-MARC environment
Important to understand sources of truth (primary data) and where there is derivative data
Imagine landscape with items described in multiple formats including at least MARC, BF, DC (eCommons), JSTOR
Imagine all items indexed and discoverable via D&A
Functions of "Aggregated index, allowing pivoting & ETL"
Includes current functionality of Frances' indexing
Does it include any editing?
Is there interaction with CULAR?
Includes indexing associated with DCP
What interfaces or functionality do we expect for the connecting lines?
Do we need a diagram for now (or at least July 1, 2021 with Voyager gone)?
2021-03-19 Jason/Steven/Simeon created separate diagrams and Jason is working on a combined picture
PCC/Sinopia and SVDE shape analysis
2021-03-19 Steven has been working through a spreadsheet of 400+ lines to compare the shape of SVDE data with the PCC/Sinopia profile. He is finding that there are many many differences which will severely limit how well Sinopia will be able to consume and edit SVDE data. For the purposes of QA/Sinopia cloning, Steven could come up with some ldpaths but not sure whether the amount of data will be useful. Steven expects to be able to share the spreadsheet at the next Sinopia/SVDE meeting. Going forward we need to consider the role of versioning/documenting shape changes and validation at both scale and single descriptions. Justin's validation scripts: https://github.com/LD4P/dctap. Tom Baker's csv2shex: https://github.com/tombaker/csv2shex
2021-03-26 Steven finished working through the spreadsheet comparing SVDE data with the PCC profile. Notes that he is looking only from the side of the PCC profile and would thus miss other things in SVDE data. Patterns around different types of work in SVDE data (e.g. Opus and other higher level works have very different shapes). Difficult pattern of double-reified relationships between works. Steven will let SVDE/QA folks know about completion of the work. Need to find a way toward alignment.
ACTION - Steven Folsom to write up state of current analysis and store a snapshot of the spreadsheet on the LD4P3 wiki
OCLC Linked Data / Entities Advisory Group
2021-03-26 No updates, some emails
PCC Task Group on Non-RDA Entities
2021-03-19 Group headed by standing committee on standards will formally propose a list of non-RDA entity types. Steven will join. Deliverables by June
Hope to include URIs as part of Cornell FOLIO migration, possible LD4P work
2021-03-19 In LTS there is a task group that has a proposal for authority management in FOLIO (absent new features). Being reviewed with request for scripting work to create reports etc.. Includes insertion of URIs into MARC
https://kula.uvic.ca/index.php/kula/announcement/view/1. Call for Proposals - Special Issue: "The Metadata Issue: Metadata as Knowledge". Due January 31, 2021 (abstract 300-500 words). Includes "The use of linked open data to facilitate the interaction between metadata and bodies of knowledge" and "Cultural heritage organization (libraries, archives, galleries, and museums) and academic projects that contribute to or leverage open knowledge platforms such as Wikidata"
Steven thinking something around shapes and compatibility and round trips
Lynette/Greg/Dave - containerization, should have documented product by then
code4lib - Expecting to attend: Huda, Steven, Lynette, Greg
Great opening keynote about capturing of indigenous information connected to mapping, discussion of what information is shared or not. https://terrastories.io/
Steven notes everyone dealing with questions of sovereignty and a agency in building collections
Steven's Discogs poster at code4lib went well
Steven notes session on technologies to fix problematic terminology in upstream data sources and/or as an overlay/replacement strategy. There are at least 3 or 4 Blacklight implementation to replace "Illegal Alien" in discovery systems
Lynette doing a QA presentation at Samvera partner call in June