2020-02-14 Dave has made progress this week, is moving to have all data=
in index tailored to search in order to avoid SPARQL queries at search tim=
e, results are in a blob of RDF. Working on CERL first, expect to get this =
our soon. Will then try MeSH and OCLC FAST, then LC.
2020-02-21 CERL was deployed with the new index strategy but no before =
and after to compare. However, this is small so we need to wait for LC or s=
uch to get a sense of possible improvement
2020-02-25 New authorities have been brought online all the way th=
rough to Sinopia. These include CERL (searching person, corporate, imprint,=
or all of these) and Ligatus. Additionally, MeSH has been updated to inclu=
de extended context and support for searching by subject or publication typ=
e.
Adam Smith to investigate cost and any issues with setting up a D&A=
Beta system to allow broader testing of some discovery ideas from this wor=
k=20
John Skiles Skinner to continue dis=
cussion with Hathi trust about an API or access to their index=20
2020-01-31 There is investigation but not sure whether it will res=
ult in something we can use
2020-02-14 Some more discussions with Hathi and suggestions was to use =
current search with debug facility that includes things like facet values i=
n machine readable form (requires either 1) a user account for testing, or =
2) to use IP access for our dev machine but there is some issue of fixed ex=
ternal IP for our dev VMs)
2020-02-21 HathiTrust sent over the XML version for one of the que=
ries that John had tried for zero results. This would be the same xml they =
may be able to open up for us by allowing the IP address of my dev vm to ac=
cess the URL that would result in XML. Huda set up a controller that parses=
the xml and returns json with the list of subject heading strings and set =
of search results being returned (this is the xml version of the search res=
ults page that includes subject facet values). John said he should be able =
to incorporate the subjects and perhaps the results into the zero search re=
sults page.
2020-02-28 HathiTrust have allowed institutional accounts to add a quer=
y parameter to get XML output, may also provide IP based access for prototy=
pes. Have already made demo with a mock-up of access
Have a currently insurmountable issue with nested profiles. When create=
Work profile with nested Instance profile there isn't a URI for the Instan=
ce (it just gets hung from a bnode). Without a URI the title of the Instanc=
e doesn't get indexed. The Sinopia team are unable to fix this in the near =
term.
Cataloging work continues with the above limitations
2020-02-28 Steven update =E2=80=93 I did a bunch of PCC profile and LOC=
policy related writing/correspondence; met with Huda, Tim, and John to dis=
cuss the Discovery Event (happy to help facilitate/notetake/rove on the day=
of the event); worked with Sinopia team to understand title search and dis=
play bugs that have been affecting Sinatra work (Jeremy has created https://github.com/LD4P/sinopia_editor/issues/20=
90 which looks at part of the problem); I still need to clean up t=
he QA/Sinopia priority list to reflect the work completed by Lynette and Da=
ve.
SMASH! (dev to run through 7 Feb, then user testing, video and write-up=
) =E2=80=93 dev complete, video done, Hitchcock homage and cameos=
still under consideration, Lessons learned document in process and also set up doc=
ument for an=
nif use summary
Open meeting March 3, 2-3:30pm in Mann 102 and should Zoom it too<=
/li>
Will continue on Hathi work...
How will we decide what to take forward from KAPOW!, BAM! and SMASH!? (=
or as Tim put it, "what happens in late February?")=20
Do discovery session... get feedback
Knowledge panels =E2=80=93 could we make a component that is easil=
y reusable in any Blacklight? How much are local customizations key?
Semantic stuff .. annif ... relationships in data to get relevant seman=
tic links and use of hierarchy in data
Call number browse and other virtual browse notions, with semantics/fac=
ets?
Use of linked-data descriptions from Sinopia - what can we do in discov=
ery that is different?