Link to Google doc

Questions asked:

What existing work on library catalog discovery (including but not limited to Blacklight) can we review to better understand what catalog users want to do with the catalog?  

What existing work on discovery and linked data systems can yield insight into search with linked data?

High level takeaways:

  • Information tasks can be categorized into targeted and known-item search as well as exploratory.  With exploratory tasks, formulating what is to be searched can be difficult since the user may not have enough information about the search space itself.

  • Earlier library interface design work as well as current review highlights the usefulness of facets as a way to browse the search space. (Trapido, Marie, Johnston et al.).  That said, improvement is still possible as users may enter queries that could not intersect well with facets or not realize a query may be better suited as a facet.

  • Desired features include fault-tolerant systems that provide support for different kinds of information gathering tasks: known-item and targeted search as well as browse and exploration. (Trapido)

  • Search expectations and behavior are influenced by what users see with Google and search engine behaviors to which they have become accustomed: search as primary access point, most relevant results on first page, more comprehensive search of content itself.  (Trapido, Pekala 2017). The distinction between sources of information (i.e. between the catalog and other sources such as articles) may not always be clear.

  • The mismatch between vocabularies and taxonomies used to generate catalog search and what users may type in means users don’t get the most relevant or expected results. (Trapido)

  • Schema.org inclusion may help in increasing Google search result ranking as well as referrals to items: (context = DigitalGeorgetown Collections, Pekala 2018)

  • Various linked data browsers and interfaces are mentioned (e.g. in Cervane et al.) but it is hard to consistently find functional URLs or understand whether the project existed beyond prototype stage.  These interfaces tend to lend themselves to more exploratory search tasks rather than specific queries, although query interfaces do exist (Thakker et al., Marie). With queries, the issue is navigating the gap between semantic queries and natural language or visual interaction paradigms. (Hasnain et al.)


Specific items of interest

  • The Primo evaluation looked at “Did you mean” suggestions, showing low usage in general, lower usage than facets, but with open-ended searches seeing higher percentages than known-item tasks. As we consider suggestions and recommendations, the type of task which can be supported by this feature should be kept in mind. (Hanrath and Kottman 2015)

  • Google has a “Talk to Books” semantic search feature which allows plain-English questions to discover relevant information from 100,000 books.  This work is defined (with link to technology) in an arXiv paper at https://arxiv.org/abs/1803.11175 (kurzweil blog).

  • Marie’s dissertation explores linked data exploratory search techniques including facets, set-based exploration, and recommendations based on “spreading activation”  (taking a model of how memory operates and using that to find related concepts to what is being queried) as well as the design of a system that integrates data from SPARQL query endpoints (specifically DBPedia) and provides recommendations for additional topics for search.


Resources reviewed/referenced (to be alphabetized later):

  • No labels