- Created by Bill Branan, last modified by Michelle Futornick on Feb 08, 2017
You are viewing an old version of this page. View the current version.
Compare with Current View Page History
« Previous Version 63 Next »
Welcome to the website of Linked Data for Production (LD4P). With support from the Andrew W. Mellon Foundation, six partners (Columbia, Cornell, Harvard, Library of Congress, Princeton, and Stanford University) are piloting the transition of technical services workflows to a linked data environment. Over a two-year period, the partners will focus on developing the ability to produce metadata as linked open data communally, extending the BIBFRAME ontology to encompass the many resource formats managed by libraries, and engaging the broader library community to ensure a sustainable and extensible environment. LD4P works closely with the Mellon-funded Linked Data for Libraries-Labs project.
Here is the latest news from LD4P. What we're working on now. What's coming next. This will be compiled from the partners' individual updates.
In collaboration with the Art Libraries Society of North America's Cataloging Advisory Committee (ARLIS CAC), the Library of Congress Prints and Photographs Division, The Clark Library, and the Morgan Library & Museum, Columbia University Libraries led an effort to develop ArtFrame, a BIBFRAME ontology extension for the description of two-and three-dimensional artworks. For questions or comments, please contact Melanie Wacker, Metadata Coordinator, Columbia University Libraries: mw2064 [ at ] columbia dot edu
NOTE: As described in the next section, this effort developed into the Art & Rare Materials BIBFRAME Ontology Extension.
NOTE: As of June 2018, the Art Libraries Society of North America's Cataloging Advisory Committee has begun to explore continued development and maintenance of ARM, in partnership with RBMS' Bibliographic Standards Committee and other community partners.
The Art & Rare Materials BIBFRAME Ontology Extension |
---|
The Art & Rare Materials BIBFRAME Ontology Extension (ARM), and the ontologies housed in the ARM GitHub repository have been developed as extensions of the BIBFRAME ontology for generalized bibliographic description to provide specialized modeling in the art and rare materials domains. These were originally conceived of as two separate projects: Columbia University led ArtFrame, an ontology extension for the description of two-and three-dimensional artworks, in collaboration with the Art Libraries Society of North America's Cataloging Advisory Committee (ARLIS CAC), the Library of Congress Prints and Photographs Division, The Clark Library, and the Morgan Library & Museum. Meanwhile, Cornell University led the Rare Materials Ontology Extension (RareMat) in collaboration with the ACRL Rare Books and Manuscript Section's Bibliographic Standards Committee (RBMS-BSC); RareMat was intended to provide modeling for the complexity of rare materials, particularly item-level description not addressed in BIBFRAME. As work proceeded, it became apparent that many of the modeling needs of the two groups overlapped: physical description, physical condition and conservation, custodial history, measurements, awards and exhibitions, and so on. It was thus decided to merge the projects and jointly develop a single set of models. Meanwhile, select models were considered separable from the core models, amenable to independent implementation by users within and outside the bibliographic domain. These models have been pulled out of the core ontology into three modularized ontologies: awards, custodial history, and measurements. In addition to the ontologies, the group has generated several other outputs:
|
Use Case Development
- Defined use cases for description of art
- Classified use cases into entity types
Prioritized use cases
Analyzed use cases and developed ontology requirements
- Compared ArtFrame use cases to Rare Material use cases and identified commonalities
Analysis/Modeling
- Accession Number
- Attributions
- Awards
- Bibliographic Citation
- Bindings
- Carriers & Bound-withs
- Custodial History
- Exhibitions
- Fonts, Handwriting Types and Notations
- Limitation Statement
- Markings
- Materials
- Measurements of Part, Whole and Arrangement
- Notes in Art
- Pagination and Foliation
- Physical Condition
- Signature Statements
- Style/Period
- Titles in Art
OWL file development
- Core Ontology
- Core ontology used in the description of art and rare materials.
- Ontology specification
- Human-readable documentation
- Award Ontology
- Specialized ontology for the description of awards received by bibliographic resources or agents; reusable in contexts beyond the bibliographic domain.
- Ontology specification
- Human-readable documentation
- Custodial History Ontology
- Specialized ontology for the description of an object's custodial history; reusable in contexts beyond the bibliographic domain.
- Ontology specification
- Human-readable documentation
- Measurement Ontology
- Specialized ontology for the description of an object's measurements; reusable in contexts beyond the bibliographic domain.
- Ontology specification
- Human-readable documentation
- Activity Ontology
- The issue of whether to adopt the BIBFRAME Contribution model or the bibliotek-o Activity model has not yet been resolved. As a temporary accommodation, the Activity model is used, but the terms are stored in a separate ontology file with a distinct namespace for easy separation. The expectation is that either (1) the Activity model will be adopted, and the terms merged into the ARM Core Ontology, or (2) the BIBFRAME Contribution model will be adopted, the Activity ontology deprecated, and other associated revisions to the ARM ontologies and vocabularies be implemented.
- Ontology specification
- Human-readable documentation
Application Profiles (SHACL)
- Art: https://github.com/LD4P/arm/tree/master/application_profiles/art/shacl
- Recommendation for SHACL maintenance and development
In-Person Meetings
- In-Person Meeting of ARTframe + Rare Materials Ontology Extension (January 2018)
- In-Person Meeting of ARTframe + Rare Materials Ontology Extension (March 2017)
- Virtual meetings occurred every two weeks between July 2016 and July 2018
Presentations
- ALCTS Webinar: ArtFrame: Extending Library Ontologies for Works of Art (April 4, 2018,Melanie Wacker and Amber Billey)
- Presentation: The Outcome of the ArtFrame Project (ARLIS/NA Annual Conference, Feb. 27, 2018, Amber Billey, Marie-Chantal L’Ecuyer-Coelho, Jason Kovari and Melanie Wacker)
- Presentation: Artframe–Connecting two Worlds (Metropolitan Museum of Art, November 14, 2016, Roberto C. Ferrari and Melanie Wacker)
- Presentation: Artframe, LD4P@Columbia (METRO (Metropolitan New York Library Council), June 2, 2016, Melanie Wacker)
- Presentation: Linked data in libraries: another fad or a paradigm shift? (Eastern New York ACRL spring conference, Saratoga, NY, May 23, 2016, Amber Billey)
- Presentation: LD4P@Columbia: BIBFRAME and Art Properties (BIBFRAME Update Forum, ALA Midwinter, Boston, MA, Jan. 10, 2016, Amber Billey and Melanie Wacker)
Arden Alexander (Library of Congress, Prints & Photographs)
Penny Baker (Clark Art Institute)
Roberto Ferrari (Columbia/Art Properties)
Paul Frank (Library of Congress)
Kate Harcourt (Columbia/OSMC)
Robert Rendall (Columbia/OSMC)
Margaret Smithglass (Columbia/Avery)
Melanie Wacker (Columbia/OSMC)
Amber Billey (Bard College)
Timothy Ryan Mendenhall (Columbia/OSMC)
Members of ARLIS/NA Cataloging Advisory Committee:
William Blueher (Metropolitan Museum of Art) (November 2017- )
Marie-Chantal L'Écuyer-Coelho (Bibliotheque et Archives nationales Québec)
John A. Mair (Pratt Institute)
Elizabeth O'Keefe (November 2017- )
Andrea Puccio (Clark Art Institute)
Sherman Clarke (freelance art cataloger)
Bronwen Bitetti (Bard College)
Tamara Fultz (Metropolitan Museum of Art) (2016-November 2017)
Maria Oldal (Morgan Library & Museum)
Karen Stafford (Art Institute of Chicago)
Samantha Deutch (Frick Collection)
- Ontology extension to BIBFRAME 2.0 for describing art objects
- Application profile for the linked data description of art objects
- Data models for handling materiality, provenance, titles, and other concepts relevant to describing art objects
- Ontology documentation, including modeling illustrations
Harvard University is leading the development of an extension to BIBFRAME to describe cartographic resources including printed maps, atlases, and digital geospatial datasets.
Note: Documentation and ontology files being developed to support the Cartographic Materials extension project are available at the LD4P Cartographic Materials GitHub repository
For questions or comments, please contact Marc McGee, Geospatial Metadata Librarian, mmcgee (at) fas (dot) harvard (dot) edu, or submit issues via https://github.com/LD4P/Cartographic_Materials
- BIBFRAME extension profile for cartographic resources
- Set of published BIBFRAME descriptions for a variety of cartographic resource types
- Written evaluation of the project and set of recommendations for future research and development
- Presentation of project findings to appropriate library cartographic materials and linked data communities.
Last updated: 7/17/2018
Analysis/Modeling
Finalize supporting documentation for the Geospatial and Cartographic Resource Ontology, including recommendation papers and graphic models (July 2018)
Create a Geospatial and Cartographic Resources Ontology (GCRO) SHACL application profile(deferred to future date)
Production Cataloging
Test cataloging of a selection of geospatial and cartographic resources using the GCRO application profile in the Harvard geospatial metadata instance of VitroLib(deferred to future date)
Community Engagement
Biweekly standing meeting of the Linked Data for Libraries Cartographic Materials Working Group,Thursdays 2-3p ET.(Working Group on indefinite hiatus as of 7/1/2018)
LD4P Cartographic Materials Working Group
Marc McGee (coordinator) (Harvard)
Craig Thomas (Harvard)
Iris Taylor (Library of Congress)
Kathy Weimer (Rice)
Kim Durante (Stanford)
Louise Ratliff (UCLA)
Megan Rush (East View Geospatial)
Min Zhang (Library of Congress)
Seanna Tsung (Library of Congress)
Steven Folsom (Cornell, former member)
Tammy Wong (Library of Congress)
Tim Kiser (Michigan State)
Last updated: 7/13/2018
Analysis/Modeling
- Surveyed existing linked open data efforts in the library and geospatial domains (Aug. 2016).
- Held a 2-day in-person Cartographic Materials Working Group ontology sprint modeling meeting, Harvard Library, Sept. 19-20, 2016.
- Compiled initial set of target use cases for cartographic metadata (Nov. 2016).
- Finished draft models for prioritized Geospatial and Cartographic Resources Ontology concept extension areas:
Projection (draft rdf examples)
Spatial Extent - Bounding Box Coordinates (draft rdf examples)
Scale (draft rdf examples)
Relief (draft rdf examples)
Types of Cartographic Resources (graphic model)
Prime Meridians (graphic model)
- Created Geospatial and Cartographic Resources Ontology (GCRO) OWL file
- GCRO OWL file (Beta version, April 2018)
- GCRO in Biblioportal
- GCRO and LD4L/P Cartographic Extension project files
Community Engagement
Presentations
Linked Open Data Modeling for Library Cartographic Resources (Kimberly Durante, Katherine Hart Weimer and Marc McGee; poster for American Association of Geographers Annual Meeting, April 2016)
Geoinformation in Digital Libraries and Linked Data (Katherine Hart Weimer; presentation for Aplace4places, DH2016 pre-conference workshop, July 2016)
Linked Data Description of Cartographic Resources (Marc McGee, ALCTS Webinar, May 2, 2018) [slides]
LD4P + LD4L Labs projects for geospatial and cartographic resources + moving image materials at Harvard (Marc McGee and Christine Fernsebner Eslao; presentation for IGELU-ELUNA Linked Open Data Working Group Show & Tell, July 10, 2018) [slides]
Community updates
ALCTS-CaMMS MAGIRT Cartographic Resources Cataloging Interest Group meeting
Project updates ALA Annual 2016, 2017, 2018
Project updates ALA Midwinter 2017, 2018
Papers
Toward a Linked Data Model for Describing Cartographic Resources [article] (Marc McGee, Harvard Library, Kim Durante, Stanford University, and Katherine Hart Weimer, Rice University, Journal of Map and Geography Libraries, May 11, 2017)
The Performed Music Ontology project aims to develop a BIBFRAME-based ontology for performed music in all formats, with a particular emphasis on clarifying and expanding on the modelling of works, events, and their contributors. The work is a collaborative effort of Stanford University, the Music Library Association (MLA), the Association for Recorded Sound Collections (ARSC), the Library of Congress, and the PCC, with participation of LD4P partner institutions.
Timeline (initial)
- Written evaluation of the BIBFRAME ontology in describing performed music, including a set of use cases to justify extending and/or expanding the ontology.
- Linked data ontology with a BIBFRAME core with for describing performed music.
- RDA profile to complement the ontology for shared use by the library community, including list of preferred vocabularies.
- Representative selection of resource descriptions created using the Performed Music Ontology.
- Best practices for augmenting and/or manipulating performed music MARC bibliographic records to improve conversion to the Performed Music Ontology.
- Evaluation of the project findings with a set of recommendations for further research and development.
- Presentation of project findings to appropriate library and linked data communities.
- Finalizing modeling of performers, medium of performance, and events in relation to works; basic model for dealing with aggregate works.
Community Engagement
- Analysis & critique of modeling papers by Music Library Association Linked Data Working Group
- Working with Library of Congress to have LC host PMO
Tool Exploration / Requirements Definition
- PMO in Protégé
- PMO in github
- PMO vocabularies in github
Collaboration
- gathering of use cases from members of MLA and ARSC
- exploration of event models by Linked Data Working Group of MLA (LuDWig)
- medium of performance modeling shared with RSC Music Working Group
- sharing of data models for events and medium of performance with Library of Congress
- addition of medium of performance model to LC Editor templates
Nancy Lorimer (lead) (Stanford)
Kirk-Evan Billet (MLA; Peabody Institute-Johns Hopkins)
Chew Chiat Naun (PCC; Cornell)
Greta de Groat (Stanford)
Arcadia Falcone (Stanford)
Caitlin Hunter (ARSC; LC)
Kevin Kishimoto (Stanford)
Wendy Sistrunk (ARSC; U Missouri, Kansas City)
Jim Soe-Nyun (MLA; UC San Diego)
Hilary Thorsen (Stanford)
Valerie Weinberg (LC)
Link to working documents (for partners only)
Analysis/Modeling
- Gathered existing use cases for performed music metadata and developed further use cases.
- Surveyed existing linked data efforts related to performed music.
- Preliminary position paper on titles (submitted to all ontology group)
- Analyzed BIBFRAME and added classes and properties relevant to performed music.
- Submitted added classes of bf:Identifier, definition changes to music-related bf:Identifier subclasses, and bf:Identifier subclass changes to LC
- Modeled performed-music-specific concepts including thematic catalog and opus numbers and musical key and mode.
- List of all classes, properties, individuals in PMO (updated 2018-07-03)
- Analysis and modelling papers:
- Use cases
- Events (updated 2018-06-27)
- Medium of performance. Part 1(updated 2017-09-10)
- Medium of performance. Part 2 (updated 2017-09-10)
- Medium of performance. Part 3 (updated 2017-09-27)
- Works. Part 1 (Draft–added 2017-12-05; corrected 2017-12-09)
- Works. Part 2, Compilations (Updated Draft--2018-07-03)
- Mappings:
- MARC 382 field to PMO (Drafty)
Presentations
- Performed Music & BIBFRAME: Enriching the Ontology. ALCTS series: From MARC to BIBFRAME–Enhanced.
ALCTS webinar by Nancy Lorimer, March 28, 2018. - Love from Afar: Modeling music audio & video recordings in BIBFRAME, the Performed Music Ontology, and Beyond.
presentation by Nancy Lorimer, OLAC general meeting, ALA Annual, Chicago IL, June 2017 - Modeling Metadata for Sound Archives: extending BIBFRAME 2.0 for archival sound resources
presentation by Caitlin Hunter & Nancy Lorimer, ARSC, San Antonio, TX, May 2017 - Linked Data for Production: Stanford Projects
presentation by Nancy Lorimer, Bibliothèque Nationale de France, February 2017 - Linked Data for Performed Music: a performed music ontology extension to BIBFRAME 2.0
presentation by Kevin Kishimoto & Nancy Lorimer, Music Library Association Annual Meeting, Orlando, Fla., February 2017 - Linked Data for Performed Music: an extension to BIBFRAME 2.0 for production & archival cataloging of sound recordings & music videos
presentation by Nancy Lorimer, BIBFRAME Update Forum, ALA Midwinter, Atlanta, Ga., January 2017 - BIBFRAME beyond books: explorations in extending BIBFRAME 2.0 to improve discovery of performed music
presentation by Nancy Lorimer & Michelle Futornick, PCC Participants' Meeting, ALA Midwinter, Atlanta, Ga., January 2017 - Linked Data for Performed Music: a Linked Data for Production project
presentation by Nancy Lorimer, Heads of Cataloging Interest Group, ALA Annual, Orlando, 2016
In conjunction with the ACRL Rare Books and Manuscript Section's Bibliographic Standards Committee (RBMS-BSC), Cornell University Library led an effort to build an ontology extension for the description of rare materials. This collaborative effort was intended to provide RBMS-BSC with a model for handling the complexity of rare materials, particularly item-level description not addressed in BIBFRAME 2.0. For questions or comments, please contact Jason Kovari, Director of Cataloging & Metadata Services, Cornell University Library: jak473 [ at ] cornell dot edu
NOTE: As described in the next section, this effort developed into the Art & Rare Materials BIBFRAME Ontology Extension.
NOTE: As of June 2018, RBMS' Bibliographic Standards Committee committed to development and maintenance of ARM, in partnership with community partners.
The Art & Rare Materials BIBFRAME Ontology Extension (ARM), and the ontologies housed in the ARM GitHub repository have been developed as extensions of the BIBFRAME ontology for generalized bibliographic description to provide specialized modeling in the art and rare materials domains. These were originally conceived of as two separate projects: Columbia University led ArtFrame, an ontology extension for the description of two-and three-dimensional artworks, in collaboration with the Art Libraries Society of North America's Cataloging Advisory Committee (ARLIS CAC), the Library of Congress Prints and Photographs Division, The Clark Library, and the Morgan Library & Museum. Meanwhile, Cornell University led the Rare Materials Ontology Extension (RareMat) in collaboration with the ACRL Rare Books and Manuscript Section's Bibliographic Standards Committee (RBMS-BSC); RareMat was intended to provide modeling for the complexity of rare materials, particularly item-level description not addressed in BIBFRAME.
As work proceeded, it became apparent that many of the modeling needs of the two groups overlapped: physical description, physical condition and conservation, custodial history, measurements, awards and exhibitions, and so on. It was thus decided to merge the projects and jointly develop a single set of models.
Meanwhile, select models were considered separable from the core models, amenable to independent implementation by users within and outside the bibliographic domain. These models have been pulled out of the core ontology into three modularized ontologies: awards, custodial history, and measurements.
In addition to the ontologies, the group has generated several other outputs:
- A set of controlled vocabularies for arrangement of physical objects (e.g., rolled and unrolled), origin (e.g., of titles), status (e.g., of identifiers), typefaces, and handwriting types.
- Detailed documentation of each model. These include use cases; model overviews, with motivation and rationale in reference to the use cases and legacy data; diagrams; RDF samples; term specifications; recommendations for future modifications to BIBFRAME, where applicable; and lists of related topics out-of-scope of the current project but of interest for future research.
- Application profiles, formalized in SHACL and accompanied by external ontologies and vocabularies, to support form and display interfaces in VitroLib, an RDF cataloging tool under development in the associated Linked Data for Libraries Labs project. Two application profiles are under development: art and rare monographs. Also included are fragments of source ontologies and vocabularies used in the application profiles and modeling recommendations.
- Experimental: SHACL validation profiles stored with the associated ontologies, which can be used as a basis for application profiles but whose primary purpose is to define expected implementation of the ontologies independent of any application. Some proof-of-concept application profiles are built on the validation SHACL in this way; when the profiles are complete we will evaluate the feasibility and usefulness of refactoring all the application profiles according to this model. Currently available are core validation and custodial_history SHACL shapes.
Use Case Development
- Defined use cases for description of rare materials
- Classified use cases into entity types
- Identified areas for modeling based on DCRM and other standards
- Compared Rare Material use cases to ArtFrame use cases and identified commonalities
Analysis/Modeling
- Accession Number
- Attributions
- Awards
- Bibliographic Citation
- Bindings
- Carriers & Bound-withs
- Custodial History
- Exhibitions
- Fonts, Handwriting Types and Notations
- Limitation Statement
- Markings
- Materials
- Measurements of Part, Whole and Arrangement
- Notes in Art
- Pagination and Foliation
- Physical Condition
- Signature Statements
- Style/Period
- Titles in Art
OWL file development
- Core Ontology
- Core ontology used in the description of art and rare materials.
- Ontology specification
- Human-readable documentation
- Award Ontology
- Specialized ontology for the description of awards received by bibliographic resources or agents; reusable in contexts beyond the bibliographic domain.
- Ontology specification
- Human-readable documentation
- Custodial History Ontology
- Specialized ontology for the description of an object's custodial history or provenance; reusable in contexts beyond the bibliographic domain.
- Ontology specification
- Human-readable documentation
- Measurement Ontology
- Specialized ontology for the description of an object's measurements; reusable in contexts beyond the bibliographic domain.
- Ontology specification
- Human-readable documentation
- Activity Ontology
- The issue of whether to adopt the BIBFRAME Contribution model or the bibliotek-o Activity model has not yet been resolved. As a temporary accommodation, the Activity model is used, but the terms are stored in a separate ontology file with a distinct namespace for easy separation. The expectation is that either (1) the Activity model will be adopted, and the terms merged into the ARM Core Ontology, or (2) the BIBFRAME Contribution model will be adopted, the Activity ontology deprecated, and other associated revisions to the ARM ontologies and vocabularies be implemented.
- Ontology specification
- Human-readable documentation
Application Profiles (SHACL)
- Rare Monograph: https://github.com/LD4P/arm/tree/master/application_profiles/raremat_monograph/shacl
- Recommendation for SHACL maintenance and development
In-Person Meetings
- In-Person Meeting of ARTframe + Rare Materials Ontology Extension (January 2018)
- In-Person Meeting of ARTframe + Rare Materials Ontology Extension (March 2017)
- Virtual meetings occurred weekly between June 2016 and July 2018
Amber Billey (Bard College)
Amy Brown (Boston College)
Todd Fell (Yale University)
Steven Folsom (Cornell University)
Peter Green (Princeton University)
Linda Isaac (University of Miami)
Jason Kovari (Team Lead. Cornell University)
Margaret Nichols (Cornell University)
Michelle Paquette (Stanford University)
Audrey Pearson (Yale University)
Timothy Thompson (Yale University)
Rebecca Younes (Cornell University)
- (accepted) 2018.09.10 - Kovari, Wacker, Khan and Folsom. "Modeling and application profiles in the Art and Rare Materials BIBFRAME Ontology Extension". DCMI, Porto, Portugal
- 2018.06.23 - Kovari and Lapka. "Art & Rare Materials BIBFRAME Ontology Extension : from Modeling to Implementation". ALCTS Metadata Interest Group: Implementing Linked Open Data in the Real World, ALA Annual Conference, New Orleans, LA
- 2018.06.19 - Kovari, Isaac and Pearson. "Art & Rare Materials BIBFRAME Ontology Extension : from Modeling to Implementation". RBMS Conference, New Orleans, LA
- 2018.06.08 - Kovari, Isaac and Pearson. "Art & Rare Materials BIBFRAME Ontology Extension : from Modeling to Implementation". Yale University, New Haven, CT
Harvard will assess BIBFRAME's effectiveness as a data model for describing moving image materials for research needs, and identify domain-specific extensions to BIBFRAME, as well as vocabularies for description of these materials in a linked data environment.
For questions or comments, please contact Christine Fernsebner Eslao, Metadata Management, Harvard Library: eslao [ at ] fas [ dot ] harvard [ dot ] edu or submit issues via https://github.com/HLITS/LD4L_Film_Ontology
- Extension to BIBFRAME for moving image materials
- BIBFRAME profile for moving images descriptions
- Set of published BIBFRAME descriptions for a variety of moving image resources
- Written evaluation of the project and set of recommendations for future research and development
- Presentation of project findings to appropriate communities.
Analysis/Modeling
- Discussion papers and recommendations
- Ontology extension files
- Application profile and conversion mappings
- Survey of related ontologies and linked data efforts
- Pattern documents for types of moving image resources, extending shared activities model, modeling physical details (durations, defects, sound, color, relationships between works and expressions, item history)
- Ontology file with extension terms
See Harvard LD4L Labs wiki for documents
Linked Data Creation
- Creation of ISNI identifiers for named entities, beginning with directors
Tool Exploration / Requirements Definition
- Vitrolib custom forms
- Lookups/caching specs
- Reconciliation tools
Community Engagement
- Discussions with Harvard Film Archive and Northeast Historic Film staff
Link to working documents (for partners only)
Analysis/Modeling
- Core ontology contributions
- Moving image use cases
- Analysis of Harvard Film Archive data sources
- Initial moving image modeling sprint
See Harvard LD4L Labs wiki for documents
Tool Exploration / Requirements Definition
- Vitrolib custom forms
Collaboration
- Discussions with Library of Congress BIBFRAME pilot participants
- Pattern documents for LD4P/LD4L Labs BIBFRAME extension group
- LD4P/LD4L ontology extension meeting
Community Engagement
- Interviews with Harvard Film Archive and Northeast Historic Film staff
This sub-project of Linked Data for Production (LD4P) focused on the production of linked data descriptions for Columbia University’s art objects, which are overseen by Art Properties at the Avery Architectural & Fine Arts Library. The collection in total numbers about 10,000 objects, including public outdoor sculpture, paintings, photography, works on paper and decorative works. At present, data describing the art objects is captured in a spreadsheet according to locally developed guidelines following conventions developed by the art community and using both Library of Congress and Getty vocabularies. The data is also mapped to MARC and available through CLIO, Columbia University Libraries' catalog. This project developed into the Art & Rare Materials BIBFRAME Ontology Extension (ARM), a collaborative effort by the ArtFrame Extension Group and the Rare Materials Group.
- Converted Art Properties Sample data to ARM ontology
- Evaluation and testing of the VitroLib RDF cataloging tool for original descriptions
- Develop Art Properties ArtFrame Application profile
- Map ArtFrame ontology to Art Properties data
Months 20-27 (November-June 2018)
- Converted Art Properties Sample data to ARM ontology using Karma
- Created GitHub Repository to host Art Properties LOD documentation
- Develop Art Properties ArtFrame Application profile
- Map arm ontology to Art Properties data
- Participate in Tools calls (Melanie Wacker)
- Participate in extension lead calls (Melanie Wacker)
Months 17-19 (August-October 2017)
- Began work on Art Properties ArtFrame application profile (Melanie Wacker)
- Began work to map ArtFrame ontology to Art Properties data (Melanie Wacker)
- Participate in VitroLib Planning Group (Kate Harcourt, Robert Rendall, Melanie Wacker)
- Finalized MARC BSR to bibliotek-o mapping (Amber Billey (facilitator), Melanie Wacker)
- Participate in Tools calls (Amber Billey, Melanie Wacker)
- Participate in extension lead calls (Melanie Wacker)
Months 13-16 (April-July 2017)
- Attend LD4P/LD4L-Labs Community and Participants Meetings (April 2017, Melanie Wacker)
- Attend LD4L/LD4P Ontology Meeting at Princeton University, April 03-04, 2017 (Amber Billey, Robert Rendall, Melanie Wacker)
- See also ArtFrame page for further completed work
Months 10-12 (January-March 2017)
- Host ArtFrame/Rare Materials Ontology Development Sprint at Columbia (March 2-3, 2017). Topics: Notes & Annotations, Citations, Physical Description and Provenance.
- Participate in (all) Ontology Extension Group planning (Amber Billey, Melanie Wacker)
- Additional Art Properties MARC records loaded into CLIO (Roberto Ferrari)
- Participate in VitroLib Custom Form configuration group (Robert Rendall)
- Participate in MARC to bibliotek-o mapping (Amber Billey (facilitator), Melanie Wacker)
- Participate in LD4L-Labs Converter call (Melanie Wacker)
- Attend LD4P Meeting at ALA Annual, Jan. 23, 2017 in Atlanta (Amber Billey, Kate Harcourt, Robert Rendall, Melanie Wacker)
Month 9 (December 2016)
- Planning for combined ArtFrame/Rare Materials Ontology Development Sprint
- Completed participation in LD4L/BIBFRAME Alignement Group (Amber Billey, Melanie Wacker)
- Continue participation in collaborative LD4* ontology calls (Amber Billey, Melanie Wacker)
Month 8 (November 2016)
- Compare Artframe and Rare Material use cases and identify commonalities (Amber Billey, Jason Kovari, Melanie Wacker)
- Presentation on Artframe project at the Metropolitan Museum of Art, New York, Nov. 14, 2016 (Roberto Ferrari, Melanie Wacker)
- Test VitroLib Sandbox and provide feedback (Columbia Group)
- Continue participation in collaborative LD4* ontology calls and LD4L/BIBFRAME Alignment Group (Amber Billey, Melanie Wacker)
Months 4-7 (July-October 2016)
Create larger data set representing more than 2000 objects from the Art Properties collection (Columbia Group)
Reconcile new data set to external ontologies (Columbia Group)
Finalize literature review (Columbia Group)
Months 1-3 (April-June 2016)
Create public wiki page on the Columbia Wiki: https://wiki.library.columbia.edu/display/CBT/Columbia+BIBFRAME+Test+Home
Participation in collaborative LD4* ontology calls, LD4L/BIBFRAME Alignment Group, calls, ontology alignment sprint calls, and the identity subgroup calls
Follow wider linked data-related activities
Test open source tools for ontology development and data processing
Mapping of Art Properties data to MARC 21 resulting in the initial release of over 1,000 Art Properties MARC records through CLIO, Columbia University Libraries’ online catalog.
Map Art Properties data to BIBFRAME 2.0
Map BIBFRAME 2.0 to VRA RDF (shared with members of the VRA RDF Ontology Group)
Begin work to analyze CIDOC-CRM and FRBRoo data models and relate them to BIBFRAME 2.0
Outreach to linked data projects in the art domain
Presentation:
"Artframe, LD4P@Columbia" at METRO Workshop “The Bibliographic Framework Initiative (BIBFRAME): Development, Vocabulary and Use” (June 2 2016)
Pre-project activities (prior to April 2016)
Form Columbia core group consisting of members of the Libraries’ Original and Special Materials Cataloging Division, Avery Library, and the Art Properties Department.
Create private wiki for Columbia internal documentation
Create test dataset in spreadsheet form representing a variety of art formats
Reconcile spreadsheet data to external controlled vocabularies
Characterize objects in the Art Properties collection based on sample data
Survey existing linked data developments in the art domain
Map Art Properties data to Bibframe 1.0
Document issues and omissions in BIBFRAME 1.0 relating to art descriptions
Map Art Properties data to VRA RDF
Presentation:
"LD4P@Columbia: BIBFRAME and Art Properties", BIBFRAME Update Forum, ALA Midwinter Meeting, Jan. 10, 2016
Cornell will produce descriptions natively in RDF for hundreds of uncataloged noncommercial LPs in its Hip Hop collection. This collection supports a growing body of international scholars and educators who study Hip Hop not only for its global influence on popular music, art and style, but also for its role in articulating social and political issues. As such, it offers a promising test bed for exploring the use of linked data techniques to extend the reach of descriptive metadata. This work will complement Stanford's LD4P work, in association with the Music Library Association (MLA) and other partners, to develop a Performed Music Ontology based on BIBFRAME.
- Application profile for use in the description of Hip Hop LPs
- Natively created RDF descriptions of select items in Cornell's Hip Hop collection: CONTENT.nq (in nquads format)
- Evaluation of integration of natively created RDF descriptions with descriptions generated through conversion from MARC
- Evaluation and in-the-field testing of the VitroLib RDF cataloging tool
Analysis/Modeling
- Shared use cases with the Performed Music Ontology group.
- Created workflow diagram showing infrastructure and data flows.
- Actively participated in Rare Materials Ontology Extension group modeling efforts.
- Defined cataloging and modeling requirements in consultation with collection curators, including how to model and describe related events (tours, performances); complex attributions (people working under multiple names); annotations; provenance; and other item-specific details such as signed and numbered copies.
Tool Exploration / Requirements Definition
- Held initial meetings with catalogers and VitroLib developers to come to shared understanding of cataloging functional requirements.
- Created SHACL metadata application profile (MAP) to define implementation of relevant ontologies in the VitroLib editor and facilitate customization of alternative editors, enabling testing of different editing environments.
- Experimented with iterative updates to the VitroLib editing tool as they were released.
Collaboration
- Provided input, chiefly through representation in the MLA, to the development of the Performed Music Ontology, which Cornell will eventually incorporate into its VitroLib editing tool.
Linked Data Creation
- Creation of original RDF metadata for Hip Hop LPs in the VitroLib metadata editor.
Harvard will create native Linked Data descriptions for a selection of library cartographic resources including printed maps, atlases, digital geospatial datasets, and other cartographic information resources.
Together with LD4L-Labs partners, Harvard will convert a set of Harvard Geospatial Library metadata records into linked data descriptions.
Note: Documentation, ontology files, and data being created to support the Cartographic Materials extension project are available at the LD4P Cartographic Materials GitHub repository
For questions or comments, please contact Marc McGee, Geospatial Metadata Librarian, mmcgee (at) fas (dot) harvard (dot) edu, or submit issues via https://github.com/LD4P/Cartographic_Materials
- Set of natively cataloged BIBFRAME descriptions for a variety of cartographic resource types using the cartographic materials extension ontology.
- Develop a set of mapping rules for Federal Geographic Data Committee (FGDC) geospatial metadata standards to linked data.
- A set of Harvard Geospatial Library metadata records converted to linked open data using the cartographic materials extension.
Last updated: 7/17/2018
Analysis/Modeling
Finalize supporting documentation for the Geospatial and Cartographic Resource Ontology (GCRO), including recommendation papers and graphic models (July 2018)
Create a Geospatial and Cartographic Resources Ontology (GCRO) SHACL application profile(deferred to future date)
Production Cataloging
Test cataloging of a selection of geospatial and cartographic resources using the GCRO application profile in the Harvard geospatial data instance of VitroLib(deferred to future date)
Visualization work
- Completing demo visualization tool for converted geospatial metadata collections (January-June 2018)
Last updated: 4/27/2018
Analysis/Modeling
- Completed field mappings for geospatial metadata conversion: FGDC to bibliotek-o + Geospatial and Cartographic Resources Ontology (GCRO) extensions (3/2017)
Data conversion
- Converted a set of 8,800 Harvard Geospatial Library and 5,100 Stanford Earthworks geospatial metadata records to the target geospatial LOD ontology
- Reconciled agent names, topic keywords, places of publication, and place keywords to linked data entities in the converted data sets
- Data loaded into Harvard geospatial data instance of VitroLib
The HFA project will create linked data descriptions for a set of moving image materials by women directors--work that has previously been underexposed and in many cases is unique to the HFA.
For questions or comments, please contact Christine Fernsebner Eslao, Metadata Management, Harvard Library: eslao [ at ] fas [ dot ] harvard [ dot ] edu or submit issues via https://github.com/HLITS/LD4L_Film_Ontology
- Linked data descriptions of moving image resources
- Mapping existing metadata to BIBFRAME and extensions
See Harvard LD4L Labs wiki for documents
Linked Data Creation
- Automated & manual reconciliation of film director names to ISNI entities
- Manual creation of target output for converter development
- Pre-conversion data cleanup
- Ontology extension files
- Mapping and reconciliation files
Tool Exploration / Requirements Definition
- Vitrolib custom forms
- Vitrolib lookup specs
- Application profile
Collaboration
Community Engagement
Analysis/Modeling
See Harvard LD4L Labs wiki for documents
Linked Data Creation
HFA subjects & genre mapping to LCSH, Getty AAT, and FAST URIs
Tool Exploration / Requirements Definition
Vitrolib custom form for annotations
Vitrolib lookup specs for ISNI
Collaboration
Discussions with Library of Congress BIBFRAME pilot participants
Pattern documents for LD4P/LD4L Labs BIBFRAME extension group
LD4P/LD4L ontology extension meeting
Community Engagement
Interviews with Harvard Film Archive and Northeast Historic Film staff
Stanford’s linked data production project focuses on technical services workflows. For each of four key production pathways we will examine each step in the workflow, from acquisition to discovery, to determine how best to transition to a linked data production environment. Our emphasis is on following each workflow from start to finish to show an end-to-end linked data production process, and to highlight areas for future work. The four pathways are: copy cataloging through the Acquisitions Department, original cataloging, deposit of a single item into the Stanford Digital Repository, and deposit of a collection of resources into the Stanford Digital Repository.
- Technical Services Workflows: Overview of Tracer Bullet project, with details on Tracer Bullet 1: Conversion of vendor-supplied MARC records (PowerPoint)
- Screenshots of tools and data used in Tracer Bullets 2-4: local instance of Library of Congress BIBFRAME Editor, Blazegraph triple store (PowerPoint)
Existing MARC workflows
Overview of existing MARC workflows
Annotations for metadata flowchart
A. Order placed
- Data created (from BIB 9XX) -- For orders placed in vendor databases, the vendor sends order data in the bib record 9XXs, which are used to automatically generate the order record. For other kinds of orders, ACQ staff creates the order record manually in Symphony.
- Provisional data created (BIB) -- For orders placed in vendor databases, the vendor sends the provisional bib records to be automatically loaded into Symphony. For other kinds of orders, ACQ staff creates provisional bib records in Symphony or imports OCLC copy.
- Special handling instructions noted (BIB 9XX) -- For example, rush handling on receipt.
- Data published -- Searchworks indexes and displays the provisional bibliographic data.
- BIBFRAME transform -- Possible first BIBFRAME transform to allow discovery of on-order items.
B. Records received
- Data updated (from BIB 9XX) -- N/A
- Provisional data replaced with full data (BIB) -- Matches on title control number in Symphony; preprocessing required to match on order ID first for Casalini.
- Tracking initialized with status "ON ORDER" (ITEM) -- Item record is automatically created during the full record load.
C. Items received
- Tracking updated: "IN PROCESS" (ITEM) -- ACQ receiver wands item to pull up associated bib record, and changes the location to register that the item has arrived at SUL.
D. Manual updates
- Data updated to match item & local standards (BIB) -- Depending on the state of the metadata, this work may be done by ACQ or CLASS. See the receiving workflow for more detail.
- Tracking updated (iterative): routing directions and history (BIB 9XX) -- Each time the item changes hands, the 910 field is updated. This is used to track the location of the item, to troubleshoot any issues that arise during processing or later, and to direct where the item should go next.
- Tracking updated: manual metadata work marked complete (BIB 9XX) -- The cataloging date in the 916 field is updated from "NEVER" to the current date. This registers that manual work on the metadata is done, and that the item will proceed to Binding and Finishing.
- Tracking updated: shelving location (ITEM) -- This shows that processing is complete and that the item is ready for use.
E. Automatic updates
- Backstage: data replaced with authorities updated (BIB) -- MARC records in Symphony are overlaid with the updated records returned from Backstage.
- Nielsen, digital bookplates: additional data inserted (BIB 9XX) -- Nielsen data is inserted as 905, 920, and 986 fields; these are indexed in SearchWorks as the corresponding 5XX fields. For digital bookplates, a 979 field is inserted with a link to the bookplate object in the SDR.
- Casalini, OCLC: additional data inserted (BIB) -- Casalini TOCs are downloaded as PDFs, and a link to the local file is added as an 856. OCLC master record numbers are inserted as an 079.
- BIBFRAME transform -- Possible later BIBFRAME transform to minimize the work required to keep both datasets in sync.
F. External updates
- OCLC: SUL holdings updated -- N/A
Firm orders
Approval orders
Receiving
Automatic record enhancements
MARC-to-BIBFRAME conversion workflow implemented by LD4P
Conversion workflow overview
Described in Technical Services Workflows (PowerPoint)
Dataflow for conversion workflow
Exploratory MARC-to-BIBFRAME conversion workflows
Option 1: parallel processing in MARC and BIBFRAME, MARC record primary
Option 2: operational record in MARC, discovery data in BIBFRAME, BIBFRAME data primary
Conversion workflows by process
Reactive pipeline for larger-scale conversion of MARC records
Pipeline overview
Pipeline with implementation details
- Reactive pipeline demo (YouTube video)
- Scala/Kafka/Spark Linked Data Pipeline (github repository)
Supporting material
- User stories for converting MARC records to BIBFRAME (pdf)
- Plans for a Minimum MARC Bibliographic Record & Attachments (pdf)
- BIBFRAME-to-Solr mapping (Google doc)
- SPARQL queries for Solr mapping (pdf)
Workflow for original cataloging
Existing item-deposit workflows
Metadata created in self-deposit interface (non-SUL users)
This diagram represents metadata only (not file management or other admin tasks) for objects described entirely through the self-deposit tool Hydrus. The boxes under "Self-deposit interface" represent the metadata-related tasks a user can perform through that interface. The leftmost column of boxes are metadata tasks contained within the self-deposit tool. The right column of boxes involves writing data to DOR. Except where otherwise specified, these tasks apply to description of both collections and items. Currently this model is more commonly used for deposits originating from non-SUL users.
Metadata created in Symphony/MARC (SUL users)
This diagram represents metadata only (not file management or other admin tasks) for objects that are accessioned through the self-deposit tool Hydrus by SUL staff and cataloged by the MDU in MARC. The boxes under "Self-deposit interface" represent the metadata-related tasks an internal user performs through that interface in this workflow. Currently this model is commonly used for digital files received by Acquisitions or acquired by curators. The self-deposit tool serves as a convenient way to get a file into the SDR, notify MDU staff that cataloging is needed, and pass information such as the catkey and purl for the object to MDU. New items are deposited to existing collections that have been set up with the appropriate rights, permissions, etc.
RDF-based item-deposit workflow
The tracer bullet focuses on the metadata flow rather than the file management portion of this scenario, as the upcoming adoption of Hyku will have a significant impact. For the purposes of the tracer bullet, we are working with digital objects already deposited into the SDR and described via Hydrus. The "depositor" (actually metadata staff) will describe the objects in CEDAR based on the records in SearchWorks, but will reformulate the metadata in CEDAR independently rather than doing a simple mapping from MODS. One possible step is to generate an operational MODS record for SDR use from the RDF description, but for the purposes of the tracer bullet this operational record will not actually be written to the repository.
- CEDAR template for self-deposit item metadata (create CEDAR account to view)
- App to fetch folder contents from CEDAR and post them to a triplestore (github repository)
Existing bulk-deposit workflow
MODS-to-BIBFRAME bulk-deposit workflow
- Description of automated workflow to convert tabular MODS-based metadata for a digital collection to BIBFRAME (pdf)
- Sample tabular MODS-based metadata for digital collection (tsv)
- Mapping from tabular MODS-based metadata to BIBFRAME for sample collection (txt)
- Ruby script to convert tabular MODS-based metadata to BIBFRAME
- Resulting BIBFRAME metadata for digital collection (ttl)
Project Co-Managers
- Philip Schreur, Assistant University Librarian for Technical and Access Services
- Tom Cramer, Assistant University Librarian, Chief Technology Strategist, and Director of Digital Library Systems and Services
Acquisitions Department
- Alexis Manheim, Head of Aquisitions Department
- Linh Chang, Receiving and Access Librarian
Metadata Department
- Nancy Lorimer, Head of Metadata Department
- Joanna Dyla, Head of Medata Development Unit
- Vitus Tang, Head of Data Control and E-resources Unit
- Arcadia Falcone, Metadata Coordinator
Digital Library Systems and Services
- Darsi Rueda, Head of Library Systems Department
- Naomi Dushay, Digital Library Software Engineer
- Joshua Greben, Library Systems Programmer / Analyst
- Darren Weber, Digital Library Software Engineer
Analysis/Modeling
- Mapped Stanford's vendor-supplied copy cataloging and original cataloging workflows
- Mapped workflow for converting vendor-supplied records to linked data
- Generated requirements for work-based discovery environment, to take advantage of RDF
- Evaluated BIBFRAME profiles for original cataloging
Linked Data Creation
- Worked with vendor on improvements to supplied MARC data to enhance conversion to BIBFRAME
- Tracer Bullet 1: Converted set of 38,000 MARC records from Symphony to BIBFRAME using Library of Congress converter, loaded to Blazegraph triplestore, and indexed to Blacklight Solr environment via automated scripts
- Tracer Bullet 2: Created original descriptions of 50 items with local instance of BIBFRAME 2.0 Editor
- Tracer Bullet 3: Created original descriptions of about 30 digital assets using CEDAR RDF editor
- Tracer Bullet 4: Converted tabular MODS-based metadata for a digital collection to BIBFRAME using Ruby
- Piloted automated pipeline approach for conversion of MARC records to BIBFRAME, loading to triplestore, and indexing to Solr
Discovery Environment Creation
- Created Blacklight/Solr instance-based discovery environment with source data a mix of linked data and MARC data
- Developed a mapping from BIBFRAME 2.0 to Solr document for book materials
- Developed a mapping from RDF to Solr for digital assets
Tool Exploration / Requirements Definition
- Gathered requirements for conversion and editing tools
- Set up Registry of Tools
- Evaluated CEDAR template creation and metadata editing tool
- Developed a validation suite for MARC-to-RDF converters
- Created local instance of Library of Congress BIBFRAME 2.0 Editor
Presentations
- Technical Services Workflow Pipeline, Arcadia Falcone, LD4P Community Input Meeting 2017, Stanford, CA
- Linked Data for Production (LD4P): Technical services workflow evolution through tracer bullets (Stanford projects), Arcadia Falcone, Josh Greben, Nancy Lorimer, DCMI 2017, Washington, DC
- LD4P Tracer Bullet 1: an RDF-Based Copy-Cataloging Pipeline, Philip Schreur, ALA Annual 2017: LC BIBFRAME Update Chicago, IL
- The Shot Heard Round the World: Linked Data for Production's Tracer Bullet 1, Practical Copy-Cataloging in RDF, Philip Schreur ALA Annual 2017: Library Linked Data Interest Group Chicago, IL
LD4P partners
Columbia University ArtFrame | Columbia Art Properties
Cornell University Rare Materials Ontology | Cornell Hip Hop Archive
Harvard University Cartographic Extension | Harvard Geospatial Data | Harvard Film Archive | Moving Image Extension
Library of Congress Library of Congress BIBFRAME
Princeton University Princeton Derrida Archive
Stanford University Performed Music Ontology | Stanford Workflows
Learn more about LD4P
Upcoming Conferences Related to Linked Data | Includes planned attendance by LD4P and LD4L-Labs members |
LD4P Presentations and Publications | Presentations given at conferences and workshops by LD4P team members, and publications about our work |
Linked Data for Production Report, April 2016 (Partner Meeting #1) | Meeting report |
LD4P Grant Proposal | Text of the grant proposal |
Stanford Libraries leads collaborative grant to leverage linked data for enhanced access to information | Press release from launch of grant (April 2016) |
Learn more about Linked Data
PCC Report on Available Linked Data Training Resources | Document from the Program for Cooperative Cataloging |
Questions and comments: contact ld4p-program at stanford dot edu
- PCC. The Program for Cooperative Cataloging is partnering with LD4P and the work of its committees will complement the work of LD4P projects.
- The Importance of Identifiers in the New Web Environment and Using the Uniform Resource Identifier (URI) in Subfield Zero ($0): A Small Step That Is Actually a Big Step by Jackie Shieh and Terry Reese
- Steven Folsom's Presentation on the Work of the PCC Task Group on URIs in MARC (download to see speaker notes)
- No labels