Page tree
Skip to end of metadata
Go to start of metadata

This meeting is open to anyone who would like to attend; it will provide an opportunity for members of the Fedora community to connect, share information, and provide updates on local projects and initiatives. This meeting will be based around European timezones, but anyone is welcome to attend. 

When

September 9, 2020

Where

Online via Zoom

Agenda/Presentations

All times are CEST.

Time

Topic

Presenter

13:00 - 13:10Welcome and Introductions


13:10 - 13:30Fedora Program and Community UpdateDavid Wilcox, LYRASIS
13:30 - 13:50Islandora UpdatesMelissa Anez, Islandora Foundation
13:50 - 14:10Phaidra - University of ViennaRaman Ganguly, University of Vienna
14:10 - 14:30Break
14:30 - 14:50Memobase 2020: Using Fedora as central repository to store linked data and digital objectsThomas Bernhart, Docuteam
14:50 - 15:10A field report from the relaunch of an infrastructure: A FEDORA based framework for long-term storage and dissemination of research dataJohannes Stigler, Centre for Information Modelling, University Graz
15:10 - 15:30Break
15:30 - 16:20Lightning Talks:
  1. Fedora Commons in the CLARIN-D infrastructure
  2. eHumanities
  3. Past problems - Fine Future
  4. Unversity library "Svetozar Marković"


  1. Thomas Eckhart, Leipzig University
  2. Jaime Penagos, University Library Ludwig-Maximilian University Munich
  3. Oliver Schöner, Berlin State Library
  4. Dragana Stolić, Unversity library "Svetozar Marković"
16:20 - 16:50Wrap-up and Discussion

Notes

13:10 - 13:30 Fedora Program and Community Update David Wilcox, LYRASIS

Questions:

13:30 - 13:50 Islandora Updates Melissa Anez, Islandora Foundation

  1. Islandora: Drupal + Fedora
    1. Islandora 7: Vertical stack (cheeseburger)
    2. Islandora 8: Distributed architecture (bento box)
  2. Using Fedora for smart storage, fixity, versioning (memento), and OCFL (in version 6)
    1. Only putting preservation copies in Fedora; access copies are stored in Drupal
  3. Flysystem
    1. Fedora is treated as the Drupal filesystem from the user perspective
  4. Islandora community
    1. 320+ installations
    2. Backed by the Islandora Foundation, which is member supported
  5. Examples:
    1. Archives Central → probably the first digital archive using the Records in Context ontology in production
  6. Questions
    1. Does Islandora make assumptions about data model used to describe the collection → default form is based on MODS, but Islandora is very flexible regarding the data model (use Drupal forms and map those to Fedora properties)

13:50 - 14:10 Phaidra - University of Vienna

  1. Phaidra - repository system based on Fedora
  2. Started in 2007
  3. Both the repository itself and the services around the repository
  4. Community: Austria, Italy, Western Balkans
  5. Architecture
    1. Changed from monolithic to modular
    2. Fedora at the centre, surrounded by Phaidra Core
    3. System of plug-ins and a rich API
    4. Multiple user interfaces and workflows
  6. Workflows
    1. 4 phased workflow model
    2. Pre-ingest, ingest, management, re-use
  7. Focused on long tail of research data

14:30 - 14:50 Memobase 2020 - Thomas Bernhart, Docuteam

  1. Using Fedora to store linked data and digital objects
  2. Memobase: audio/video collection of Swiss heritage
    1. http://memobase.ch
    2. Collaboration between University of Basel, Outermedia, and Docuteam
    3. Built on Fedora 5
  3. Data model
    1. Transition from XML to RDF
  4. Architecture
    1. SFTP server where users can upload data
    2. Transformed in import pipeline and stored in Fedora
    3. Processed via messages (Kafka cluster)
  5. Fedora integration
    1. Fedora ingest service
    2. Messages (Activity Streams) to event handler
    3. Metadata extractor
  6. Challenges
    1. Mapping external URIs to internal URIs
    2. Handling references between resources (cannot create relationships to resources that don’t exist yet - this is no longer an issue in Fedora 6)
    3. When to use different LDP container types


14:50 - 15:10 A field report from the relaunch of an infrastructure: A FEDORA based framework for long-term storage and dissemination of research data - Johannes Stigler, Centre for Information Modelling, University Graz

  1. Migrated Fedora 3 to 4.7
  2. Using Fedora since version 2
  3. New versions of Fedora represent a paradigm shift
  4. Infrastructure
    1. Fedora is a storage layer in a complex system that stores research data generated by a variety of digital humanities projects
    2. Based on XML structure
    3. 114,000 digital objects in Fedora 3, many of which are complex
    4. Services are triggered by user uploads
  5. Content models
    1. Structural definitions for object types
    2. Manuscript transcripts based on TEI
    3. Content models trigger workflows
    4. Content models support specific XML schemas
  6. Migration project
    1. Need to migrate Fedora 3 REST API functionality to Fedora 6
  7. Java based thin desktop client to handle ingest workflows
  8. Using Docker and Kubernetes
    1. Advantage: isolation
    2. Disadvantages: complexity, difficulty identifying errors, need to learn new practices
    3. Using Kubernetes for clustering and orchestration

Questions:

  1. From Gunter Vasold: What software is used for the Media Server?
    Response: I think it is a custom implementation from the University Basel

15:30 - 16:20 Lightning Talks:

Fedora Commons in the CLARIN-D infrastructure - Thomas Eckhart, Leipzig University

  1. CLARIN - Common Language Resources and Technology Infrastructure
    1. CLARIN-D - National consortium in Germany
  2. Distributed infrastructure with standardized interfaces
  3. Using OAI-PMH for metadata exchange - most important endpoint
  4. CLARIN-D (10 centres): majority Fedora 3.x + ProAI
  5. Migration
    1. Issues migrating to Fedora 4.x
    2. Majority still using Fedora 3.x
    3. Created a requirements document

eHumanities - Jaime Penagos, University Library Ludwig-Maximilian University Munich

  1. Research data from the digital humanities
  2. Lexical information from the Alpine region
    1. Approximately 500,000 datasets
    2. Many relations between datasets
  3. Infrastructure
    1. Used to transform XML to specific file formats
    1. Fedora 5, Solr, Blacklight
    2. Apache Camel
    3. Storing CSVs
  4. Challenges
    1. Scalability: issues with messaging queue
    2. Many concurrent operations can lead to performance degradation
  5. Questions
    1. Cluster vs. standalone?
    2. Impacts of eventual migration to Fedora 6?

Past problems - Fine Future - Oliver Schöner, Berlin State Library

  1. Many Fedora 4.x repositories at the Berlin State Library
    1. ITR (East Asian department): Many millions of objects (mostly pages)
    2. Stabirep: Born digital materials
    3. RGZ: Prussian judgments
    4. DC: Bibliographic materials (no binaries)
  2. Fedora 4.5.1 server shutdown due to power outage
    1. One file in the Infinispan layer had a bad OS owner
    1. Fedora restarted but Tomcat did not
    2. Problem was difficult to discover and solve
    3. If migrations were easier the data could have been transferred to a new instance with an upgraded Fedora
  3. Believe that OCFL is the future

Phaidra / Islandora - Dragana Stolić, University library "Svetozar Marković"

  1. Using Phaidra for almost a decade
  2. Started using Islandora 2 years ago
  3. Website: https://phaidrabg.bg.ac.rs
    1. Started in 2011
    2. PhD theses
    3. Permanent archiving
    4. Looking for a more adjustable tool
  4. Serbian Literary Criticism
    1. Using Islandora 7
    2. Self-archiving of literary criticism texts
  5. Phaidra is very user-friendly and simple
  6. Islandora is more adjustable
  7. Archiving ethnographic data
    1. Ethnographic data is very compatible with Fedora / Phaidra
    1. Which repository to choose?
    2. Decided to return to Phaidra when the time comes (probably next year)

Wrap-up and Discussion

  1. Nice to have an event scheduled around European timezones to hear from the European Fedora community
  2. Issue with Fedora: too many major version upgrades, hard to absorb
  3. Many smaller institutions have a hard time installing and setting up Fedora. Not out-of-the-box enough
    1. ISLE provides a Docker deployment of Islandora
    2. Phaidra also looking into Docker deployment
    3. Some institutions choose Fedora over an out-of-the-box solution in order to achieve a lot of flexibility
  4. How often to have users group meetings?
    1. An open forum to ask questions
    1. Meetings should occur at least once per year, or perhaps once every 6 months
    2. More discussion on problems and open issues, difficulties
    3. Another call after some initial experiences with migrations to discuss

Resources