Date

Call-in Information

To join the online meeting:

Slack

Attendees

(star)  Indicating note-taker

  1. Michel Héon
  2. Ralph O'Flinn
  3. Andrew Woods (star)
  4. Mike Conlon
  5. Sandra Mierz
  6. Benjamin Gross
  7. Benjamin Kampe
  8. Brian Lowe
  9. Bruce Herbert
  10. Christian Hauschke
  11. Huda Khan
  12. Maxime Belanger
  13. Nicolas Dickner
  14. Paul Albert
  15. Sarbajit Dutta
  16. Tatiana Walther
  17. William Welling
  18. Rachid Belkouch

Context

  1. Team UQAM and team TIB had a nice first investigative sprint with the objective to learn about Apache Kafka. We would like to start the work of the data ingest task force as soon as possible. We would like to present the outcome of our first investigation as long as it is fresh, and to see if we have alignment with others, and to get valuable feedback.

Agenda

  1. Introduction: Apache Kafka as a central component for data ingest in VIVO? (10 minutes by Michel Héon)
    Entry point of the presentation 2020-12-16 VIVO-DataConnect ORCID Demo and https://github.com/vivo-community/vivo-data-connect/tree/POC-extract-orcid for code
  2. Work at UQAM (5-10 min)
  3. Work at TIB (5-10 min)
  4. General discussion

Recording

Notes 

Draft notes in Google-Doc

VIVO - DataConnect - ORCID - UQAM Demo

  1. Walking through context and use case
  2. Goal of using Kafka with VIVO:
  3. Main idea of Kafka
  4. Recent sprint
  5. Walkthrough of flow:
  6. Demo
  7. Summary
  8. Future plans

TIB

  1. Using Kafka as a consumer of VIVO messages
  2. Tasks
  3. VIVO Kafka-Module
  4. VIVO producer
  5. Code will be in GitHub soon

Discussion

  1. Interest in the architecture presented
  2. This initiative allows for outputs from VIVO
  3. Can past initiatives be used in this context?
  4. Could this support large-scale ingest?
  5. Next steps


Actions