Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The fly-in took place over the course of two days, starting at 9am and concluding at 5:30pm, followed by team dinners.

Opening considerations

...

Topics

Ingest

Requirements

  1. Ingest must support varied and currently unknown data sources
  2. Ingest must be scalable: the use of different backend triplestores or datastores must be allowed to support site-specific scaling requirements
  3. Content must be validated before being ingested. Example validation mechanisms: SHACL, ShEx, JSON Schema
  4. Ingest tooling must support two modes of operation: hands-free (automated) and curated
  5. Ingest tooling must support curation of data prior to ingest: disambiguation and reconciliation of entities
  6. Ingest will be entity-centric vs triple-centric. Example entities: Person, Grant, Publication, Authorship
  7. Ingest tooling must not require the use of a specific programming language

Out of scope

  1. Extraction of data from data sources
  2. Transformation of data from data sources

...and more