...
The fly-in took place over the course of two days, starting at 9am and concluding at 5:30pm, followed by team dinners.
Opening considerations
...
Topics
Ingest
Requirements
- Ingest must support varied and currently unknown data sources
- Ingest must be scalable: the use of different backend triplestores or datastores must be allowed to support site-specific scaling requirements
- Content must be validated before being ingested. Example validation mechanisms: SHACL, ShEx, JSON Schema
- Ingest tooling must support two modes of operation: hands-free (automated) and curated
- Ingest tooling must support curation of data prior to ingest: disambiguation and reconciliation of entities
- Ingest will be entity-centric vs triple-centric. Example entities: Person, Grant, Publication, Authorship
- Ingest tooling must not require the use of a specific programming language
Out of scope
- Extraction of data from data sources
- Transformation of data from data sources
...and more