...
- In Islandora, I am using forms to create one or more items, and I am editing relationships incrementally
- In the National Science Digital Library (defunct), I have a constant stream of third party annotations being added
- In RepoMMan, I need the send a paper through an approval process where the approvers may want to make editorial changes
- In Hydra (I don't have one but I bet its there)
Repository Operations
- TBD - Map to API
Simple Ingest
Simple Ingest consist of upload of single or small amounts of content and metadata. It can be accomplished with a single atomic operation or a short series of operations, usually RESTful, without required intermediate reads prior to completion. The duration of the small ingest is expected to be approximately the time it takes to upload the content and metadata starting with the beginning of the connection, where the connection is terminated after the operation. It is expected that the ability to read (access) the uploaded content and metadata should happen fairly soon after the upload is complete.
...
- I want upload an image through Hydra
- I want to upload a paper in NIST
- I am using a sync tools to upload a slow flow of new items
...
- TBD - Map to API
Bulk Ingest
In bulk ingest, a large quantity of content and metadata is ingested as a logical unit or is continuous. This may be accomplished using number of repository operations or may utilize methods that are optimized for bulk ingest. It is characterized by the expectation that there may be a defined delay between when the ingest is started and part or all of the content and metadata becomes available for read.
...
- I am the Bodleian Library and I want to create a duplicate (backup) of my digitised texts
- I am SIdora and I want to ingest the gene sequence for a Manakin (bird) coming from my in house gene sequencers
- I am using a sync tool to upload a new collection via I2
- I want to upload a 2000 graphs each consisting of 10000 items, and I want to be sure that each graph is complete, and the whole set is complete
Repository Operations
- TBD - Map to API
Simple Access
Simple access (a.k.a simple read or download) is the download of content and metadata (a.k.a representation of a resource) as a single user operation and one or a small number of repository operations. It usually RESTful, and usually contained a single request. Simple Access must not require any concurrent writes to accomplish the single user operation. The content and metadata stays fixed from the beginning to the end of the access.
...
- I want to use the Exhibition module in Islandora to present a static website
- I want to present a dynamic website through Hydra
- I am using a sync tool to download a slow flow of new items
Repository Operations
- TBD - Map to API
Conditioned Access
When streaming media, dropouts present a significant problem. The user expects to be able to access the contents without interruption. This may require a front end tool for buffering so the stream need not be perfect but good enough for the buffering tool.
...
- I am using Hydra to show a class lecture
Repository Operations
- TBD - Map to API
Mediated Access
Not all of the content is managed by Fedora but some resources are is provided by reference from a remote web service. Fedora would retrieve the representation (content and metadata) from the web service and present it as if it was a resource in Fedora.
...
- I am using the Data Conservancy Service but I want so show Glacier images kept by the NSIDC
- I have papers stored in Islandora but I want to get the supporting datasets from SIdora
...
- TBD - Map to API
Bulk Access
Download of large amount of content as single user operation. This may require any number of repository operations to accomplish. Whether content and metadata stays fixed from the beginning to the end of the operation is to be defined. This is needs consideration a whole intellectual entity, graph or DIP is considered the unit. Also we need to consider what this means for continuous access operations.
...
- I am the Bodleian library and an EMP device went off. I need to use a sync tool to download a major set of digitized texts
- I am SIdora and I need to send a whole set of genome fragments to be assembled a ORNL
- I am Hydra and I need to send a SIP to APTrust and DPN
Repository Operations
- TBD - Map to API
Preliminary Testing Matrix
Category | UsesUser Operation | APIsRepository Operation | Test Metric | Test | Priority | Notes |
---|---|---|---|---|---|---|
Authoring | Islandora Authoring | |||||
Authoring | Authoring with Workflow | |||||
Simple Ingest | Duration | |||||
Simple Ingest | Large Files - Media | |||||
Simple Ingest | Rate | |||||
Simple Ingest | Large Files - Media | Rate | ||||
Simple Ingest | Large File Count | Rate - Normalized | ||||
Bulk Ingest | Large Files - Mixed | |||||
Bulk Ingest | Large File Count | Rate - Normalized | ||||
Simple Access | Static Web Site | Random Access |
Testing Considerations
...
Step up rates X2 until flat line
Then proceed to declining performance and failure or non-response
Fedora Configuration
- Not Clustered
- Clustered
Resources
- https://github.com/pinterest/bender/blob/master/http/TUTORIAL.md
- https://github.com/fcrepo4-labs/fcrepo-test-grinder
...