Title (Goal)Quality Control Reports
Primary ActorAdmin
Scope 
Level 
Story (A paragraph or two describing what happens)

Administrator or collection administrator can launch quality control reports that identify instances of a specific use case on a collection by collection basis.

Collection administrators can use these reports to perform quality control checks. Software developers can use these reports to identify interesting use cases for system testing.

Possible reports

  • Number of items without a bitstream
  • Number of items with multiple original bitstreams
  • Number of items without a license
  • Number of items containing document bitstreams (pdf, word)
  • Number of items containing image bitstreams (jpg, jp2)
  • Number of items containing audio bitstreams
  • Number of items containing video bitstreams
  • Number of items containing compound bitstreams (zip)
  • Number of items referencing streamable media
  • Number of items with non-standard bitstream formats
  • Number of items with very large bitstreams
  • Number of items with very small (possibly corrupt) bitstreams
  • Number of items without thumbnails
  • Number of items with multiple thumbnails
  • Number of items with custom (non-generated) thumbnails
  • Number of document items without a full text extract bitstream
  • Number of items with a specific metadata field
  • Number of items with multiple instances of a metadata field
  • Number of items missing a required metadata field
  • Number of items with an improperly formatted metadata field (date field)
  • Number of items visible to anonymous user
  • Number of items not visible to an anonymous user
  • Number of items with bitstreams accessible to an anonymous user
  • Number of items with bitstreams not accessible to an anonymous user
  • Number of items with a finite embargo end date
  • Number of items with an indefinite embargo end date
  • Number of items with an embargo end date in the past
  • Number of items with metadata containing a URL
  • Number of items with metadata containing non-ASCII characters
  • Number of items containing full text (per the provenance field)
  • Number of items modified in the last X days
  • Number of items containing provenance data or annotation data within a non-public bitstream