DuraSpace Notes on SHARE Proposal from ARL/AAU/APLU
This page is just general notes on the OSTP-related SHARE proposal from ARL/AAU/APLU . It is not a formal statement from DuraSpace. But, members of the community are welcome to comment on these notes or help to enhance them.
General comments/questions from DuraSpace/others are made throughout the below summary and are marked as such.
The full text (PDF) of the proposal is available at: http://www.arl.org/publications-resources/2772-shared-access-research-ecosystem-share-proposal
High Level Summary
The SHARE proposal suggests a number of functions and metadata fields that would need to be captured by repositories. We've attempted to briefly summarize them below. But, the full text of the Proposal has additional details.
Minimum SHARE metadata fields
These are the listed minimum SHARE metadata fields as noted near the beginning of the "How SHARE Works" section of the proposal:
- article title
- journal title
- award number
- Principal Investigator ID (ORCID or ISNI)
- designated repository number
In Support of Principal Investigators
As described in the paragraph about requirements of Principal Investigators (PIs), repositories may need to be able to "capture" or log the following:
- "Sufficient copyright licenses to enable permanent archiving, access, and reuse of publications"
General Repository Functions
As described in the "SHARE workflow" paragraphs, a repository would need to support the following functions:
- Be able to accept XML versions of manuscripts from Journal publishers
- "Journal submits XML version of final peer reviewed manuscript to the PI's designated repository"
- Make article available to search engines
- Google, Google Scholar, Yahoo, Bing, etc
- Must be able to link to publisher's website
- Support embargo
- link to publisher's website until embargo period expires
- make full-text of article available post-embargo
- Certify compliance with agencies
- Automatically notify "both the funding agency and the PI's institutional research office that a deposit has occurred"
As noted in the proposal, the "following precursors are required immediately to implement SHARE as a solution to the OSTP memorandum.":
- Principal Investigator (PI) Identifier (recommended to use either ORCID or ISNI)
- Award Identification Number - assigned by Federal agencies
- Copyright License Terms - "requires a standardized and coded expression ... for machine processing"
- Repository Designation ID Number - "to identify the repository access location"
- Preservation Rights - "required to be coded into the metadata residing with the record"
Phase ONE (12-18 months)
Additional requirements for Phase One, after which "the SHARE system will be available for both deposit and access".
- PI Identifier (Also mentioned in "Requisite Conditions")
- Award Number (Also mentioned in "Requisite Conditions")
- Publication ID - "unique, persistent identifier to reference the journal article of the publication"
- Data Set ID - "resolvable, persistent identifier to location of stored data or data sets that are linked to the published article"
- Copyright License Conditions - includes embargo information
- Sponsoring/Funding Agency Name - "Link to agency providing funding so that reports can be automatically returned"
- Reporting - "Creates a feedback loop to the federal agency and the PI's research office providing tracking of publications resulting from awards funded by the agency"
- Core Usage Statistics - "Reports to authors (and agencies, if desired) include statistical data on usage activity and downloads of their publications."
- Metadata Exposed to Search Engines
- Some connections to Digital Preservation Network (DPN)? - "All phases connect with and take advantage of the Digital Preservation Network (DPN)"
Phase TWO (6-12 months after phase one)
Required in support of phase two. Begun "concurrently with Phase One activities".
- Submission Workflow - "Development of software to automate and optimize article submission from author through repository and to publisher"
- Requires publishers to comply with single, standardized submission mechanism
- Usage Metrics
- Incorporate OAI-ORE
- Adoption of Best Practices
Phase Three envisions "more complex interactions with SHARE", and includes:
- Text and Data Mining
- Bulk Harvesting
- Semantic Data
- Relationships among publications
- API Specifications
- In support of interation with repositories
- Open Annotation
- Web-centric annotation framework
Phase Four involves "development of infrastructure relationships to support data requirements of federal agencies"
- Data Curation and Associated Software
- Linked Data
- Shared Distributed Resources in Repositories