You are viewing an old version of this page. View the current version.
Compare with Current
View Page History
Version 1
Next »
VIVO Harvester
Fetch
JDBC Fetch
- Review Database Structure
- Identify Keys
- Idetntify Foriegn Keys
- Dump the whole table
- Result Sets (Query or Point to Stored Procedure)
- Black and White List for tables to ingest
OAI Fetch
- Working for CiteSeer
- Double checked. Not working for CiteSeerX, working for standard CiteSeer using URL cs1.ist.psu.edu/cgi-bin/oai.cgi. Successful harvest in XMLVault/OAI/CiteSeer.xml -DRS
- Working for UF IR
HTTP Fetch
- Clean and make Generic or Extensible
- Configuration for PubMed
- Configuration for UF IR
Pubmed SOAP
- Method cleanup
Translate
- New node/attribute detection in xsl
Translate Standard Schemas
- Microformats
- hCard
- hGrant
- hResume
- hCalendar
- vCard
- vCalendar
- iCal
- eduPerson
- eduOrg
RDF Workflow
- Work with Brian Lowe to pull RDF Workflow from VIVO into a library
- Add RDFWorkflow to Jena Library
- Utilize new library in translate methods for harvester
Score
- Remove record handler ingest and instead only work with models
- Utilize transfer for loading input models
- Disambiguate authors
- Add generic field-matching function
- Need to allow for progressive match on name, zip, affliation, possibly co-author, etc
- Fix jena create Model performance
- Explore/Implement Jena SDB
Algorithms
- Pair Wise
- Neural
- Regex
Configuration
- Must allow for parameters to be passed in from command line for any algorithim
- Must allow for process flow and order depedency (possible solution is to pipe ouptut and run score multiple times)
Transfer
- Query for Ontology Version
- Translate up to current Ontology Version
- Update/Overwrite/Append to Models
- Integrate with Data Provenance
- separate graphs for each harvested data source
- Include MetaData about the source of the data
Qualify
- Configurations examples
Utilities
- SDB for the VIVO Harvester (Jena Connect)
- Fix argument parsing for model overrides (ie, in transfer "input-model", "model name for input (overrides config file))
GUI
- Point to VIVO Harvesters (on external systems) by entering in their information and finding the configuration settings
- View Config Files
- View Logs
- Tabulate data from logs
- graphical elements to display statistical data
- Provide configuration wizards
Additional Libraries
DataSources
- NIH
- Grants
- PubMed Publications
- NSF
- Generic Government Sites
- Grants.gov
- ClinicalTrials.gov
- Scopus
- ISI (depending on agreements)
- Sakai
Testing
- Implement Junit testing
- Ensure Junit testing works
- Create development, staging, and production servers for harvester testing
Process
- Build complete release script
- Implement release process for Harvester
- Create set of rules for staging
- Create set of rules for commits
- Create set of rules for packaging
- Implement release process for Virtual Appliances
- Create set of rules for updating
- Create set of rules for releasing
Building
- Implement local Maven repository to deal with 303 bug in Maven.
- Add rpm to maven build
Packaging
- Fix issues with vitural machines
- Ensure virtual machine creation is done as part of release (find a way to automate/semi-automate?)
Documentation & Demoing
- Howto's w/Pictures
- Example of a running People Harvest on a public server (vivo.ctrip.ufl.edu)
- Example of a running PubMed Harvest on a public server (vivo.ctrip.ufl.edu)
- Create tutorials on Sourceforge usage and development integration.
- How to FTP upload
- How to SSH in
- Maven/Javadoc integration
External Interfacing
- Installing Joseki How-To
- Installing Sesame How-To
- Drupal How-To
- Sparql End Point
- Linked Data
- Wordpress How-To
- Sparql End Point
- Linked Data
- Sakai
- Sparql End Point
- Linked Data
- example of a sparql end point (vivo.ctrip.ufl.edu)
- example of using a sparql end point (ctrip.ufl.edu?)
VIVO Authentication
- Establish Framework for Plug-in Play Authentication Modules
- Kerberos
- Shibboleth
- AD
- LDAP
- Insure integration of authentication systems with authorization system of vivo (vivo group levels)
- GUI
- Security Record Viewer (view security logins from VIVO Application
- Security Set-up
- link authentication groups to security levels in vivo
- specify the type of authentication (shibboleth, kerberos, whatever is installed)
- Install necessary tools on server from app (such as shibboleth)
- Modify security files such as shibboleth.xml
VIVO Packaging
- Amazon Cloud
- VMWare
- Debian Package
- RPM Package
- War File
Processes
- Automated Release Process
- Integrate with Jim to allow for press of button against release code and all VM's and other packages build