Attendees
Jonathan Corson-Rikert
Brian Caruso
Brian Lowe
Jim Blake
Jonathan Markow
Andrew Woods
Agenda
- Any updates?
- review questions and any answers from last time to understand the activities we need, that are both organizational and development, to get a better oerall picture
- Further definition of technical requirements
- Where is our knowledge of Hadoop?
- How applicable to the VIVO linked data index builder is the code for indexing RDF versions of MARC records?
- Further definition of business
...
- requirements – straw man proposals for discussion
- Stage 1 – by VIVO conference mid-August 2013
- reproduce the current index and update the current vivosearch.org site
- establish a workable frequency of update – monthly or weekly at most
- then expand 5-10 more institutions (Melbourne, Colorado, Duke, Brown, Eindhoven or VU Amsterdam, Cambridge might be candidates)
- Stage 2 – for October, 2013 CTSA PI meetings
- work with ~5 CTSAs to demonstrate an index including only designated CTSA investigators
- UF, Weill Cornell, Washington University, Indiana, Harvard
- work with CTSA researcher networking group
- Stage 3 – by the end of 2013
- work with Colorado to help them set up an independent Colorado search across Boulder, Colorado Springs, and Denver campuses
- prepare more detailed ongoing business plan as part of marketing campaign for 2014
- Other topics
...
Discussion
How much technical work is required to updating Distributed Indexer?
Review of questions
- Worthwhile to review questions and responses
- What will this project consist of, soup to nuts
- What will the total costs look like?
- The idea of an uber-project with CTSAs
- CTSAs may not currently be in a position to leverage a cross-institutional VIVO Search: disambiguation
- Which institions do we target in which phases?
- Pilot group: friend-institutions and few CTSAs
- Post-pilot: additional CTSAs
- Defining phases/stages
- Pilot phase
- CTSA phase? General open phase?
- Defining sequence of tasks
- Defining roles
- Project roles
- Production service roles
- How to define level of effort for various roles
- Work backwards from the required tasks
- May want to consider additional roles
- Business liason
- Fund-raising manager
- Technical lead
- Division of labor
- DuraSpace/VIVO relationship is that DuraSpace provides advice/mentoring
- It may or may not make sense for DuraSpace to participate in implementation
- DuraSpace can help with marketing and cloud-service support
- Keys to success
- Additionally, needs to be as easy as possible on client-side
- Partner specialist will also be required
- Deciding on frontend technology
- What in-house skills are available?
- What are the application needs
- Dynamic/scalable servers useful in Hadoop context, less so for frontend app
- Indexing frequency: could be a business model around higher frequency
- Sites would have to be able to support hammering of linked-open-data requests
- Improve UI to support increased institutions and facets
- Need ability to adjust relevancy rankings
- Analysis for disambiguation of source data
Next steps
- Technical analysis
- Move towards technology choices
- Business model?