You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 8 Next »

Hiring for VIVO projects

Current VIVO-related job postings

One big aspect of starting a VIVO project is to find the necessary staff to do the work, and the VIVO team receives many questions about the skills and level of effort required.  There is no one-size-fits-all answer, since the IT and application support environments vary so widely from large research universities and government agencies to small schools and non-profits.

The recent VIVO survey (written by Paul Albert at Weill Cornell Medical College) includes a question getting at both the amount and division of labor that participating sites have needed.  It's telling how many different job tasks can be associated with a VIVO, and also how much variation there is in the range of answers.

RoleAverage FTEMinMaxComment
Project executive or champion0.1300.1 - 0.5 

Project manager or coordinator

0.3001.1 - 2.0 
Business analyst/information architect0.0900.1 - 0.5 
Ontologist0.1500.6 - 1.0e.g., for local ontology extensions
Marketing/outreach person0.1200.6 - 1.0 
IT/systems engineer0.1900.6 - 1.0e.g., for installation, configuration, and tuning
Programmer/ETL specialist0.5202.1 - 3.0e.g., for writing scripts to ingest and update data or re-purpose data from VIVO
Database administrator0.0900.1 - 0.5 
Web developer0.1300.6 - 1.0e.g., for branding and customization

Content coordinator

0.1600.6 - 1.0 

Manual entry clerk

0.0700.1 - 0.5 

For the respondents, the total average FTE for the 11 roles described is 1.95 FTE, with a maximum per responding site of 6.5 FTE and a minimum of 0.2 FTE (from an institution in the exploratory stage).

It's also likely that some roles such as project champion, marketer, or even project coordinator may not be officially accounted for but added into the jobs of people already on staff.

The programmer/ETL specialist has the highest average commitment of effort at 0.5 FTE, which certainly makes sense, followed by the project manager/coordinator at 0.3 FTE.  IT/systems engineer at 0.19, content coordinator at 0.16, ontologist at 0.15, and marketer/outreach at 0.12 follow – but this brings up another frequently cited issue.

How many sites will have specialists in each of these areas with suitable fractions of FTE available to work on VIVO?  More likely a small group of staff will have to be assembled to cover these 11 roles plus any other locally-specific tasks, and these people will need to be multi-talented.  How the tasks break down depends on the individuals involved.

Three major skill domains – policy, content, and technical

Policy

Most VIVO projects need a leader who may not be the executive sponsor but who has the campus relationships and can translate big picture goals into achievable steps suitable for the timeline and resources available. Most often this person becomes the project manager to coordinate the work of a content specialist and a programmer/web developer.

Very often this person will do most of the outreach to build the case for VIVO and promote early prototypes or case studies.  Talent at writing and presentation is important; having the skills to develop marketing materials would be wonderful but may often be brought in on an as-needed basis – look for a talented visual designer who can establish a clear branding that can be applied both to the VIVO application but also to related outreach materials.

Content

The content role requires a detail-oriented person who can cope with data that will almost inevitably be found to be in worse shape than anticipated. Ontology skills are useful if a site plans to introduce new types of data, but otherwise the focus can be more on data analysis, on building communication and trust to have access to all the right information from available sources, and on quality control as first steps expand into production quality, repeatable processes.  The content coordinator needs to work well with the available technical support but usually doesn't need to understand the nuts and bolts in favor of deciding what content is appropriate and when it is correctly represented.

Technical

The IT support environment varies widely from institution to institution.  In an environment where there's excellent "devops" support and systems engineers are in close communication with application owners and understand their preferred technologies, VIVO has not typically been a difficult application to install and support operationally, although as the data scales there can be some more challenging configuration and resourcing issues to address in optimizing the use of processing, database, and memory.  The VIVO technicians can typically focus more on data ingest, customization of branding, ways to re-use the data when a strong IT support environment already exists. In a small organization without a history of supporting enterprise web applications the skill set of the VIVO technician may need to encompass cover aspects of server and database configuration outside the scope of typical web developers.

Very often the primary technical challenges in building up a VIVO revolve around acquiring, transforming, and updating data from institutional systems of record.  A good ETL (extract, transform, load) person will find the tasks familiar but may find the transition from working with row/column data tables or strictly hierarchical XML structures to RDF and the Semantic Web to have a steep learning curve.  We recommend allowing time for a person to train in the basics of RDF, the principles of ontologies, the different mindset of a triple store, and the SPARQL query languages before trying to immerse themselves in complex data transformations. It's essential to know what the target is – what should RDF conforming to the VIVO-ISF ontology look like – before jumping into writing mapping tables or XSLT transformations.  Once the task is clear, the programming is not hard, so look for clear thinking and a willingness to learn.

Do you need a Java programmer? Note in the chart above the distinction in roles and required FTE between the IT/systems engineer and the programmer/ETL specialist – while the VIVO application is written in Java, the skills required to implement a VIVO focus more on scripting and data ingest, especially after initial installation. And while a Java programmer may be more familiar with writing and deploying applications similar to VIVO, it's familiarity with the application stack (principally Apache, Tomcat, and MySQL) more than Java programming that will be required.  Some implementing sites have extended the VIVO software itself for extended functionality by modifying the Java source code (notably the Deep Carbon Observatory project at RPI), but most may never have to change a line of Java.

Sample job descriptions

 

 

 

 

 

 

 

 

  • No labels