Short Tour: Starting a VIVO project

up to Tour start | part 1: what's VIVO? | part 2: what's different?

This page is part 3 of a short, self-paced tour introducing VIVO for use in an interactive workshop or online.

Starting a VIVO Project

There is perhaps no "typical" VIVO, since every organization from the smallest to the largest is different. Certain themes recur, however, in planning and executing an evaluation, pilot, and eventual full rollout of VIVO.

Finding data.

In a very small organization such as a department, a single research center, or small institute, it may be possible to populate a VIVO instance interactively, either by training a few people to enter data about everyone or asking people to fill in their own profile information. This approach does not scale to large organizations, where much of the information desired for inclusion in VIVO already exists in institutional systems of record such as human resource databases, grants management systems, and course catalogs. Manual entry is time consuming, and if the data exist in a machine-readable form it makes little sense to re-key it.

When data systems already exist, one of the first challenges will be to find common identifiers to link them. Most organizations have evolved an identifier for people that can be shared publicly, whether the institutional email address or another number other than a social security number in the U.S. or other confidential government ID. It's remarkably hard to identify who should be in a VIVO, and surprisingly few large organizations even have an accurate or consistent scheme for identifying departments and organizations, especially somewhat transient units such as research centers.

Access to data.

The mere fact that data exist does not assure that they will be made available to you, however. It will be important to make contact with the owners or stewards of the data desired for VIVO, and it may not be trivial for them to make data available. It's common for public and private data to be intermingled in systems of record that have been built for functions such as payroll, performance evaluation, or financial accountability. Gaining access to data and permission to display it on the Web often requires building bridges to other units and catching the ear of people with the authority to support your requests. Some of the most successful VIVOs involve closely-coordinated collaborations among central administrators, IT staff, the library, and the research administration office.

Data evolves.

It's easy to focus on a one-time data load, especially in a proof of concept or pilot project. Putting up any kind of information system, especially one with as many different kinds of data as VIVO, requires making a plan for handling updates – both full and partial data removals as well as additions. Many data sources don't maintain the date of last modification so a complete scan is needed to detect change. In some cases data layers can be replaced wholesale in VIVO because they interact or interleave very little with other datasets, which simplifies updating.

Users will quickly point out incorrect or outdated information, and once the confidence of a user has been lost in the accuracy or completeness of an online database it's not easy to gain that confidence back. Some VIVO installations have augmented display pages with information icons that clarify the source of each category of information and even identify who to contact to change it. Where policies permit users to edit their own profiles, change is straightforward, but when data have come from a system of record it's more sustainable to make changes back at the source so that errors don't propagate back to VIVO or to other systems.

Be realistic.

If you are fortunate enough to have buy-in from top management and a commitment for the staffing and IT resources to embark on a multi-faceted exploration and implementation of VIVO, more power to you. In many cases people interested in VIVO need to start small and build a case for VIVO based on a targeted need identified for a small segment of the organization, whether a campus, a government department, or a distributed organization.

Starting small and not taking on too many different kinds of data allows you to familiarize yourself with what are often new concepts and new tools. Data are often dirtier than advertised to be, and more will be left to you to clean up in the process of importing into VIVO. If you need to hire technical help, it may take longer than anticipated to find a qualified person familiar with Semantic Web technologies than for routine web development.

For more information

on to part 4 of the Tour: VIVO in an information ecosystem

Space shortcuts

Page tree