Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3

Added in DSpace 3.0 is an optional statistics engine using Elastic Search. Elastic Search Statistics is independent of SOLR Statistics that was added in DSpace 1.6. , which may be enabled as an alternative to the default DSpace Statistics engine (based on Apache SOLR). The motivation for adding Elastic Search was to find an alternative statistics processing engine that would handle the workload of a large amount of statistics data. Additionally, the Elastic Search statistics display offers another method for creating statistical queries against your data. Elastic Search Usage Statistics has been contributed by Peter Dietz of Ohio State University's Knowledge Bank. The data source for Elastic Search Statistics is DSpace Usage Events, where Usage Event is a view or download of a DSpace Object (Bitstream, Item Page, Collection Page, Community Page). Elastic Search Statistics is bundled with DSpace, and requires no additional installation of software, it just needs to be enabled. Elastic Search is only available for use with XMLUI. 

...

Command used:

[dspace]/bin/dspace stats-log-converter

Java class:

org.dspace.statistics.util.ClassicDSpaceLogConverter

Arguments short and long forms):

Description

-i or --in

Input file

-o or --out

Output file

-m or --multiple

Adds a wildcard at the end of input and output, so it would mean if -i dspace.log -m was specified, dspace.log* would be converted. (For example, the following files would be included because of this argument: i.e. all of the following: dspace.log, dspace.log.1, dspace.log.2, dspace.log.3, etc.)

-n or --newformat

If the log files have been created with DSpace 1.6 or newer

-v or --verbose

Display verbose output (helpful for debugging)

-h or --help

Help

An example form of this command would be [dspace]/bin/dspace stats-log-converter -i dspace.log * -o statistics.log -m -n


The Log Importer program takes the intermediate format data produced in the previous step, and imports it into Elastic Search Statistics.

Command used:

[dspace]/bin/dspace stats-log-importer-elasticsearch

Java class:

org.dspace.statistics.util.StatisticsImporterElasticSearch

Arguments short and long forms):

Description

-i or --in

Input file

-m or --multiple

Adds a wildcard at the end of input and output, so it would mean dspaceif -i statistics.log -m was specified, statistics.log* would be convertedimported. (For example, the following files would be included because of this argument: dspacei.e. all of the following: statistics.log, dspacestatistics.log.1, dspacestatistics.log.2, dspacestatistics.log.3, etc.)

-s or --skipdns

To skip the reverse DNS lookups that work out where a user is from. (The DNS lookup finds the information about the host from its IP address, such as geographical location, etc. This can be slow, and wouldn't work on a server not connected to the internet.)

-v or --verbose

Display verbose output (helpful for debugging)

-h or --help

Help

An example form of this command would be [dspace]/bin/dspace stats-log-importer-elasticsearch -i statistics.log * -m

Viewing Data in Elastic Search Statistics

...