Old Release

This documentation covers an old version of Fedora. Looking for another version? See all documentation.

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Next »

Existing Test Results

The most recent benchmark test results I see are from June:

These tests ingested a very small number of objects (10), and files ranging from 1 byte to 100MB.

New Test Parameters

  • Number of Objects: In some initial testing on my laptop, I noticed degraded ingest performance as the repository size grew, particularly after 20K objects.
  • Datastream Size: In my experience, the larger files in those tests (1MB, 8MB, 100MB) are fairly typical for high-res images and audio/video derivatives, and audio/video masters are much larger.

Testing full-scale ingest would take more disk space and time than we have available.  So it probably makes more sense to do two separate tests:

  • Many Objects: 25K objects with small datastreams (100KB): 2.5GB total data size
  • Large Datastreams: 100 objects with larger datastreams (50MB): 5GB total data size

In addition to varying the number of objects and datastream size, it is particularly important to perform some tests on the same hardware in order to have directly-comparable data:

  • Thread Test: Run the same test on the same hardware with increasing numbers of client threads
  • Fedora 3/4 Comparison: Run the same tests on the same hardware with both Fedora 3 and 4

Test Matrix

Fedora 3/4 Comparison

Objects

Datastream
Size

Fedora 3
Time (ms)

Fedora 4
Time (ms)

Notes
10050MB389,034281,301Performance evaluation on the SCC Cluster #A & #C
10050MB 881,883Performance evaluation on AWS #C
25,000100KB   

Thread Test

Objects

Datastream
Size

1
Thread
2
Threads
3
Threads
4
Threads
10050MB    
25,000100KB    

Using Benchtool

The ingest benchmarking tool is available on github: https://github.com/futures/benchtool

To build and run benchtool (creating 1000 objects with 1KB datastreams using 5 client threads):

git clone git@github.com:futures/benchtool.git
cd benchtool
mvn install
java -cp target/bench-tool-0.0.1-SNAPSHOT-jar-with-dependencies.jar org.fcrepo.bench.BenchToolFC4 http://localhost:8080 1000 1024 5

While running benchtool will output a running progress percentage, and then output the time to perform the ingest:

Complete ingest of 1000 files took 23301 ms
  • No labels