Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

Introduction

The Fedora Community has raised significant concerns about retrieval times for resources with many members.   By "members" we are talking about at least three scenarios

...

Using a set of tests originally devised by Esmé Cowles I modified them slightly and added a couple of new tests.  The latest versions can be found here : test scripts.   The test results, in most cases, reflect that fastest retrieval speeds I was able to get. Initial retrieval times tended to be slower in most cases due to the fact that modeshape needs to warm up the cache.  The scripts now make an initial GET call to warm the cache before timing a second call to the same resource.

...

  1.  Install postgresql 
  2. Install mysql-server 
  3. Build  the fcrepo4 project from the specified commit. 
  4. Run fcrepo-webapp using mvn clean jetty:run and the appropriate MAVEN_OPTS such as fcrepo.modeshape.configuration, fcrepo.mysql.username, fcrepo.mysql.password,  fcrepo.postgres.username, fcrepo.postgres.password.
  5. git clone https://github.com/dbernsteinfcrepo4-labs/fcrepo-performance-test-testsscripts.git
  6. For each test, I ran the scripts with 1000 and 10000 items. 

Instance Definition

  1. AWS / Ubuntu 16 / Oracle Java 8 / m3.medium  (3.7 GiB memory  / Intel Xeon E5-2670 (Sandy Bridge) Processor @  2.6 GHz x 1)
  2. Lenovo / Ubuntu 16.10 / Java HotSpot 1.8.0_111 / 11.6 GiB memory / Intel i7-4600U CPU @ 2.10GHz x 4
  3. AWS / Ubuntu 16 / Oracle Java 8 / m3.xlarge (14 GiB memory  / Intel Xeon E5-2670 (Sandy Bridge) Processor @  2.6 GHz x 4)
  4. AWS / Ubuntu 16 / Oracle Java 8 / c4.xlarge (7.5 GiB memory  / Intel Xeon E5-2666 v3 (Haswell)  @  2.9 GHz x 4)
  5. AWS / Ubuntu 16 / Oracle Java 8 / c4.2xlarge (15 GiB memory  /Intel Xeon E5-2666 v3 (Haswell) @  2.9 GHz x 8)
  6. AWS / Ubuntu 16 / Oracle Java 8 / c4.4xlarge (30 GiB memory  /Intel Xeon E5-2666 v3 (Haswell) @  2.9 GHz x 16)


n-children.sh

FCREPO Version
RepoBranch
Commit
Environment
Modeshape Config

# of relations

Test Duration

(seconds)

Tester

4.8.0-SNAPSHOTdbernstein
b60d4e5file-simple100005.222
4.8.0-SNAPSHOTdbernstein
daa11f35file-simple100005.567
4.8.0-SNAPSHOTdbernstein
b60d4e6file-simple100004.859
4.8.0-SNAPSHOTdbernstein
daa11f36file-simple100004.841

n-members.sh

...

Number of relations: 1000

FCREPO VersionRepobranchCommitmodeshape configEnvironment# of relations

Test Duration

(seconds)

TesterNotes4.8.0-SNAPSHOTfcrepo4master2df32file-simple110006.5704.7.1fcrepo4master
FCREPO VersionRepoBranchCommitmodeshapeEnvironment

# of relations

Test Duration

(seconds)

Tester
4.8.0-SNAPSHOTfcrepo4master2df32file-simple110001.408
4.7.1fcrepo4master4.7.1546f5a5file-simple210001.45
4.8.0-SNAPSHOTfcrepo4master2df32file-simple110,00028.583
4.7.1fcrepo4master4.7.1546f5a5file-simple210,00024.79
4.8.0-SNAPSHOTfcrepo4masterb60d4efile-simple310,0009.58
4.8.0-SNAPSHOTdbernstein
fcrepo-2105-v4-parallelization
daa11f3file-simple310,00012.16Danny Bernsteinparallel streams enabled.
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-v4-parallelizationdaa11f3file-simple410,0009.71Danny Bernsteinparallel streams enabled.
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-v4-parallelizationdaa11f3file-simple510,0008.51Danny Bernsteinparallel streams enabled.
4.8.0-SNAPSHOT

b60d4efile-simple510,0008.815
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-v4-parallelizationdaa11f3file-simple610,0008.29Danny Bernsteinparallel streams enabled.
4.8.0-SNAPSHOT

b60d4efile-simple610,0008.661
4.8.0-SNAPSHOTbbrananfcrepo-2402f0a51ejdbc-postgresql110001.543
4.8.0-SNAPSHOTbbrananfcrepo-2402f0a51ejdbc-postgresql11000010,00061.381perhaps postgres needs caching configured?
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-v4-parallelizationdaa11f3jdbc-postgresql310000.592
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-v4-parallelizationdaa11f3jdbc-postgresql31000010,00039.671
4.8.0-SNAPSHOTbbrananfcrepo-2402f0a51ejdbc-postgresql310004.435
4.8.0-SNAPSHOTbbrananfcrepo-2402f0a51ejdbc-postgresql31000010,00039.486
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-large-cachef453ajdbc-postgresql310000.5611 million item cache
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-large-cachef453ajdbc-postgresql31000010,0005.3341 million item cache
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-large-cache5138b4jdbc-postgresql310000.6331 million item cache + parallelized
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-large-cache5138b4jdbc-postgresql31000010,0005.3981 million item cache + parallelized
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-large-cache5138b4mysql-postgresql310000.8201 million item cache + parallelized
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-large-cache5138b4mysql-postgresql31000010,0007.7261 million item cache + parallelized
4.8.0-SNAPSHOTdbernsteinfcrepo-240275dd1jdbc-postgresql-s3310000.7011 million item cache + parallelized
4.8.0-SNAPSHOTdbernsteinfcrepo-240275dd1jdbc-postgresql-s331000010,0005.4851 million item cache + parallelized
4.8.0-SNAPSHOTdbernsteinfcrepo-240275dd1jdbc-mysql-s3310000.8641 million item cache + parallelized
4.8.0-SNAPSHOTdbernsteinfcrepo-240275dd1jdbc-mysql-s331000010,0007.3951 million item cache + parallelized

...

4.7.1fcrepo44.7.1546f5a5file-simple210,00013.07cacheSize = 50,000
4.7.1fcrepo44.7.1
546f5a5file-simple2
1000
10,000
2
10.
80
30cacheSize = 1,000,000


n-memberof.sh

Number of relations: 1000

4.8.0000
FCREPO VersionRepobranchCommitmodeshape configEnvironment# of relations

Test Duration

(seconds)

TesterNotes
4.8.0-SNAPSHOTfcrepo4master2df32file-simple110,0001000866.570
4.7.1fcrepo4master4.7.1546f5a5file-simple210,0001000572.3580
4.8.0-SNAPSHOTfcrepo4masterb60d4e2df32file-simple3110,0003086.02000Unlike the n-member example, results begin streaming right away - so the response begins streaming within 2 seconds.
4.7.1fcrepo44.7.1546f5a5file-simple210,00057.35
4.8.0-SNAPSHOTfcrepo4masterb60d4e4.8.0-SNAPSHOTdbernsteinfcrepo-2105-v4-parallelizationdaa11f3file-simple310,0002930.97502parallel streams enabledUnlike the n-member example, results begin streaming right away - so the response begins streaming within 2 seconds.
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-v4-parallelizationdaa11f3file-simple4310,00029.975Danny Bernsteinparallel streams enabled.
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-v4-parallelizationdaa11f3file-simple410,00024.790Danny Bernsteinparallel streams enabled.
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-v4-parallelizationdaa11f3file-simple510,00020.992Danny Bernsteinparallel streams enabled.
4.8.0-SNAPSHOTfcrepo4masterb60d4efile-simple510,00026.357
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-v4-parallelizationdaa11f3file-simple610,00020.337Danny Bernsteinparallel streams enabled.
4.8.0-SNAPSHOTfcrepo4masterb60d4efile-simple610,00025.782
4.8.0-SNAPSHOTbbrananfcrepo-2402f0a51ejdbc-postgresql1100011.414
4.8.0-SNAPSHOTbbrananfcrepo-2402f0a51ejdbc-postgresql11000010,000194
4.8.0-SNAPSHOTdbernsteincrepo-2105-v4-parallelizationdaa11f3jdbc-postgresql310003.961parallel streams enabled.
4.8.0-SNAPSHOTdbernsteincrepo-2105-v4-parallelizationdaa11f3jdbc-postgresql31000010,00053.452parallel streams enabled.
4.8.0-SNAPSHOTbbrananfcrepo-2402f0a51ejdbc-postgresql3100010.833
4.8.0-SNAPSHOTbbrananfcrepo-2402f0a51ejdbc-postgresql31000010,000109.530
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-large-cachef453ajdbc-postgresql31000022.9631 million item cache.
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-v4-parallelization5138b4jdbc-postgresql31000010,00011.5591 million item cache + parallelized
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-v4-parallelization5138b4mysql-postgresql310002.3371 million item cache + parallelized
4.8.0-SNAPSHOTdbernsteinfcrepo-2105-v4-parallelization5138b4mysql-postgresql31000010,00014.9981 million item cache + parallelized
4.8.0-SNAPSHOTdbernsteinfcrepo-240275dd1jdbc-mysqlpostgresql-s33100021.3748831 million item cache + parallelized
4.8.0-SNAPSHOTdbernsteinfcrepo-240275dd1jdbc-mysqlpostgresql-s331000014.95610,000
1 million item cache + parallelized

...

FCREPO VersionCommitEnvironment# of relations

Test Duration

(seconds)

 
4.8.0-SNAPSHOT
2df32
dbernstein
1
fcrepo-2402
10000.039
75dd1jdbc-mysql-s3310002.374
4.7.1546f5a5210000.14
1 million item cache + parallelized
4.8.0-SNAPSHOT
2df321
dbernsteinfcrepo-240275dd1jdbc-mysql-s3310,000
0
14.
063
956
4
1 million item cache + parallelized
4.7.1fcrepo44.7.1546f5a5file-simple210,000
0
31.
17
91

n-properties.sh

cacheSize = 50,000
4.7.1fcrepo44.7.1546f5a5file-simple210,00023.07cacheSize = 1,000,000

n-uris.sh

Tester 1000
FCREPO VersionCommitEnvironment# of relations

Test Duration

(seconds)

FCREPO VersionCommitEnvironment# of relations

Test Duration

(seconds)


4.8.0-SNAPSHOT2df32110000.060039
4.7.1546f5a5210000.08914
4.8.0-SNAPSHOT2df32110,0000.119063
4.7.1546f5a5210,0000.28117
      

n-binaries.sh  (time for loading binary files)

n-properties.sh

FCREPO VersionCommitEnvironment# of relations

Test Duration

(seconds)

Tester
4.8.0-SNAPSHOT2df321 10000.060
4.7.1546f5a5210000.089
4.8.0-SNAPSHOT2df32110,0000.119
4.7.1546f5a5210,0000.281







n-binaries.sh  (time for loading binary files)

repobranchCommitEnvironmentModeshape Config# of binariessize in KB

Test Duration

(seconds)

Tester
https://github.com/bbranan/fcrepo4.gitfcrepo-2402f0a51e1file-s3 1000100000:06:02
https://github.com/bbranan/fcrepo4.git fcrepo-2402f0a51e1file-simple1000100000:02:16
repobranchCommitEnvironmentModeshape Config# of binariessize in KB

Test Duration

(seconds)

Tester

https://github.com/bbranan/fcrepo4.gitfcrepo-2402f0a51e1
file
jdbc-
s3
postgresql
 1000
1000100000:
06
02:
02
13
https://github.com/bbranan/fcrepo4.gitfcrepo-2402f0a51e
1
3
file
jdbc-postgresql-
simple
s31000100000:
02
06:
16
36
https://github.com/bbranan/fcrepo4.gitfcrepo-2402f0a51e
1
3jdbc-mysql-
postgresql
s31000100000:
02
06:
13
32



Conclusions