Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Storage environment:  for the purposes of this test (and for our real migration), we are migrating from one CIFS-mounted remote filesystem to another CIFS-mounted remote filesystem.
  2. Datastream index:  takes about 1h10m minutes to build, and occupies 327MB of disk space.
  3. Source layout.  Akubra hash storage, using the pattern "#/##/##" for both datastreams and objects.
  4. Average seconds per object is calculated based on the difference between the time the first object is processed (after the datastream index has been generated) and the time the last object is processed.

Issues

Migration Tests

UW Digital Collections Center Production Repository

...

Code Block
languagebash
titleUW Madison migration-util command line
$ java -jar target/migration-utils-4.4.1-SNAPSHOT-driver.jar --migration-type=FEDORA_OCFL --source-type=akubra --datastreams-dir=/fedora3-prod/fedora/datastreams --objects-dir=/fedora3-prod/fedora/objects --target-dir=/fedora-migration-test --index-dir=/var/tmp/datastream-index


Number
of objects

Execution
Time

Average seconds per objectOCFL repository size
Source
Layout

Migration
tool version

Notes
1000Datastream index: 1h17m
OCFL repo: 4h36m
16
2.
3
9 sec
184GB
133GB
Akubra


(81586bf )

with param --pid-file=1000pids.txt
datastream index cleared after run
10,000Datastream index: 1h5m
OCFL repo: 11h48m
4.3 sec
688GBAkubra
147GB
(81586bf )


with param --pid-file=10000pids.txt
datastream index cleared after run

Most objects are XML docs in this batch.

100,000Datastream index: 1h9m
OCFL repo: 3d20h16m
3.3 sec
Akubra
1.6TB

 
(4a9f19c)

with param --pid-file=100000pids.txt
datastream index cleared after run
All 561,000
Akubra

Datastream index: 1h10m
OCFL repo:
20d21h12m

3.2 sec9TB

 
(43b7bae)

all pids