This is work in progress!

Currently there seems to be a bug, which creates OutOfMemoryExceptions when ingesting files that are larger than available heap space with certain infinispan configurations (e.g. LevelDB). It seems like this is an issue with the Modeshape project which has been reported at: https://issues.jboss.org/browse/MODE-2103

The following TestCase can be used to reproduce the issue: https://github.com/futures/large-files-test

Workaround

You will need a large heap size for this to work (e.g. -Xmx2048g)

Currently the only known workaround is using a _file_ configuration for infinspan caches e.g.: https://github.com/futures/fcrepo4/blob/34aab66bc26edfca3a4cbabecc4870bfd81f05da/fcrepo-http-commons/src/main/resources/config/single-file/repository.json.

This can be done by setting the following property:

-Dfcrepo.modeshape.configuration=config/single-file/repository.json

 

Large Files on a Single Node Fedora 4 Installation

Use config: CATALINA_OPTS="-Dfcrepo.modeshape.configuration=classpath:/config/single-file/repository.json" bin/catalina.sh run

 

 

Using the single-file configuration ingest and retrieval of files up to the size of 300 GB using Fedora 4's REST API were tested successfully. The files were ingested sequentially, retrieved and a bitwise comparison with the original data has been performed. Larger sizes have not been tested, due to HDD size limitations.

 

Large File Upload/Download Roundtrip Tests

File SizeUploadDownload
256GB15,488,156ms (16.9MB/sec)3,306,756ms (79.3MB/sec)
512GB  

Federated Content Large File Download Roundtrip Tests

File SizeProjection Directory Request DurationFirst Projected Node Request Duration

Download Duration

Throughput
2 GB0m35.117s0m34.572s0m8.236s248.66 mb/sec
10 GB    

100 GB

  

 

 
300 GB    
10*10 GB    

Related articles

 

Related articles

Related articles appear here based on the labels you select. Click to edit the macro and add or change labels.