Participants : Gabriel Antoniu, Luc Bougé, Bogdan Nicolae.
Bogdan Nicolae, Bogdan.Nicolae@irisa.fr
GNU Lesser General Public License (LGPL) version 3.
This software is available on Inria 's forge. Registration with APP is in progress.
BlobSeer is a data storage service specifically designed to deal with the requirements of large-scale data-intensive distributed applications, that abstract data as huge sequences of bytes which are stored as BLOBs (binary large objects). It exports a simple, yet versatile versioning interface to manipulate BLOBs that enables reading, writing and appending to them. BlobSeer offers both scalability and performance with respect to a series of issues typically associated with the data-intensive context: scalable aggregation of storage space from the participating nodes with minimal overhead, ability to store huge data objects , efficient fine-grain access to data subsets, high throughput in spite of heavy access concurrency , as well as fault-tolerance . Development has started since January 2008. The implementation is build on top of the Boost collection of C++ libraries, Berkeley DB and libconfig. Additional scripting in Perl/Python handles deployment on Grid'5000 , which is done through the OAR resource scheduler. Benchmarking so far has proved correctness and performance with up to 400 nodes from 3 different sites.