WARNING: This is the _old_ Lustre wiki, and it is in the process of being retired. The information found here is all likely to be out of date. Please search the new wiki for more up to date information.

DRBD and Lustre

From Obsolete Lustre Wiki
Revision as of 10:38, 8 September 2009 by Docadmin (talk | contribs) (Created page with 'Distributed Replicated Block Device (DRBD) is a block device that is designed for building high-availability clusters. It works by mirroring the entire block device via a dedicat...')
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

Distributed Replicated Block Device (DRBD) is a block device that is designed for building high-availability clusters. It works by mirroring the entire block device via a dedicated network. For more information, see the DRBD website.

Preliminary testing has been done to evaluate the use of DRBD as back-end storage for Lustre (to avoid shared storage solutions while retaining redundancy). Of particular interest is the performance impact of DRBD on a Lustre filesystem. The results of this preliminary testing appear below.

NOTE: When these test results were obtained, no fine-tuning of DRBD was done.

Tests with lmdd

These tests measured write throughput (2 OSTs on 1 OSS):

  • Without DRBD: 160 MB/s
  • With DRBD synchronous mode: 90 MB/s
  • With DRBD semi-synchronous mode: 108 MB/s
  • With DRBD asynchronous mode: 115 MB/s

In the lmdd tests, the write throughput loss due to DRBD is about 30% in asynchronous mode, and about 40% in synchronous mode.


Tests with IOzone

For this testing, IOzone was run this way:

./iozone -c -e -i 0 -s 1g -t 1 -F /mnt/lustre/iozone -w

./iozone -c -e -i 1 -s 1g -t 1 -F /mnt/lustre/iozone

These tests measured write/read throughput:

  • Without DRBD: W 125 MB/s, R 180 MB/s
  • With DRBD synchronous mode: W 90 MB/s, R 160 MB/s
  • With DRBD semi-synchronous mode: W 50 MB/s, R 140 MB/s
  • With DRBD asynchronous mode: W 115 MB/s, R 160 MB/s

With the IOzone tests, the read throughput is the same in synchronous and asynchronous modes. This is because reads are always carried out locally. In read throughput, the performance loss due to DRBD is about 10%. In write throughput, the performance loss due to DRBD is about 10% in asynchronous mode and about 30% in synchronous mode.

The performance impact of DRBD is smaller with IOzone, because IOzone proceeds to independent writes, and avoids accumulating DRBD overhead with each I/O instance.