Handling File System Errors

(Updated: Mar 2010)

From time to time, usually due to catastrophic disk / RAID failures, it may be necessary to repair the backing file system of an OST or MDT to correct file system errors. This is done using a special version of the e2fsck tool. In such cases, it may also be useful to run lfsck, a Lustre™-specific fsck tool that checks the coherency of a running Lustre file system as a whole.

A Lustre-specific version of e2fsprogs can be found at http://downloads.lustre.org/public/tools/e2fsprogs/. A quilt patchset of all changes to the vanilla e2fsprogs is available in e2fsprogs-{version}-patches.tgz.

For information about:


 * Using e2fsck on a backing file system, see Section 19.5: Recovering from Errors or Corruption on a Backing File System in the Lustre Operations Manual.
 * Running e2fsck+lfsck on a corrupted Lustre file system, see Section 19.6: Recovering from Corruption in the Lustre File System in the Lustre Operations Manual.
 * Addressing orphaned objects, see Section 19.6.1: Working with Orphaned Objects in the Lustre Operations Manual.

For more information about lfsck, see Section 27.2: lfsck in the Lustre Operations Manual.