WARNING: This is the _old_ Lustre wiki, and it is in the process of being retired. The information found here is all likely to be out of date. Please search the new wiki for more up to date information.
Handling File System Errors
From Obsolete Lustre Wiki
Jump to navigationJump to search
From time to time, usually due to catastrophic disk / RAID failures, it may be necessary to repair the backing file system of an OST or MDT to correct file system errors. This is done using a special version of the e2fsck tool. It may also be useful to run lfsck in such cases. This is a Lustre™-specific tool that checks the coherency of a running Lustre file system as a whole.
A Lustre-specific version of e2fsprogs can be found at http://downloads.lustre.org/public/tools/e2fsprogs/. A quilt patchset of all changes to the vanilla e2fsprogs is available in e2fsprogs-{version}-patches.tgz.
For information about e2fsck, see:
- Using e2fsck on a backing file system in Section 19.5: Recovering from Errors or Corruption on a Backing File System in the Lustre Operations Manual.
- Running e2fsck+lfsck on a corrupted Lustre file system in Section 19.6: Recovering from Corruption in the Lustre File System in the Lustre Operations Manual.
For more information about lfsck, see Section 27.2: lfsck in the Lustre Operations Manual.
(Updated 02/10)