WARNING: This is the _old_ Lustre wiki, and it is in the process of being retired. The information found here is all likely to be out of date. Please search the new wiki for more up to date information.
Handling File System Errors: Difference between revisions
From Obsolete Lustre Wiki
				
				
				Jump to navigationJump to search
				
				| No edit summary | No edit summary | ||
| Line 9: | Line 9: | ||
| * Using e2fsck on a backing file system, see [http://wiki.lustre.org/manual/LustreManual20_HTML/TroubleShootingRecovery.html#50438225_pgfId-1292068 Section 27.1: ''Recovering from Errors or Corruption on a Backing File System''] in the [http://wiki.lustre.org/manual/LustreManual20_HTML/index.html ''Lustre Operations Manual'']. | * Using e2fsck on a backing file system, see [http://wiki.lustre.org/manual/LustreManual20_HTML/TroubleShootingRecovery.html#50438225_pgfId-1292068 Section 27.1: ''Recovering from Errors or Corruption on a Backing File System''] in the [http://wiki.lustre.org/manual/LustreManual20_HTML/index.html ''Lustre Operations Manual'']. | ||
| * Running e2fsck+lfsck on a corrupted Lustre file system, see [http://wiki.lustre.org/manual/LustreManual20_HTML/TroubleShootingRecovery.html#50438225_pgfId-1291230 Section 27.2: ''Recovering from Corruption in the Lustre File System''] in the [http://wiki.lustre.org/manual/LustreManual20_HTML/index.html ''Lustre Operations Manual'']. | * Running e2fsck+lfsck on a corrupted Lustre file system, see [http://wiki.lustre.org/manual/LustreManual20_HTML/TroubleShootingRecovery.html#50438225_pgfId-1291230 Section 27.2: ''Recovering from Corruption in the Lustre File System''] in the [http://wiki.lustre.org/manual/LustreManual20_HTML/index.html ''Lustre Operations Manual'']. | ||
| * Addressing orphaned objects, see [http://wiki.lustre.org/manual/ | * Addressing orphaned objects, see [http://wiki.lustre.org/manual/LustreManual20_HTML/TroubleShootingRecovery.html#50438225_pgfId-1290574 Section 27.2.1: ''Working with Orphaned Objects''] in the [http://wiki.lustre.org/manual/LustreManual20_HTML/index.html ''Lustre Operations Manual'']. | ||
| For more information about ''lfsck'', see [http://wiki.lustre.org/manual/LustreManual18_HTML/UserUtilitiesMan1_HTML.html#50651189_91700 Section 28.3: ''lfsck''] in the [http://wiki.lustre.org/manual/LustreManual20_HTML/index.html ''Lustre Operations Manual'']. | For more information about ''lfsck'', see [http://wiki.lustre.org/manual/LustreManual18_HTML/UserUtilitiesMan1_HTML.html#50651189_91700 Section 28.3: ''lfsck''] in the [http://wiki.lustre.org/manual/LustreManual20_HTML/index.html ''Lustre Operations Manual'']. | ||
Revision as of 07:05, 20 January 2011
(Updated: Mar 2010)
From time to time, usually due to catastrophic disk / RAID failures, it may be necessary to repair the backing file system of an OST or MDT to correct file system errors. This is done using a special version of the e2fsck tool. In such cases, it may also be useful to run lfsck, a Lustre™-specific fsck tool that checks the coherency of a running Lustre file system as a whole.
A Lustre-specific version of e2fsprogs can be found at http://downloads.lustre.org/public/tools/e2fsprogs/. A quilt patchset of all changes to the vanilla e2fsprogs is available in e2fsprogs-{version}-patches.tgz.
For information about:
- Using e2fsck on a backing file system, see Section 27.1: Recovering from Errors or Corruption on a Backing File System in the Lustre Operations Manual.
- Running e2fsck+lfsck on a corrupted Lustre file system, see Section 27.2: Recovering from Corruption in the Lustre File System in the Lustre Operations Manual.
- Addressing orphaned objects, see Section 27.2.1: Working with Orphaned Objects in the Lustre Operations Manual.
For more information about lfsck, see Section 28.3: lfsck in the Lustre Operations Manual.

