WARNING: This is the _old_ Lustre wiki, and it is in the process of being retired. The information found here is all likely to be out of date. Please search the new wiki for more up to date information.

Handling File System Errors: Difference between revisions

From Obsolete Lustre Wiki
Jump to navigationJump to search
(Created page with 'Need introduction here - what is this topic about?Jody to provide For information on these topics, see Fsck Support. * Where to get e2fsck support for Lustre * Code...')
 
No edit summary
 
(30 intermediate revisions by 2 users not shown)
Line 1: Line 1:
[[Need introduction here - what is this topic about?]][[Jody to provide]]
<small>''(Updated: Mar 2010)''</small>


For information on these topics, see [[Fsck Support]].
From time to time, usually due to catastrophic disk / RAID failures, it may be necessary to repair the backing file system of an OST or MDT to correct file system errors.  This is done using a special version of the ''e2fsck'' tool. In such cases, it may also be useful to run ''lfsck'', a Lustre™-specific ''fsck'' tool that checks the coherency of a running Lustre file system as a whole.  


* Where to get e2fsck support for Lustre
A Lustre-specific version of ''e2fsprogs'' can be found at [http://downloads.lustre.org/public/tools/e2fsprogs/ http://downloads.lustre.org/public/tools/e2fsprogs/]. A quilt patchset of all changes to the vanilla e2fsprogs is available in ''e2fsprogs-{version}-patches.tgz''.
* Code for e2fsprogs-lustre
* Using e2fsck on a backing filesystem
* How to run e2fsck+lfsck on a corrupted Lustre filesystem


The ''e2fsprogs'' package contains an ''lfsck'' tool which does distributed coherency
For information about:
checking for the Lustre file system after ''e2fsck'' is run. For more information about ''lfsck'', see Chapter 28 in the [[Lustre Documentation|''Lustre Operations Manual'']].
 
* Using e2fsck on a backing file system, see [http://wiki.lustre.org/manual/LustreManual20_HTML/TroubleShootingRecovery.html#50438225_pgfId-1292068 Section 27.1: ''Recovering from Errors or Corruption on a Backing File System''] in the [http://wiki.lustre.org/manual/LustreManual20_HTML/index.html ''Lustre Operations Manual''].
* Running e2fsck+lfsck on a corrupted Lustre file system, see [http://wiki.lustre.org/manual/LustreManual20_HTML/TroubleShootingRecovery.html#50438225_pgfId-1291230 Section 27.2: ''Recovering from Corruption in the Lustre File System''] in the [http://wiki.lustre.org/manual/LustreManual20_HTML/index.html ''Lustre Operations Manual''].
* Addressing orphaned objects, see [http://wiki.lustre.org/manual/LustreManual20_HTML/TroubleShootingRecovery.html#50438225_pgfId-1290574 Section 27.2.1: ''Working with Orphaned Objects''] in the [http://wiki.lustre.org/manual/LustreManual20_HTML/index.html ''Lustre Operations Manual''].
 
For more information about ''lfsck'', see [http://wiki.lustre.org/manual/LustreManual20_HTML/UserUtilities_HTML.html#50438206_pgfId-1305844 Section 32.3: ''lfsck''] in the [http://wiki.lustre.org/manual/LustreManual20_HTML/index.html ''Lustre Operations Manual''].

Latest revision as of 07:06, 20 January 2011

(Updated: Mar 2010)

From time to time, usually due to catastrophic disk / RAID failures, it may be necessary to repair the backing file system of an OST or MDT to correct file system errors. This is done using a special version of the e2fsck tool. In such cases, it may also be useful to run lfsck, a Lustre™-specific fsck tool that checks the coherency of a running Lustre file system as a whole.

A Lustre-specific version of e2fsprogs can be found at http://downloads.lustre.org/public/tools/e2fsprogs/. A quilt patchset of all changes to the vanilla e2fsprogs is available in e2fsprogs-{version}-patches.tgz.

For information about:

For more information about lfsck, see Section 32.3: lfsck in the Lustre Operations Manual.