2019-03-15 17:15:58 +00:00
|
|
|
.. _mds-scrub:
|
|
|
|
|
2019-09-09 19:36:04 +00:00
|
|
|
======================
|
|
|
|
Ceph File System Scrub
|
|
|
|
======================
|
2019-03-15 09:26:47 +00:00
|
|
|
|
2019-09-09 19:36:04 +00:00
|
|
|
CephFS provides the cluster admin (operator) to check consistency of a file system
|
2019-03-15 09:26:47 +00:00
|
|
|
via a set of scrub commands. Scrub can be classified into two parts:
|
|
|
|
|
2019-09-09 19:36:04 +00:00
|
|
|
#. Forward Scrub: In which the scrub operation starts at the root of the file system
|
2019-03-15 09:26:47 +00:00
|
|
|
(or a sub directory) and looks at everything that can be touched in the hierarchy
|
|
|
|
to ensure consistency.
|
|
|
|
|
|
|
|
#. Backward Scrub: In which the scrub operation looks at every RADOS object in the
|
2019-09-09 19:36:04 +00:00
|
|
|
file system pools and maps it back to the file system hierarchy.
|
2019-03-15 09:26:47 +00:00
|
|
|
|
|
|
|
This document details commands to initiate and control forward scrub (referred as
|
|
|
|
scrub thereafter).
|
|
|
|
|
2019-09-09 19:36:04 +00:00
|
|
|
Initiate File System Scrub
|
|
|
|
==========================
|
2019-03-15 09:26:47 +00:00
|
|
|
|
|
|
|
To start a scrub operation for a directory tree use the following command
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
ceph tell mds.a scrub start / recursive
|
|
|
|
{
|
|
|
|
"return_code": 0,
|
|
|
|
"scrub_tag": "6f0d204c-6cfd-4300-9e02-73f382fd23c1",
|
|
|
|
"mode": "asynchronous"
|
|
|
|
}
|
|
|
|
|
|
|
|
Recursive scrub is asynchronous (as hinted by `mode` in the output above). Scrub tag is
|
|
|
|
a random string that can used to monitor the progress of the scrub operation (explained
|
|
|
|
further in this document).
|
|
|
|
|
|
|
|
Custom tag can also be specified when initiating the scrub operation. Custom tags get
|
2019-09-09 19:36:04 +00:00
|
|
|
persisted in the metadata object for every inode in the file system tree that is being
|
2019-03-15 09:26:47 +00:00
|
|
|
scrubbed.
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
ceph tell mds.a scrub start /a/b/c recursive tag0
|
|
|
|
{
|
|
|
|
"return_code": 0,
|
|
|
|
"scrub_tag": "tag0",
|
|
|
|
"mode": "asynchronous"
|
|
|
|
}
|
|
|
|
|
|
|
|
|
2019-09-09 19:36:04 +00:00
|
|
|
Monitor (ongoing) File System Scrubs
|
|
|
|
====================================
|
2019-03-15 09:26:47 +00:00
|
|
|
|
|
|
|
Status of ongoing scrubs can be monitored using in `scrub status` command. This commands
|
|
|
|
lists out ongoing scrubs (identified by the tag) along with the path and options used to
|
|
|
|
initiate the scrub.
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
ceph tell mds.a scrub status
|
|
|
|
{
|
|
|
|
"status": "scrub active (85 inodes in the stack)",
|
|
|
|
"scrubs": {
|
|
|
|
"6f0d204c-6cfd-4300-9e02-73f382fd23c1": {
|
|
|
|
"path": "/",
|
|
|
|
"options": "recursive"
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
`status` shows the number of inodes that are scheduled to be scrubbed at any point in time,
|
2019-07-24 11:07:16 +00:00
|
|
|
hence, can change on subsequent `scrub status` invocations. Also, a high level summary of
|
|
|
|
scrub operation (which includes the operation state and paths on which scrub is triggered)
|
|
|
|
gets displayed in `ceph status`.
|
2019-03-15 09:26:47 +00:00
|
|
|
|
2019-07-24 11:07:16 +00:00
|
|
|
::
|
|
|
|
|
|
|
|
ceph status
|
|
|
|
[...]
|
|
|
|
|
|
|
|
task status:
|
|
|
|
scrub status:
|
|
|
|
mds.0: active [paths:/]
|
|
|
|
|
|
|
|
[...]
|
2019-03-15 09:26:47 +00:00
|
|
|
|
2019-09-09 19:36:04 +00:00
|
|
|
Control (ongoing) File System Scrubs
|
|
|
|
====================================
|
2019-03-15 09:26:47 +00:00
|
|
|
|
|
|
|
- Pause: Pausing ongoing scrub operations results in no new or pending inodes being
|
|
|
|
scrubbed after in-flight RADOS ops (for the inodes that are currently being scrubbed)
|
|
|
|
finish.
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
ceph tell mds.a scrub pause
|
|
|
|
{
|
|
|
|
"return_code": 0
|
|
|
|
}
|
|
|
|
|
|
|
|
`scrub status` after pausing reflects the paused state. At this point, initiating new scrub
|
|
|
|
operations (via `scrub start`) would just queue the inode for scrub.
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
ceph tell mds.a scrub status
|
|
|
|
{
|
|
|
|
"status": "PAUSED (66 inodes in the stack)",
|
|
|
|
"scrubs": {
|
|
|
|
"6f0d204c-6cfd-4300-9e02-73f382fd23c1": {
|
|
|
|
"path": "/",
|
|
|
|
"options": "recursive"
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
- Resume: Resuming kick starts a paused scrub operation.
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
ceph tell mds.a. scrub resume
|
|
|
|
{
|
|
|
|
"return_code": 0
|
|
|
|
}
|
|
|
|
|
|
|
|
- Abort: Aborting ongoing scrub operations removes pending inodes from the scrub
|
|
|
|
queue (thereby aborting the scrub) after in-flight RADOS ops (for the inodes that
|
|
|
|
are currently being scrubbed) finish.
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
ceph tell mds.a. scrub abort
|
|
|
|
{
|
|
|
|
"return_code": 0
|
|
|
|
}
|