mirror of https://github.com/ceph/ceph
486 lines
14 KiB
Groff
486 lines
14 KiB
Groff
.TH "RBD" "8" "April 29, 2013" "dev" "Ceph"
|
|
.SH NAME
|
|
rbd \- manage rados block device (RBD) images
|
|
.
|
|
.nr rst2man-indent-level 0
|
|
.
|
|
.de1 rstReportMargin
|
|
\\$1 \\n[an-margin]
|
|
level \\n[rst2man-indent-level]
|
|
level margin: \\n[rst2man-indent\\n[rst2man-indent-level]]
|
|
-
|
|
\\n[rst2man-indent0]
|
|
\\n[rst2man-indent1]
|
|
\\n[rst2man-indent2]
|
|
..
|
|
.de1 INDENT
|
|
.\" .rstReportMargin pre:
|
|
. RS \\$1
|
|
. nr rst2man-indent\\n[rst2man-indent-level] \\n[an-margin]
|
|
. nr rst2man-indent-level +1
|
|
.\" .rstReportMargin post:
|
|
..
|
|
.de UNINDENT
|
|
. RE
|
|
.\" indent \\n[an-margin]
|
|
.\" old: \\n[rst2man-indent\\n[rst2man-indent-level]]
|
|
.nr rst2man-indent-level -1
|
|
.\" new: \\n[rst2man-indent\\n[rst2man-indent-level]]
|
|
.in \\n[rst2man-indent\\n[rst2man-indent-level]]u
|
|
..
|
|
.\" Man page generated from reStructuredText.
|
|
.
|
|
.SH SYNOPSIS
|
|
.nf
|
|
\fBrbd\fP [ \-c \fIceph.conf\fP ] [ \-m \fImonaddr\fP ] [ \-p | \-\-pool \fIpool\fP ] [
|
|
\-\-size \fIsize\fP ] [ \-\-order \fIbits\fP ] [ \fIcommand\fP ... ]
|
|
.fi
|
|
.sp
|
|
.SH DESCRIPTION
|
|
.sp
|
|
\fBrbd\fP is a utility for manipulating rados block device (RBD) images,
|
|
used by the Linux rbd driver and the rbd storage driver for Qemu/KVM.
|
|
RBD images are simple block devices that are striped over objects and
|
|
stored in a RADOS object store. The size of the objects the image is
|
|
striped over must be a power of two.
|
|
.SH OPTIONS
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-c ceph.conf, \-\-conf ceph.conf
|
|
Use ceph.conf configuration file instead of the default /etc/ceph/ceph.conf to
|
|
determine monitor addresses during startup.
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-m monaddress[:port]
|
|
Connect to specified monitor (instead of looking through ceph.conf).
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-p pool, \-\-pool pool
|
|
Interact with the given pool. Required by most commands.
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-\-no\-progress
|
|
Do not output progress information (goes to standard error by
|
|
default for some commands).
|
|
.UNINDENT
|
|
.SH PARAMETERS
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-\-image\-format format
|
|
Specifies which object layout to use. The default is 1.
|
|
.INDENT 7.0
|
|
.IP \(bu 2
|
|
format 1 \- Use the original format for a new rbd image. This format is
|
|
understood by all versions of librbd and the kernel rbd module, but
|
|
does not support newer features like cloning.
|
|
.IP \(bu 2
|
|
format 2 \- Use the second rbd format, which is supported by
|
|
librbd (but not the kernel rbd module) at this time. This adds
|
|
support for cloning and is more easily extensible to allow more
|
|
features in the future.
|
|
.UNINDENT
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-\-size size\-in\-mb
|
|
Specifies the size (in megabytes) of the new rbd image.
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-\-order bits
|
|
Specifies the object size expressed as a number of bits, such that
|
|
the object size is \fB1 << order\fP. The default is 22 (4 MB).
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-\-stripe\-unit size\-in\-bytes
|
|
Specifies the stripe unit size in bytes. See striping section (below) for more details.
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-\-stripe\-count num
|
|
Specifies the number of objects to stripe over before looping back
|
|
to the first object. See striping section (below) for more details.
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-\-snap snap
|
|
Specifies the snapshot name for the specific operation.
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-\-id username
|
|
Specifies the username (without the \fBclient.\fP prefix) to use with the map command.
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-\-keyfile filename
|
|
Specifies a file containing the secret to use with the map command.
|
|
If not specified, \fBclient.admin\fP will be used by default.
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-\-keyring filename
|
|
Specifies a keyring file containing a secret for the specified user
|
|
to use with the map command. If not specified, the default keyring
|
|
locations will be searched.
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-\-shared tag
|
|
Option for \fIlock add\fP that allows multiple clients to lock the
|
|
same image if they use the same tag. The tag is an arbitrary
|
|
string. This is useful for situations where an image must
|
|
be open from more than one client at once, like during
|
|
live migration of a virtual machine, or for use underneath
|
|
a clustered filesystem.
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-\-format format
|
|
Specifies output formatting (default: plain, json, xml)
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \-\-pretty\-format
|
|
Make json or xml formatted output more human\-readable.
|
|
.UNINDENT
|
|
.SH COMMANDS
|
|
.INDENT 0.0
|
|
.TP
|
|
.B \fBls\fP [\-l | \-\-long] [pool\-name]
|
|
Will list all rbd images listed in the rbd_directory object. With
|
|
\-l, also show snapshots, and use longer\-format output including
|
|
size, parent (if clone), format, etc.
|
|
.TP
|
|
.B \fBinfo\fP [\fIimage\-name\fP]
|
|
Will dump information (such as size and order) about a specific rbd image.
|
|
If image is a clone, information about its parent is also displayed.
|
|
If a snapshot is specified, whether it is protected is shown as well.
|
|
.TP
|
|
.B \fBcreate\fP [\fIimage\-name\fP]
|
|
Will create a new rbd image. You must also specify the size via \-\-size. The
|
|
\-\-stripe\-unit and \-\-stripe\-count arguments are optional, but must be used together.
|
|
.TP
|
|
.B \fBclone\fP [\fIparent\-snapname\fP] [\fIimage\-name\fP]
|
|
Will create a clone (copy\-on\-write child) of the parent snapshot.
|
|
Object order will be identical to that of the parent image unless
|
|
specified. Size will be the same as the parent snapshot.
|
|
.sp
|
|
The parent snapshot must be protected (see \fIrbd snap protect\fP).
|
|
This requires format 2.
|
|
.TP
|
|
.B \fBflatten\fP [\fIimage\-name\fP]
|
|
If image is a clone, copy all shared blocks from the parent snapshot and
|
|
make the child independent of the parent, severing the link between
|
|
parent snap and child. The parent snapshot can be unprotected and
|
|
deleted if it has no further dependent clones.
|
|
.sp
|
|
This requires format 2.
|
|
.TP
|
|
.B \fBchildren\fP [\fIimage\-name\fP]
|
|
List the clones of the image at the given snapshot. This checks
|
|
every pool, and outputs the resulting poolname/imagename.
|
|
.sp
|
|
This requires format 2.
|
|
.TP
|
|
.B \fBresize\fP [\fIimage\-name\fP] [\-\-allow\-shrink]
|
|
Resizes rbd image. The size parameter also needs to be specified.
|
|
The \-\-allow\-shrink option lets the size be reduced.
|
|
.TP
|
|
.B \fBrm\fP [\fIimage\-name\fP]
|
|
Deletes an rbd image (including all data blocks). If the image has
|
|
snapshots, this fails and nothing is deleted.
|
|
.TP
|
|
.B \fBexport\fP [\fIimage\-name\fP] [\fIdest\-path\fP]
|
|
Exports image to dest path (use \- for stdout).
|
|
.TP
|
|
.B \fBimport\fP [\fIpath\fP] [\fIdest\-image\fP]
|
|
Creates a new image and imports its data from path (use \- for
|
|
stdin). The import operation will try to create sparse rbd images
|
|
if possible. For import from stdin, the sparsification unit is
|
|
the data block size of the destination image (1 << order).
|
|
.TP
|
|
.B \fBexport\-diff\fP [\fIimage\-name\fP] [\fIdest\-path\fP] [\-\-from\-snap \fIsnapname\fP]
|
|
Exports an incremental diff for an image to dest path (use \- for stdout). If
|
|
an initial snapshot is specified, only changes since that snapshot are included; otherwise,
|
|
any regions of the image that contain data are included. The end snapshot is specified
|
|
using the standard \-\-snap option or @snap syntax (see below). The image diff format includes
|
|
metadata about image size changes, and the start and end snapshots. It efficiently represents
|
|
discarded or \(aqzero\(aq regions of the image.
|
|
.TP
|
|
.B \fBimport\-diff\fP [\fIsrc\-path\fP] [\fIimage\-name\fP]
|
|
Imports an incremental diff of an image and applies it to the current image. If the diff
|
|
was generated relative to a start snapshot, we verify that snapshot already exists before
|
|
continuing. If there was an end snapshot we verify it does not already exist before
|
|
applying the changes, and create the snapshot when we are done.
|
|
.TP
|
|
.B \fBdiff\fP [\fIimage\-name\fP] [\-\-from\-snap \fIsnapname\fP]
|
|
Dump a list of byte extents in the image that have changed since the specified start
|
|
snapshot, or since the image was created. Each output line includes the starting offset
|
|
(in bytes), the length of the region (in bytes), and either \(aqzero\(aq or \(aqdata\(aq to indicate
|
|
whether the region is known to be zeros or may contain other data.
|
|
.TP
|
|
.B \fBcp\fP [\fIsrc\-image\fP] [\fIdest\-image\fP]
|
|
Copies the content of a src\-image into the newly created dest\-image.
|
|
dest\-image will have the same size, order, and format as src\-image.
|
|
.TP
|
|
.B \fBmv\fP [\fIsrc\-image\fP] [\fIdest\-image\fP]
|
|
Renames an image. Note: rename across pools is not supported.
|
|
.TP
|
|
.B \fBsnap\fP ls [\fIimage\-name\fP]
|
|
Dumps the list of snapshots inside a specific image.
|
|
.TP
|
|
.B \fBsnap\fP create [\fIimage\-name\fP]
|
|
Creates a new snapshot. Requires the snapshot name parameter specified.
|
|
.TP
|
|
.B \fBsnap\fP rollback [\fIimage\-name\fP]
|
|
Rollback image content to snapshot. This will iterate through the entire blocks
|
|
array and update the data head content to the snapshotted version.
|
|
.TP
|
|
.B \fBsnap\fP rm [\fIimage\-name\fP]
|
|
Removes the specified snapshot.
|
|
.TP
|
|
.B \fBsnap\fP purge [\fIimage\-name\fP]
|
|
Removes all snapshots from an image.
|
|
.TP
|
|
.B \fBsnap\fP protect [\fIimage\-name\fP]
|
|
Protect a snapshot from deletion, so that clones can be made of it
|
|
(see \fIrbd clone\fP). Snapshots must be protected before clones are made;
|
|
protection implies that there exist dependent cloned children that
|
|
refer to this snapshot. \fIrbd clone\fP will fail on a nonprotected
|
|
snapshot.
|
|
.sp
|
|
This requires format 2.
|
|
.TP
|
|
.B \fBsnap\fP unprotect [\fIimage\-name\fP]
|
|
Unprotect a snapshot from deletion (undo \fIsnap protect\fP). If cloned
|
|
children remain, \fIsnap unprotect\fP fails. (Note that clones may exist
|
|
in different pools than the parent snapshot.)
|
|
.sp
|
|
This requires format 2.
|
|
.TP
|
|
.B \fBmap\fP [\fIimage\-name\fP]
|
|
Maps the specified image to a block device via the rbd kernel module.
|
|
.TP
|
|
.B \fBunmap\fP [\fIdevice\-path\fP]
|
|
Unmaps the block device that was mapped via the rbd kernel module.
|
|
.TP
|
|
.B \fBshowmapped\fP
|
|
Show the rbd images that are mapped via the rbd kernel module.
|
|
.TP
|
|
.B \fBlock\fP list [\fIimage\-name\fP]
|
|
Show locks held on the image. The first column is the locker
|
|
to use with the \fIlock remove\fP command.
|
|
.TP
|
|
.B \fBlock\fP add [\fIimage\-name\fP] [\fIlock\-id\fP]
|
|
Lock an image. The lock\-id is an arbitrary name for the user\(aqs
|
|
convenience. By default, this is an exclusive lock, meaning it
|
|
will fail if the image is already locked. The \-\-shared option
|
|
changes this behavior. Note that locking does not affect
|
|
any operation other than adding a lock. It does not
|
|
protect an image from being deleted.
|
|
.TP
|
|
.B \fBlock\fP remove [\fIimage\-name\fP] [\fIlock\-id\fP] [\fIlocker\fP]
|
|
Release a lock on an image. The lock id and locker are
|
|
as output by lock ls.
|
|
.TP
|
|
.B \fBbench\-write\fP [\fIimage\-name\fP] \-\-io\-size [\fIio\-size\-in\-bytes\fP] \-\-io\-threads [\fInum\-ios\-in\-flight\fP] \-\-io\-total [\fItotal\-bytes\-to\-write\fP]
|
|
Generate a series of sequential writes to the image and measure the
|
|
write throughput and latency. Defaults are: \-\-io\-size 4096, \-\-io\-threads 16,
|
|
\-\-io\-total 1GB
|
|
.UNINDENT
|
|
.SH IMAGE NAME
|
|
.sp
|
|
In addition to using the \-\-pool and the \-\-snap options, the image name can include both
|
|
the pool name and the snapshot name. The image name format is as follows:
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
[pool/]image\-name[@snap]
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
Thus an image name that contains a slash character (\(aq/\(aq) requires specifying the pool
|
|
name explicitly.
|
|
.SH STRIPING
|
|
.sp
|
|
RBD images are striped over many objects, which are then stored by the
|
|
Ceph distributed object store (RADOS). As a result, read and write
|
|
requests for the image are distributed across many nodes in the
|
|
cluster, generally preventing any single node from becoming a
|
|
bottleneck when individual images get large or busy.
|
|
.sp
|
|
The striping is controlled by three parameters:
|
|
.INDENT 0.0
|
|
.TP
|
|
.B order
|
|
.TP
|
|
.B The size of objects we stripe over is a power of two, specifially 2^[*order*] bytes. The default
|
|
.TP
|
|
.B is 22, or 4 MB.
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B stripe_unit
|
|
.TP
|
|
.B Each [*stripe_unit*] contiguous bytes are stored adjacently in the same object, before we move on
|
|
.TP
|
|
.B to the next object.
|
|
.UNINDENT
|
|
.INDENT 0.0
|
|
.TP
|
|
.B stripe_count
|
|
.TP
|
|
.B After we write [*stripe_unit*] bytes to [*stripe_count*] objects, we loop back to the initial object
|
|
.TP
|
|
.B and write another stripe, until the object reaches its maximum size (as specified by [*order*]. At that
|
|
.TP
|
|
.B point, we move on to the next [*stripe_count*] objects.
|
|
.UNINDENT
|
|
.sp
|
|
By default, [\fIstripe_unit\fP] is the same as the object size and [\fIstripe_count\fP] is 1. Specifying a different
|
|
[\fIstripe_unit\fP] requires that the STRIPINGV2 feature be supported (added in Ceph v0.53) and format 2 images be
|
|
used.
|
|
.SH EXAMPLES
|
|
.sp
|
|
To create a new rbd image that is 100 GB:
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd \-p mypool create myimage \-\-size 102400
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
or alternatively:
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd create mypool/myimage \-\-size 102400
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
To use a non\-default object size (8 MB):
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd create mypool/myimage \-\-size 102400 \-\-order 23
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
To delete an rbd image (be careful!):
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd rm mypool/myimage
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
To create a new snapshot:
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd snap create mypool/myimage@mysnap
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
To create a copy\-on\-write clone of a protected snapshot:
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd clone mypool/myimage@mysnap otherpool/cloneimage
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
To see which clones of a snapshot exist:
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd children mypool/myimage@mysnap
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
To delete a snapshot:
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd snap rm mypool/myimage@mysnap
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
To map an image via the kernel with cephx enabled:
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd map mypool/myimage \-\-id admin \-\-keyfile secretfile
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
To unmap an image:
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd unmap /dev/rbd0
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
To create an image and a clone from it:
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd import \-\-format 2 image mypool/parent
|
|
rbd snap create \-\-snap snapname mypool/parent
|
|
rbd snap protect mypool/parent@snap
|
|
rbd clone mypool/parent@snap otherpool/child
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
To create an image with a smaller stripe_unit (to better distribute small writes in some workloads):
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd \-p mypool create myimage \-\-size 102400 \-\-stripe\-unit 65536 \-\-stripe\-count 16
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
To change an image from one format to another, export it and then
|
|
import it as the desired format:
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd export mypool/myimage@snap /tmp/img
|
|
rbd import \-\-format 2 /tmp/img mypool/myimage2
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
To lock an image for exclusive use:
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd lock add mypool/myimage mylockid
|
|
.ft P
|
|
.fi
|
|
.sp
|
|
To release a lock:
|
|
.sp
|
|
.nf
|
|
.ft C
|
|
rbd lock remove mypool/myimage mylockid client.2485
|
|
.ft P
|
|
.fi
|
|
.SH AVAILABILITY
|
|
.sp
|
|
\fBrbd\fP is part of the Ceph distributed file system. Please refer to
|
|
the Ceph documentation at \fI\%http://ceph.com/docs\fP for more information.
|
|
.SH SEE ALSO
|
|
.sp
|
|
\fBceph\fP(8),
|
|
\fBrados\fP(8)
|
|
.SH COPYRIGHT
|
|
2010-2013, Inktank Storage, Inc. and contributors. Licensed under Creative Commons BY-SA
|
|
.\" Generated by docutils manpage writer.
|
|
.
|