mirror of
https://github.com/ceph/ceph
synced 2025-01-21 02:31:19 +00:00
b740ae12e5
Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
530 lines
17 KiB
ReStructuredText
530 lines
17 KiB
ReStructuredText
|
||
.. _orchestrator-cli-module:
|
||
|
||
================
|
||
Orchestrator CLI
|
||
================
|
||
|
||
This module provides a command line interface (CLI) to orchestrator
|
||
modules (ceph-mgr modules which interface with external orchestration services).
|
||
|
||
As the orchestrator CLI unifies different external orchestrators, a common nomenclature
|
||
for the orchestrator module is needed.
|
||
|
||
+--------------------------------------+---------------------------------------+
|
||
| *host* | hostname (not DNS name) of the |
|
||
| | physical host. Not the podname, |
|
||
| | container name, or hostname inside |
|
||
| | the container. |
|
||
+--------------------------------------+---------------------------------------+
|
||
| *service type* | The type of the service. e.g., nfs, |
|
||
| | mds, osd, mon, rgw, mgr, iscsi |
|
||
+--------------------------------------+---------------------------------------+
|
||
| *service* | A logical service, Typically |
|
||
| | comprised of multiple service |
|
||
| | instances on multiple hosts for HA |
|
||
| | |
|
||
| | * ``fs_name`` for mds type |
|
||
| | * ``rgw_zone`` for rgw type |
|
||
| | * ``ganesha_cluster_id`` for nfs type |
|
||
+--------------------------------------+---------------------------------------+
|
||
| *daemon* | A single instance of a service. |
|
||
| | Usually a daemon, but maybe not |
|
||
| | (e.g., might be a kernel service |
|
||
| | like LIO or knfsd or whatever) |
|
||
| | |
|
||
| | This identifier should |
|
||
| | uniquely identify the instance |
|
||
+--------------------------------------+---------------------------------------+
|
||
|
||
The relation between the names is the following:
|
||
|
||
* A *service* has a specfic *service type*
|
||
* A *daemon* is a physical instance of a *service type*
|
||
|
||
|
||
.. note::
|
||
|
||
Orchestrator modules may only implement a subset of the commands listed below.
|
||
Also, the implementation of the commands are orchestrator module dependent and will
|
||
differ between implementations.
|
||
|
||
Status
|
||
======
|
||
|
||
::
|
||
|
||
ceph orch status
|
||
|
||
Show current orchestrator mode and high-level status (whether the module able
|
||
to talk to it)
|
||
|
||
Also show any in-progress actions.
|
||
|
||
Host Management
|
||
===============
|
||
|
||
List hosts associated with the cluster::
|
||
|
||
ceph orch host ls
|
||
|
||
Add and remove hosts::
|
||
|
||
ceph orch host add <host>
|
||
ceph orch host rm <host>
|
||
|
||
OSD Management
|
||
==============
|
||
|
||
List Devices
|
||
------------
|
||
|
||
Print a list of discovered devices, grouped by host and optionally
|
||
filtered to a particular host:
|
||
|
||
::
|
||
|
||
ceph orch device ls [--host=...] [--refresh]
|
||
|
||
Example::
|
||
|
||
# ceph orch device ls
|
||
Host 192.168.121.206:
|
||
Device Path Type Size Rotates Available Model
|
||
/dev/sdb hdd 50.0G True True ATA/QEMU HARDDISK
|
||
/dev/sda hdd 50.0G True False ATA/QEMU HARDDISK
|
||
|
||
Host 192.168.121.181:
|
||
Device Path Type Size Rotates Available Model
|
||
/dev/sdb hdd 50.0G True True ATA/QEMU HARDDISK
|
||
/dev/sda hdd 50.0G True False ATA/QEMU HARDDISK
|
||
|
||
.. note::
|
||
Output form Ansible orchestrator
|
||
|
||
Create OSDs
|
||
-----------
|
||
|
||
Create OSDs on a group of devices on a single host::
|
||
|
||
ceph orch osd create <host>:<drive>
|
||
ceph orch osd create -i <path-to-drive-group.json>
|
||
|
||
|
||
The output of ``osd create`` is not specified and may vary between orchestrator backends.
|
||
|
||
Where ``drive.group.json`` is a JSON file containing the fields defined in
|
||
:class:`ceph.deployment_utils.drive_group.DriveGroupSpec`
|
||
|
||
Example::
|
||
|
||
# ceph orch osd create 192.168.121.206:/dev/sdc
|
||
{"status": "OK", "msg": "", "data": {"event": "playbook_on_stats", "uuid": "7082f3ba-f5b7-4b7c-9477-e74ca918afcb", "stdout": "\r\nPLAY RECAP *********************************************************************\r\n192.168.121.206 : ok=96 changed=3 unreachable=0 failed=0 \r\n", "counter": 932, "pid": 10294, "created": "2019-05-28T22:22:58.527821", "end_line": 1170, "runner_ident": "083cad3c-8197-11e9-b07a-2016b900e38f", "start_line": 1166, "event_data": {"ignored": 0, "skipped": {"192.168.121.206": 186}, "ok": {"192.168.121.206": 96}, "artifact_data": {}, "rescued": 0, "changed": {"192.168.121.206": 3}, "pid": 10294, "dark": {}, "playbook_uuid": "409364a6-9d49-4e44-8b7b-c28e5b3adf89", "playbook": "add-osd.yml", "failures": {}, "processed": {"192.168.121.206": 1}}, "parent_uuid": "409364a6-9d49-4e44-8b7b-c28e5b3adf89"}}
|
||
|
||
.. note::
|
||
Output form Ansible orchestrator
|
||
|
||
Decommission an OSD
|
||
-------------------
|
||
::
|
||
|
||
ceph orch osd rm <osd-id> [osd-id...]
|
||
|
||
Removes one or more OSDs from the cluster and the host, if the OSDs are marked as
|
||
``destroyed``.
|
||
|
||
Example::
|
||
|
||
# ceph orch osd rm 4
|
||
{"status": "OK", "msg": "", "data": {"event": "playbook_on_stats", "uuid": "1a16e631-906d-48e0-9e24-fa7eb593cc0a", "stdout": "\r\nPLAY RECAP *********************************************************************\r\n192.168.121.158 : ok=2 changed=0 unreachable=0 failed=0 \r\n192.168.121.181 : ok=2 changed=0 unreachable=0 failed=0 \r\n192.168.121.206 : ok=2 changed=0 unreachable=0 failed=0 \r\nlocalhost : ok=31 changed=8 unreachable=0 failed=0 \r\n", "counter": 240, "pid": 10948, "created": "2019-05-28T22:26:09.264012", "end_line": 308, "runner_ident": "8c093db0-8197-11e9-b07a-2016b900e38f", "start_line": 301, "event_data": {"ignored": 0, "skipped": {"localhost": 37}, "ok": {"192.168.121.181": 2, "192.168.121.158": 2, "192.168.121.206": 2, "localhost": 31}, "artifact_data": {}, "rescued": 0, "changed": {"localhost": 8}, "pid": 10948, "dark": {}, "playbook_uuid": "a12ec40e-bce9-4bc9-b09e-2d8f76a5be02", "playbook": "shrink-osd.yml", "failures": {}, "processed": {"192.168.121.181": 1, "192.168.121.158": 1, "192.168.121.206": 1, "localhost": 1}}, "parent_uuid": "a12ec40e-bce9-4bc9-b09e-2d8f76a5be02"}}
|
||
|
||
.. note::
|
||
Output form Ansible orchestrator
|
||
|
||
..
|
||
Blink Device Lights
|
||
^^^^^^^^^^^^^^^^^^^
|
||
::
|
||
|
||
ceph orch device ident-on <dev_id>
|
||
ceph orch device ident-on <dev_name> <host>
|
||
ceph orch device fault-on <dev_id>
|
||
ceph orch device fault-on <dev_name> <host>
|
||
|
||
ceph orch device ident-off <dev_id> [--force=true]
|
||
ceph orch device ident-off <dev_id> <host> [--force=true]
|
||
ceph orch device fault-off <dev_id> [--force=true]
|
||
ceph orch device fault-off <dev_id> <host> [--force=true]
|
||
|
||
where ``dev_id`` is the device id as listed in ``osd metadata``,
|
||
``dev_name`` is the name of the device on the system and ``host`` is the host as
|
||
returned by ``orchestrator host ls``
|
||
|
||
ceph orch osd ident-on {primary,journal,db,wal,all} <osd-id>
|
||
ceph orch osd ident-off {primary,journal,db,wal,all} <osd-id>
|
||
ceph orch osd fault-on {primary,journal,db,wal,all} <osd-id>
|
||
ceph orch osd fault-off {primary,journal,db,wal,all} <osd-id>
|
||
|
||
Where ``journal`` is the filestore journal, ``wal`` is the write ahead log of
|
||
bluestore and ``all`` stands for all devices associated with the osd
|
||
|
||
|
||
Monitor and manager management
|
||
==============================
|
||
|
||
Creates or removes MONs or MGRs from the cluster. Orchestrator may return an
|
||
error if it doesn't know how to do this transition.
|
||
|
||
Update the number of monitor hosts::
|
||
|
||
ceph orch apply mon <num> [host, host:network...]
|
||
|
||
Each host can optionally specify a network for the monitor to listen on.
|
||
|
||
Update the number of manager hosts::
|
||
|
||
ceph orch apply mgr <num> [host...]
|
||
|
||
..
|
||
.. note::
|
||
|
||
The host lists are the new full list of mon/mgr hosts
|
||
|
||
.. note::
|
||
|
||
specifying hosts is optional for some orchestrator modules
|
||
and mandatory for others (e.g. Ansible).
|
||
|
||
|
||
Service Status
|
||
==============
|
||
|
||
Print a list of services known to the orchestrator. The list can be limited to
|
||
services on a particular host with the optional --host parameter and/or
|
||
services of a particular type via optional --type parameter
|
||
(mon, osd, mgr, mds, rgw):
|
||
|
||
::
|
||
|
||
ceph orch ls [--service_type type] [--service_name name] [--export] [--format f] [--refresh]
|
||
|
||
Discover the status of a particular service or daemons::
|
||
|
||
ceph orch ls --service_type type --service_name <name> [--refresh]
|
||
|
||
Export the service specs known to the orchestrator as yaml in format
|
||
that is compatible to ``ceph orch apply -i``::
|
||
|
||
ceph orch ls --export
|
||
|
||
|
||
Daemon Status
|
||
=============
|
||
|
||
Print a list of all daemons known to the orchestrator::
|
||
|
||
ceph orch ps [--hostname host] [--daemon_type type] [--service_name name] [--daemon_id id] [--format f] [--refresh]
|
||
|
||
Query the status of a particular service instance (mon, osd, mds, rgw). For OSDs
|
||
the id is the numeric OSD ID, for MDS services it is the file system name::
|
||
|
||
ceph orch ps --daemon_type osd --daemon_id 0
|
||
|
||
|
||
.. _orchestrator-cli-cephfs:
|
||
|
||
Depoying CephFS
|
||
===============
|
||
|
||
In order to set up a :term:`CephFS`, execute::
|
||
|
||
ceph fs volume create <fs_name> <placement spec>
|
||
|
||
Where ``name`` is the name of the CephFS, ``placement`` is a
|
||
:ref:`orchestrator-cli-placement-spec`.
|
||
|
||
This command will create the required Ceph pools, create the new
|
||
CephFS, and deploy mds servers.
|
||
|
||
Stateless services (MDS/RGW/NFS/rbd-mirror/iSCSI)
|
||
=================================================
|
||
|
||
The orchestrator is not responsible for configuring the services. Please look into the corresponding
|
||
documentation for details.
|
||
|
||
The ``name`` parameter is an identifier of the group of instances:
|
||
|
||
* a CephFS file system for a group of MDS daemons,
|
||
* a zone name for a group of RGWs
|
||
|
||
Sizing: the ``size`` parameter gives the number of daemons in the cluster
|
||
(e.g. the number of MDS daemons for a particular CephFS file system).
|
||
|
||
Creating/growing/shrinking/removing services::
|
||
|
||
ceph orch {mds,rgw} update <name> <size> [host…]
|
||
ceph orch {mds,rgw} add <name>
|
||
ceph orch nfs update <name> <size> [host…]
|
||
ceph orch nfs add <name> <pool> [--namespace=<namespace>]
|
||
ceph orch {mds,rgw,nfs} rm <name>
|
||
|
||
e.g., ``ceph orch mds update myfs 3 host1 host2 host3``
|
||
|
||
Start/stop/reload::
|
||
|
||
ceph orch service {stop,start,reload} <type> <name>
|
||
|
||
ceph orch daemon {start,stop,reload} <type> <daemon-id>
|
||
|
||
.. _orchestrator-cli-service-spec:
|
||
|
||
Service Specification
|
||
=====================
|
||
|
||
As *Service Specification* is a data structure often represented as YAML
|
||
to specify the deployment of services. For example:
|
||
|
||
.. code-block:: yaml
|
||
|
||
service_type: rgw
|
||
service_id: realm.zone
|
||
placement:
|
||
hosts:
|
||
- host1
|
||
- host2
|
||
- host3
|
||
spec: ...
|
||
|
||
Where the properties of a service specification are the following:
|
||
|
||
* ``service_type`` is the type of the service. Needs to be either a Ceph
|
||
service (``mon``, ``crash``, ``mds``, ``mgr``, ``osd`` or
|
||
``rbd-mirror``), a gateway (``nfs`` or ``rgw``), or part of the
|
||
monitoring stack (``alertmanager``, ``grafana``, ``node-exporter`` or
|
||
``prometheus``).
|
||
* ``service_id`` is the name of the service. Omit the service time
|
||
* ``placement`` is a :ref:`orchestrator-cli-placement-spec`
|
||
* ``spec``: additional specifications for a specific service.
|
||
|
||
Each service type can have different requirements for the spec.
|
||
|
||
Service specifications of type ``mon``, ``mgr``, and the monitoring
|
||
types do not require a ``service_id``
|
||
|
||
A service of type ``nfs`` requires a pool name and contain
|
||
an optional namespace:
|
||
|
||
.. code-block:: yaml
|
||
|
||
service_type: nfs
|
||
service_id: mynfs
|
||
placement:
|
||
hosts:
|
||
- host1
|
||
- host2
|
||
spec:
|
||
pool: mypool
|
||
namespace: mynamespace
|
||
|
||
Where ``pool`` is a RADOS pool where NFS client recovery data is stored
|
||
and ``namespace`` is a RADOS namespace where NFS client recovery
|
||
data is stored in the pool.
|
||
|
||
A service of type ``osd`` is in detail described in :ref:`drivegroups`
|
||
|
||
Many service specifications can then be applied at once using
|
||
``ceph orch apply -i`` by submitting a multi-document YAML file::
|
||
|
||
cat <<EOF | ceph orch apply -i -
|
||
service_type: mon
|
||
placement:
|
||
host_pattern: "mon*"
|
||
---
|
||
service_type: mgr
|
||
placement:
|
||
host_pattern: "mgr*"
|
||
---
|
||
service_type: osd
|
||
placement:
|
||
host_pattern: "osd*"
|
||
data_devices:
|
||
all: true
|
||
EOF
|
||
|
||
.. _orchestrator-cli-placement-spec:
|
||
|
||
Placement Specification
|
||
=======================
|
||
|
||
In order to allow the orchestrator to deploy a *service*, it needs to
|
||
know how many and where it should deploy *daemons*. The orchestrator
|
||
defines a placement specification that can either be passed as a command line argument.
|
||
|
||
Explicit placements
|
||
-------------------
|
||
|
||
Daemons can be explictly placed on hosts by simply specifying them::
|
||
|
||
orch apply prometheus "host1 host2 host3"
|
||
|
||
Or in yaml:
|
||
|
||
.. code-block:: yaml
|
||
|
||
service_type: prometheus
|
||
placement:
|
||
hosts:
|
||
- host1
|
||
- host2
|
||
- host3
|
||
|
||
MONs and other services may require some enhanced network specifications::
|
||
|
||
orch daemon add mon myhost:[v2:1.2.3.4:3000,v1:1.2.3.4:6789]=name
|
||
|
||
Where ``[v2:1.2.3.4:3000,v1:1.2.3.4:6789]`` is the network address of the monitor
|
||
and ``=name`` specifies the name of the new monitor.
|
||
|
||
Placement by labels
|
||
-------------------
|
||
|
||
Daemons can be explictly placed on hosts that match a specifc label::
|
||
|
||
orch apply prometheus label:mylabel
|
||
|
||
Or in yaml:
|
||
|
||
.. code-block:: yaml
|
||
|
||
service_type: prometheus
|
||
placement:
|
||
label: "mylabel"
|
||
|
||
|
||
Placement by pattern matching
|
||
-----------------------------
|
||
|
||
Daemons can be placed on hosts as well::
|
||
|
||
orch apply prometheus 'myhost[1-3]'
|
||
|
||
Or in yaml:
|
||
|
||
.. code-block:: yaml
|
||
|
||
service_type: prometheus
|
||
placement:
|
||
host_pattern: "myhost[1-3]"
|
||
|
||
To place a service on *all* hosts, use ``"*"``::
|
||
|
||
orch apply crash '*'
|
||
|
||
Or in yaml:
|
||
|
||
.. code-block:: yaml
|
||
|
||
service_type: node-exporter
|
||
placement:
|
||
host_pattern: "*"
|
||
|
||
|
||
Setting a limit
|
||
---------------
|
||
|
||
By specifying ``count``, only that number of daemons will be created::
|
||
|
||
orch apply prometheus 3
|
||
|
||
To deploy *daemons* on a subset of hosts, also specify the count::
|
||
|
||
orch apply prometheus "2 host1 host2 host3"
|
||
|
||
If the count is bigger than the amount of hosts, cephadm still deploys two daemons::
|
||
|
||
orch apply prometheus "3 host1 host2"
|
||
|
||
Or in yaml:
|
||
|
||
.. code-block:: yaml
|
||
|
||
service_type: prometheus
|
||
placement:
|
||
count: 3
|
||
|
||
Or with hosts:
|
||
|
||
.. code-block:: yaml
|
||
|
||
service_type: prometheus
|
||
placement:
|
||
count: 2
|
||
hosts:
|
||
- host1
|
||
- host2
|
||
- host3
|
||
|
||
|
||
Configuring the Orchestrator CLI
|
||
================================
|
||
|
||
To enable the orchestrator, select the orchestrator module to use
|
||
with the ``set backend`` command::
|
||
|
||
ceph orch set backend <module>
|
||
|
||
For example, to enable the Rook orchestrator module and use it with the CLI::
|
||
|
||
ceph mgr module enable rook
|
||
ceph orch set backend rook
|
||
|
||
Check the backend is properly configured::
|
||
|
||
ceph orch status
|
||
|
||
Disable the Orchestrator
|
||
------------------------
|
||
|
||
To disable the orchestrator, use the empty string ``""``::
|
||
|
||
ceph orch set backend ""
|
||
ceph mgr module disable rook
|
||
|
||
Current Implementation Status
|
||
=============================
|
||
|
||
This is an overview of the current implementation status of the orchestrators.
|
||
|
||
=================================== ====== =========
|
||
Command Rook Cephadm
|
||
=================================== ====== =========
|
||
apply iscsi ⚪ ⚪
|
||
apply mds ✔ ✔
|
||
apply mgr ⚪ ✔
|
||
apply mon ✔ ✔
|
||
apply nfs ✔ ✔
|
||
apply osd ✔ ✔
|
||
apply rbd-mirror ✔ ✔
|
||
apply rgw ⚪ ✔
|
||
host add ⚪ ✔
|
||
host ls ✔ ✔
|
||
host rm ⚪ ✔
|
||
daemon status ⚪ ✔
|
||
daemon {stop,start,...} ⚪ ✔
|
||
device {ident,fault}-(on,off} ⚪ ✔
|
||
device ls ✔ ✔
|
||
iscsi add ⚪ ⚪
|
||
mds add ✔ ✔
|
||
nfs add ✔ ✔
|
||
ps ⚪ ✔
|
||
rbd-mirror add ⚪ ✔
|
||
rgw add ✔ ✔
|
||
ps ✔ ✔
|
||
=================================== ====== =========
|
||
|
||
where
|
||
|
||
* ⚪ = not yet implemented
|
||
* ❌ = not applicable
|
||
* ✔ = implemented
|