2021-09-19 13:12:44 +00:00
|
|
|
.. _upgrade-mds-cluster:
|
|
|
|
|
2018-04-05 22:49:32 +00:00
|
|
|
Upgrading the MDS Cluster
|
|
|
|
=========================
|
|
|
|
|
|
|
|
Currently the MDS cluster does not have built-in versioning or file system
|
|
|
|
flags to support seamless upgrades of the MDSs without potentially causing
|
|
|
|
assertions or other faults due to incompatible messages or other functional
|
|
|
|
differences. For this reason, it's necessary during any cluster upgrade to
|
|
|
|
reduce the number of active MDS for a file system to one first so that two
|
2021-03-30 21:46:45 +00:00
|
|
|
active MDS do not communicate with different versions.
|
2018-04-05 22:49:32 +00:00
|
|
|
|
|
|
|
The proper sequence for upgrading the MDS cluster is:
|
|
|
|
|
2021-03-30 21:46:45 +00:00
|
|
|
1. For each file system, disable and stop standby-replay daemons.
|
2021-03-16 15:31:48 +00:00
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
ceph fs set <fs_name> allow_standby_replay false
|
|
|
|
|
|
|
|
In Pacific, the standby-replay daemons are stopped for you after running this
|
|
|
|
command. Older versions of Ceph require you to stop these daemons manually.
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
ceph fs dump # find standby-replay daemons
|
|
|
|
ceph mds fail mds.<X>
|
|
|
|
|
|
|
|
|
2021-03-30 21:46:45 +00:00
|
|
|
2. For each file system, reduce the number of ranks to 1:
|
2018-04-05 22:49:32 +00:00
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
ceph fs set <fs_name> max_mds 1
|
|
|
|
|
2021-03-16 15:31:48 +00:00
|
|
|
3. Wait for cluster to stop non-zero ranks where only rank 0 is active and the rest are standbys.
|
2018-04-05 22:49:32 +00:00
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
ceph status # wait for MDS to finish stopping
|
|
|
|
|
2021-03-30 21:46:45 +00:00
|
|
|
4. For each MDS, upgrade packages and restart. Note: to reduce failovers, it is
|
|
|
|
recommended -- but not strictly necessary -- to first upgrade standby daemons.
|
2018-04-13 23:48:36 +00:00
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
# use package manager to update cluster
|
|
|
|
systemctl restart ceph-mds.target
|
2018-04-05 22:49:32 +00:00
|
|
|
|
2021-03-30 21:46:45 +00:00
|
|
|
5. For each file system, restore the previous max_mds and allow_standby_replay settings for your cluster:
|
2018-04-05 22:49:32 +00:00
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
ceph fs set <fs_name> max_mds <old_max_mds>
|
2021-03-30 21:46:45 +00:00
|
|
|
ceph fs set <fs_name> allow_standby_replay <old_allow_standby_replay>
|
2021-04-23 18:22:56 +00:00
|
|
|
|
2016-02-29 14:19:50 +00:00
|
|
|
|
2019-09-09 19:36:04 +00:00
|
|
|
Upgrading pre-Firefly file systems past Jewel
|
|
|
|
=============================================
|
2016-02-29 14:19:50 +00:00
|
|
|
|
|
|
|
.. tip::
|
|
|
|
|
2019-09-09 19:36:04 +00:00
|
|
|
This advice only applies to users with file systems
|
2016-02-29 14:19:50 +00:00
|
|
|
created using versions of Ceph older than *Firefly* (0.80).
|
2019-09-09 19:36:04 +00:00
|
|
|
Users creating new file systems may disregard this advice.
|
2016-02-29 14:19:50 +00:00
|
|
|
|
|
|
|
Pre-firefly versions of Ceph used a now-deprecated format
|
|
|
|
for storing CephFS directory objects, called TMAPs. Support
|
|
|
|
for reading these in RADOS will be removed after the Jewel
|
|
|
|
release of Ceph, so for upgrading CephFS users it is important
|
|
|
|
to ensure that any old directory objects have been converted.
|
|
|
|
|
|
|
|
After installing Jewel on all your MDS and OSD servers, and restarting
|
|
|
|
the services, run the following command:
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
cephfs-data-scan tmap_upgrade <metadata pool name>
|
|
|
|
|
|
|
|
This only needs to be run once, and it is not necessary to
|
|
|
|
stop any other services while it runs. The command may take some
|
|
|
|
time to execute, as it iterates overall objects in your metadata
|
2019-09-09 19:36:04 +00:00
|
|
|
pool. It is safe to continue using your file system as normal while
|
2016-02-29 14:19:50 +00:00
|
|
|
it executes. If the command aborts for any reason, it is safe
|
|
|
|
to simply run it again.
|
|
|
|
|
2019-09-09 19:36:04 +00:00
|
|
|
If you are upgrading a pre-Firefly CephFS file system to a newer Ceph version
|
2016-02-29 14:19:50 +00:00
|
|
|
than Jewel, you must first upgrade to Jewel and run the ``tmap_upgrade``
|
|
|
|
command before completing your upgrade to the latest version.
|
|
|
|
|