common/options: turn off bluestore_fsck_quick_fix_on_mount by default

This option enables 3 conversions:

1) pool stats, added in nautilus
2) per-pool omap, added in octopus
3) per-pg omap (replacing (2)) in pacific

Upgrading the long running cluster in sepia from octopus to pacific
resulted in conversion (3). This conversion isn't particularly useful
yet since the follow-on optimization of pg removal aren't in pacific
yet.

This took 25 minutes for the SSD-based osds with <10GB of omap. That's
a lot of disruption, and some clusters have 10x that much omap data.

Upgrades going from nautilus to pacific will miss the finer-grained
stats granularity, but isn't such an important feature it's worth
causing potential availability problems.

In the future we can orchestrate these format changes via cephadm/rook
to minimize the impact on the whole cluster, e.g. going an osd at a
time or doing it during an off-peak period, and not necessarily at the
same time as an upgrade.

Fixes: https://tracker.ceph.com/issues/45265
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
This commit is contained in:
Josh Durgin 2021-03-17 14:51:27 -04:00
parent 5d10b6995c
commit c3e7acb3ac

View File

@ -4630,7 +4630,7 @@ std::vector<Option> get_global_options() {
.set_description("Run deep fsck at mount when bluestore_fsck_on_mount is set to true"),
Option("bluestore_fsck_quick_fix_on_mount", Option::TYPE_BOOL, Option::LEVEL_DEV)
.set_default(true)
.set_default(false)
.set_description("Do quick-fix for the store at mount"),
Option("bluestore_fsck_on_umount", Option::TYPE_BOOL, Option::LEVEL_DEV)