Commit Graph

137154 Commits

Author SHA1 Message Date
Patrick Donnelly
7ffa065a03
mds: add config to decide whether to mark dentry bad
So admin can restore access to files if necessary.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2023-03-29 13:02:56 -04:00
Patrick Donnelly
1b4d9077af
qa: add missing scan_links step for data scan recovery
Without, the first field remains corrupt (HEAD).

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2023-03-29 13:02:56 -04:00
Patrick Donnelly
293b90fe63
qa/tasks/cephfs: test damage to dentry's first is caught
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2023-03-29 13:02:56 -04:00
Patrick Donnelly
3ba17390e9
qa/tasks/cephfs: use rank_asok and allow specifying rank
This will use the more efficient:

    ceph tell mds.<fsname>:<rank> ...

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2023-03-29 13:02:56 -04:00
Patrick Donnelly
dbf1a402c9
qa/tasks: allow specifying timeout command prefix to ceph
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2023-03-29 13:02:55 -04:00
Patrick Donnelly
fe258dc54b
mds: provide test configs for creating first corruption
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2023-03-29 13:02:55 -04:00
Patrick Donnelly
03407528f9
mds: catch damage to dentry's first field
When possible. Abort the MDS before it can be written to the
journal/directory.

This is part of a series to address corruption first observed in [1].
How the corruption is introduced is yet unknown.

[1] https://tracker.ceph.com/issues/38452#note-10

Fixes: http://tracker.ceph.com/issues/58482
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2023-03-29 13:02:55 -04:00
Patrick Donnelly
c9d36056c7
mds: add debugging for pre_cow_old_inode
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2023-03-29 13:02:33 -04:00
Patrick Donnelly
17d826551c
mds: cleanup code
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2023-03-29 13:02:33 -04:00
zdover23
714978806f
Merge pull request #50723 from zdover23/wip-doc-2023-03-28-documenting-ceph-squashing
doc/start: documenting-ceph - add squash procedure

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2023-03-29 22:54:58 +10:00
Avan
ef6b9ae3c7
Merge pull request #50718 from rhcs-dashboard/use-only-counter-dump-schema
exporter: user only counter dump/schema commands for extacting counters

Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
2023-03-29 14:22:51 +05:30
Xiubo Li
579510dab5
Merge pull request #48183 from lxbsz/wip-57591
qa: enable kclient test for newop test
2023-03-29 12:44:07 +08:00
Xiubo Li
898316a72c qa: enable kclient test for newop test
The kclient have already fix this. This will only enable the upstream
kclient with the testing branch, the downstream ones may not include
the fixing yet, so skip them for now.

The nautilus will only support the syntax v1. And for kclient there
is not need to do the upgrade.

Fixes: https://tracker.ceph.com/issues/57591
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2023-03-29 09:18:18 +08:00
Xiubo Li
be9c8458ff qa: add one func to distiguish the mount syntax version
Fixes: https://tracker.ceph.com/issues/57591
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2023-03-29 09:08:20 +08:00
Xiubo Li
ede1afe91c qa: remove the FuseMount/KernelMount check in test_newops.py
Will use the postmerge fragment to check this.

Fixes: https://tracker.ceph.com/issues/57591
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2023-03-29 09:07:58 +08:00
Xiubo Li
ef2cdfdefa qa: introduce postmerge for fuse/kclient mounts
Suggested by Patrick and this will check mounter's type.

Fixes: https://tracker.ceph.com/issues/57591
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2023-03-29 09:07:58 +08:00
Xiubo Li
e123fcaadc qa: remove the '0-' prefix
Both the nautilus and pacific will be run in parallel.

Fixes: https://tracker.ceph.com/issues/57591
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2023-03-29 09:07:58 +08:00
Zac Dover
71ee225d7b doc/start: documenting-ceph - add squash procedure
Add a procedure to doc/start/documenting-ceph.rst that explains how to
perform an interactive rebase to squash commits.

Signed-off-by: Zac Dover <zac.dover@proton.me>
2023-03-29 09:57:48 +10:00
Avan Thakkar
51a89906df exporter: user only counter dump/schema commands for extacting counters
Fixes: https://tracker.ceph.com/issues/59191
Signed-off-by: Avan Thakkar <athakkar@redhat.com>

Ceph exporter no more required the output of perf dump/schema, as the ``counter dump`` command
returns both labeled and unlabeled perf counters which exporter can fetch and export.
Removed the ``exporter_get_labeled_counters`` confiug option as exporter will now export
all the counters, labeled or unlabeled.
Also the fix includes the support for renaming the metrics name of rgw multi-site and
adding labels to it, similar to what is there in prometheus module.
2023-03-28 23:42:09 +05:30
Anthony D'Atri
cec15a5992
Merge pull request #50713 from zdover23/wip-doc-2023-03-28-glossary-cephx
doc/glossary: improve "CephX" entry
2023-03-28 08:28:49 -04:00
Aashish Sharma
9a28ba2a89
Merge pull request #50529 from rhcs-dashboard/dashboard-edit-rgw-multisite
mgr/dashboard: edit realm in rgw-multisite


Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
2023-03-28 17:08:17 +05:30
Zac Dover
02e3a5cb76 doc/glossary: improve "CephX" entry
Improve the glossary entry for "CephX".

Signed-off-by: Zac Dover <zac.dover@proton.me>
2023-03-28 18:54:07 +10:00
Aashish Sharma
eb56f2680c mgr/dashboard: Add unit test for realm
Fixes: https://tracker.ceph.com/issues/59171
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
2023-03-28 12:26:17 +05:30
Yingxin
0793495b9d
Merge pull request #50653 from xxhdx1985126/wip-exist-clean
crimson/os/seastore/cache: consider EXIST_CLEAN extents as pending ones

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
2023-03-28 14:07:43 +08:00
Yingxin Cheng
865285a53c crimson/os/seastore/cache: use CachedExtent::is_mutable() where appropriate
CachedExtent::is_mutable() should only be used to check whether need to
call duplicate_for_write(extent).

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
2023-03-28 09:41:55 +08:00
zdover23
6a4088a9c8
Merge pull request #50697 from zdover23/wip-doc-2023-03-28-glossary-scrubbing
doc/glossary: add "Scrubbing"

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2023-03-28 10:39:16 +10:00
Zac Dover
4a66819da4 doc/glossary: add "Scrubbing"
Add "Scrubbing" to the glossary.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
2023-03-28 08:07:30 +10:00
Adam King
c989b0a351
Merge pull request #48937 from adk3798/device-ls-size
mgr/orchestrator: fix device size in `orch device ls` output

Reviewed-by: Redouane Kachach <rkachach@redhat.com>
2023-03-27 13:55:31 -04:00
Xiubo Li
043d4abccf
Merge pull request #50681 from lxbsz/qa-fscrypt-020
qa: fscrypt enable xfstests-dev generic/020 test case
2023-03-27 21:31:06 +08:00
avanthakkar
f658ac2670 disable default check if already set to true for selected realm
Fixes: https://tracker.ceph.com/issues/59171
Signed-off-by: avanthakkar <avanjohn@gmail.com>
2023-03-27 17:42:08 +05:30
avanthakkar
d42ea1d5af disable create zonegroup if no master zone exist for existing master zonegroup
Fixes: https://tracker.ceph.com/issues/59171
Signed-off-by: avanthakkar <avanjohn@gmail.com>
2023-03-27 17:42:01 +05:30
avanthakkar
e804800432 mgr/dashboard: edit rgw-multisite
Fixes: https://tracker.ceph.com/issues/59171
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
2023-03-27 17:41:44 +05:30
Xuehan Xu
f34faf363e crimson/os/seastore/cache: consider EXIST_CLEAN extents as pending ones
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
2023-03-27 15:32:34 +08:00
Xiubo Li
585481f343 qa: fscrypt enable xfstests-dev generic/020 test case
Since the https://git.ceph.com/xfstests-dev.git has pulled the
corresponding fix about the long attribute for generic/020 test case,
we can enable it now.

Signed-off-by: Xiubo Li <xiubli@redhat.com>
2023-03-27 14:25:52 +08:00
zdover23
f5c5009eab
Merge pull request #50675 from zdover23/wip-doc-2023-03-27-rados-operations-bluestore-migration-cleanup
doc/rados: clean up ops/bluestore-migration.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2023-03-27 05:06:45 +10:00
Zac Dover
b28be76d0f doc/rados: clean up ops/bluestore-migration.rst
Clean up internal links, fix the numbering of a procedure, and implement
Anthony D'Atri's suggestions in
https://github.com/ceph/ceph/pull/50487 and
https://github.com/ceph/ceph/pull/50488.

https://tracker.ceph.com/issues/58485

Signed-off-by: Zac Dover <zac.dover@proton.me>
2023-03-27 02:07:30 +10:00
zdover23
9792102cb3
Merge pull request #50654 from zdover23/wip-doc-2023-03-24-glossary-user
doc/glossary: add "User"

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2023-03-26 05:43:00 +10:00
Zac Dover
fd6bfaf3fe doc/glossary: add "User"
Add "User" to glossary.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
2023-03-25 06:50:00 +10:00
Anthony D'Atri
68df405e53
Merge pull request #50660 from zdover23/wip-doc-2023-03-25-rados-operations-bluestore-migration-prompt-fix
doc/operations: fix prompt in bluestore-migration
2023-03-24 13:54:06 -04:00
Zac Dover
5e54641aec doc/operations: fix prompt in bluestore-migration
Fix a single prompt in bluestore-migration.rst.

Signed-off-by: Zac Dover <zac.dover@proton.me>
2023-03-25 03:47:10 +10:00
Ilya Dryomov
b89782a369
Merge pull request #50302 from weirdwiz/rbd-perf-counters
rbd-mirror: switch to labeled perf counters

Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2023-03-24 13:00:10 +01:00
Rishabh Dave
76177ab1a9
Merge pull request #50497 from rishabh-d-dave/fs-qa-caps-helper
qa/cephfs: add more helper methods to caps_helper.py

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2023-03-24 16:28:38 +05:30
Ilya Dryomov
4431be49fc
Merge pull request #49302 from petrutlucian94/adapter_resets
rbd-wnbd: optionally handle wnbd adapter restart events

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2023-03-24 11:41:04 +01:00
Avan
5976b1f21d
Merge pull request #50369 from rhcs-dashboard/exporter-labeled-counters
exporter: add support for exposing labeled perf counters
2023-03-24 14:56:28 +05:30
Lucian Petrut
98a7aff741 rbd-wnbd: consistently use negative error codes in rbd-wnbd
The rbd-wnbd iterators return positive errors, which is why
in certain cases we may end up with both positive and negative
error codes.

This change ensures that we'll consistently use negative
error codes.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2023-03-24 09:00:21 +00:00
Lucian Petrut
3d8afc0021 common, rbd-wnbd: bump Windows log level
We're increasing the log level for certain Windows operational log
messages.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2023-03-24 08:59:49 +00:00
Lucian Petrut
0c25ca6564 rbd-wnbd: optionally handle wnbd adapter restart events
The WNBD adapter may be reset in certain situations (e.g. driver
upgrade, MS WHQL tests, etc).

We're going to monitor the WNBD adapter using WMI[1] events, restarting
the rbd-wnbd disk mappings whenever necessary. Adapter monitoring can be
enabled by passing the --adapter-monitoring-enabled flag to the service.

This feature is optional for the following reasons:

* it's mainly used during development / driver certification
* we had to use a relatively small polling interval, which might imply
  additional resource usage. WMI quotas also have to be considered.

While at it, we're updating two lambdas that are submitted to thread pools,
avoiding default reference capturing and explicitly specifying the variables
that get copied.

[1] https://learn.microsoft.com/en-us/windows/win32/wmisdk/wmi-start-page

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2023-03-24 08:58:15 +00:00
Rishabh Dave
969a93d0dc qa/cephfs: add more helper methods to caps_helper.py
Add methods that will accept read/write permissions, CephFS names and
CephFS mount point and in return will generate string form of MON, OSD
and MDS caps exactly as it is reported in Ceph keyrings.

Replace similar code in test_multifs_auth.py with calls to these helper
methods.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2023-03-24 11:32:58 +05:30
Anthony D'Atri
b5d6ca7a8f
Merge pull request #50649 from Thingee/foundation-mem-update-20230323
doc/foundation: Update Foundation members
2023-03-23 20:11:04 -04:00
Mike Perez
73aa44aa44 doc/foundation: Update Foundation members
Removing EasyStack, Vexxhost and adding 42on

Signed-off-by: Mike Perez <thingee@gmail.com>
2023-03-23 15:52:12 -07:00