Commit Graph

133911 Commits

Author SHA1 Message Date
Sridhar Seshasayee
b6e36cb4a9
Merge pull request #47630 from sseshasa/wip-add-mclk-cpu-util-rel-note
PendingReleaseNotes: Note the fix for high CPU utilization during recovery

Reviewed-by: Neha Ojha <nojha@redhat.com>
2022-09-06 20:45:02 +05:30
Tatjana Dehler
15fa97d49d
monitoring/ceph-mixin: add RGW host to label info
Add the missing information about the RGW instance to the labels of the
"Average GET/PUT Latencies" panel on the "RGW Overview" dashboard.

Fixes: https://tracker.ceph.com/issues/57166
Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2022-09-06 16:19:19 +02:00
Daniel Gryniewicz
1d08d8c124
Merge pull request #47908 from dang/wip-dang-zipper-cleanup
RGW - Sal - remove unused default constructors

Reviewed-by: Casey Bodley <cbodley@redat.com>
2022-09-06 09:48:49 -04:00
Casey Bodley
0676cabb31
Merge pull request #47907 from cbodley/wip-57195
rgw: fix d3n crash in StoreManager::get_config()

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>
Reviewed-by: Mark Kogan <mkogan@redhat.com>
2022-09-06 07:52:34 -04:00
lmgdlmgd
c5b4e53e29
osd: set per_pool_stats true when OSD has no PG
if per_pool_stats is false when OSD has no PG, osd_sum.num_osds not equals osd_sum.num_per_pool_osds, then ceph df command return STORED == USED, this is not our expectation
 
Signed-off-by: jindengke <jindengke@inspur.com>

Signed-off-by: lmgdlmgd <jindengke@inspur.com>
2022-09-06 15:32:54 +08:00
nsedrickm
05c1b776c2 mgr/dashboard: add cypress e2e accessibility tests for navigation
* Add cypress e2e accessibility tests
* Log accessibility violations to console for quick debugging
* Disable retries to avoid duplicate logs

Signed-off-by: nsedrickm <nsedrick101@gmail.com>
2022-09-06 07:37:09 +01:00
nsedrickm
d84c827879 mgr/dashboard: Improve accessibility for navigation components
* Fix all dropdown accessibility issues by setting their roles as buttons
* Increase navlink font-size to 16px
* Add new primary color variable $primary-wcag-aa-large-text: #25828e  to meet WCAG level AA color contrast ratio for active navlink backgrounds

Fixes: https://tracker.ceph.com/issues/56021
Signed-off-by: nsedrickm <nsedrick101@gmail.com>
2022-09-06 07:37:09 +01:00
Kefu Chai
bd8b7aa4da
Merge pull request #46823 from lightmelodies/librados-c-api-stat2
librados: export stat2 in C API

Reviewed-by: Kefu Chai <tchaikov@gmail.com>
2022-09-06 08:40:46 +08:00
Kefu Chai
1217c9eb98
Merge pull request #47962 from tchaikov/wip-3rd-time-is-a-charm
test/{librbd, rgw}: retry when bind fail with port 0

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2022-09-06 08:37:45 +08:00
Ilya Dryomov
36f1d515ca librbd: make RefreshRequest tests compatible with clone v1
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-09-05 23:02:39 +02:00
Ilya Dryomov
bd885d75b2 librbd: retry ENOENT in V2_REFRESH_PARENT as well
With auto-deletion of trashed snapshots, it is relatively easy to lose
a race to "rbd flatten" as follows:

- when V2_GET_PARENT runs, the image is technically still a clone
- when V2_REFRESH_PARENT runs, the image is fully flattened and the
  snapshot in the parent image is deleted

This results in a spurious ENOENT error, mainly when trying to open the
image (e.g. for "rbd info").  This race condition has always been there
but auto-deletion of trashed snapshots makes it much worse.

Retry ENOENT in V2_REFRESH_PARENT the same way as in V2_GET_SNAPSHOTS.

Fixes: https://tracker.ceph.com/issues/52810
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-09-05 23:02:39 +02:00
Ilya Dryomov
8570194b13 librbd: limit the number of ENOENT retries in RefreshRequest
If the image header is corrupt, ENOENT error may be persistent.  Avoid
an infinite loop in that case.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-09-05 23:02:39 +02:00
Ilya Dryomov
6bd89ea119 librbd: fix a bunch of issues with restarting RefreshRequest
Make RefreshRequest properly restartable, at least up until and including
V2_REFRESH_PARENT step:

- clear m_migration_spec when skipping GET_MIGRATION_HEADER
- don't rely on potentially stale m_incomplete_update on retry
- reset m_legacy_parent when retrying more than just V2_GET_PARENT
- don't rely on potentially stale m_parent_md.overlap and
  m_head_parent_overlap on retry
- clear m_metadata before fetching image metadata (but not before
  fetching pool metadata)
- clear m_op_features when skipping V2_GET_OP_FEATURES
- clear m_group_spec on EOPNOTSUPP error in V2_GET_GROUP
- reset m_legacy_snapshot when retrying more than just V2_GET_SNAPSHOTS
- don't rely on potentially stale m_snap_parents on retry

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-09-05 23:02:39 +02:00
Ilya Dryomov
ce6dff89c0 librbd: check *result consistently in RefreshRequest
Stick to *result >= 0 checks everywhere and add missing checks for
op_features_get_finish() and image_group_get_finish() errors.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-09-05 23:02:32 +02:00
Ilya Dryomov
ca36ffa347 librbd: reflect V2_GET_SNAPSHOTS ENOENT retry in state diagram
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2022-09-05 23:00:35 +02:00
Casey Bodley
9c828b72ad
Merge pull request #47964 from yuvalif/wip-yuval-fix-57418
rgw/lua: handle null lua manager in librgw

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2022-09-05 14:17:07 -04:00
Yuri Weinstein
99f42bfba7
Merge pull request #47456 from sseshasa/wip-fix-mon-cmd-nack
osd: Implement Context based completion for mon cmd to set a config option

Reviewed-by: Samuel Just <sjust@redhat.com>
2022-09-05 11:16:00 -07:00
Yuri Weinstein
41dd298743
Merge pull request #37767 from shun-s/add-mons-ms-tos-support
msg: add min delay packets support for mons

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
2022-09-05 11:14:33 -07:00
Kefu Chai
aa7885f7cc test/{librbd, rgw}: retry when bind fail with port 0
there is chance that the bind() call may fail if we have another test
happen to pick the free port picked by operating system. in this case,
we just retry up to 42 times.

in theory, this change does not fully address the racing, but it should
help to alleviate this issue.

See-also: https://tracker.ceph.com/issues/57116
Signed-off-by: Kefu Chai <tchaikov@gmail.com>
2022-09-05 21:04:34 +08:00
Pere Diaz Bou
b4957d6203
Merge pull request #47250 from nSedrickm/progress-bar-accessibility-update
mgr/dashboard: Improve level A accessibility for usagebar

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
2022-09-05 13:02:00 +02:00
Yuval Lifshitz
41e1f9a482
Merge pull request #47507 from zenomri/wip-omri-fix-tracer-provider
common/tracer: fix for tracer provider destruction

reviewed-by: yuvalif
2022-09-05 12:19:56 +03:00
Yuval Lifshitz
eac3e96c50
Merge pull request #47905 from cbodley/wip-57326
rgw: avoid string_view to temporary in RGWBulkUploadOp

reviewed-by: dang, yuvalif
2022-09-05 10:33:27 +03:00
nsedrickm
78ec4addee mgr/dashboard: remove duplicate ids from table column filters
Generate random ids for column filters to avoid id conflics on pages with multiple tables sharing the same column filters

Signed-off-by: nsedrickm <nsedrick101@gmail.com>
Fixes: https://tracker.ceph.com/issues/55869
2022-09-05 07:31:15 +01:00
Yingxin
226b6b8f36
Merge pull request #47518 from aravind-wdc/zns-wip
crimson/os/seastore: improve ZNS support for SeaStore

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
2022-09-05 11:34:31 +08:00
Yingxin Cheng
5c425d5bfa crimson/os/seastore: rename transaction type properly
Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
2022-09-05 11:28:02 +08:00
Yingxin Cheng
76c2793229 crimson/os/seastore: introduce generic AsyncCleaner interface to EPM
Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
2022-09-05 11:28:02 +08:00
Yingxin Cheng
54d8814c2c crimson/os/seastore: introduce log subsys seastore_epm
Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
2022-09-05 11:28:02 +08:00
Yingxin Cheng
06b7b1e8e2 crimson/os/seastore: move AsyncCleaner::GCProcess to EPM::BackgroundProcess
Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
2022-09-05 11:27:56 +08:00
Xiaoliang Yang
1a66581732 doc/cephadm: Add creation method of OSD
ceph orch daemon add osd command is does not supported for device partition,when you have not raw block device,you can create lvm through device partition,then you can use ceph orch daemon add osd : command create osd
Signed-off-by: Xiaoliang Yang <yangxiaoliang07@163.com>
2022-09-05 10:32:31 +08:00
Yingxin Cheng
1e21bc5fd8 crimson/os/seastore/async_cleaner: move cleaner_state_t to GCProcess
Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
2022-09-05 09:43:25 +08:00
Yingxin Cheng
32def58910 crimson/os/seastore/async_cleaner: make trimmer optional to cleaner
Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
2022-09-05 09:43:25 +08:00
Yingxin Cheng
d98e08d665 crimson/os/seastore/async_cleaner: move the related stats to GCProcess
Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
2022-09-05 09:43:25 +08:00
Yingxin Cheng
662449077b crimson/os/seastore/async_cleaner: move trimmer into GCProcess
Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
2022-09-05 09:43:25 +08:00
Zac Dover
367695f5b0 doc/monitoring: add min vers of apps in mon stack
https://tracker.ceph.com/issues/45447

This PR adds recommended versions of grafana and
prometheus and alert manager.

This PR is a second attempt at getting the information
in the following PR into the docs:
https://github.com/ceph/ceph/pull/46000/files

Himadri Maheshwari deserves the credit for the work
in this commit.

Signed-off-by: Zac Dover <zac.dover@gmail.com>
Signed-off-by: Himadri Maheshwari <himadri.maheshwari7915@gmail.com>
2022-09-05 07:36:52 +10:00
Yuval Lifshitz
5177d7d1cb rgw/lua: handle null lua manager in librgw
Fixes: https://tracker.ceph.com/issues/57418

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
2022-09-04 16:11:36 +03:00
Nizamudeen A
a873546c0d
Merge pull request #47571 from rhcs-dashboard/nfs-form-squash-norequired
mgr/dashboard: fix nfs exports form issues with squash field 

Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2022-09-04 13:00:17 +05:30
Nizamudeen A
a3f601b5e7 mgr/dashboard: fix Cannot read properties of undefined (reading 'filter')
Fixes: https://tracker.ceph.com/issues/57434
Signed-off-by: Nizamudeen A <nia@redhat.com>
2022-09-04 11:22:45 +05:30
Nizamudeen A
dcbf179106 mgr/dashboard: fix Expected to find element: `cd-modal .badge but never found it
This looks related to https://github.com/ceph/ceph/pull/46323, but i am
not sure why it started failing recently in the teuthology.

The error is fixed by deselecting the row manually, so that it won't
accidentally get deselected when we actually want it to select it.

Fixes: https://tracker.ceph.com/issues/57207
Signed-off-by: Nizamudeen A <nia@redhat.com>
2022-09-04 10:36:40 +05:30
Nizamudeen A
d3a03afa28
Merge pull request #47634 from rhcs-dashboard/kcli-fedora-36
mgr/dashboard: fix "can't read .ssh/known_hosts: No such file or directory

Reviewed-by: Pegonzal <NOT@FOUND>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
2022-09-04 09:56:19 +05:30
zdover23
555f0590dd
Merge pull request #47871 from zdover23/wip-doc-2022-08-30-start-documenting-ceph-branch-name-update
doc/start: update documenting-ceph branch names

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2022-09-04 12:10:37 +10:00
J. Eric Ivancich
929e96fbe9
Merge pull request #47906 from ivancich/wip-bad-move-s3-getobj
rgw: fix bug where variable referenced after data moved out

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
2022-09-03 18:26:42 -04:00
Adam King
1aac56e007
Merge pull request #47941 from adk3798/tp-no-overwrite-doc2
doc/cephadm: add info about --no-overwrite to note about tuned-profiles

Reviewed-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
2022-09-03 15:47:21 -04:00
Adam King
d879c5098f doc/cephadm: add info about --no-overwrite to note about tuned-profiles
Follow up to fix to this flag in
028cb031dd

Signed-off-by: Adam King <adking@redhat.com>
2022-09-03 12:19:54 -04:00
Adam King
648618f2c0
Merge pull request #47733 from adk3798/prometheus-retention
mgr/cephadm: allow setting prometheus retention time

Reviewed-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
Reviewed-by: Redouane Kachach <rkachach@redhat.com>
2022-09-03 11:59:32 -04:00
Adam King
a725a7c573
Merge pull request #47815 from rkachach/fix_issue_57032
cephadm/mgr: adding logic to handle --no-overwrite for tuned profiles

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
2022-09-03 11:58:01 -04:00
Adam King
eaac7b7c80
Merge pull request #47859 from pcuzner/fix-gather-facts-disk-size
cephadm: Fix disk size calculation

Reviewed-by: Adam King <adking@redhat.com>
2022-09-03 11:55:05 -04:00
Adam King
41ac5cab4c
Merge pull request #47882 from rkachach/fix_issue_57060
mgr/cephadm: Fix how we check if a host belongs to public network

Reviewed-by: Adam King <adking@redhat.com>
2022-09-03 11:53:23 -04:00
J. Eric Ivancich
550196c104
Merge pull request #47904 from cbodley/wip-57329
rgw: avoid use-after-move in RGWDataSyncSingleEntryCR ctor

Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>
Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>
2022-09-03 11:21:11 -04:00
Kefu Chai
6cc31cc593 denc: support non-c++20 compliant C++ standard library
when compiling with the standard library comes with GCC-10, we have
FTBFS like:

```
src/include/denc.h:517:49: error: 'bit_cast' is not a member of 'std';
did you mean 'bad_cast'?
  517 |   unsigned lowznib = v ?
      (std::countr_zero(std::bit_cast<uint64_t>(v)) / 4) : 0u;
      |                                                 ^~~~~~~~
      |                                                 bad_cast
```

to address this issue, an implementation of std::bit_cast<> is defined
if it is not available. in the long run, we should use a better C++
compiler for compiling the tree.

Fixes: https://tracker.ceph.com/issues/57355
Signed-off-by: Kefu Chai <tchaikov@gmail.com>
2022-09-03 08:24:06 +08:00
Adam King
34d0977861
Merge pull request #47921 from rkachach/fix_issue_56953
doc/cephadm/services: fix example for specifying rgw placement

Reviewed-by: Adam King <adking@redhat.com>
2022-09-02 14:36:22 -04:00