Commit Graph

121400 Commits

Author SHA1 Message Date
Patrick Donnelly
10a86af09d
Merge PR #40326 into master
* refs/pull/40326/head:
	client: rebuild bl to avoid too many vector(> IOV_MAX)

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
2021-03-26 15:11:28 -07:00
Sage Weil
8350cc4596 Merge PR #40394 into master
* refs/pull/40394/head:
	osd: handle MOSDPGInfo
	osd/PeeringState: init last_require_osd_release in ctor

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2021-03-26 16:41:31 -04:00
Neha Ojha
fc330101d9
Merge pull request #40415 from sseshasa/wip-qa-wait-for-clean-check
qa/tasks: Add wait_for_clean() check prior to initiating scrubbing.

Reviewed-by: Neha Ojha <nojha@redhat.com>
2021-03-26 11:25:16 -07:00
Kefu Chai
f61ff08737
Merge pull request #40438 from idryomov/wip-crimson-reopen-session
crimson/monc: discard active/pending connections when reopening

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2021-03-27 00:11:54 +08:00
Ilya Dryomov
2edf4f568d crimson/monc: discard active/pending connections when reopening
Otherwise pending_conns vector just keeps growing with redundant
connections all trying to reach the same set of monitors.  When one
of the attempts finally succeeds, _finish_auth() will pick the first
connection with a matching entity_addr_t, designate it as active and
close all others.  The match is very likely to be wrong and hence the
actual authenticated connection gets closed, leaving the OSD with
a bogus active_con and no monitor session.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2021-03-26 14:36:47 +01:00
Sage Weil
37d1a231c0 Merge PR #40404 into master
* refs/pull/40404/head:
	auth: require CEPHX_V2 by default

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2021-03-26 08:24:52 -04:00
Sage Weil
7283a58f3d Merge PR #40321 into master
* refs/pull/40321/head:
	cephadm: fix parsing of keepalived version (drop leading 'v')
	cephadm: keepalived needs --cap-add=NET_RAW
	cephadm: fix --cap-add=NET_ADMIN
	cephadm: fix quoting for keepalived env var
	mgr/cephadm: ha-rgw: use correct port

Reviewed-by: Michael Fritch <mfritch@suse.com>
2021-03-26 08:19:06 -04:00
Sage Weil
671ba83c0b Merge PR #40015 into master
* refs/pull/40015/head:
	cephadm: validate fsid during cephadm shell command

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
2021-03-26 08:18:35 -04:00
Sage Weil
47d00f6503 Merge PR #40341 into master
* refs/pull/40341/head:
	os/bluestore: separate omap per-pool vs per-pg alerts

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
2021-03-26 08:16:19 -04:00
Venky Shankar
325f38f065
Merge pull request #40429 from varshar16/wip-fix-fs-top-error
src/tools/cephfs-top: fix flake8 line too long error

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2021-03-26 17:25:03 +05:30
Nathan Cutler
61f267362d
Merge pull request #40401 from lnussel/master
ceph.spec: prepare openSUSE usrmerge (boo#1029961)
2021-03-26 11:23:55 +01:00
Varsha Rao
51cf6c3d14 src/tools/cephfs-top: fix flake8 line too long error
Fixes: https://tracker.ceph.com/issues/50005
Signed-off-by: Varsha Rao <varao@redhat.com>
2021-03-26 15:08:33 +05:30
Kefu Chai
e038fe24ee
Merge pull request #40170 from Rethan/fix-sync-pause-max-lat
os/FileStore: fix sync_pause_max_lat which should not be inced

Reviewed-by: Kefu Chai <kchai@redhat.com>
2021-03-26 15:08:25 +08:00
Kefu Chai
76724ad716
Merge pull request #40334 from dvanders/pick
common/ipaddr: skip loopback interfaces named 'lo' and test it

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2021-03-26 14:31:10 +08:00
Kefu Chai
406f2efafc
Merge pull request #40292 from tchaikov/wip-39191
debian,cmake,cephsqlite: hide non-public symbols

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2021-03-26 14:28:34 +08:00
Kefu Chai
a45e4b644c
Merge pull request #40171 from swegener/bluefs-assert-always-true
bluefs: fix assert always being true

Reviewed-by: Kefu Chai <kchai@redhat.com>
2021-03-26 14:25:38 +08:00
Kefu Chai
74f3e4e207
Merge pull request #40304 from tchaikov/wip-telegraph-ann
pybind/mgr/telegraf: add type annotations

Reviewed-by: Wido den Hollander <wido@widodh.nl>
2021-03-26 14:24:56 +08:00
Venky Shankar
04a46059df
Merge pull request #40403 from vshankar/wip-cephfs-top-add-metrics
cephfs-top: include additional metrics reported by `fs perf stats`

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2021-03-26 11:52:31 +05:30
Kefu Chai
6e1de9ae35
Merge pull request #40167 from singuliere/wip-49781
common/mempool: only fail tests if sharding is very bad

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2021-03-26 14:13:18 +08:00
Samuel Just
3cb8b3592f
Merge pull request #40331 from athanatos/sjust/wip-seatore-gc-refactor
crimson/seastore: refactor gc into concurrent task

Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
2021-03-25 21:21:28 -07:00
Kefu Chai
6152076c22
Merge pull request #40336 from tchaikov/wip-cmake-fio
cmake: pass compile options by fio interface library

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2021-03-26 09:28:31 +08:00
Neha Ojha
347da505ee
Merge pull request #40323 from aclamk/wip-onode-resiliant-to-split-cache
os/bluestore: Make Onode::put/get resiliant to split_cache

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Igor Fedotov <ifedotov@suse.com>
2021-03-25 18:05:28 -07:00
Samuel Just
0514aadb4e crimson/os/seastore/segment_cleaner: bump reclaim_ratio limits
GC behavior still needs a ton of work, but this adjustment should
reduce gc activity for now until we have time to refine the params
and introduce the ability to write cold extents to a segment other
than the current journal segment.

Signed-off-by: Samuel Just <sjust@redhat.com>
2021-03-25 13:16:35 -07:00
Samuel Just
0408a60997 crimson/os/seastore/.../btree_range_pin: use ceph_assert
There's a performance overhead, but seastore is quite immature and
detecting these logic errors saves a ton of debugging time.

Signed-off-by: Samuel Just <sjust@redhat.com>
2021-03-25 13:13:59 -07:00
Samuel Just
24267ea8cb crimson/os/seastore/.../block.cc: add error message on alloc error to segment_write
Signed-off-by: Samuel Just <sjust@redhat.com>
2021-03-25 13:13:59 -07:00
Sridhar Seshasayee
b5570238b6 qa/tasks: Add wait_for_clean() check prior to initiating scrubbing.
Fixes: https://tracker.ceph.com/issues/49983
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2021-03-25 22:01:19 +05:30
Mark Kogan
0772593632
Merge pull request #39776 from alimaredia/wip-rgw-log-req-latency
rgw: add latency to the request summary of an op
2021-03-25 18:16:39 +02:00
Daniel Pivonka
9118b08365 cephadm: validate fsid during cephadm shell command
Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
2021-03-25 10:26:40 -04:00
Sage Weil
dba20d0e07 Merge PR #40366 into master
* refs/pull/40366/head:
	mgr/cephadm: Fix dashboard gateway configuration when using IPV6

Reviewed-by: Sage Weil <sage@redhat.com>
2021-03-25 08:39:34 -04:00
Kefu Chai
ce28172dfe
Merge pull request #40395 from rhcs-dashboard/fix-49952-master
mgr/dashboard:Simplify some complex calculations in test_alerts.yml

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
2021-03-25 19:28:59 +08:00
Ilya Dryomov
e5744672db auth: require CEPHX_V2 by default
It's been almost three years and support is present in all relevant
clients.

From the security perspective, roughly the same could be achieved
with "ceph osd set-require-min-compat-client nautilus", but this is
more user friendly as the client gets ENOTSUP instead of spinning on
"feature set mismatch" faults.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2021-03-25 12:17:32 +01:00
Venky Shankar
5a119dcc13 cephfs-top: include additional metrics reported by fs perf stats.
Without this, `cephfs-top` hits an exception since the additional
metrics keys were not configured.

Also, include a validation suring selftest that checks if `ceph
fs perf stats` metrics match what cephfs-top is configured to
report.

Fixes: http://tracker.ceph.com/issues/49974
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2021-03-25 07:12:59 -04:00
Kefu Chai
8d30d8a67f
Merge pull request #40400 from tchaikov/wip-ctest-jenkins
run-make-check.sh: let ctest generate XML output

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
2021-03-25 18:24:22 +08:00
Venky Shankar
972d045d94
Merge pull request #40327 from racpatel/wip-cephfs-top-refresh-interval
cephfs-top: allow configurable stats refresh interval
2021-03-25 15:16:39 +05:30
Venky Shankar
93dd8821aa
Merge pull request #40210 from joscollin/wip-B49736-missing-client-metadata
cephfs-top: be resilient to missing client metadata keys
2021-03-25 15:15:01 +05:30
Ludwig Nussel
e4c4a4ce97 ceph.spec: prepare openSUSE usrmerge (boo#1029961)
The compat symlink in /sbin is no longer required and actually in
the way in the usrmerge case.

Signed-off-by: Ludwig Nussel <ludwig.nussel@suse.de>
2021-03-25 10:35:35 +01:00
Kefu Chai
48ba39987d run-make-check.sh: let ctest generate XML output
to enable XUnit plugin of jenkins to consume the ctest output and
publish it in the dashboard, we need to

* let ctest generate XML output instead of plain text output
* do not fail the test if any test case fails. this allows the publisher
  to do its job by checking the XML output.
* prevent ctest from compressing the output. see
  https://issues.jenkins.io/browse/JENKINS-21737

Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-03-25 17:14:09 +08:00
Aashish Sharma
8d2f39e6c5 mgr/dashboard:Simplify some complex calculations in test_alerts.yml
run-promtool-unittests is failing with difference in floating point values in some complex calculations. This PR intends to simplify those calculations and fix this issue.

Fixes: https://tracker.ceph.com/issues/49952
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
2021-03-25 12:05:07 +05:30
Kefu Chai
e4b7f4563a osd: handle MOSDPGInfo
this change partially reverts b37e959238,
which introduced a regression where we fail to handle MOSDPGInfo
messages sent by osd before aba13c7661
and/or f7130db7a9fa85e475f14e4feed0097bf2d0b964.

Fixes: https://tracker.ceph.com/issues/49963
Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-03-25 12:52:04 +08:00
Kefu Chai
a25d097271 osd/PeeringState: init last_require_osd_release in ctor
so we can use the latest osdmap to tell what is the required osd release in
*current* cluster before consuming a new osdmap which is fed to
PeeringState only if we advance to a new map.

a osd shares its pg_info using MOSDPGInfo2 or MOSDPGInfo according to
PeeringState::last_require_osd_release, this happens before a new osdmap
is fully consumed.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-03-25 12:52:03 +08:00
Kefu Chai
c420dedae4
Merge pull request #40377 from tchaikov/wip-doc-fixes
doc/man: fix formatting

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2021-03-25 10:39:14 +08:00
Neha Ojha
7bc9206457
Merge pull request #38044 from dvanders/dvanders_buffered
common/options: bluefs_buffered_io=true by default

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2021-03-24 17:45:17 -07:00
Ali Maredia
c2eeb1d9cd
Merge pull request #40380 from adamemerson/wip-num-rados-handles-die-die-die
Die, `num_rados_handles`, die.

Reviewed-by: Ali Maredia <amaredia@redhat.com>
2021-03-24 18:39:35 -04:00
Sage Weil
f33bf0647b cephadm: fix parsing of keepalived version (drop leading 'v')
Signed-off-by: Sage Weil <sage@newdream.net>
2021-03-24 16:46:02 -05:00
Sage Weil
3368844d02 cephadm: keepalived needs --cap-add=NET_RAW
This makes

Mar 24 12:00:32 dael conmon[3969650]: Wed Mar 24 16:00:32 2021: cant open raw socket. errno=1

go away and allows it to enter the MASTER state.

Signed-off-by: Sage Weil <sage@newdream.net>
2021-03-24 16:45:58 -05:00
Sage Weil
6a176b02b1 cephadm: fix --cap-add=NET_ADMIN
Podman wants the = sign.  This aligns us with the other --cap-add user
(SYS_PTRACE), which uses =.

Signed-off-by: Sage Weil <sage@newdream.net>
2021-03-24 16:45:55 -05:00
Sage Weil
40e29b9786 cephadm: fix quoting for keepalived env var
This was broken by 3ea514c552

Signed-off-by: Sage Weil <sage@newdream.net>
2021-03-24 16:45:51 -05:00
Adam C. Emerson
4bc2a30c24 rgw: Remove spurious num_rados_handles option
While any code that used the option is gone, it appears the option
itself was accidentally resurrected in various options.cc refactors
and transitions.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
2021-03-24 15:49:32 -04:00
Samuel Just
47d8167cc4 crimson/os/seastore/segment_cleaner: introduce byte limit on rewrite_dirty
Signed-off-by: Samuel Just <sjust@redhat.com>
2021-03-24 18:53:13 +00:00
Kefu Chai
bd0a87fb5c doc/install/windows-troubleshooting: fix typo
and tweak the format a little bit

Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-03-25 01:03:34 +08:00