Commit Graph

617 Commits

Author SHA1 Message Date
Xie Xingguo
b7a959c1cf
Merge pull request #33910 from xiexingguo/wip-44532-2
osd/PeeringState: do not trim pg log past last_update_ondisk

Reviewed-by: Yan Jun <yan.jun8@zte.com.cn>
Reviewed-by: Samuel Just <sjust@redhat.com>
2020-03-26 08:10:22 +08:00
Sage Weil
de20c7bc61 Merge PR #34105 into master
* refs/pull/34105/head:
	Merge PR #34042 into octopus
	Merge PR #33959 into octopus
	Merge PR #34067 into octopus
	mgr/DaemonServer: add explicit check that acting matches for merge
	Merge pull request #34040 from dillaman/wip-44396-partial-fix
	Merge PR #34098 into octopus
	mgr/rook: list rgw services
	mgr/rook: tolerate timestamps that are None
	mgr/orch: add 'subcluster' property to RGWSpec
	mgr/rook: do not create radosgw pools
	mgr/rook: refactor apply/add for rgw
	Merge PR #34082 into octopus
	Merge PR #34068 into octopus
	cephadm: relabel /etc/ganesha mount
	Merge PR #34046 into octopus
	Merge PR #34092 into octopus
	Merge pull request #33719 from ukernel/wip-44416
	rbd-mirror: leader watcher should not cancel get locker if locker is invalid
	rbd-mirror: snapshot sync request needs to check for interruption
	librbd: request exclusive lock when moving to trash
	rbd-mirror: basic integration with sync throttling
	rbd-mirror: don't prematurely finish snapshot replay loop
	rbd-mirror: pass InstanceWatcher to snapshot Replayer
	doc/releases/octopus.rst: add note about ec recovery below min_size
	mgr/cephadm: configure rgw_frontends for rgw service
	cephadm: switch grafana image to the ceph repo
	Merge PR #34034 into octopus
	qa/suites/rados/cephadm/upgrade: update starting version
	Merge PR #33540 into octopus
	Merge PR #34023 into octopus
	Merge PR #34044 into octopus
	Merge PR #34030 into octopus
	doc/orchestrator: update rgw creation
	mgr/cephadm: clean up client.crash.* container_image settings after upgrade
	cephadm: make add-repo --release and --version independent
	cephadm: env over last used
	mgr/orch: accept port and ssl flags to 'apply rgw'
	mgr/orch: 'ceph upgrade ...' -> 'ceph orch upgrade ...'
	cephadm: fall back to default for infer_image
	cephadm: remove outdated check
	cephadm: consolidate default image logic
	remove ceph_test_rados_watch_notify
	python-common/ceph/deployment/service_spec: add ssl to RGWSpec
	cephadm: only infer image for shell, run, inspect-image, pull, ceph-volume
	mgr/test_orchestrator: fix service filtering when using dummy data
	mgr/dashboard: fix adding/removing host errors
	mgr/rook: fix 'orch ps' for osds
	qa: fix all the fsx.sh-invoking yaml files to install dependencies
	mds: pass proper MutationImpl::LockOp to Locker::wrlock_start()

Reviewed-by: Kiefer Chang <kiefer.chang@suse.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
2020-03-23 08:24:06 -05:00
Lenz Grimmer
b819847556
Merge pull request #34063 from s0nea/wip-dashboard-crush-rule-suite
mgr/dashboard: add crush rule test suite

Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Stephan Müller <smueller@suse.com>
2020-03-23 13:11:50 +01:00
Sage Weil
ec1b0e33d8 qa/suites/rados/cephadm/upgrade: update starting version
Need upgrade command to match!

Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-20 08:11:58 -05:00
Tatjana Dehler
04ee0178a5 mgr/dashboard: add crush rule test suite
https://tracker.ceph.com/issues/44679
Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2020-03-19 16:33:22 +01:00
Sage Weil
41dcb3c814 mgr/orch: 'ceph upgrade ...' -> 'ceph orch upgrade ...'
Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-18 20:56:44 -05:00
Samuel Just
ceb8e457e7 suites/rados/thrash: add rados-high-concurrency
Signed-off-by: Samuel Just <sjust@redhat.com>
2020-03-19 08:15:02 +08:00
Sage Weil
7c19c1534b qa/suites/rados/verify/validater/valgrind: tolerate SLOW_OPS
Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-17 19:32:42 -05:00
Sage Weil
baeb051910 qa/suites/rados/verify/validater/valgrind: less bluestore logging
Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-17 19:32:42 -05:00
Sage Weil
4fda9d50f0 qa/suites/rados/verify/validater: increase heartbeat grace
Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-17 19:32:42 -05:00
Sage Weil
0bd14ab080 Revert "qa/suites/rados/verify: debug_ms = 1, osd_heartbeat_grace = 60"
This reverts commit 4f742f200df6c91db87bfee1109c37fad3c0548b.

This was in the wrong file.. see valgrind.yaml

Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-17 19:32:42 -05:00
Sage Weil
12105ed9d7 Revert "qa/suites/rados/verify/validator/valgrind: debug refs = 5"
This reverts commit 65e81e6eb4f136bf21b67e5de10ab49f028f9e95.

This slows things down too much with valgrind.

Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-17 19:32:42 -05:00
Sage Weil
40a7bcea70 qa/suites/rados/verify: set ALLOW_TIMEOUTS for workunits
When running under valgrind (and thrashing) things can be slow.  Tell
tests in case they need to tolerate timeouts.

Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-17 18:24:12 -05:00
Sage Weil
d7de2abc2f cephadm: add allow_ptrace option to enable SYS_PTRACE
In any environments it is helpful to have SYS_PTRACE so that you can
gdb attach or strace a daemon.

Leave this off by default so that the container is more secure.

Enable this in teuthology and vstart.

Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-16 21:04:22 -05:00
Sage Weil
7957655227 cephadm: bootstrap: deploy monitoring stack by default
Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-16 12:39:44 -05:00
Sage Weil
13919b5fed qa/suites/rados/cephadm/upgrade: start from compatible build
The osd create command is changing, so upgrade from something that the
latest cephadm.py task can use.

Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-14 10:18:25 -05:00
xie xingguo
a9412ba25c qa/osd-recovery: pass osd_pg_log_trim_min = 0 to exercise short pg logs
we have osd_min_pg_log_entries to 2 (good) but not osd_pg_log_trim_min
which defaults to 100. Thus, even on those tests we're only rarely vulnerable.
Reset osd_min_pg_log_entries to 0 to make sure we really
would keep a minimal pg log in hand.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2020-03-13 08:49:58 +08:00
Sage Weil
281b106dc6 Merge PR #33871 into octopus
* refs/pull/33871/head:
	qa/suites/rados/verify: debug_ms = 1, osd_heartbeat_grace = 60

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
2020-03-12 11:17:53 -05:00
Sage Weil
4f742f200d qa/suites/rados/verify: debug_ms = 1, osd_heartbeat_grace = 60
The rados api tests are failing WatchNotify because the OSDs are so
heavily lagged.. in large part due to the high debug level of debug_ms=20
and debug_osd=25.  Reduce that.

Also increase the heartbeat grace so slow valgrind-y osds don't get marked
down.

Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-11 06:57:52 -05:00
Sage Weil
90b52cba91 qa/suites/rados/cephadm/upgrade: upgrade start point that supports the no-spec option
Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-10 15:38:52 -05:00
Sage Weil
090288fa3e Merge PR #33801 into octopus
* refs/pull/33801/head:
	qa/suites/rados/ceph: drop opensuse for now

Reviewed-by: Nathan Cutler <ncutler@suse.com>
2020-03-09 16:25:57 -05:00
Sage Weil
b6084c333d qa/suites/rados/cephadm/upgrade: new start point
The starting cephadm needs to look for default ceph.conf etc in /etc/ceph
for cephadm.py to be happy.

Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-07 15:19:07 -06:00
Sage Weil
af30e245dc qa/suites/rados/ceph: drop opensuse for now
Until https://tracker.ceph.com/issues/44501 is resolved and the builders
are able to keep up.

Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-07 08:23:20 -06:00
Sage Weil
bfb03adbe6 qa/suites/rados/cephadm/upgrade: adjust starting version
This version understands how to apply a mgr spec like '2;host=x' with a
semicolon.  This particular test build does.

Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-07 06:42:07 -06:00
Sage Weil
4e7ab6f2e6 qa/suites/rados/cephadm: deploy all monitoring components
Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-06 15:51:45 -06:00
Kefu Chai
63f5528930
Merge pull request #33584 from liewegas/fix-39525
compressor/lz4: work around bug in liblz4 versions <1.8.2

Reviewed-by: Neha Ojha <nojha@redhat.com>
2020-03-04 12:44:38 +08:00
Sage Weil
dc4ff2b4e2 Merge PR #33648 into master
* refs/pull/33648/head:
	qa/suites/cephadm: move orchestrator_cli test into rados/cephadm

Reviewed-by: Michael Fritch <mfritch@suse.com>
2020-03-02 14:14:40 -06:00
Sage Weil
8c8fb59dcf qa/objectstore: test all bluestore compression algorithms
Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-02 13:01:22 -06:00
Sage Weil
ddb8ab3dd3 qa/suites/cephadm: move orchestrator_cli test into rados/cephadm
Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-02 10:52:24 -06:00
Kefu Chai
59546f6a49
Merge pull request #33645 from liewegas/fix-mon-down-cephtool-test
qa/suites/rados/singleton-bluestore/cephtool: whitelist MON_DOWN

Reviewed-by: Kefu Chai <kchai@redhat.com>
2020-03-02 19:11:42 +08:00
Sage Weil
4c5827241a qa/suites/rados/singleton-bluestore/cephtool: whitelist MON_DOWN
cephtool/test.sh now includes a test that disallows mon from the quorum
for a short period.

Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-01 08:03:21 -06:00
Sage Weil
1400b35858 qa/suites/rados/verity/tasks/mon_recovery: whitelist SLOW_OPS
The mon can see slow ops when thrashing.

Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-01 07:58:11 -06:00
Nathan Cutler
47926809f3
Merge pull request #33535 from smithfarm/wip-rados-cephadm-smoke-152
qa: run cephadm/smoke on opensuse 15.2 instead of 15.1

Reviewed-by: Thomas Bechtold <tbechtold@suse.com>
2020-02-25 18:36:11 +01:00
Nathan Cutler
465eb03930 qa: run cephadm/smoke on opensuse 15.2 instead of 15.1
Companion PR to https://github.com/ceph/ceph-build/pull/1509

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2020-02-25 14:28:35 +01:00
Sage Weil
a4a3a3c0a0 qa/suites/rados/singleton/all/thrash-eio: whitelist 'slow request'
Signed-off-by: Sage Weil <sage@redhat.com>
2020-02-24 08:47:43 -06:00
Sridhar Seshasayee
e527067666 qa: Whitelist 'slow request' within a bunch of tests
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2020-02-24 19:59:56 +05:30
Sage Weil
3e85a09ad2 Merge PR #33328 into master
* refs/pull/33328/head:
	osd/OSD: Log slow ops/types to cluster logs

Reviewed-by: Neha Ojha <nojha@redhat.com>
2020-02-23 15:12:48 -06:00
Sage Weil
9bcdc110de Merge PR #33486 into master
* refs/pull/33486/head:
	qa/suites/rados/cephadm/smoke: run a few orch commands
	mgr/orch: fix 'host ls'

Reviewed-by: Kefu Chai <kchai@redhat.com>
2020-02-22 12:56:26 -06:00
Sage Weil
4324bce802 Merge PR #33475 into master
* refs/pull/33475/head:
	qa/suites/rados/cephadm/upgrade: change start version

Reviewed-by: Michael Fritch <mfritch@suse.com>
2020-02-22 12:55:27 -06:00
Sage Weil
5afec0fbfb Merge PR #33091 into master
* refs/pull/33091/head:
	qa/suites/rados: disable device scraping
	qa/standalone/ceph-helpers: disable device monitoring
	qa/tasks/ceph.py: add pre-mgr-commands option for ceph task
	mgr/devicehealth: set default monitoring to 'on'

Reviewed-by: Sage Weil <sage@redhat.com>
2020-02-22 12:05:55 -06:00
Sage Weil
12f7b82544 qa/suites/rados/cephadm/smoke: run a few orch commands
Signed-off-by: Sage Weil <sage@redhat.com>
2020-02-22 09:23:12 -06:00
Sage Weil
1a7c4fcb79 qa/suites/rados/cephadm/upgrade: change start version
Signed-off-by: Sage Weil <sage@redhat.com>
2020-02-22 08:13:57 -06:00
Sage Weil
12fe1a1c17 qa/workunits/cephadm: separate out test_adoption.sh; fix
- fix adoption test to handle different legacy fsids
- separate it out into a different script

Signed-off-by: Sage Weil <sage@redhat.com>
2020-02-21 07:17:47 -06:00
Nathan Cutler
9c238af442 qa/cephadm/smoke: run on opensuse_15.1
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2020-02-20 11:50:29 +01:00
Sage Weil
f8d0e3d73a qa/suites/rados: disable device scraping
We need no pools to avoid breaking some tests.

Signed-off-by: Sage Weil <sage@redhat.com>
2020-02-19 15:31:26 -06:00
Sridhar Seshasayee
d20f57000b osd/OSD: Log slow ops/types to cluster logs
In addition to logging slow ops in mon and osd specific log files,
re-introduce logging the same information along with slow op type
details to cluster logs as well. The objective is to make debugging
slow ops easier.

Modify the log whitelisting string to "slow request" within qa suites in
order to make the search for the new warning log message within the
cluster log successful. This should not cause any issue as it's a
substring of the earlier string.

Fixes: https://tracker.ceph.com/issues/43975
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2020-02-19 14:31:48 +05:30
Kefu Chai
73b03097f2
Merge pull request #33396 from liewegas/fix-cephadm-upgrade-test
qa/suites/rados/cephadm/upgrade: fix initial version

Reviewed-by: Kefu Chai <kchai@redhat.com>
2020-02-19 11:08:16 +08:00
Sage Weil
6fa842df98 qa/suites/ardos/cephadm/upgrade: fix initial version
Switch initial version to one with the renamed orchestrator module.

Signed-off-by: Sage Weil <sage@redhat.com>
2020-02-18 13:27:26 -06:00
Sage Weil
ec9fb1aab5 qa/suites/rados/cephadm/smoke: remove rgw
Fixes: https://tracker.ceph.com/issues/44168
Signed-off-by: Sage Weil <sage@redhat.com>
2020-02-17 11:25:47 -06:00
Sage Weil
b650e75838 Merge PR #33289 into master
* refs/pull/33289/head:
	qa/tasks/cephadm: deploy rgw daemons too

Reviewed-by: Michael Fritch <mfritch@suse.com>
2020-02-15 08:42:10 -06:00