Commit Graph

447 Commits

Author SHA1 Message Date
Kefu Chai
0a8c11d172 qa/suites/rados/rest/mgr-restful: whitelist OSDMAP_FLAGS
`rest/test-restful.sh` calls `test_mgr_rest_api.py`, which in turn
calls

```
    ('patch',  '/config/osd', {'pause': True}),
```

and rest module translates it to `ceph osd set key=pause`

Signed-off-by: Kefu Chai <kchai@redhat.com>
2019-07-27 01:29:05 +08:00
Sage Weil
0c0093e450 Merge PR #29034 into master
* refs/pull/29034/head:
	doc/mgr/crash: document missing commands, options
	qa/suites/rados/singleton/all/test-crash: whitelist RECENT_CRASH
	qa/suites/rados/mgr/tasks/insights: whitelist RECENT_CRASH
	qa/tasks/mgr/test_insights: crash module now rejects bad crash reports
	mgr/telemetry: fix remote into crash do_ls()
	mgr/crash: don't make these methods static
	mgr/BaseMgrModule: handle unicode health detail strings
	mgr/crash: verify timestamp is valid
	qa/suites/mgr: whitelist RECENT_CRASH
	mgr/crash: remove unused var
	mgr/crash: remove unused import 'six'
	qa/workunits/rados/test_crash: health check
	mgr/crash: improve validation on post
	mgr/crash: automatically prune old crashes after a year
	mgr/crash: raise RECENT_CRASH warning for recent (new) crashes
	mgr/crash: add 'crash ls-new'
	mgr/crash: add option and serve infra
	mgr/crash: keep copy of crashes in memory
	mgr/pg_autoscaler: adjust style to match built-in tables
	mgr/crash: make 'crash ls' a nice table with a NEW column
	mgr/crash: nicely format 'crash info' output
	mgr/crash: add 'crash archive <id>', 'crash archive-all' commands

Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-07-24 17:20:21 -05:00
Kefu Chai
10af76ab1b
Merge pull request #28778 from myoungwon/wip-chunk-flush-op
src/osd: add tier-flush op

Reviewed-by: Sage Weil <sage@redhat.com>
2019-07-24 09:56:47 +08:00
Kefu Chai
361b7febcc
Merge pull request #28983 from myoungwon/wip-dedup-io-test
src/test, qa/suites/rados/thrash: add dedup test

Reviewed-by: Sage Weil <sage@redhat.com>
2019-07-24 09:54:54 +08:00
Sage Weil
31b5c43806 Merge PR #29114 into master
* refs/pull/29114/head:
	mgr/dashboard: qa: whitelist client eviction warning

Reviewed-by: Laura Paduano <lpaduano@suse.com>
2019-07-23 09:09:15 -05:00
Sage Weil
468cbcd3be Merge PR #29169 into master
* refs/pull/29169/head:
	qa/suites/rados/mgr/tasks/module_selftest: whitelist mgr client getting backlisted

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2019-07-23 09:07:31 -05:00
Sage Weil
d325faf6a7 Merge PR #29121 into master
* refs/pull/29121/head:
	qa/suites/rados/multimon: whitelist SLOW_OPS while thrashing mons

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: João Eduardo Luís <joao@suse.de>
2019-07-22 17:16:35 -05:00
Sage Weil
9257175f08 qa/suites/rados/singleton/all/test-crash: whitelist RECENT_CRASH
Signed-off-by: Sage Weil <sage@redhat.com>
2019-07-22 17:14:01 -05:00
Sage Weil
209ce4ec68 qa/suites/rados/mgr/tasks/insights: whitelist RECENT_CRASH
Signed-off-by: Sage Weil <sage@redhat.com>
2019-07-22 13:42:06 -05:00
Sage Weil
0060f1c5b8 qa/suites/rados/mgr/tasks/module_selftest: whitelist mgr client getting backlisted
The mgr's libcephfs client gets evicted after the mgr fails over.
Whitelist the message.

Fixes: http://tracker.ceph.com/issues/40867
Signed-off-by: Sage Weil <sage@redhat.com>
2019-07-22 10:34:24 -05:00
Josh Durgin
234a400376
Merge pull request #29120 from neha-ojha/wip-add-write-omap
qa: add basic omap testing capability

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2019-07-19 14:46:22 -07:00
Neha Ojha
3a2abb94eb qa/suites/rados/perf: add op_size to radosbench_omap_write.yaml
Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-07-19 13:16:50 -07:00
Sage Weil
e2ff58ba16 qa/suites/mgr: whitelist RECENT_CRASH
Signed-off-by: Sage Weil <sage@redhat.com>
2019-07-19 14:45:47 -05:00
Neha Ojha
9665aeb1e1 qa/suites/rados/perf: add objectsize to radosbench_omap_write.yaml
Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-07-18 16:56:07 -07:00
Sage Weil
07badf051d qa/suites/rados/multimon: whitelist SLOW_OPS while thrashing mons
The mons may have slow ops.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-07-18 16:44:23 -05:00
Neha Ojha
83c3cb9c54 qa/suites/rados/perf: add radosbench_omap_write.yaml
This excercises the --write-omap option in the radosbench task.

Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-07-18 13:29:11 -07:00
Ricardo Dias
658bd35b96
mgr/dashboard: qa: whitelist client eviction warning
This warning is caused by the recent changes to the volumes
module that cache the CephFS handles.
Commit 5c41e949af9acabd612b0644de0603e374b4b42a

Signed-off-by: Ricardo Dias <rdias@suse.com>
2019-07-18 15:44:04 +01:00
myoungwon oh
f1acdf3f2e src/test, qa/suite: add tier_flush test
Signed-off-by: Myoungwon Oh <omwmw@sk.com>
2019-07-12 22:23:22 +09:00
Sage Weil
200d4d9c16 Merge PR #28943 into master
* refs/pull/28943/head:
	qa: stop testing on 16.04 xenial

Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-07-11 09:32:59 -05:00
myoungwon oh
c1b69f48d2 src/test, qa/suites/rados/thrash: add dedup test
Signed-off-by: Myoungwon Oh <omwmw@sk.com>
2019-07-11 20:28:55 +09:00
Brad Hubbard
aeded77aa3
Merge pull request #28070 from badone/wip-lazy-omap-test
Wip lazy omap test

Reviewed-by: Greg Farnum gfarnum@redhat.com
2019-07-11 09:52:12 +10:00
Sage Weil
141ad20e73 qa: stop testing on 16.04 xenial
Note that the hammer old-client test will now *only* run on centos.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-07-10 10:04:30 -05:00
Brad Hubbard
88e9ca58a0 tests: Add test for lazy omap stat collection
Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
2019-07-10 07:53:37 +10:00
Sage Weil
0ddc2dea4c Merge PR #28658 into master
* refs/pull/28658/head:
	qa/suites/rados/thrash: force normal pg log length with cache tiering

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-06-24 13:14:42 -05:00
Kefu Chai
90022b35ab
Merge pull request #17619 from liuchang0812/wip-ec-below-min-size
osd: allow EC PGs to do recovery below min_size

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-06-22 12:58:55 +08:00
Sage Weil
afd0b508c2 qa/suites/rados/thrash: force normal pg log length with cache tiering
When we are doing cache tiering, we are more sensitive to short PG logs
because the dup op entries are not perfectly promoted from the base to
the cache.

See:
 http://tracker.ceph.com/issues/38358
 http://tracker.ceph.com/issues/24320

This works around the problem by not testing short pg logs in combination
with cache tiering.  This works because the short_pg_log.yaml fragment
sets the short log in the [global] section but the cache workloads overload
it (back to a large/default value) in the [osd] section.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-06-19 11:15:25 -05:00
Sage Weil
fd00136eb3 qa/suites/rados/multimon: skew clocks 2s (< paxos lease)
If the leader is the one with the accurate clock, it can still
form quorum, but if the leader has the skewed clock, all other mons appear
skewed from its perspective and no quorum is formed.  This leads to
intermittent failures, depending on the non-deterministic teuthology
deployment order and how the mon IPs sort.

Fix by reducing the skew.  This is enough skew to trigger a warning, but
not enough that it will break quorum.  This ensures that the parts of the
teuthology test that issue random mon commands won't fail (e.g., 'ceph osd
dump').

Fixes: http://tracker.ceph.com/issues/40112
Signed-off-by: Sage Weil <sage@redhat.com>
2019-06-03 10:49:02 -05:00
Sage Weil
8d137b9345 qa/suites/rados/multimon: create_rbd_pool: false
Signed-off-by: Sage Weil <sage@redhat.com>
2019-05-30 15:43:48 -05:00
Sage Weil
1991faaafe qa/suites/rados/multimon: no osds when skewing clock
Sometimes the clock skew prevents the mon quorum from making progress and
processing the osd boot messages.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-05-30 13:05:54 -05:00
Chang Liu
b02e2f6cf2 test: update test_pool_min_size test in thrasher
Signed-off-by: Chang Liu <liuchang0812@gmail.com>
2019-05-10 10:45:25 +08:00
Greg Farnum
7950ce2488 qa: don't create rbd pool for min-size thrashing tests
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2019-05-10 10:45:25 +08:00
Greg Farnum
b701395065 qa: write a thrasher for putting PGs below min_size and watching them recover
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2019-05-10 10:45:25 +08:00
Nathan Cutler
f9f824448a qa: add version number sanity singleton to rados suite
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2019-05-08 11:31:34 +02:00
David Zafman
6c98782b76 test: Remove redundant mon-seesaw.yaml file
mon.yaml runs all scripts in the qa/standalone/mon dir

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-04-24 17:29:48 -07:00
Sage Weil
0adc99511f qa/suites: add upgrade/nautilus-x-singleton and symlink from rados/upgrade
Signed-off-by: Sage Weil <sage@redhat.com>
2019-04-02 16:35:25 -05:00
Sage Weil
7fbc0f6e2c qa/suites: move rados/upgrade/mimic-x-singleton to upgrade/ and symlink
Signed-off-by: Sage Weil <sage@redhat.com>
2019-04-02 16:33:38 -05:00
Sage Weil
e58bf34386 qa/suites/rados/thrash-old-clients: fix up v1/v2 options
- don't specify ceph.py options in the ceph.py
- instead, specify them in the per-version facet

Note that we don't currently have a way to do v2 only for the mon IPs, so
in the v2only cases, we are still binding the mons to v1.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-28 14:06:58 -05:00
Kefu Chai
1231a306ec qa/suites: do not test luminous-x upgrade path
in this change,

* suites/rados/upgrade: luminous-x-singleton => mimic-x-singleton
* suites/upgrade: luminous-x => nautilus-x

we support upgrade from n to n+2 release. otherwise monitor refuses to
do so:

mon.a@-1(probing) e1 current monmap has min_mon_release 15 (luminous)
which is >2 releases older than me 15 (octopus), stopping.

Fixes: https://tracker.ceph.com/issues/38845
Signed-off-by: Kefu Chai <kchai@redhat.com>
2019-03-22 22:39:42 +08:00
Sage Weil
cc9a9142fd msg: remove XioMessenger
There is a lot of good stuff going on here, but nobody is investing in xio
and it is not expected to be the path forward for RDMA.  If that ever
changes, we can resurrect the code.  Until then, let's clean up the tree
and reduce friction for changes going forward.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-21 00:57:33 -05:00
Sage Weil
3bab269445 qa/suites/rados/thrash-old-clients: add nautilus
v2+v1, v2, and v1 variants.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-20 07:57:41 -05:00
Sage Weil
f3b4d79214 qa/suites/rados/thrash-old-clients: add mimic v1 variant
Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-20 07:57:17 -05:00
Sage Weil
73043655e5 qa/suites/rados/thrash-old-clients: add mimic
Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-20 07:55:04 -05:00
Sage Weil
07f0a26770 qa/suites/rados/thrash-old-clients: collapse msgr and client choice
Hammer doesn't have async, and nautilus doesn't have simple.  Expand the
async default (v2v1) and v1-only choice and the client choice.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-20 07:54:03 -05:00
Sage Weil
28b4392a71 qa: remove simplemessenger tests
Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-20 06:10:25 -05:00
Sage Weil
e79dc454db qa/suites: disable valgrind leak checks on ceph-mgr
We've disabled the "clean" shutdown in ceph-mgr due to
https://tracker.ceph.com/issues/38621

Until then, no valgrind leak checks!

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-07 13:03:28 -06:00
Lenz Grimmer
72ef9b58ec
Merge pull request #26510 from callithea/wip-add-ganesha-qa-tests
mgr/dashboard: Added NFS Ganesha suite to QA tests

Reviewed-by: Jeff Layton <jlayton@redhat.com>
2019-02-22 10:28:41 +01:00
Sage Weil
d198831c44 Merge PR #26262 into master
* refs/pull/26262/head:
	mgr/orch/ssh: add qa test
	qa: install/exclude ceph-mgr-ssh package
	debian: package ssh orchestrator
	spec.ceph: package ssh orchestrator
	ssh/orch: add ssh orchestrator
2019-02-20 13:05:11 -06:00
Noah Watkins
06422208ec mgr/orch/ssh: add qa test
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
2019-02-20 09:45:48 -08:00
Noah Watkins
a2e8960079 qa: install/exclude ceph-mgr-ssh package
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
2019-02-20 09:45:48 -08:00
Sage Weil
618c0ea5da Merge PR #26417 into master
* refs/pull/26417/head:
	qa, doc: update ec fast read setting
	qa/suites: remove mon kv backend options

Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-02-20 07:03:56 -06:00