Commit Graph

296 Commits

Author SHA1 Message Date
Sage Weil
2a6f1633b1
Merge pull request #22158 from liewegas/wip-24222
mon/MgrMonitor: change 'unresponsive' message to info level

Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: John Spray <john.spray@redhat.com>
2018-05-23 06:42:48 -05:00
Sage Weil
4ddbbe561c mon/MgrMonitor: change 'unresponsive' message to info level
We generate a MGR_DOWN health warning at the appropriate points; having
this at WRN level just triggers failed teuthology runs but doesn't much
value for the user.

Clear out teuthology whitelisting for this message.

Fixes: http://tracker.ceph.com/issues/24222
Signed-off-by: Sage Weil <sage@redhat.com>
2018-05-22 16:55:03 -05:00
David Zafman
ee4acb6e1f test: Whitelist corrections
Signed-off-by: David Zafman <dzafman@redhat.com>
2018-05-22 14:26:46 -07:00
David Zafman
4fad800043 test: Add CACHE_POOL_NO_HIT_SET to whitelist for mon/pool_ops.sh
Ignore
  cluster [WRN] Health check failed: 1 cache pools are missing hit_sets (CACHE_POOL_NO_HIT_SET)

Signed-off-by: David Zafman <dzafman@redhat.com>
2018-05-21 09:47:59 -07:00
David Zafman
918921ab2f test: Need to escape parens in log-whitelist for grep
Signed-off-by: David Zafman <dzafman@redhat.com>
2018-05-21 09:47:59 -07:00
Sage Weil
dd46ae1006 qa/suites/rados/thrash-old-clients: only centos and 16.04
We don't have old client builds for the newer distros.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-05-20 09:38:02 -05:00
Kefu Chai
c237d0befb qa/suites/rados/verify: remove random-distro$
the distro specified by random-distro$ will be overwrited by the one
specfied by valgrind.yaml. and teuthology-suite will give

KeyError: '16.04 not a centos version or codename'

when scheduling a suite involving the facets above. also, i think it's
of not much value to run valgrind/lockdep with different distros.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-05-17 19:11:14 +08:00
Sage Weil
664af17b30
Merge pull request #21932 from yuriw/wip-yuriw-add-dollar-rgw
tests/qa: Adding $ distro mix - rgw

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2018-05-15 16:15:05 -05:00
Josh Durgin
e8f5038f2b
Merge pull request #21895 from yuriw/wip-yuriw-add-dollar1
tests/qa: trying $ distro mix

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-05-14 16:18:45 -07:00
Yuri Weinstein
9f2c485942 tests/qa: adding rados/.. dirs
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2018-05-11 14:03:15 -07:00
Casey Bodley
7da0fe2832
Merge pull request #21680 from cbodley/wip-rm-replica-log
rgw: remove all traces of cls replica_log

Reviewed-by: Orit Wasserman <owasserm@redhat.com>
2018-05-10 10:26:55 -04:00
Yuri Weinstein
c79a74a33c tests/qa: adding rados/.. dirs
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2018-05-08 16:00:05 -07:00
Kefu Chai
770dbae2ca qa/suites/rados/thrash-old-clients: ms_type=simple
hammer does not support async messenger, so set ms_type to "simple" for
hammer client.

Fixes: http://tracker.ceph.com/issues/23922
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-04-30 21:40:53 +08:00
Josh Durgin
aef8d96def
Merge pull request #21710 from neha-ojha/wip-cosbench-workloads
qa: add cosbench workloads and override teuthology default settings

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-04-27 18:30:11 -07:00
Neha Ojha
f01eb7adc6 qa/suites/rados/perf: add cosbench workloads
Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-04-27 16:02:40 -07:00
Jason Dillaman
0cb3e64221
Merge pull request #21704 from liewegas/wip-rados-old-clients
qa/suites/rados/thrash-old-clients: add rbd tests

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-04-27 17:35:58 -04:00
Yuri Weinstein
2c98007a13
Merge pull request #21630 from neha-ojha/wip-rgw-pools
qa: modify rgw default pool names

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-04-27 14:21:52 -07:00
Sage Weil
9fbdca34bf qa/suites/rados/thrash-old-clients: add cls_rbd, rbd api tests
Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-27 14:04:25 -05:00
Neha Ojha
33db6bdfdd qa/suites/rados/perf: override default teuthology settings
Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-04-27 12:04:00 -07:00
Sage Weil
7cbaa62498 qa/suites/rados/thrash-old-clients: add hammer
Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-27 11:40:53 -05:00
Sage Weil
41fae6e9c4
Merge pull request #21679 from liewegas/wip-rados-old-clients
qa/suites/rados/thrash-old-clients: do some thrashing with jewel and luminous clients

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-04-27 11:30:39 -05:00
Sage Weil
2332e63e33
Merge pull request #21646 from liewegas/wip-rados-api-tests
qa/suites/rados/verify/tasks/rados_api_tests: whitelist OBJECT_MISPLACED
2018-04-26 12:39:23 -05:00
Casey Bodley
f9ee48caa2 rgw: remove all traces of cls replica_log
replica log was for the old radosgw sync agent, which was replaced with
multisite v2 in jewel. no sense in continuing to maintain and test it

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2018-04-26 11:40:11 -04:00
Kefu Chai
966c76330b qa: reduce "mon client hunt interval max multiple" to 2 for all clients
because with high failure rate, we need to connect to mon more
frequently if the connection fails.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-04-26 12:04:49 +08:00
Sage Weil
e331311b87 qa/suites/rados/verify/tasks/rados_api_tests: whitelist OBJECT_MISPLACED
The api tests do some splits, which can move data.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-25 10:33:52 -05:00
Neha Ojha
8158a2c430 qa: add rgw index pool to rgw_snaps.yaml
Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-04-24 15:22:18 -07:00
Sage Weil
27e91a99f5
Merge pull request #21273 from jdurgin/wip-23195
osd/ECBackend: only check required shards when finishing recovery reads

Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2018-04-24 17:20:25 -05:00
Sage Weil
a43c7b3a23 qa/suites/rados/thrash-old-clients: do some thrashing with jewel and luminous clients
Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-24 16:14:36 -05:00
Neha Ojha
48f4b8172a qa: correct default rgw pool name
Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-04-24 13:45:33 -07:00
Kefu Chai
cdcbd47e1e qa/suite: whitelist PG_AVAILABILITY in rados_api_tests.yaml
pg will be created when increasing pgp-num and pg-num. so at that
moment, PG_AVAILABILITY is reported. so whitelist it in all tests which
run rados/test.sh. that script exercises ceph_test_rados_api_list.

Fixes: http://tracker.ceph.com/issues/23763
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-04-24 10:16:12 +08:00
Ricardo Dias
7309be145b
qa/suites/rados/mgr/tasks/dashboard: add MDS_(ALL_DOWN|UP_LESS_THAN_MAX) to whitelist
Signed-off-by: Ricardo Dias <rdias@suse.com>
2018-04-21 15:58:23 +01:00
Josh Durgin
234d652317 qa/suites/rados: add coverage for osd_recovery_max_single_start > 1
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2018-04-20 19:42:15 -04:00
Kefu Chai
d55724ec4a
Merge pull request #21331 from liewegas/wip-slow-ops
qa/suites/rados/monthrash: whitelist SLOW_OPS

Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-04-13 17:49:30 +08:00
Sage Weil
2f00b8f990 Merge PR #21302 into wip-sage-testing-20180410.153118
* refs/pull/21302/head:
	qa/suites/rados/singleton/all/random-eio: whitelist eio error message

Reviewed-by: David Zafman <dzafman@redhat.com>
2018-04-10 10:31:26 -05:00
Sage Weil
c365614292 qa/suites/rados/monthrash: whitelist SLOW_OPS
Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-10 08:56:39 -05:00
Kefu Chai
acc08559ce qa/suites: whitelist SLOW_OPS
Fixes: http://tracker.ceph.com/issues/23495
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-04-10 19:25:47 +08:00
Sage Weil
35c14a0162 qa/suites/rados/singleton/all/random-eio: whitelist eio error message
"cluster [ERR] 2.1 shard 1: soid 2:8007ad8d:::benchmark_data_smithi115_12935_object2439:head candidate had a read error"

is normal when we're injecting EIO.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-09 07:40:23 -05:00
Brad Hubbard
eeeed6497b qa/suites/rados: Disable scrub backoff
A long run of lost coin flips can lead to a timeout in
test_large_omap_detection.py.

Fixes: http://tracker.ceph.com/issues/23578

Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
2018-04-09 17:21:01 +10:00
Gregory Farnum
6d2e4c9b7b
Merge pull request #19973 from liewegas/wip-peering-fast-dispatch
osd: fast dispatch of peering events and pg_map + osd sharded wq refactor

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2018-04-06 11:48:11 -07:00
Sage Weil
8f8f65594b
Merge pull request #21250 from liewegas/wip-recovery-slow-ops
qa/suites/rados/singleton/all/recovery_preemption: whitelist SLOW_OPS
2018-04-06 10:59:23 -05:00
Sage Weil
b235a3f62a qa/suites/rados/singleton/all/ec-lost-unfound: whitelist SLOW_OPS
Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-06 10:38:45 -05:00
Joao Eduardo Luis
940dd941ef
Merge pull request #19331 from jecluis/wip-mon-osdmap-prune
mon: osdmap prune

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-04-06 15:22:28 +01:00
Joao Eduardo Luis
3997eed4db qa: enable mon osdmap pruning on 'rados/' suites
Signed-off-by: Joao Eduardo Luis <joao@suse.de>
2018-04-06 04:18:23 +01:00
Joao Eduardo Luis
65e7e4a38f qa: enable mon osdmap pruning on rados/monthrash
Signed-off-by: Joao Eduardo Luis <joao@suse.de>
2018-04-06 04:18:23 +01:00
Sage Weil
3b3c32f643 qa/suites/rados/singleton/all/recovery_preemption: whitelist SLOW_OPS
Recovery and peering can be slow enough with all the logging enabled to
trigger a slow ops warning.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-04 16:24:31 -05:00
Sage Weil
26f00dd67c qa/suites: mon warn on pool no app = false for api tests
Among other things, the list.cc tests set pg_num which waits for cluster
healthy.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-04 08:26:58 -05:00
Sage Weil
c2cce3bc88 qa/suites/rados/basic/tasks/rados_api_tests: debug ms = 1
Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-04 08:26:58 -05:00
Sage Weil
29a885c915 qa/suites/rados/singleton/all/recovery_preemption: make test more reliable
A 30 second run did only 7000 ops, which means ~50 log entires per pg...
not enough to trigger backfill.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-04 08:26:57 -05:00
Sage Weil
c3589df320 qa/suites/rados/singleton/all/mon-seesaw: whitelist PG_AVAILABILITY
The seesaw might delay pg creation by more than 60s.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-04 08:26:57 -05:00
Ricardo Dias
7b774f1bb0
qa/suites/rados/mgr/tasks/dashboard: add fail_on_skip = false
This commit prevents dashboard API tests to be tagged as failed when
some test is decorated with @skip.

Signed-off-by: Ricardo Dias <rdias@suse.com>
2018-03-16 08:40:42 +00:00