Sage Weil
2d260443f0
qa/suites/upgrade/*/parallel: disable POOL_APP_NOT_ENABLED
...
There is some other random workload running (that creates pools)
while we upgrade and wait for healthy. Just disable the warning
for these tests.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-06 09:56:55 -04:00
Sage Weil
f4c2863999
qa/suites/upgrade/jewel-x/parallel: whitelist OSD_DOWN
...
We restart OSDs during the upgrade.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-06 09:56:55 -04:00
Sage Weil
d9a0145f8f
Merge pull request #16824 from liewegas/wip-more-scrub-time
...
qa/tasks/ceph: wait longer for scrub
2017-08-05 13:35:55 -05:00
Sage Weil
62482ce82c
qa/tasks/ceph: debug osd setup
...
I've seen a couple rbd runs that seem to skip the next block :/
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-05 13:53:26 -04:00
Patrick Donnelly
04d8ba4b04
Merge PR #16833 into master
...
* refs/remotes/upstream/pull/16833/head:
qa: whitelist expected MDS_CLIENT_OLDEST_TID warn
qa: ignore insufficient standby during failover
qa: fix read-only whitelist
mds: MDS_DAMAGED to MDS_DAMAGE
doc: remove duplicate CephFS health check doc
2017-08-04 20:26:09 -07:00
Patrick Donnelly
29e5f0a450
qa: whitelist expected MDS_CLIENT_OLDEST_TID warn
...
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-04 20:21:43 -07:00
Patrick Donnelly
06f53e4a82
qa: ignore insufficient standby during failover
...
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-04 20:14:59 -07:00
Patrick Donnelly
42cd1c7122
qa: fix read-only whitelist
...
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-04 20:14:48 -07:00
Patrick Donnelly
8d4f3e3045
mds: MDS_DAMAGED to MDS_DAMAGE
...
We had both and MDS_DAMAGE looks to be the right/intended one.
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-04 13:01:33 -07:00
Sage Weil
62e51661e6
Merge branch 'wip-qa-rbd-health' of git://github.com/dillaman/ceph
...
# Conflicts:
# qa/tasks/ceph.py
2017-08-04 15:07:22 -04:00
Sage Weil
ffd171fd46
Merge pull request #16820 from liewegas/wip-more-whitelist
...
qa/suites/rados: a bit more whitelisting
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-08-04 13:44:08 -05:00
Sage Weil
82cf3046de
qa/suites/rados/basic/tasks/rados_python: POOL_APP_NOT_ENABLED
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-04 13:39:13 -04:00
Sage Weil
d09606619f
qa/tasks/ceph: wait longer for scrub
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-04 12:06:27 -04:00
Sage Weil
9c7a653fee
Merge pull request #16769 from liewegas/wip-20295-b
...
os/bluestore: allow multiple DeferredBatches in flight at once
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
2017-08-04 11:04:38 -05:00
Sage Weil
6dfae21eda
Merge pull request #16805 from liewegas/wip-crush-classes
...
crush: misc changes/fixes for device classes
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
2017-08-04 08:48:53 -05:00
Sage Weil
c8af364699
Merge pull request #16739 from liewegas/wip-multi-backfill-reject
...
qa/suites/rados/singleton-nomsgr/all/multi-backfill-reject: sleep longer
2017-08-04 08:41:06 -05:00
Sage Weil
f681d6e794
Merge pull request #16794 from liewegas/wip-rbd-app
...
qa/tasks/ceph: enable rbd on rbd pool
2017-08-04 08:40:45 -05:00
Sage Weil
7ed9d1e8aa
qa/tasks/ceph: enable rbd on rbd pool
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-04 09:40:26 -04:00
Sage Weil
1ae9ff173b
qa/suites/rados/upgrade: ignore FS_DEGRADED from mds restart
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-04 09:34:31 -04:00
Sage Weil
27a685f626
qa/suites/rados/monthrash: ignore MGR_DOWN
...
Heavily thrashing mons + mgr reconnect backoff may make us fail
to process the beacon.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-04 09:34:15 -04:00
Nathan Cutler
d919987caa
tests: rbd: reproducer for rbd-on-EC issue
...
This introduces a new "rbd/singleton-bluestore" suite because creating an rbd
on an EC-backed datapool will fail on filestore.
References: http://tracker.ceph.com/issues/20295
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-08-03 22:54:17 -04:00
Sage Weil
47480d8a06
qa/suites/rados/thrash-erasure-code-big: add k=4 m=2
...
Get better coverage for larger codes.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-03 22:50:16 -04:00
Patrick Donnelly
d89af4a3e8
Merge PR #16802 into master
...
* refs/remotes/upstream/pull/16802/head:
qa: update wait_for_health for new health json syntax
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-03 16:20:20 -07:00
John Spray
0613d411aa
qa: update wait_for_health for new health json syntax
...
Fixes: http://tracker.ceph.com/issues/20890
Signed-off-by: John Spray <john.spray@redhat.com>
2017-08-03 23:46:41 +01:00
Kefu Chai
007095b7ae
qa/workunits/mon/crush_ops.sh: remove existing dev class before setting it
...
we cannot overwrite existing dev class, and "osd_class_update_on_start"
is true by default (see 0c885d6
). so we should remove all device classes before
setting them.
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-08-03 17:26:26 -04:00
xie xingguo
734b5f2c60
test/osd-fast-mark-down: enable 'osd-class-update-on-start' by default
...
116cf759c8
will now hide all shadow trees(roots), so this is not applicable anymore
(actually it is misleading).
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2017-08-03 17:26:26 -04:00
Patrick Donnelly
9d348ad8c9
qa: add health whitelist for all fs sub-suites
...
Fixes: http://tracker.ceph.com/issues/20892
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-03 14:01:28 -07:00
Patrick Donnelly
60fa9714d4
Merge PR #16768 into master
...
* refs/remotes/upstream/pull/16768/head:
qa: fix log whitelist string
Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-08-03 13:55:42 -07:00
Patrick Donnelly
66756c4f65
Merge PR #16292 into master
...
* refs/remotes/upstream/pull/16292/head:
qa: use new hex rep of inode
qa: fix whitelist error message
mds: refine "Scrub error" cluster log message
mds: polish clog messages
doc: developer logging guidance
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-03 13:55:21 -07:00
Sage Weil
342607f4d5
Merge pull request #16749 from tchaikov/wip-restful-delete-key
...
mgr: handle "module.set_config(.., None)" correctly
Reviewed-by: John Spray <john.spray@redhat.com>
2017-08-03 15:53:27 -05:00
Yuri Weinstein
09fd18d031
Merge pull request #16760 from cbodley/wip-rgw-disable-lifecycle-s3tests
...
qa/rgw: disable lifecycle tests because of expiration failures
Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2017-08-03 13:25:39 -07:00
Josh Durgin
b172642124
Merge pull request #16789 from liewegas/wip-ec-m-2
...
qa: avoid map-gap tests for k=2 m=1
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-08-03 11:20:13 -07:00
Sage Weil
b8627f897a
Merge pull request #16795 from liewegas/wip-mgr-whitelist
...
qa/suites/rados/mgr/tasks/failover: whitelist
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-08-03 13:14:20 -05:00
Sage Weil
ef21c9d7df
qa/suites/rados/thrash-erasure-code: do not test map gap with m=1
...
We test EC profiles with m=1 here, and mapgap can lead to incomplete pgs
because it takes an osd down and waits for healthy.
Fixes: http://tracker.ceph.com/issues/20844
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-03 14:13:02 -04:00
Sage Weil
f74d71f708
qa/suites/rados/thrash-erasure-coe-big/clsuter: 12 osds on 3 nodes not 4
...
smithi have 4 nvme partitions available, not 3.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-03 14:11:43 -04:00
Josh Durgin
ae48c75065
Merge pull request #16797 from jdurgin/wip-upgrade-jewel-x
...
qa: timeout when waiting for mgr to be available in healthy()
Reviewed-by: Sage Weil <sage@redhat.com>
2017-08-03 11:11:41 -07:00
Sage Weil
63221e21f5
qa/suites/rados/thrash-erasure-code-big: add k=4 m=2
...
Get better coverage for larger codes.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-03 14:10:36 -04:00
Sage Weil
e994b03335
qa/suites/rados/monthrash/worklaods/rados_api_tests: whitelist SMALLER_PGP_NUM
...
The rados/test.sh fiddles with pg_num.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-03 13:31:39 -04:00
Sage Weil
7c350180b1
qa/suites/rados/mgr/tasks/failover: whitelist
...
remote/smithi025/log/ceph.log.gz:2017-08-03 07:02:15.049074 mon.b mon.0 172.21.15.25:6789/0 197 : cluster [INF] Manager daemon x is unresponsive, replacing it with standby daemon y
remote/smithi025/log/ceph.log.gz:2017-08-03 07:03:10.078032 mon.b mon.0 172.21.15.25:6789/0 226 : cluster [WRN] Manager daemon x is unresponsive. No standby daemons available.
x and y may be swapped, so whitelist the rest of the string.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-03 12:40:01 -04:00
Jason Dillaman
c2b451e8cb
qa: fix RBD-related POOL_APP_NOT_ENABLED health warnings
...
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-08-03 09:50:41 -04:00
Patrick Donnelly
8d33cbbf5c
qa: use new hex rep of inode
...
Resolves a failure from QA:
2017-08-02T19:23:27.489 INFO:tasks.cephfs_test_runner:======================================================================
2017-08-02T19:23:27.489 INFO:tasks.cephfs_test_runner:FAIL: test_oversize (tasks.cephfs.test_fragment.TestFragmentation)
2017-08-02T19:23:27.489 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2017-08-02T19:23:27.490 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2017-08-02T19:23:27.490 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_ceph-c_wip-pdonnell-testing-20170802/qa/tasks/cephfs/test_fragment.py", line 71, in test_oversize
2017-08-02T19:23:27.490 INFO:tasks.cephfs_test_runner: self.assertEqual(frags[0]['dirfrag'], "10000000000.0*")
2017-08-02T19:23:27.490 INFO:tasks.cephfs_test_runner:AssertionError: u'0x10000000000.0*' != '10000000000.0*'
2017-08-02T19:23:27.490 INFO:tasks.cephfs_test_runner:
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-02 21:39:48 -07:00
Patrick Donnelly
d4ed085238
Merge PR #16713 into master
...
* refs/remotes/upstream/pull/16713/head:
qa: ignore failed MDS message during upgrade
2017-08-02 19:41:42 -07:00
Patrick Donnelly
6cad5be68c
Merge PR #16714 into master
...
* refs/remotes/upstream/pull/16714/head:
qa: test export_pin is correct in dumped subtree
mds: print export_pin for dumped subtree
Reviewed-by: Douglas Fuller <dfuller@redhat.com>
Reviewed-by: huanwen ren <ren.huanwen@zte.com.cn>
2017-08-02 18:41:12 -07:00
Patrick Donnelly
7f04d88af8
qa: fix whitelist error message
...
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-02 16:52:30 -07:00
Patrick Donnelly
8e975a6347
qa: fix log whitelist string
...
Fixes: http://tracker.ceph.com/issues/20889
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-02 16:32:19 -07:00
Sage Weil
5085dc1164
qa/suites/powercycle: whitelist health for thrashing
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-02 11:06:43 -04:00
Casey Bodley
0debf4dc6e
qa/rgw: disable lifecycle tests because of expiration failures
...
lifecycle expiration tests are too reliant on timing, and have been
failing consistently for a long time
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-08-02 11:06:35 -04:00
Kefu Chai
da1a60ced1
qa: refactor suites/rados/rest/mgr-restful
...
- use "ceph restful restart" to restart the restful API server instead
of restarting the ceph-mgr
- test "ceph restful delete-key"
- test "ceph restful list-keys"
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-08-02 18:20:56 +08:00
Josh Durgin
63693779fc
qa: timeout when waiting for mgr to be available
...
Otherwise during upgrades we wait forever.
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2017-08-02 02:18:28 -04:00
Kefu Chai
1ff1f836da
Merge pull request #16722 from tchaikov/wip-qa-fixes
...
qa/suites: escape the parenthesis of the whitelist text
Reviewed-by: Sage Weil <sage@redhat.com>
2017-08-02 13:00:01 +08:00