Kefu Chai
a406553a79
qa/tasks/ceph_manager: add inject_args() method
...
* move Thrasher._set_config() to CephManager, and make it a public
method, and rename it to inject_args(),
* use this method instead of using 'tell ... injectargs ...' directly
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-29 18:44:16 +08:00
Kefu Chai
749bbda075
qa/tasks: prolong revive_osd() timeout to 6 min
...
see also #17902
Fixes: http://tracker.ceph.com/issues/21474
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-20 13:40:59 +08:00
Kefu Chai
7f549af459
qa: do not wait for down/out osd for pg convergence
...
that osd is not invovlved in the PG state changes.
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-08 14:50:10 +08:00
Sage Weil
d21809b14e
qa/tasks/thrashosds: set min_in default to 4
...
We have EC tests with k=2,m=2, so we need a min of 4.
Fixes: http://tracker.ceph.com/issues/21997
Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-01 08:32:48 -05:00
Patrick Donnelly
c58161f25b
Merge PR #17266 into master
...
* refs/pull/17266/head:
qa: update test_ceph_argparse to test fs cmds
qa: use fs rm_data_pool
qa: fix mdsmap lookup
qa: remove usage of mds dump
PendingReleaseNotes: add obsoleted mds commands
qa: remove use of obsolete mds commands
ceph_volume_client: remove use of obsolete mds cmd
doc: update on obsolete mds commands
cephfs: obsolete deprecated mds commands
Reviewed-by: Douglas Fuller <dfuller@redhat.com>
2017-10-24 16:37:14 -07:00
Patrick Donnelly
3a5f090a1e
qa: remove usage of mds dump
...
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-10-24 11:32:43 -07:00
Kefu Chai
4c7df944c7
osd: add max-pg-per-osd limit
...
osd will refused to create new pgs, until its pg number is lower
than the max-pg-per-osd upper bound setting.
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-17 23:08:40 +08:00
Kefu Chai
e21114274f
qa: s/backfill/backfilling/
...
it's renamed "backfilling" in 4015343f
.
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-11 11:52:43 +08:00
Sage Weil
b6a5c09dba
ceph-objectstore-tool: remove rm-past-intervals op
...
The OSD doesn't rebuild this on demand anymore.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-06 13:08:18 -05:00
Sage Weil
61799c4c8c
Merge pull request #17810 from hjwsm1989/wip-21294
...
qa/ceph_manager: check pg state again before timedout
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-09-25 12:33:34 -05:00
Kefu Chai
42be200c56
qa/tasks: prolong revive_osd() timeout to 6 min
...
bluestore_fsck_on_mount and bluestore_fsck_on_mount_deep are enabled by
default. and bluestore is used as the default store backend. it takes
longer to perform the deep fsck with verbose log. so prolong the
revive_osd()'s timeout from 150 sec to 360 sec.
Fixes: http://tracker.ceph.com/issues/21474
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-09-22 10:58:41 +08:00
huangjun
fa40add7f0
qa/ceph_manager: check pg state again before timedout
...
Pg state maybe all in active+clean when no recovering going on,
so check it again before timedout.
Fixes: http://tracker.ceph.com/issues/21294
Signed-off-by: huangjun <huangjun@xsky.com>
2017-09-20 00:04:04 +08:00
yonghengdexin735
fc5ac9ea69
common:fix error word
...
Signed-off-by: yonghengdexin735 <zhang.zezhu@zte.com.cn>
2017-09-13 10:22:08 +08:00
David Zafman
3bb20f6d75
ceph-objectstore-tool: Make pg removal require --force
...
Add new export-remove to combine the 2 operations
Fixes: http://tracker.ceph.com/issues/21272
Signed-off-by: David Zafman <dzafman@redhat.com>
2017-09-08 17:56:05 -07:00
Sage Weil
21027233b2
qa/tasks/ceph_manager: revive osds before doing final rerr reset
...
We assume below that rerrosd is up, but it may not be when we exit the
loop.
Fixes: http://tracker.ceph.com/issues/21206
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-31 14:55:46 -04:00
Sage Weil
a40d94b163
qa/tasks/ceph: wait for pg stats to flush in healthy check
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-27 12:10:27 -04:00
Sage Weil
80978dea8a
qa/tasks/ceph_manager: wait_for_all_up -> wait_for_all_osds_up
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-27 12:10:26 -04:00
Sage Weil
7648894e55
qa/tasks/ceph_manager: expose flush_all_pg_stats
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-27 12:10:26 -04:00
Sage Weil
02c2e853d3
Merge pull request #16509 from liewegas/wip-rgw-wait
...
qa/suits/rados/basic/tasks/rgw_snaps: wait for pools to be created
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-24 11:55:54 -05:00
Sage Weil
29549e6834
Merge pull request #13723 from ovh/bp-forced-recovery
...
osd/PG: make prioritized recovery possible
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-07-24 09:01:03 -05:00
Sage Weil
ecd1193ab9
qa/suites/rados/basic/tasks/rgw_snaps: wait for pools to be be created
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-22 18:54:46 -04:00
Sage Weil
583a38bca2
qa/tasks/ceph_manager: wait for osd to start after objectstore-tool sequence
...
Fixes: http://tracker.ceph.com/issues/20705
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-20 11:41:36 -04:00
Piotr Dałek
b0134cc7a8
qa: add force/cancel recovery/backfill to QA testing
...
This randomly issues pg force-recovery/force-backfill and
pg cancel-force-recovery/cancel-force-backfill during QA
testing. Disabled for upgrades from hammer, jewel and kraken.
Signed-off-by: Piotr Dałek <piotr.dalek@corp.ovh.com>
2017-07-20 09:35:55 +02:00
Jason Dillaman
836ab7ad95
test: skip pool application metadata tests if OSDs not at min luminous
...
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-07-19 13:13:01 -04:00
Sage Weil
56e2965502
qa/tasks/ceph_manager: wait longer for pg stats to flush
...
An ill-timed mgr restart could blow the current 15s wait.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-13 12:13:45 -04:00
David Zafman
33edfe3a0f
test: Add two new singleton test yamls radom-eio and thrash-eio
...
New option "random_eio" to Thrasher, sets 1 osd random read percentage
New option "objectsize" to radosbench task (-o bench option)
New option "type" to radosbench specify write, seq or rand
Signed-off-by: David Zafman <dzafman@redhat.com>
2017-06-23 08:09:15 -07:00
Sage Weil
6a00ba0e26
qa/tasks/ceph_manager: get osds all in after thrashing
...
Otherwise we might end up with some PGs remapped, which means they won't
get scrubbed.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-20 12:07:25 -04:00
Sage Weil
f870cc5f28
qa/tasks/thrashosds: wait before wait_for_recovery
...
Make sure OSDs are up *and* they have flushed their PG stats before
waiting for recovery to ensure that we do not see a stale 'clean' state.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-15 12:14:24 -04:00
Kefu Chai
e8b23d6852
qa/tasks: add a blacklist for flush_pg_stats()
...
so we don't wait for marked out osds.
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-06-02 13:06:50 -04:00
Sage Weil
ab1b78ae00
qa/tasks: use new reliable flush_pg_stats helper
...
The helper gets a sequence number from the osd (or osds), and then
polls the mon until that seq is reflected there.
This is overkill in some cases, since many tests only require that the
stats be reflected on the mgr (not the mon), but waiting for it to also
reach the mon is sufficient!
Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-02 13:02:45 -04:00
Kefu Chai
8abc6e1bea
qa/tasks/rebuild_mondb: update to address ceph-mgr changes
...
- revive ceph-mgr after updating the keyring cap
- grant "mgr:allow *" to client.admin
- minor refactors
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-05-28 09:59:50 +08:00
Sage Weil
5ab996ab3c
qa/tasks/ceph_manager: 'ceph $service tell ...' is obsolete
...
This died forever ago; no need for the fallback here.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-05-23 22:53:53 -04:00
Kefu Chai
da1161cbd8
qa/tasks/ceph_manager: always fix pgp_num when done with thrashosd task
...
Fixes: http://tracker.ceph.com/issues/19771
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-05-03 18:28:27 +08:00
Sage Weil
27dd6530a2
Merge pull request #14559 from liewegas/wip-pg-map
...
mon: move 'pg map' to OSDMonitor
Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-04-21 18:53:17 -05:00
Sage Weil
069182f91f
qa/tasks/ceph_manager: use 'pg map' for get_pg_{primary,replica}
...
Pulling this out of the 'pg dump' heap is inefficient.
Also, pg dump data comes from the mgr and may be stale.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-04-21 10:56:28 -04:00
Kefu Chai
6fa16c4477
Merge pull request #14584 from tchaikov/wip-19631
...
qa/suites: Revert "qa/suites: add mon-reweight-min-pgs-per-osd = 4"
Reviewed-by: Sage Weil <sage@redhat.com>
2017-04-21 22:56:21 +08:00
Kefu Chai
e6a436bb27
qa/tasks/ceph_manager: be able to store options with service type
...
so we are able to change options for services other than mon while
thrashing.
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-04-20 14:18:21 +08:00
Kefu Chai
ee653ba87c
Merge pull request #14608 from tchaikov/wip-19594
...
qa/tasks: assert on pg status with a timeout
Reviewed-by: Sage Weil <sage@redhat.com>
2017-04-20 10:49:12 +08:00
Kefu Chai
960032e513
qa/tasks: update tests with helper to wait for pg-stats
...
and remove unused helpers
Fixes: http://tracker.ceph.com/issues/19594
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-04-20 09:35:05 +08:00
Kefu Chai
1207caf3a2
qa/tasks/ceph_manager: add a "wait_for_pg_stats()" decorator
...
and accompany it with two helpers to access the pg stats in a more
natural way
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-04-20 09:35:04 +08:00
Josh Durgin
6fba80c1fa
osd, OSDMonitor, qa: mark ec overwrites non-experimental
...
Keep the pool flag around so we can distinguish between a pool that
should maintain hashes for each chunk, and a missing one is a bug, vs
an overwrites pool where we rely on bluestore checksums for detecting
corruption.
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2017-04-19 17:45:43 -07:00
Sage Weil
ee1bb01a54
Merge pull request #14556 from liewegas/wip-pgupmap
...
osd: pg-remap -> pg-upmap
Reviewed-by: David Zafman <dzafman@redhat.com>
2017-04-19 17:07:01 -05:00
Sage Weil
ce188e8fdf
osd: pg-remap -> pg-upmap
...
'remap' is to non-specific a name. In particular, it
sounds like it is related to the 'remapped' PG state
but in reality it is not related.
'upmap' or 'pg-upmap' is more specific: it maps a pgid
to the 'up' set value (or item)
Signed-off-by: Sage Weil <sage@redhat.com>
2017-04-18 12:59:40 -04:00
Kefu Chai
1b54b5f3f1
Merge pull request #14415 from smithfarm/wip-19556
...
tests: Thrasher: handle "OSD has the store locked" gracefully
Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-04-18 23:18:35 +08:00
David Zafman
a5731076ad
osd: Handle backfillfull_ratio just like nearfull and full
...
Add BACKFILLFULL as a local OSD cur_state
Notify monitor of this new fullness state
Signed-off-by: David Zafman <dzafman@redhat.com>
2017-04-17 08:00:24 -07:00
Nathan Cutler
a5b19d2d73
tests: Thrasher: handle "OSD has the store locked" gracefully
...
On slower machines (VPS, OVH) it takes time for the OSD to go down.
Fixes: http://tracker.ceph.com/issues/19556
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-04-11 16:09:45 +02:00
Sage Weil
2a08cbbed5
qa/tasks/thrashosds,ceph_manager: thrash pg_remap[_items]
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-03-28 10:12:10 -04:00
Sage Weil
296708091c
qa/tasks/ceph_manager: use new luminous set-full-ratio etc
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-03-07 16:39:09 -05:00
Sage Weil
a202b68d18
qa/tasks/thrashosds: chance_thrash_cluster_full
...
Induce a momentarily full cluster.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-03-07 13:33:44 -05:00
Samuel Just
44b26f6ab4
Merge pull request #13594 from athanatos/wip-snap-trim-sleep
...
osd: add snap trim reservation and re-implement osd_snap_trim_sleep
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-02-24 14:09:17 -08:00