Commit Graph

3014 Commits

Author SHA1 Message Date
Sage Weil
9255516b77 qa/suites/rados/upgrade: whitelist MON_DOWN, MGR_DOWN
They get restarted.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-12-05 12:26:12 -06:00
Sage Weil
0e81fa9f39 qa/suites/rados/singleton/all/threash-eio: whitelist OBJECT_*
Signed-off-by: Sage Weil <sage@redhat.com>
2017-12-04 21:16:44 -06:00
Kefu Chai
52bb6e8c6e
Merge pull request #19242 from liewegas/wip-21147
qa/suites/rados/thrash: extend mgr beacon grace when many msgr failures injected

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-12-02 14:14:29 +08:00
Kefu Chai
89c876d1b6
Merge pull request #19233 from tchaikov/wip-no-mon-health-to-clog-when-upgrading
qa: disable mon-health-to-clog in upgrade test

Reviewed-by: Sage Weil <sage@redhat.com>
2017-12-01 11:42:52 +08:00
Kefu Chai
4898c36a36
Merge pull request #19225 from tchaikov/wip-more-whitelist
qa/suites/rados/singleton: more whitelist

Reviewed-by: Sage Weil <sage@redhat.com>
2017-12-01 10:39:18 +08:00
Kefu Chai
3aaf5d24f1 qa: disable mon-health-to-clog in upgrade test
otherwise the cluster will complain that the quorum size is smaller than
that of monmap.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-12-01 01:15:59 +08:00
Yuri Weinstein
80d2a0de86 tests - Replaced requests for "centos 7.3" to centos_latest
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2017-11-30 07:52:11 -08:00
Patrick Donnelly
49028ee79f
Merge PR #19069 into master
* refs/pull/19069/head:
	qa/tasks/ceph_fuse: fix ceph-fuse mount failed

Reviewed-by: Yunfei Guan <yunfei.guan@xtaotech.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Amit Kumar <amitkuma@redhat.com>
2017-11-29 22:11:12 -08:00
Sage Weil
dda79ad1fa
Merge pull request #15482 from myoungwon/wip-chunked-manifest
osd,librados: add manifest, operations for chunked object

Reviewed-by: Sage Weil <sage@redhat.com>
2017-11-29 21:13:43 -06:00
Sage Weil
27e06ffa4c
Merge pull request #18974 from liewegas/wip-22145
osd/PG: restart recovery if NotRecovering and unfound found

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
2017-11-29 12:48:37 -06:00
Sage Weil
431d1482ff qa/suites/rados/thrash: extend mgr beacon grace when many msgr failures injected
Fixes: http://tracker.ceph.com/issues/21147
Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-29 10:29:52 -06:00
John Spray
967fc9cf8a
Merge pull request #18649 from b-ranto/wip-restful-fixes
mgr/restful: A couple of restful fixes

Reviewed-by: John Spray <john.spray@redhat.com>
2017-11-29 12:11:58 +00:00
Kefu Chai
185d2f5375 qa/tasks/ceph: add mon-health-to-clog option to "restart"
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-29 20:04:44 +08:00
Kefu Chai
a406553a79 qa/tasks/ceph_manager: add inject_args() method
* move Thrasher._set_config() to CephManager, and make it a public
  method, and rename it to inject_args(),
* use this method instead of using 'tell ... injectargs ...' directly

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-29 18:44:16 +08:00
Kefu Chai
0e987665fe qa/suites/rados/singleton: more whitelist
* SLOW_OPS is normal in a cluster with flattering OSDs
* so is OBJECT_MISPLACED.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-29 13:53:44 +08:00
Kefu Chai
6a9e622cf9
Merge pull request #19199 from tchaikov/wip-22136
qa/ceph-disk: enlarge the simulated SCSI disk

Reviewed-by: Alfredo Deza <adeza@redhat.com>
2017-11-28 23:46:49 +08:00
Kefu Chai
b3c159e9fb qa/ceph-disk: enlarge the simulated SCSI disk
100MB will be allocated for journal, and the remaining 100MB is for data
device. taking the inode into consideration, there will be approximately
87988 kB available for the activated OSD. and it will complain with a
"nearfull" state.

Fixes: http://tracker.ceph.com/issues/22136
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-28 14:55:33 +08:00
Kefu Chai
aa34321a4a qa/workunits/ceph-disk: do not redirect stderr to stdout
normally, if we care about the output of ceph-disk, we expect a json
string, and ceph-disk sends the output to stdout, and errors/warnings
to stderr. so everything works as expected. and the test should also
follow this tradition. for example, if deprecated warnings are printed,
the warning message should not be collected along with the json string.

see also: d44334f3

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-28 14:55:19 +08:00
Sage Weil
ddb274669c qa/suites/rados/rest: move rest_test from qa/suites/rest/
...and add more health whitelists.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-27 10:11:51 -06:00
Boris Ranto
059588e758 qa/restful: Test pg_num/pgp_num modifications
Signed-off-by: Boris Ranto <branto@redhat.com>
2017-11-27 14:35:26 +01:00
Kefu Chai
e25300cb6a
Merge pull request #18614 from tchaikov/wip-osd-mgrreport
osd,mgr: report slow requests and pending creating pgs to mgr

Reviewed-by: John Spray <john.spray@redhat.com>
2017-11-23 15:01:01 +08:00
Kefu Chai
4a1f2a5c78 qa: silence SLOW_OPS,PENDING_CREATING_PGS warnings
this is an intermediate step to deprecate REQUEST_SLOW warnings.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-23 13:59:42 +08:00
Joao Eduardo Luis
0b30a1538a qa: add simple and dirty script to find ports being used
Seriously, having this is way better than realizing that one needs to
find an unused port each time a new test is created.

It's far from a perfect script. It's downright not elegant.

Signed-off-by: Joao Eduardo Luis <joao@suse.de>
2017-11-22 16:58:34 +00:00
liyan
0a752e634a qa/tasks/ceph_fuse: fix ceph-fuse mount failed
Fixes: http://tracker.ceph.com/issues/22205

Signed-off-by: liyan <liyan.wang@xtaotech.com>
2017-11-22 15:02:47 +08:00
Mykola Golub
7fb9920316 qa/workunits/rbd: wait for status propagated only if daemon started
Signed-off-by: Mykola Golub <to.my.trociny@gmail.com>
2017-11-21 22:29:16 +02:00
Jason Dillaman
0502cb1573
Merge pull request #19073 from trociny/wip-qa-demote
qa/workunits/rbd: wait for demote status is propagated

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-11-21 12:52:08 -05:00
Andrew Schoen
7efdf3c50a
Merge pull request #19075 from tchaikov/wip-22154
qa/workunits: silence py warnings for ceph-disk tests

Reviewed-by: Andrew Schoen <aschoen@redhat.com>
2017-11-21 09:42:30 -06:00
Kefu Chai
d44334f317 qa/workunits: silence py warnings for ceph-disk tests
ceph-disk now prints "depreacted" warning message when it starts. but
the tests parses its stdout and stderr for a json string. so we need to
silence the warnings for the tests.

Fixes: http://tracker.ceph.com/issues/22154
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-21 21:49:48 +08:00
Mykola Golub
f4b88b2c9d qa/workunits/rbd: wait for demote status is propagated
(before promoting the image on another cluster)

Signed-off-by: Mykola Golub <to.my.trociny@gmail.com>
2017-11-21 13:22:50 +02:00
Patrick Donnelly
df43e415c6
Merge PR #18274 into master
* refs/pull/18274/head:
	mds: fold mds_revoke_cap_timeout into mds_session_timeout
	client: add new delegation testcases
	client: add delegation support for cephfs
	common: remove data_dir_option from common_preinit and global_pre_init

Reviewed-by: Gregory Farnum <gfarnum@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-11-20 21:48:19 -08:00
Kefu Chai
bcf12a6abd
Merge pull request #19024 from tchaikov/wip-21474
qa/tasks: prolong revive_osd() timeout to 6 min

Reviewed-by: Sage Weil <sage@redhat.com>
2017-11-20 22:54:27 +08:00
Kefu Chai
749bbda075 qa/tasks: prolong revive_osd() timeout to 6 min
see also #17902

Fixes: http://tracker.ceph.com/issues/21474
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-20 13:40:59 +08:00
Sage Weil
25b7965f88 qa/suites/rados: test for recovery_unfound bug
See http://tracker.ceph.com/issues/22145

Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-19 21:32:57 -06:00
Matt Benjamin
c662dc3067
Merge pull request #17977 from cbodley/wip-rgw-beast-qa
qa/rgw: add beast frontend to some rgw suites
2017-11-16 15:14:28 -05:00
lvshanchun
2c1653ca37 radosgw-admin zonegroup get and zone get return defaults when there is no realm
Fixs: http://tracker.ceph.com/issues/21615
Signed-off-by: lvshanchun <lvshanchun@gmail.com>
2017-11-16 23:43:55 +08:00
Josh Durgin
e81a7ffcda
Merge pull request #18808 from tchaikov/wip-qa-max-pg-per-osd
qa: do not wait for down/out osd for pg convergence

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-11-15 18:05:58 -08:00
Yuri Weinstein
c9dbb86dc4
Merge pull request #17761 from cbodley/wip-rgw-bilog-trim
rgw multisite: automated trimming for bucket index logs

Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
2017-11-15 09:04:50 -08:00
Jason Dillaman
09070016f5 Merge pull request #17032 from yangdongsheng/rbd_qos
rbd: implement image qos in tokenbucket algorithm

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-11-14 19:58:13 -05:00
Jeff Layton
3321cc7b37 mds: fold mds_revoke_cap_timeout into mds_session_timeout
Right now, we have two different timeout settings -- one for when the
client is just not responding at all (mds_session_timeout), and one for
when the client is otherwise responding but isn't returning caps in a
timely fashion (mds_cap_revoke_timeout).

The default settings on them are equivalent (60s), but only the
mds_session_timeout is communicated via the mdsmap. The
mds_cap_revoke_timeout is known only to the MDS. Neither timeout results
in anything other than warnings in the current codebase.

There is also a third setting (mds_session_autoclose) that is also
communicated via the MDSmap. Exceeding that value (default of 300s)
could eventually result in the client being blacklisted from the
cluster. The code to implement that doesn't exist yet, however.

The current codebase doesn't do any real sanity checking of these
timeouts, so the potential for admins to get them wrong is rather high.
It's hard to concoct a use-case where we'd want to warn about these
events at different intervals.

Simplify this by just removing the mds_cap_revoke_timeout setting, and
replace its use in the code with the mds_session_timeout. With that, the
client can at least determine when warnings might start showing up in
the MDS' logs.

Signed-off-by: Jeff Layton <jlayton@redhat.com>
2017-11-14 07:27:01 -05:00
Dongsheng Yang
bf4e454a22 librbd: limit IO per second by TokenBucketThrottle
Signed-off-by: Dongsheng Yang <dongsheng.yang@easystack.cn>
2017-11-14 09:19:00 +08:00
Sage Weil
7f0d82d93b
Merge pull request #18826 from liewegas/wip-22090
qa/suite/rados: fix balancer vs firefly tunables failures

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-11-12 19:50:14 -06:00
Casey Bodley
fa8a0713de qa/rgw: add kwargs for debug output
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-11-10 13:23:03 -05:00
Sage Weil
c1c81a04f5 qa/suites/rados/perf: whitelist health warnings
We may see these as cluster stabilizes.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-10 09:00:39 -06:00
Sage Weil
6455954d29 qa/suites/rados: stop testing firefly tunables
We can't mix the balancer compat-set testing with firefly tunables because
it requires that all buckets be straw2.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-10 07:10:11 -06:00
Jason Dillaman
5094838ab9
Merge pull request #18793 from idryomov/wip-qa-cram-path
qa/tasks/cram: include /usr/sbin in the PATH for all commands

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-11-08 11:00:41 -05:00
Kefu Chai
7f549af459 qa: do not wait for down/out osd for pg convergence
that osd is not invovlved in the PG state changes.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-08 14:50:10 +08:00
Ilya Dryomov
8cd42742b6 qa/suites/krbd: enable generic/050 and generic/448
generic/448 has been failing on ext4 and is now fixed upstream.
The fix for generic/050 is in testing, scheduled for 4.15-rc1.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-11-07 18:05:44 +01:00
Ilya Dryomov
1591426c30 qa/tasks/cram: include /usr/sbin in the PATH for all commands
/usr/sbin is not in non-login PATH on centos.  We already do this for
workunits, see commit 0e53f5f38b ("workunit: include /usr/sbin in the
PATH for all commands").

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-11-06 20:05:38 +01:00
Mykola Golub
3a09801ceb qa/suites/rbd: enable fsx deep_copy test case
Signed-off-by: Mykola Golub <to.my.trociny@gmail.com>
2017-11-06 10:29:45 +02:00
myoungwon oh
93be6f79e0 qa/suites/rados/thrash: add set_chunk test case
Signed-off-by: Myoungwon Oh <omwmw@sk.com>
2017-11-06 15:53:46 +09:00