Commit Graph

1735 Commits

Author SHA1 Message Date
Dongsheng Yang
4447340e8d qa: add a test for rbd rm NOT_EXIST
Signed-off-by: Dongsheng Yang <dongsheng.yang@easystack.cn>
2017-02-23 11:10:59 +08:00
John Spray
de5249436c Merge pull request #13359 from jcsp/wip-logrotate-sshexception
qa: handle SSHException in logrotate

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-02-22 10:05:07 +00:00
Dongsheng Yang
3d26d932af librbd: remove header when id missing
Signed-off-by: Dongsheng Yang <dongsheng.yang@easystack.cn>
2017-02-21 19:17:26 +08:00
Samuel Just
2ed7759cfe PrimaryLogPG: reimplement osd_snap_trim_sleep within the state machine
Rather than blocking the main op queue, just pause for that amount of
time between state machine cycles.

Also, add osd_snap_trim_sleep to a few of the thrasher yamls.

Signed-off-by: Samuel Just <sjust@redhat.com>
2017-02-20 21:24:33 -08:00
Kefu Chai
b3e516fc38 Merge pull request #13518 from tchaikov/wip-fix-pgp-num
test: Thrasher: do not update pools_to_fix_pgp_num if nothing happens

Reviewed-by: Sage Weil <sage@redhat.com>
2017-02-21 00:46:26 +08:00
Sage Weil
d6950a413f Merge pull request #13517 from liewegas/wip-kraken-x
qa/suites/upgrade/kraken-x

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2017-02-20 10:20:26 -06:00
Vikhyat Umrao
90144aa64c auth: 'ceph auth import -i' overwrites caps, if caps are not specified
in given keyring file, should alert user and should not allow this import.
Because in 'ceph auth list' we keep all the keyrings with caps and importing
'client.admin' user keyring without caps locks the cluster with error[1]
because admin keyring caps are missing in 'ceph auth'.

[1] Error connecting to cluster: PermissionDeniedError

Fixes: http://tracker.ceph.com/issues/18932

Signed-off-by: Vikhyat Umrao <vumrao@redhat.com>
2017-02-20 03:48:31 +05:30
Dongsheng Yang
4ee764f9e1 qa/workunit/rbd: import_export.sh: add image priorities exporting and importing test cases.
Signed-off-by: Dongsheng Yang <dongsheng.yang@easystack.cn>
2017-02-19 20:42:03 +08:00
Dongsheng Yang
55edd81658 qa/workunit/rbd: add test case for --import/export-format in import_export.sh
Signed-off-by: Dongsheng Yang <dongsheng.yang@easystack.cn>
2017-02-19 20:42:03 +08:00
Kefu Chai
c0f0cde399 test: Thrasher: do not update pools_to_fix_pgp_num if nothing happens
we should not update pools_to_fix_pgp_num if the pool is not expanded or
the pg_num is not increased due to pgs being created. this prevent us
from fixing the pgp_num after done with thrashing if we actually did
nothing when fixing the pgp_num when thrashing, but we removed the pool
from pools_to_fix_pgp_num after set_pool_pgpnum() returns.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-02-19 13:10:46 +08:00
Sage Weil
3f9e6cfce4 qa/suites/upgrade/kraken-x: test bluestore and filestore
Signed-off-by: Sage Weil <sage@redhat.com>
2017-02-18 21:12:23 -05:00
Sage Weil
86c0d07e32 qa/tasks/ceph.py: fix timing of wait-for-* and osd markdown
Mark down osds, *then* wait for them to come up or for the cluster to be
healthy!

Signed-off-by: Sage Weil <sage@redhat.com>
2017-02-18 21:12:23 -05:00
Yuri Weinstein
ccc885ac59 qa/suites/upgrade/kraken-x: Initial checkin
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2017-02-18 21:12:23 -05:00
Jason Dillaman
2c2b7abf93 Merge pull request #13482 from idryomov/wip-qa-krbd-data-pool
qa: add workunit to test krbd data-pool support

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-02-18 09:00:59 -05:00
Sage Weil
96bc86b537 Revert "qa/tasks/workunit: use the suite repo for cloning workunit" 2017-02-17 11:54:27 -06:00
Kefu Chai
929c6a6af9 Merge pull request #13456 from tchaikov/wip-qa-objectstore
qa/suites/rados/objectstore: enable experimental features for testing bluestore

Reviewed-by: Sage Weil <sage@redhat.com>
2017-02-17 23:16:05 +08:00
Kefu Chai
8ae48637f7 qa/workunits/objectstore/test_fuse.sh: enable experimental features
so we can test bluestore

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-02-17 11:23:41 +08:00
Dan Mick
6582f42dfd qa/workunits/objectstore/test_fuse.sh: use portable function decl
function f() is illegal in strict POSIX shells, like dash, which is
the default /bin/sh on Ubuntu

Signed-off-by: Dan Mick <dan.mick@redhat.com>
2017-02-17 11:18:20 +08:00
Dan Mick
ce5d375e61 qa/workunits/objectstore/test_fuse.sh: allow failure
This script currently has a syntax error, but still exits with
success, which is hiding that failure.  Expose it by allowing
the 'sudo' exit code to be the script's exit code.

Signed-off-by: Dan Mick <dan.mick@redhat.com>
2017-02-17 11:18:20 +08:00
Ilya Dryomov
4db31d53d9 qa: add workunit to test krbd data-pool support
This is based on a script that I've been using for a while for basic
smoke testing.  The matrix has exploded with the addition of data-pool
and now it's primarily a data-pool test fixture that takes minutes to
run, so turning it into a workunit.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-02-16 22:42:46 +01:00
Nathan Cutler
a6ce4dbd12 Merge pull request #13200 from smithfarm/wip-17193-reenable-test
tests: re-enable cephfs python tests on kclient

Reviewed-by: Gregory Farnum <gfarnum@redhat.com>
2017-02-16 17:22:50 +01:00
Kefu Chai
8f8c06096c Merge pull request #13452 from tchaikov/wip-suite-repo-for-workunit
qa/tasks/workunit: use the suite repo for cloning workunit

Reviewed-by: Sage Weil <sage@redhat.com>
2017-02-16 23:40:15 +08:00
Kefu Chai
1f82b9b944 qa/tasks/workunit: use the suite repo for cloning workunit
as "workunits" reside in ceph/qa/workunits, it's more intuitive to
respect suite-repo option when cloning workunits.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-02-16 15:05:51 +08:00
Kefu Chai
f91fa5ae96 Merge pull request #13406 from vumrao/wip-vumrao-18486
osd/Pool: Disallow enabling 'hashpspool' option to a pool without  '--yes-i-really-mean-it'

Reviewed-by: Sage Weil <sage@redhat.com>
2017-02-16 12:48:20 +08:00
Sage Weil
eb491a13dc Merge pull request #13235 from liewegas/wip-pg-split-interval
osd: have clients resend ops on pg split

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
2017-02-15 16:20:18 -06:00
Sage Weil
f5cfba602b qa/suites/rgw/thrash: add rgw with osd trashing test
Signed-off-by: Sage Weil <sage@redhat.com>
2017-02-15 12:07:20 -05:00
Sage Weil
967b3cbcb7 qa/suites/rgw/multifs: drop redundant civetweb override
This probably breaks the frontend/apache.yaml case.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-02-15 12:00:13 -05:00
Mykola Golub
22c2614195 Merge pull request #13421 from dillaman/wip-18935
qa/workunits/rbd: resolve potential rbd-mirror race conditions

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
2017-02-15 16:00:45 +02:00
John Spray
dfbc6757f8 Merge pull request #13089 from jcsp/wip-18600
#18600: Clear out tasks that don't make sense from multimds suite

Reviewed-by: Yan, Zheng <zyan@redhat.com>
2017-02-14 22:28:06 +00:00
Jason Dillaman
63eae97afc qa/workunits/rbd: resolve potential rbd-mirror race conditions
Fixes: http://tracker.ceph.com/issues/18935
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-02-14 15:58:31 -05:00
Sage Weil
d708041adc osd: rename backoff config options
Signed-off-by: Sage Weil <sage@redhat.com>
2017-02-13 23:03:52 -05:00
Vikhyat Umrao
3715362883 osd/Pool: Disallow enabling 'hashpspool' option to a pool without
'--yes-i-really-mean-it'

Fixes: http://tracker.ceph.com/issues/18468

Signed-off-by: Vikhyat Umrao <vumrao@redhat.com>
2017-02-14 04:09:32 +05:30
Samuel Just
4aebf59d90 rados: check that pool is done trimming before removing it
Signed-off-by: Samuel Just <sjust@redhat.com>
2017-02-13 09:47:02 -08:00
Kefu Chai
de59b5102c test: Thrasher: restore changed options after done with thrash
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-02-13 09:25:51 +08:00
Kefu Chai
761a1dc391 tests: Thrasher: extract _set_config() method
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-02-13 09:25:50 +08:00
Kefu Chai
995e144e3e tests: CephManager: add get_config() method
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-02-13 09:25:50 +08:00
Kefu Chai
136483a8f9 test: Thrasher: update pgp_num of all expanded pools if not yet
otherwise wait_until_healthy will fail after timeout as seeing warning
like:

HEALTH_WARN pool cephfs_data pg_num 182 > pgp_num 172

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-02-13 09:25:50 +08:00
Yuri Weinstein
05f1a1987e Merge pull request #13378 from liewegas/wip-luminous
qa/rados/upgrade/jewel-x-singleton: run luminous.yaml at the end

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2017-02-12 08:13:26 -08:00
Sage Weil
6e998b2f8c qa/rados/upgrade/jewel-x-singleton: run luminous.yaml at the end
renumber too

Signed-off-by: Sage Weil <sage@redhat.com>
2017-02-12 09:47:41 -05:00
Jason Dillaman
e95fe0dfad Merge pull request #13229 from trociny/wip-18335
rbd-nbd: check /sys/block/nbdX/size to ensure kernel mapped correctly

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-02-11 09:29:23 -05:00
Sage Weil
105e9d4234 qa/suites/rados/thrash: vary backoff settings
Signed-off-by: Sage Weil <sage@redhat.com>
2017-02-10 18:55:58 -05:00
Kefu Chai
108f2e20a8 Merge pull request #13264 from tchaikov/wip-print-failed-test-name
qa/workunits/rados/test.sh: print test name when it fails

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-02-11 03:57:45 +08:00
John Spray
880cbf09aa Merge pull request #13137 from jcsp/wip-18661
qa: fix race in Mount.open_background

Reviewed-by: Yan, Zheng <zyan@redhat.com>
2017-02-10 17:48:05 +00:00
John Spray
a3fd3f225c Merge pull request #13099 from jcsp/wip-18663
qa/tasks: force umount during kclient teardown
2017-02-10 17:42:37 +00:00
John Spray
6f9e11f03d qa: handle SSHException in logrotate
Yet another different type of exception we may get when
orchestra.run can't talk to a remote host.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-02-10 17:16:24 +00:00
Kefu Chai
787f6ac699 qa/workunits/rados/test.sh: print test name when it fails
we have

2017-02-04T16:15:46.090 INFO:tasks.workunit.client.0.mira032.stdout:error in 22088
2017-02-04T16:15:46.092 INFO:tasks.workunit.client.0.mira032.stderr:bash: line 1: 22092 Alarm clock             ceph_test_rados_api_aio 2>&1
2017-02-04T16:15:46.096 INFO:tasks.workunit.client.0.mira032.stderr:     22093 Done                    | tee ceph_test_rados_api_aio.log
2017-02-04T16:15:46.099 INFO:tasks.workunit.client.0.mira032.stderr:     22094 Done                    | sed "s/^/                  api_aio: /"
2017-02-04T16:15:46.102 INFO:tasks.workunit.client.0.mira032.stderr:+

if a unittest in rados/test.sh fails in teuthology.log, but it would
be desirable to have the failed test name in the line of "error in
22088".

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-02-10 23:54:29 +08:00
Nathan Cutler
a605089522 Merge pull request #13313 from smithfarm/wip-18856
tests: update SUSE yaml facets in qa/distros/all

Reviewed-by: Abhishek Lekshmanan <abhishek@suse.com>
2017-02-09 10:14:20 +01:00
Josh Durgin
d0792bc00f Merge pull request #13278 from liewegas/wip-luminous
mon,osd: luminous feature bits, require flags, upgrade gates

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-02-08 16:18:36 -08:00
Yuri Weinstein
f15639b679 Merge pull request #13296 from liewegas/wip-kill-sortbitwise
osd: remove sortbitwise thrashing

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-02-08 16:12:10 -08:00
Nathan Cutler
6b7443fb50 tests: drop buildpackages.py
The buildpackages suite has been moved to teuthology. This cleans up a file
that was left behind by https://github.com/ceph/ceph/pull/13297

Fixes: http://tracker.ceph.com/issues/18846
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-02-08 21:23:54 +01:00
Sage Weil
c1f26b330b qa/suites/rados/upgrade/jewel-x-singleton: upgrade mons first; box thrasher
Signed-off-by: Sage Weil <sage@redhat.com>
2017-02-08 15:05:15 -05:00
Nathan Cutler
0bd56e871a tests: update SUSE yaml facets in qa/distros/all
Fixes: http://tracker.ceph.com/issues/18856
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-02-08 15:27:00 +01:00
Sage Weil
8226159e7c mon: prevent sortbitwise flag from being unset
Signed-off-by: Sage Weil <sage@redhat.com>
2017-02-07 14:00:32 -05:00
Loic Dachary
5a43f8d579 buildpackages: remove because it does not belong
It should live in teuthology, not in Ceph. And it is currently broken:
there is no need to keep it around.

Fixes: http://tracker.ceph.com/issues/18846

Signed-off-by: Loic Dachary <loic@dachary.org>
2017-02-07 18:37:26 +01:00
Sage Weil
c5a3bc07b9 qa/suites/rados: stop thrashing sortbitwise
This is required to upgrade past jewel; no need to keep testing it
on master.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-02-07 12:14:13 -05:00
John Spray
76b73befd9 qa: remove simple functional tests from multimds
These were running so few ops that they weren't
giving any meaningful exercise to a multimds
system beyond what we're already covering in
the fs suite.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-02-07 13:51:47 +00:00
John Spray
6203f33df4 tasks/cephfs: tear down on mount() failure
There were some cases where we would leave a mountpoint
that would cause the teuthology teardown to get hung up
when it tried to look inside cephtest/

Signed-off-by: John Spray <john.spray@redhat.com>
2017-02-06 22:53:21 +00:00
Patrick Donnelly
1183f09e47
qa: increase debug_ms level for thrashing
This is to help locate the cause of [1].

[1] http://tracker.ceph.com/issues/18690

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:17:50 -05:00
Patrick Donnelly
d748226f00
qa: add DaemonWatchdog to stop tests on failure
Thrashing MDS will often result in failures which often do not stop the
test. The failure may also cause the test to stall which will force the
machines to needlessly be locked until a timeout is reached. This
watchdog will unmount mounts and kill daemons when a failure is
detected.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:14 -05:00
Patrick Donnelly
f005e8af6b
qa: disable max_mds changes during thrashing
While the trasher supports the behavior desired by issue 10792 [1], the
bugs uncovered due to deactivating MDS (and sometimes killing
deactivating MDS) are presently a distraction from addressing issues
during normal failures. So now thrashing max_mds is turned off by
default. I have added a TODO to deactivate ranks in order (configurably)
as random deactivation causes a lot of other problems.

This also fixes a bug: random.randrange(0.0, 1.0) always returns 0.
Oops.

[1] http://tracker.ceph.com/issues/10792

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:14 -05:00
Patrick Donnelly
82662edd7f
qa: do not pretty the json to shorten stdout log
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:14 -05:00
Patrick Donnelly
a0052fc2d6
qa: use gevent.sleep so greenlet yields
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:14 -05:00
Patrick Donnelly
cf9e0da078
qa: use fs methods for setting configs
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
0098873fb7
qa: remove old comment
Filesystem is now cluster aware.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
9e84431cfe
qa: add standbys to take over during thrashing
The thrasher expects in some scenarios for the cluster to stabilize with
a new MDS taking over when there are no standbys available. This can
cause the thrasher to quit because the cluster never stabilizes.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
fd4b61890d
qa: allow revived MDS to be up:active
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
884215d933
qa: timeout waiting for thrashed MDS to revive
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
8e9ea7b6ac
qa: configure thrashing while MDS are stopping
Currently multimds is prone to many failures when killing an active or
stopping MDS when there are MDS in the cluster which have been
deactivated (stopping). Have this turned off by default for now.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
6304b6ed5d
qa: add deactivation log message
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
1185326c45
qa: avoid infinite wait if no repl. can be made
The thrasher can enter an infinite loop waiting for an MDS to take a
certain rank when a replacement may not be possible. For example,
max_mds actives are already running.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:12 -05:00
Patrick Donnelly
638bccb2bb
qa: timeout thrasher if fs does not stabilize
After 5 minutes of waiting, it's reasonable to stop as the cluster is
probably stuck.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:12 -05:00
Patrick Donnelly
8f3e745344
qa: check replacement MDS is active in thrasher
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:12 -05:00
Patrick Donnelly
19289725c8
qa: handle thrashing ranks with holes
During the course of thrashing max_mds, the ranks assigned to MDSs may
develop holes. This causes the thrasher to try to wrongly deactivate
ranks that are not assigned.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:12 -05:00
Patrick Donnelly
9f467b1f4c
qa: remove snap tests from multimds:thrash
Snapshots are known to not work with multimds presently.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:12 -05:00
Patrick Donnelly
4bfa8d8bc7
qa: turn on multimds thrashing
Fixes: http://tracker.ceph.com/issues/10792

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:12 -05:00
Nathan Cutler
db2582e25e tests: fix regression in qa/tasks/ceph_master.py
https://github.com/ceph/ceph/pull/13194 introduced a regression:

2017-02-06T16:14:23.162 INFO:tasks.thrashosds.thrasher:Traceback (most recent call last):
  File "/home/teuthworker/src/github.com_ceph_ceph_master/qa/tasks/ceph_manager.py", line 722, in wrapper
    return func(self)
  File "/home/teuthworker/src/github.com_ceph_ceph_master/qa/tasks/ceph_manager.py", line 839, in do_thrash
    self.choose_action()()
  File "/home/teuthworker/src/github.com_ceph_ceph_master/qa/tasks/ceph_manager.py", line 305, in kill_osd
    output = proc.stderr.getvalue()
AttributeError: 'NoneType' object has no attribute 'getvalue'

This is because the original patch failed to pass "stderr=StringIO()" to run().

Fixes: http://tracker.ceph.com/issues/16263
Signed-off-by: Nathan Cutler <ncutler@suse.com>
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-02-06 19:37:38 +01:00
Sage Weil
5fc3dd36e2 Merge pull request #13237 from smithfarm/wip-18799
tests: Thrasher: eliminate a race between kill_osd and __init__

Reviewed-by: Sage Weil <sage@redhat.com>
2017-02-05 12:49:30 -06:00
Mykola Golub
596e5ea8a5 rbd-nbd: check /sys/block/nbdX/size to ensure kernel mapped correctly
Fixes: http://tracker.ceph.com/issues/18335
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2017-02-04 15:40:51 +01:00
Kefu Chai
389bd00da3 tests: ceph-helpers.sh reduce get_timeout_delays() verbosity
`set +o` prints out the full command line which is echoed if "xtrace" is
enabled. this increases the verbosity of get_timeout_delays().
in this change, we follow the way of kill_daemons() to kill the extra
output. see aefcf6d.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-02-04 17:10:49 +08:00
Josh Durgin
21cdcfcc66 Merge pull request #13194 from smithfarm/wip-16263
tests: ignore bogus ceph-objectstore-tool error in ceph_manager

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: David Zafman <dzafman@redhat.com>
2017-02-02 15:31:29 -08:00
Nathan Cutler
b519d38fb1 tests: Thrasher: eliminate a race between kill_osd and __init__
If Thrasher.__init__() spawns the do_thrash thread before initializing the
ceph_objectstore_tool property, do_thrash races with the rest
of Thrasher.__init__() and in some cases do_thrash can call kill_osd() before
Trasher.__init__() progresses much further. This can lead to an exception
("AttributeError: Thrasher instance has no attribute 'ceph_objectstore_tool'")
being thrown in kill_osd().

This commit eliminates the race by making sure the ceph_objectstore_tool
attribute is initialized before the do_thrash thread is spawned.

Fixes: http://tracker.ceph.com/issues/18799
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-02-02 23:23:54 +01:00
John Spray
3c9f16d8ab tasks/kclient: apply timeout to umount
The umount process can get stuck, in which case
we want to fail the test rather than waiting around for it.

During teardown of the kclient task catch this
timeout explicitly so that we will powercycle the node if
needed.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-02-02 15:09:48 +00:00
Mykola Golub
58798be83a qa/workunits/rbd: tweak rbd-mirror config to spead up testing
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2017-02-02 08:45:19 +01:00
Mykola Golub
93f7b5ef3f Merge pull request #13158 from dillaman/wip-18594
qa: integrate OpenStack 'gate-tempest-dsvm-full-devstack-plugin-ceph'

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
2017-02-02 08:27:49 +02:00
Tamil Muthamizhan
f99ca7a812 do not mention ceph branch explicitly
no need to mention ceph_dev_branch explicitly. it will be taken from the
ceph branch value mentioned in the teuthology-suite command

Signed-off-by: Tamil Muthamizhan <tmuthami@redhat.com>
2017-02-01 16:47:01 -08:00
Jason Dillaman
a1b088a727 Merge pull request #13189 from gaurav36/bug-18326-image-rename-non-rbd-pool
rbd: destination pool should be source pool if it is not specified

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-02-01 12:47:01 -05:00
Mykola Golub
4e234921b1 qa/suites/rbd: add rbd-mirror-ha test
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2017-02-01 11:16:34 +01:00
Mykola Golub
2bb480e8c3 qa/workunits/rbd: tweaks to use RBD_MIRROR_USE_EXISTING_CLUSTER locally
This var is mostly used when running rbd_mirror test scripts on
teuthology.  It can be used locally though to speedup re-running the
tests:

Set a test temp directory:

  export RBD_MIRROR_TEMDIR=/tmp/tmp.rbd_mirror

Run the tests the first time with NOCLEANUP flag (the cluster and
daemons are not stopped on finish):

  RBD_MIRROR_NOCLEANUP=1 ../qa/workunits/rbd/rbd_mirror.sh

Now, to re-run the test without restarting the cluster, run cleanup
with USE_EXISTING_CLUSTER flag:

  RBD_MIRROR_USE_EXISTING_CLUSTER=1 \
    ../qa/workunits/rbd/rbd_mirror_ha.sh cleanup

and then run the tests:

  RBD_MIRROR_USE_EXISTING_CLUSTER=1
    ../qa/workunits/rbd/rbd_mirror_ha.sh

Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2017-02-01 10:55:03 +01:00
Mykola Golub
c802b9df0b qa/workunits/rbd: tests for rbd mirror HA
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2017-02-01 10:55:03 +01:00
Mykola Golub
7c21275f4f qa/workunits/rbd: allow to start several daemons for the same cluster
by optionally specifyning daemon instance after cluster name and
colon, like:

  start_mirror ${cluster}:${instance}

Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2017-02-01 10:55:03 +01:00
Gaurav Kumar Garg
01f23aa99f rbd: destination pool should be source pool if it is not specified
Currently if user perform image rename operation and user give pool
name as a optional parameter (--pool=<pool_name>) then currently
its taking this optional pool name for source pool and making
destination pool name default pool name.
With this fix if user provide pool name as a optional pool name
parameter then it  will consider both soruce and destination pool
name as optional parameter pool name.

Fixes: http://tracker.ceph.com/issues/18326

Reported-by: МАРК КОРЕНБЕРГ <socketpair@gmail.com>
Signed-off-by: Gaurav Kumar Garg <garg.gaurav52@gmail.com>
2017-02-01 10:20:39 +01:00
John Spray
a027dba78f tasks/cephfs: switch open vs. write in test_open_inode
Do the write after opening the file, so that we get good
behaviour wrt the change in Mount.open_background that uses
file existence to confirm that the open happened.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-02-01 00:38:08 +00:00
John Spray
7f7f44ea5c qa/tasks: force umount during kclient teardown
Previously we could readily end up hanging on teardown
when something had gone wrong with umount.  Forcing
is a big hammer (umount_wait will power cycle the node
if umount isn't working), so if we had to do that
then raise an exception to indicate that something
was wrong with the test.

Fixes: http://tracker.ceph.com/issues/18663
Signed-off-by: John Spray <john.spray@redhat.com>
2017-02-01 00:26:59 +00:00
Nathan Cutler
5bfb0fca0d tests: re-enable cephfs python tests on kclient
Because http://tracker.ceph.com/issues/17193 and
http://tracker.ceph.com/issues/18161 are fixed.

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-01-31 14:23:09 +01:00
Sage Weil
2cc9fac7ba Merge pull request #13071 from liewegas/wip-omap-getkeys-rados-test
qa/suites/rados/basic: set low omap limit for rgw workload

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-01-30 21:43:17 -06:00
John Spray
d4f6385b85 Merge pull request #12800 from jcsp/wip-vstart-qasuite
Improve vstart_runner to (optionally) create its own cluster

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-01-31 02:02:49 +01:00
Nathan Cutler
046e873026 tests: ignore bogus ceph-objectstore-tool error in ceph_manager
Fixes: http://tracker.ceph.com/issues/16263
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-01-31 00:49:05 +01:00
Sage Weil
388a7ff8ef qa/workunits/rest: use unique pool names for cephfs test
Using cephfs_[meta]data collides with the pools that teuthology
already creates if an mds is defined.

This became a (noticeable) problem with 052c3d3f68

Signed-off-by: Sage Weil <sage@redhat.com>
2017-01-30 11:14:31 -05:00
Jason Dillaman
deaaba8fcf qa/suites/rbd: OpenStack devstack tempest full suite
This mimics the OpenStack tempest gate tests that OpenStack
Zuul executes as a gate test.

Fixes: http://tracker.ceph.com/issues/18594
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-01-27 14:22:48 -05:00
John Spray
48adc77476 Merge pull request #12825 from stiopaa1/r11124_refuseToDoFsNew
mon/MDSMonitor.cc:refuse fs new on pools with obj

Reviewed-by: John Spray <john.spray@redhat.com>
2017-01-27 17:35:38 +01:00