Commit Graph

1782 Commits

Author SHA1 Message Date
Loic Dachary
5a43f8d579 buildpackages: remove because it does not belong
It should live in teuthology, not in Ceph. And it is currently broken:
there is no need to keep it around.

Fixes: http://tracker.ceph.com/issues/18846

Signed-off-by: Loic Dachary <loic@dachary.org>
2017-02-07 18:37:26 +01:00
Sage Weil
c5a3bc07b9 qa/suites/rados: stop thrashing sortbitwise
This is required to upgrade past jewel; no need to keep testing it
on master.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-02-07 12:14:13 -05:00
John Spray
76b73befd9 qa: remove simple functional tests from multimds
These were running so few ops that they weren't
giving any meaningful exercise to a multimds
system beyond what we're already covering in
the fs suite.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-02-07 13:51:47 +00:00
John Spray
6203f33df4 tasks/cephfs: tear down on mount() failure
There were some cases where we would leave a mountpoint
that would cause the teuthology teardown to get hung up
when it tried to look inside cephtest/

Signed-off-by: John Spray <john.spray@redhat.com>
2017-02-06 22:53:21 +00:00
Patrick Donnelly
1183f09e47
qa: increase debug_ms level for thrashing
This is to help locate the cause of [1].

[1] http://tracker.ceph.com/issues/18690

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:17:50 -05:00
Patrick Donnelly
d748226f00
qa: add DaemonWatchdog to stop tests on failure
Thrashing MDS will often result in failures which often do not stop the
test. The failure may also cause the test to stall which will force the
machines to needlessly be locked until a timeout is reached. This
watchdog will unmount mounts and kill daemons when a failure is
detected.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:14 -05:00
Patrick Donnelly
f005e8af6b
qa: disable max_mds changes during thrashing
While the trasher supports the behavior desired by issue 10792 [1], the
bugs uncovered due to deactivating MDS (and sometimes killing
deactivating MDS) are presently a distraction from addressing issues
during normal failures. So now thrashing max_mds is turned off by
default. I have added a TODO to deactivate ranks in order (configurably)
as random deactivation causes a lot of other problems.

This also fixes a bug: random.randrange(0.0, 1.0) always returns 0.
Oops.

[1] http://tracker.ceph.com/issues/10792

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:14 -05:00
Patrick Donnelly
82662edd7f
qa: do not pretty the json to shorten stdout log
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:14 -05:00
Patrick Donnelly
a0052fc2d6
qa: use gevent.sleep so greenlet yields
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:14 -05:00
Patrick Donnelly
cf9e0da078
qa: use fs methods for setting configs
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
0098873fb7
qa: remove old comment
Filesystem is now cluster aware.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
9e84431cfe
qa: add standbys to take over during thrashing
The thrasher expects in some scenarios for the cluster to stabilize with
a new MDS taking over when there are no standbys available. This can
cause the thrasher to quit because the cluster never stabilizes.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
fd4b61890d
qa: allow revived MDS to be up:active
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
884215d933
qa: timeout waiting for thrashed MDS to revive
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
8e9ea7b6ac
qa: configure thrashing while MDS are stopping
Currently multimds is prone to many failures when killing an active or
stopping MDS when there are MDS in the cluster which have been
deactivated (stopping). Have this turned off by default for now.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
6304b6ed5d
qa: add deactivation log message
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:13 -05:00
Patrick Donnelly
1185326c45
qa: avoid infinite wait if no repl. can be made
The thrasher can enter an infinite loop waiting for an MDS to take a
certain rank when a replacement may not be possible. For example,
max_mds actives are already running.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:12 -05:00
Patrick Donnelly
638bccb2bb
qa: timeout thrasher if fs does not stabilize
After 5 minutes of waiting, it's reasonable to stop as the cluster is
probably stuck.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:12 -05:00
Patrick Donnelly
8f3e745344
qa: check replacement MDS is active in thrasher
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:12 -05:00
Patrick Donnelly
19289725c8
qa: handle thrashing ranks with holes
During the course of thrashing max_mds, the ranks assigned to MDSs may
develop holes. This causes the thrasher to try to wrongly deactivate
ranks that are not assigned.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:12 -05:00
Patrick Donnelly
9f467b1f4c
qa: remove snap tests from multimds:thrash
Snapshots are known to not work with multimds presently.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:12 -05:00
Patrick Donnelly
4bfa8d8bc7
qa: turn on multimds thrashing
Fixes: http://tracker.ceph.com/issues/10792

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-02-06 14:07:12 -05:00
Nathan Cutler
db2582e25e tests: fix regression in qa/tasks/ceph_master.py
https://github.com/ceph/ceph/pull/13194 introduced a regression:

2017-02-06T16:14:23.162 INFO:tasks.thrashosds.thrasher:Traceback (most recent call last):
  File "/home/teuthworker/src/github.com_ceph_ceph_master/qa/tasks/ceph_manager.py", line 722, in wrapper
    return func(self)
  File "/home/teuthworker/src/github.com_ceph_ceph_master/qa/tasks/ceph_manager.py", line 839, in do_thrash
    self.choose_action()()
  File "/home/teuthworker/src/github.com_ceph_ceph_master/qa/tasks/ceph_manager.py", line 305, in kill_osd
    output = proc.stderr.getvalue()
AttributeError: 'NoneType' object has no attribute 'getvalue'

This is because the original patch failed to pass "stderr=StringIO()" to run().

Fixes: http://tracker.ceph.com/issues/16263
Signed-off-by: Nathan Cutler <ncutler@suse.com>
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-02-06 19:37:38 +01:00
Sage Weil
5fc3dd36e2 Merge pull request #13237 from smithfarm/wip-18799
tests: Thrasher: eliminate a race between kill_osd and __init__

Reviewed-by: Sage Weil <sage@redhat.com>
2017-02-05 12:49:30 -06:00
Mykola Golub
596e5ea8a5 rbd-nbd: check /sys/block/nbdX/size to ensure kernel mapped correctly
Fixes: http://tracker.ceph.com/issues/18335
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2017-02-04 15:40:51 +01:00
Kefu Chai
389bd00da3 tests: ceph-helpers.sh reduce get_timeout_delays() verbosity
`set +o` prints out the full command line which is echoed if "xtrace" is
enabled. this increases the verbosity of get_timeout_delays().
in this change, we follow the way of kill_daemons() to kill the extra
output. see aefcf6d.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-02-04 17:10:49 +08:00
Josh Durgin
21cdcfcc66 Merge pull request #13194 from smithfarm/wip-16263
tests: ignore bogus ceph-objectstore-tool error in ceph_manager

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: David Zafman <dzafman@redhat.com>
2017-02-02 15:31:29 -08:00
Nathan Cutler
b519d38fb1 tests: Thrasher: eliminate a race between kill_osd and __init__
If Thrasher.__init__() spawns the do_thrash thread before initializing the
ceph_objectstore_tool property, do_thrash races with the rest
of Thrasher.__init__() and in some cases do_thrash can call kill_osd() before
Trasher.__init__() progresses much further. This can lead to an exception
("AttributeError: Thrasher instance has no attribute 'ceph_objectstore_tool'")
being thrown in kill_osd().

This commit eliminates the race by making sure the ceph_objectstore_tool
attribute is initialized before the do_thrash thread is spawned.

Fixes: http://tracker.ceph.com/issues/18799
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-02-02 23:23:54 +01:00
John Spray
3c9f16d8ab tasks/kclient: apply timeout to umount
The umount process can get stuck, in which case
we want to fail the test rather than waiting around for it.

During teardown of the kclient task catch this
timeout explicitly so that we will powercycle the node if
needed.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-02-02 15:09:48 +00:00
Mykola Golub
58798be83a qa/workunits/rbd: tweak rbd-mirror config to spead up testing
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2017-02-02 08:45:19 +01:00
Mykola Golub
93f7b5ef3f Merge pull request #13158 from dillaman/wip-18594
qa: integrate OpenStack 'gate-tempest-dsvm-full-devstack-plugin-ceph'

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
2017-02-02 08:27:49 +02:00
Tamil Muthamizhan
f99ca7a812 do not mention ceph branch explicitly
no need to mention ceph_dev_branch explicitly. it will be taken from the
ceph branch value mentioned in the teuthology-suite command

Signed-off-by: Tamil Muthamizhan <tmuthami@redhat.com>
2017-02-01 16:47:01 -08:00
Jason Dillaman
a1b088a727 Merge pull request #13189 from gaurav36/bug-18326-image-rename-non-rbd-pool
rbd: destination pool should be source pool if it is not specified

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-02-01 12:47:01 -05:00
Mykola Golub
4e234921b1 qa/suites/rbd: add rbd-mirror-ha test
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2017-02-01 11:16:34 +01:00
Mykola Golub
2bb480e8c3 qa/workunits/rbd: tweaks to use RBD_MIRROR_USE_EXISTING_CLUSTER locally
This var is mostly used when running rbd_mirror test scripts on
teuthology.  It can be used locally though to speedup re-running the
tests:

Set a test temp directory:

  export RBD_MIRROR_TEMDIR=/tmp/tmp.rbd_mirror

Run the tests the first time with NOCLEANUP flag (the cluster and
daemons are not stopped on finish):

  RBD_MIRROR_NOCLEANUP=1 ../qa/workunits/rbd/rbd_mirror.sh

Now, to re-run the test without restarting the cluster, run cleanup
with USE_EXISTING_CLUSTER flag:

  RBD_MIRROR_USE_EXISTING_CLUSTER=1 \
    ../qa/workunits/rbd/rbd_mirror_ha.sh cleanup

and then run the tests:

  RBD_MIRROR_USE_EXISTING_CLUSTER=1
    ../qa/workunits/rbd/rbd_mirror_ha.sh

Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2017-02-01 10:55:03 +01:00
Mykola Golub
c802b9df0b qa/workunits/rbd: tests for rbd mirror HA
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2017-02-01 10:55:03 +01:00
Mykola Golub
7c21275f4f qa/workunits/rbd: allow to start several daemons for the same cluster
by optionally specifyning daemon instance after cluster name and
colon, like:

  start_mirror ${cluster}:${instance}

Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2017-02-01 10:55:03 +01:00
Gaurav Kumar Garg
01f23aa99f rbd: destination pool should be source pool if it is not specified
Currently if user perform image rename operation and user give pool
name as a optional parameter (--pool=<pool_name>) then currently
its taking this optional pool name for source pool and making
destination pool name default pool name.
With this fix if user provide pool name as a optional pool name
parameter then it  will consider both soruce and destination pool
name as optional parameter pool name.

Fixes: http://tracker.ceph.com/issues/18326

Reported-by: МАРК КОРЕНБЕРГ <socketpair@gmail.com>
Signed-off-by: Gaurav Kumar Garg <garg.gaurav52@gmail.com>
2017-02-01 10:20:39 +01:00
John Spray
a027dba78f tasks/cephfs: switch open vs. write in test_open_inode
Do the write after opening the file, so that we get good
behaviour wrt the change in Mount.open_background that uses
file existence to confirm that the open happened.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-02-01 00:38:08 +00:00
John Spray
7f7f44ea5c qa/tasks: force umount during kclient teardown
Previously we could readily end up hanging on teardown
when something had gone wrong with umount.  Forcing
is a big hammer (umount_wait will power cycle the node
if umount isn't working), so if we had to do that
then raise an exception to indicate that something
was wrong with the test.

Fixes: http://tracker.ceph.com/issues/18663
Signed-off-by: John Spray <john.spray@redhat.com>
2017-02-01 00:26:59 +00:00
Nathan Cutler
5bfb0fca0d tests: re-enable cephfs python tests on kclient
Because http://tracker.ceph.com/issues/17193 and
http://tracker.ceph.com/issues/18161 are fixed.

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-01-31 14:23:09 +01:00
Sage Weil
2cc9fac7ba Merge pull request #13071 from liewegas/wip-omap-getkeys-rados-test
qa/suites/rados/basic: set low omap limit for rgw workload

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-01-30 21:43:17 -06:00
John Spray
d4f6385b85 Merge pull request #12800 from jcsp/wip-vstart-qasuite
Improve vstart_runner to (optionally) create its own cluster

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-01-31 02:02:49 +01:00
Nathan Cutler
046e873026 tests: ignore bogus ceph-objectstore-tool error in ceph_manager
Fixes: http://tracker.ceph.com/issues/16263
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-01-31 00:49:05 +01:00
Sage Weil
388a7ff8ef qa/workunits/rest: use unique pool names for cephfs test
Using cephfs_[meta]data collides with the pools that teuthology
already creates if an mds is defined.

This became a (noticeable) problem with 052c3d3f68

Signed-off-by: Sage Weil <sage@redhat.com>
2017-01-30 11:14:31 -05:00
Jason Dillaman
deaaba8fcf qa/suites/rbd: OpenStack devstack tempest full suite
This mimics the OpenStack tempest gate tests that OpenStack
Zuul executes as a gate test.

Fixes: http://tracker.ceph.com/issues/18594
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-01-27 14:22:48 -05:00
John Spray
48adc77476 Merge pull request #12825 from stiopaa1/r11124_refuseToDoFsNew
mon/MDSMonitor.cc:refuse fs new on pools with obj

Reviewed-by: John Spray <john.spray@redhat.com>
2017-01-27 17:35:38 +01:00
John Spray
5f67cf97f9 Merge pull request #13088 from jcsp/wip-16397-unpin
qa: unpin knfs from ubuntu

Reviewed-by: Jeff Layton <jlayton@redhat.com>
2017-01-27 15:39:41 +01:00
Jason Dillaman
ce675383b3 qa/tasks/qemu: allow tests to customize the number of CPUs
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-01-26 14:18:48 -05:00
Jason Dillaman
42e967f0bb qa/tasks/qemu: copy ceph configuration to VM image
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-01-26 14:17:43 -05:00
Jason Dillaman
d98aa1a39a qa/tasks/qemu: attach all disks as rbd block devices
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-01-26 14:17:30 -05:00
Jason Dillaman
67a4a6c519 qa/tasks/qemu: support overriding the cloud image
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-01-26 14:16:16 -05:00
Jason Dillaman
454348004b qa/tasks/qemu: support arbitrary additions to cloud-init-archive
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-01-26 14:16:10 -05:00
John Spray
c6d91dd912 qa: fix race in Mount.open_background
Previously a later remote call could end up executing
before the remote python program in open_background
had actually got as far as opening the file.

Fixes: http://tracker.ceph.com/issues/18661
Signed-off-by: John Spray <john.spray@redhat.com>
2017-01-26 16:48:58 +00:00
Sage Weil
9080596335 Merge pull request #9348 from LiumxNL/fix-mksnap
mon/OSDMonitor: make snaps on tier pool should not be allowed
2017-01-25 06:35:36 -06:00
John Spray
50bdd55512 qa: disable quota tests in multimds
Quotas don't work with kclient, and multimds tasks
are run against kclient.  We don't need to run this
against fuse here because it's a basic correctness
test that's run against fuse in the fs suite.

Fixes: http://tracker.ceph.com/issues/18600
Signed-off-by: John Spray <john.spray@redhat.com>
2017-01-24 23:25:55 +00:00
John Spray
d0963f8575 qa: break out tasks/ in multimds suite
...so that we can selectively disable those
which are not appropriate for multimds testing, or
which are not kclient compatible (all multimds workunits
run against both kclient and fuse).

Signed-off-by: John Spray <john.spray@redhat.com>
2017-01-24 23:25:51 +00:00
John Spray
4dd3666b2d qa: unpin knfs from ubuntu
We have an updated nfs-utils that is no longer
generating spurious selinux warnings on CentOS.

Fixes: http://tracker.ceph.com/issues/16397
Signed-off-by: John Spray <john.spray@redhat.com>
2017-01-24 22:06:29 +00:00
David Zafman
2c7de6c5dd Merge pull request #12866 from dzafman/wip-18471
qa/workunits/ceph-helpers: wait_for_clean() races with pg creation

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-01-24 11:02:38 -08:00
Michal Jarzabek
052c3d3f68 mon/MDSMonitor.cc:refuse fs new on pools with obj
Fixes: http://tracker.ceph.com/issues/11124
Signed-off-by: Michal Jarzabek <stiopa@gmail.com>
2017-01-23 19:48:53 +00:00
Sage Weil
88df169500 qa/suites/rados/basic: set low omap limit for rgw workload
This gets *some* coverage for the omap limits in the rados suite.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-01-23 12:07:57 -05:00
Sage Weil
4bad4bd86a Merge pull request #12981 from liewegas/wip-crush-move-osd
mon/OSDMonitor: make 'osd crush move ...' work on osds

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-01-22 10:42:08 -06:00
Mingxin Liu
0de7ac79b5 test: cannot create pool snap on tier
Signed-off-by: Mingxin Liu <mingxin@xsky.com>
2017-01-22 01:26:43 +08:00
Sage Weil
47956475de mon/OSDMonitor: make 'osd crush move ...' work on osds
Currently it only allows you to move buckets, which is annoying and much
less useful.  To move an OSD you need to use create-or-move, which is
harder to use.

Fixes: http://tracker.ceph.com/issues/18587
Signed-off-by: Sage Weil <sage@redhat.com>
2017-01-19 06:54:54 -06:00
John Spray
fe219df2a2 qa: update vstart_runner docstring
...to use paths pointing to ceph tree, not
ceph-qa-suite tree.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-01-19 06:30:20 +01:00
John Spray
549d993d3f qa: update remaining ceph.com to download.ceph.com
Fixes: http://tracker.ceph.com/issues/18574
Signed-off-by: John Spray <john.spray@redhat.com>
2017-01-17 17:14:50 +01:00
Mykola Golub
9107d31728 Merge pull request #12953 from dillaman/wip-18542
qa/tasks/qemu: update default image url after ceph.com redesign

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
2017-01-17 10:03:11 +02:00
Jason Dillaman
6d17befb3b qa/tasks/qemu: update default image url after ceph.com redesign
Fixes: http://tracker.ceph.com/issues/18542
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-01-16 22:12:51 -05:00
Yuri Weinstein
6886859cc2 Merge pull request #12928 from tmuthamizhan/wip-ansible-fs-suite
qa/suites/ceph-ansible: removing fs workunit

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2017-01-16 16:55:34 -08:00
Sage Weil
d22cdde36b qa/distros: make centos_latest 7.3
This will fix smithi runs and break vps runs until we get a 7.3 cloud-init
image ready.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-01-16 09:36:19 -05:00
Tamil Muthamizhan
728411f5ae removing fs workunit
removing fs workunit for now until tracker issue 18528 is resolved.

Signed-off-by: Tamil Muthamizhan <tmuthami@redhat.com>
2017-01-13 15:25:32 -08:00
Alfredo Deza
7172b55ad9 Merge pull request #12892 from ceph/wip-cd-fs-fix
qa/tasks/ceph-deploy: use the new create option during instantiation

Reviewed-by: Alfredo Deza <adeza@redhat.com>
2017-01-13 16:06:24 -05:00
Kefu Chai
5a393a9d85 Merge pull request #12919 from tchaikov/wip-qa-cleanup
tests: remove temporary file

Reviewed-by: Abhishek Lekshmanan <alekshmanan@suse.com>
2017-01-14 00:27:32 +08:00
Jason Dillaman
5ecfc2cf26 Merge pull request #11879 from trociny/wip-15371
rbd-mirror: delayed replication support

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-01-13 09:36:13 -05:00
John Spray
1e62467d09 Merge pull request #12833 from ukernel/wip-18396
tasks/cephfs: fix kernel force umount

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2017-01-13 11:20:00 +00:00
John Spray
2076cda04a Merge pull request #12749 from ukernel/wip-18179
mds: propagate error encountered during opening inode by number

Reviewed-by: John Spray <john.spray@redhat.com>
2017-01-13 11:18:59 +00:00
Kefu Chai
415580f3a5 tests: remove temporary file
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-01-13 18:10:49 +08:00
Yuri Weinstein
8bdbf88ad2 Merge pull request #12913 from zmc/wip-smoke-openstack
Add openstack requirements to smoke suite

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2017-01-12 16:18:00 -08:00
Tamil Muthamizhan
8f6fd3a766 qa: specify client for fs workunit
s/all/client.0

Signed-off-by: Tamil Muthamizhan <tmuthami@redhat.com>
2017-01-12 15:29:56 -08:00
Zack Cerza
fe9b7552d1 Add openstack requirements to smoke suite
Signed-off-by: Zack Cerza <zack@redhat.com>
2017-01-12 14:55:26 -07:00
Mykola Golub
72158bd9a3 qa/workunits/rbd: allow to tweak rbd-mirror test setup
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2017-01-12 18:09:02 +01:00
Yan, Zheng
6526ecc084 qa/tasks: add test_open_ino_errors
Validate that errors encountered during opening inos are properly
propagated

Signed-off-by: Yan, Zheng <zyan@redhat.com>
2017-01-12 20:15:53 +08:00
Vasu Kulkarni
2d4ed95f2b use the create option during instantiation
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-01-10 15:43:12 -08:00
Alfredo Deza
ebb02c8ef5 Merge pull request #12867 from ceph/wip-ceph-deploy-workaround
qa/tasks/ceph-deploy: create-keys explicitly

Reviewed-by: Alfredo Deza <adeza@redhat.com>
2017-01-10 15:47:26 -05:00
David Zafman
1009a16291 wait_for_clean: Racing with pg creation might cause increasing num PGs
Signed-off-by: David Zafman <dzafman@redhat.com>
2017-01-10 09:43:09 -08:00
Jason Dillaman
60b79473fe Merge pull request #12821 from SUSE/wip-drop-rbd_cli_tests
tests: drop rbd_cli_tests.pl and RbdLib.pm

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-01-10 09:29:11 -05:00
Brad Hubbard
fbad77c727 Merge pull request #12855 from badone/wip-md_config_t-_apply_changes-unchangeable
config: Improve warning for unobserved value

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-01-10 18:17:52 +10:00
Brad Hubbard
1e8e754007 config: Improve warning for unobserved value
When a variable is not being observed we currently mark it
"unchangable". This can be misleading so try something hopefully a
little more informative.

Fixes: http://tracker.ceph.com/issues/18424
Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
2017-01-10 15:35:19 +10:00
Vasu Kulkarni
127e29e1be Remove debug overrides
the high level of debug for mon/osd is causing remoto to hang during get key

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-01-09 17:59:20 -08:00
Vasu Kulkarni
2d6c3fa8b2 Add ceph-create-keys to explicitly create admin/bootstrap keys
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-01-09 17:14:33 -08:00
Yan, Zheng
4cdeeaac10 qa/tasks/cephfs: fix kernel force umount
Fixes: http://tracker.ceph.com/issues/18396
Signed-off-by: Yan, Zheng <zyan@redhat.com>
2017-01-10 08:31:25 +08:00
John Spray
6542a2e0d0 Merge pull request #12588 from jcsp/wip-18311
mds: check for errors decoding backtraces

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2017-01-09 11:02:32 +00:00
Nathan Cutler
4f4f35e37c tests: drop rbd_cli_tests.pl and RbdLib.pm
The rbd_cli_tests Perl script is not maintained and currently serves no
purpose. The RbdLib.pm module was only used by rbd_functional_tests.pl (which
was dropped by 276ffb4631) and rbd_cli_tests.pl
so drop it as well.

Fixes: http://tracker.ceph.com/issues/14825
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-01-08 16:17:08 +01:00
Jason Dillaman
0fb99b79b3 Merge pull request #12816 from SUSE/wip-18440
tests: subst repo and branch in git.ceph.com URL in qa/tasks/cram.py and qa/tasks/qemu.py

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-01-08 09:57:36 -05:00
Jason Dillaman
43f401365c Merge pull request #12818 from SUSE/wip-drop-rbd_functional_tests
tests: drop unused rbd_functional_tests.pl script

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-01-08 09:56:59 -05:00
Nathan Cutler
276ffb4631 tests: drop unused rbd_functional_tests.pl script
This script is no longer used.

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-01-07 23:33:12 +01:00
Nathan Cutler
2484a55461 tests: subst repo and branch in qemu test urls
References: http://tracker.ceph.com/issues/18440
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-01-07 22:49:56 +01:00
Nathan Cutler
74689df754 tests: subst branch and repo in qa/tasks/qemu.py
References: http://tracker.ceph.com/issues/18440
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-01-07 22:49:54 +01:00
Nathan Cutler
ed0e8be0b2 tests: subst repo name in krbd/unmap/tasks/unmap.yaml
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-01-07 13:40:06 +01:00
Nathan Cutler
56e37e41f4 tests: subst repo name in qa/tasks/cram.py
Inspired by bcbe45d948

Fixes: http://tracker.ceph.com/issues/18440
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-01-07 13:40:06 +01:00
Jason Dillaman
a159f3d1c0 Merge pull request #12549 from trociny/wip-16555
librbd: permit removal of image being bootstrapped by rbd-mirror

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-01-06 17:57:53 -05:00
Kefu Chai
0e0b7a0d58 Merge pull request #12628 from SUSE/wip-18309-alt
ceph_fuse: pid_file default to empty

Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-01-06 23:28:49 +08:00
Nathan Cutler
53ec76c7d0 tests: add no_client_pidfile override to fs/recovery tests
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-01-06 12:06:25 +01:00
Nathan Cutler
fe13664c8e tests: override yaml to set client pid file to empty string
Due to http://tracker.ceph.com/issues/18309 the pid file for fuse clients
should always be set to the empty string. (Teuthology's default ceph.conf
sets it to /var/run/ceph/$cluster-$name.pid)

This commit adds a reusable yaml facet for this purpose.

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-01-06 12:05:31 +01:00
Yuri Weinstein
6c00a64e87 Using centos 7.2 as latest version
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2017-01-06 00:26:13 +00:00
Yuri Weinstein
b29faef354 qa: Cleaned up distros to use latest versions
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2017-01-05 13:38:04 -08:00
John Spray
aa01f44022 qa: enable cluster creation in vstart_runner
Convenient when you want to create a fresh cluster
each test run: just pass --create and you'll get
a cluster with the right number of daemons for
the tests you're running.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-01-05 13:43:40 +00:00
John Spray
5d945fb71e qa/vstart_runner: more robust stop() on daemons
Previously this could get hung up if we killed one
PID and then the daemon reappears with a different
one (perhaps because we caught it during
daemonization?)

Signed-off-by: John Spray <john.spray@redhat.com>
2017-01-05 13:43:39 +00:00
John Spray
081038ef53 qa: fix vstart_runner tasks import
Instead of hunting around the filesystem for
ceph-qa-suite, get it from our own location.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-01-05 13:43:39 +00:00
John Spray
5f6cdab80f qa/tasks: add test_corrupt_backtrace
Validate that we get EIO and a damage table entry
when seeing a decode error on a backtrace.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-01-05 13:41:59 +00:00
Yuri Weinstein
dd51d2e09b Merge pull request #12610 from tmuthamizhan/wip-ansible-suite
qa/suite: Added a smoke suite for ceph-ansible

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2017-01-04 12:51:51 -08:00
Tamil Muthamizhan
ad9b75f8d0 Added a smoke suite for ceph-ansible
Smoke suite with basic tests for ceph ansible deployment

Signed-off-by: Tamil Muthamizhan <tmuthami@redhat.com>
2017-01-03 16:11:28 -08:00
Yuri Weinstein
fbb560c901 qa: fixed distros links
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2017-01-03 15:59:18 -08:00
Jason Dillaman
e4ca3c17dd Merge pull request #12747 from SUSE/wip-18388
tests: rbd/test_lock_fence.sh: fix rbdrw.py relative path

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-01-02 18:47:53 -05:00
Nathan Cutler
91231de16d tests: rbd/test_lock_fence.sh: fix rbdrw.py relative path
This commit fixes a regression introduced by
cf294777ea

Fixes: http://tracker.ceph.com/issues/18388
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-01-02 22:53:13 +01:00
Kefu Chai
8b07aad0f0 Merge pull request #12706 from Liuchang0812/wip-qa-rados-1228
test: update test_rados_tool.sh, use POOL and OBJ var

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-01-02 14:21:03 +08:00
Sage Weil
bc197d75cc Merge pull request #12451 from liewegas/wip-osd-on-applied
osd: never send rados ack (only commit)

Reviewed-by: Samuel Just <sjust@redhat.com>
2016-12-29 12:43:01 -06:00
liuchang0812
96f5c92dbf test: update test_rados_tool.sh, use POOL and OBJ var
* replace hard-code pool name with $POOL
* replace hard-code object name with $OBJ
* introduce a new variable called $POOL_EC
* clean up pool
* simplify test case

Signed-off-by: liuchang0812 <liuchang0812@gmail.com>
2016-12-29 16:05:37 +00:00
Sage Weil
f706819d6a Merge pull request #12403 from liewegas/wip-mon-mgr-cleanups
mon: clean up some osdmon/pgmon interactions

Reviewed-by: Kefu Chai <kchai@redhat.com>
2016-12-29 09:38:59 -06:00
Sage Weil
265c2ff446 qa/workunits: remove crash_replay_interval tests
Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-29 10:30:39 -05:00
Sage Weil
13c01a0350 Merge pull request #12670 from liewegas/wip-nits
mon: fix a few nits

Reviewed-by: Kefu Chai <kchai@redhat.com>
2016-12-29 09:27:32 -06:00
Willem Jan Withagen
4b9b562465 qa/test_rados_tool.sh: POSIX dd only accepts 'k' as multiplier
FreeBSD does not want KB as 1k multiplier.
see also: http://www.unix.com/man-page/posix/1posix/dd/

Signed-off-by: Willem Jan Withagen <wjw@digiware.nl>
2016-12-28 12:13:29 +01:00
Sage Weil
2e1a096a82 Merge pull request #12674 from Liuchang0812/tools-rados-offset-support
tools/rados: add a parameter "--offset" to rados put command

Reviewed-by: Kefu Chai <kchai@redhat.com>
2016-12-27 16:35:50 -06:00
Sage Weil
c467d7fb23 Merge pull request #12675 from Liuchang0812/fix-test-case-in-rados
test: replace hard-code binary names with varibles

Reviewed-by: Kefu Chai <kchai@redhat.com>
2016-12-27 16:35:29 -06:00
Sage Weil
88ae4bb067 mon/OSDMonitor: ceph osd pool get/set crush_rule by name
This means users don't have to manually translate a rule
they just created to a ruleset ID in order to map a pool
to it.

Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-27 11:02:19 -05:00
Sage Weil
88d85ad726 mon/PGMap: remove CATEGORY column from 'df detail'
Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-27 11:02:19 -05:00
liuchang0812
fe3d3821cb tools/rados: add parameter offset for put cmd and test case for it
add a parameter "--offset" for put command
add a test case for rados put command

Signed-off-by: liuchang0812 <liuchang0812@gmail.com>
2016-12-27 16:06:41 +08:00
liuchang0812
14de3a1ee1 test: replace hard-code binary names with varibles
This test case couldn't run under customize env CEPH_BIN.

Signed-off-by: liuchang0812 <liuchang0812@gmail.com>
2016-12-27 14:18:29 +08:00
Sage Weil
a03c5be452 Merge pull request #12149 from adamemerson/wip-g_ceph_context-exterminate
global: start removing g_ceph_context

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Loic Dachary <ldachary@redhat.com>
2016-12-23 08:40:48 -06:00
Sage Weil
2861a2188a Merge pull request #12630 from liewegas/wip-workunit-retry
qa/tasks/workunit: clear clone dir before retrying checkout

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2016-12-23 08:12:35 -06:00
Sage Weil
8bbdcae9b1 Merge pull request #12614 from liewegas/wip-task-peer
qa/tasks/peer: update task based on current peering behavior

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2016-12-23 08:11:36 -06:00
Sage Weil
004ef2c648 qa/distros: centos_7.yaml -> centos.yaml
Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-22 17:19:42 -05:00
Sage Weil
e9f119bda7 qa/suites: centos_7.2.yaml -> centos_7.yaml
Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-22 15:41:25 -05:00
Sage Weil
c168ce92aa qa/distros: add centos 7.3
Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-22 15:39:41 -05:00
Sage Weil
6e7db2329b qa/distros: add centos 7 yaml; use that instead
No need to be picky about the point release here.

Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-22 15:39:35 -05:00
Adam C. Emerson
750ad8340c common: Unskew clock
In preparation to deglobalizing CephContext, remove the CephContext*
parameter to ceph_clock_now() and ceph::real_clock::now() that carries
a configurable offset.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
2016-12-22 13:55:37 -05:00
Sage Weil
2a7013cd5a qa/tasks/workunit: clear clone dir before retrying checkout
If we checkout ceph-ci.git, and don't find a branch,
we'll try again from ceph.git. But the checkout will
already exist and the clone will fail, so we'll still
fail to find the branch.

The same can happen if a previous workunit task already
checked out the repo.

Fix by removing the repo before checkout (the first and
second times).  Note that this may break if there are
multiple workunit tasks running in parallel on the same
role.  That is already racy, so if it's happening, we'll
want to switch to using a truly unique clonedir for each
instantiation.

Fixes: http://tracker.ceph.com/issues/18336
Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-22 13:05:22 -05:00
Sage Weil
e1781dd573 qa/tasks/peer: update task based on current peering behavior
This changed in 0be3f5f72e.

Fixes: http://tracker.ceph.com/issues/18330
Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-22 08:40:45 -05:00
Jason Dillaman
12706d7622 Merge pull request #12587 from yuriw/wip-fix-subset-script-master
qa: fixed script to schedule rados and other suites with --subset option

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2016-12-20 18:47:39 -05:00
Yuri Weinstein
56975c039b changed script name
added kraken
parameterized suite name to be able to use for rados, rbd etc.

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2016-12-20 15:34:56 -08:00
Mykola Golub
11d6caf367 librbd: permit removal of image being bootstrapped by rbd-mirror
Fixes: http://tracker.ceph.com/issues/16555
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2016-12-20 20:07:51 +02:00
Sage Weil
c922404a03 qa/tasks/osd_backfill.py: wait for osd.[12] to start
...before sending a tell command.  Otherwise osd.2 might
start without 1, the io unblocks, and the tell fails
because osd.1 is still down.

Fixes: http://tracker.ceph.com/issues/18303
Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-19 21:56:11 -05:00
Samuel Just
dc639fd9e5 Merge pull request #12571 from athanatos/wip-15943
osd: Fix map gaps again (bug 15943)

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2016-12-19 13:50:15 -08:00
Sage Weil
5666fd61d6 qa/workunits: include extension for nose tests
When you have a relative path you have to include the extension.
Weird.

Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-19 14:08:11 -05:00
Sage Weil
904913b37f mon/OSDMonitor: remove 'osd thrash' command
This is a dev hack to generate a bunch of bogus osdmaps.  The maps are
all screwed up anyway (e.g., invalid addrs) and this is minimally useful.

Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-18 23:11:56 -05:00
Loic Dachary
c83d030d30 ceph-disk: do not create bluestore wal/db partitions by default
Fixes: http://tracker.ceph.com/issues/18291

Signed-off-by: Loic Dachary <loic@dachary.org>
2016-12-17 20:55:36 +01:00
Sage Weil
72d73b8c88 qa/tasks/workunit: retry on ceph.git if checkout fails
Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-16 15:06:16 -05:00
Vasu Kulkarni
9f04a7b32e use dev option instead of dev-commit
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2016-12-15 14:11:00 -08:00
Sage Weil
6bb3a037e5 Merge pull request #12511 from liewegas/wip-workunits
qa/workunits/rbd: fix

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Mykola Golub <mgolub@mirantis.com>
2016-12-15 14:15:31 -06:00
Sage Weil
cf294777ea qa/workunits: use relative path instead of wget from git
Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-15 15:10:28 -05:00
Sage Weil
c6698c95b8 Merge pull request #12508 from liewegas/wip-qa-admin-socket
qa/tasks/admin_socket: subst in repo name
2016-12-15 13:53:10 -06:00
Sage Weil
27b8eac249 qa/tasks/workunit.py: add CEPH_BASE env var
Root of git checkout

Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-15 13:52:03 -05:00
Sage Weil
4602884ab8 qa/tasks/workunit: leave workunits inside git checkout
Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-15 13:52:03 -05:00
Samuel Just
d4b6615a49 qa/config/rados.yaml: enable osd_debug_verify_cached_snaps
Also, make map gaps more likely.

Signed-off-by: Samuel Just <sjust@redhat.com>
2016-12-15 10:50:49 -08:00
Sage Weil
bcbe45d948 qa/tasks/admin_socket: subst in repo name
It is either ceph.git or ceph-ci.git.

Signed-off-by: Sage Weil <sage@redhat.com>
2016-12-15 13:35:02 -05:00
Sage Weil
61e592ad0b Merge pull request #12286 from dillaman/wip-18123
rados: optionally support reading omap key from file

Reviewed-by: Kefu Chai <kchai@redhat.com>
2016-12-15 10:00:47 -06:00
Nathan Cutler
a8a2a8c3e6 Revert "dummy: reduce run time, run user.yaml playbook"
This reverts commit d4e3cec185.

Fixes: http://tracker.ceph.com/issues/18259
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2016-12-15 13:01:02 +01:00
Samuel Just
ae40602c14 Merge remote-tracking branch 'ceph-qa-suite/master' into wip-18113-qa 2016-12-14 16:05:35 -08:00
Samuel Just
c2c5dace33 Merge pull request #12268 from dzafman/wip-18113
osd: When deep-scrub errors present upgrade regular scrubs

Reviewed-by: Samuel Just <sjust@redhat.com>
2016-12-14 11:34:24 -08:00
Sage Weil
a1eaa3f00f merge ceph-qa-suite 2016-12-14 11:29:58 -06:00
Sage Weil
c01f2ee0e2 move ceph-qa-suite dirs into qa/ 2016-12-14 11:29:55 -06:00
Jason Dillaman
4314cb945a qa/workunits/rbd: use more recent qemu-iotests that support Xenial
Fixes: http://tracker.ceph.com/issues/18149
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2016-12-10 20:42:52 -05:00
David Zafman
64a7012e98 test: Add test for keeping deep-scrub information
Signed-off-by: David Zafman <dzafman@redhat.com>
2016-12-09 16:51:20 -08:00
Jason Dillaman
2c70df978d qa/workunits/rbd: removed qemu-iotest case 077
The test case is not stable due to racing console output. This
results in spurious failures.

Fixes: http://tracker.ceph.com/issues/10773
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2016-12-07 09:59:39 -05:00
Jason Dillaman
286ceb1e03 rados: optionally support reading omap key from file
Fixes: http://tracker.ceph.com/issues/18123
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2016-12-07 09:57:16 -05:00
Mykola Golub
6cb1ed485f rbd-mirror: make 'rbd mirror image resync' work after split-brain
Fixes: http://tracker.ceph.com/issues/18051
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2016-12-03 19:50:27 +02:00
Mykola Golub
cccca67d5f rbd-mirror: split-brain issues should be clearly visible in mirror status
Fixed: http://tracker.ceph.com/issues/16991
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2016-12-03 19:37:10 +02:00
Mykola Golub
7c078eda0f qa/workunits/rbd: test_status_in_pool_dir: explicitly check grep return value
Otherwise, it does not work as supposed to work in statements like below:

 set -e
 test_status_in_pool_dir ... && ...

(e.g. in wait_for_status_in_pool_dir)

Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2016-12-03 19:37:10 +02:00
Jason Dillaman
fbccd1e88e Merge pull request #12195 from trociny/wip-qa-rbd-wait_for_image_present
qa/workunits/rbd: use image id when probing for image presence

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2016-12-02 09:01:43 -05:00
Loic Dachary
d5d7e3665f Merge pull request #12180 from tchaikov/wip-silence-get_timeout_delays
tests: disable the echo when running get_timeout_delays()

Reviewed-by: Loic Dachary <ldachary@redhat.com>
2016-11-29 09:06:43 +01:00
Mykola Golub
c23f56b983 qa/workunits/rbd: use image id when probing for image presence
This fixes a race in resync tests leading to false negative results.

Fixes: http://tracker.ceph.com/issues/18048
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2016-11-29 10:04:37 +02:00
Loic Dachary
f491ea062d tests: facilitate background process debug in ceph-helpers.sh
When displaying the output of a background process, do it on stderr so
that it is not bufferized. Otherwise the output of the background
process may be displayed after it completed.

Prefix the output of a background process with the PID of the process
known to the parent instead of the PID of the awk process processing the
output. When wait_background loops, it will print the process on which
it is waiting and it is confusing that they do not match with the PID
prefixing the process output.

Refs: http://tracker.ceph.com/issues/17830

Signed-off-by: Loic Dachary <loic@dachary.org>
2016-11-24 19:52:14 +01:00
Kefu Chai
1b9bc0501c tests: disable the echo when running get_timeout_delays()
this function is very distracting when one is looking at the log

Signed-off-by: Kefu Chai <kchai@redhat.com>
2016-11-25 00:17:23 +08:00
Loic Dachary
cca0f59156 Merge pull request #12085 from wjwithagen/wip-freebsd-ceph-helpers-2
workunits/ceph-helpers.sh: Fixes for FreeBSD

Reviewed-by: Loic Dachary <ldachary@redhat.com>
2016-11-24 08:01:51 +01:00
Willem Jan Withagen
e4629b3397 workunits/ceph-helpers.sh: Fixes for FreeBSD
- stat(1) does not have '%T'

Signed-off-by: Willem Jan Withagen <wjw@digiware.nl>
2016-11-24 01:57:05 +01:00
Sage Weil
4af4ef09fe Merge pull request #10402 from ceph/wip-openstack-wusui
qa: Add ceph-ansible installer.
2016-11-23 16:42:34 -05:00
David Zafman
dcb5fb9b5a test: CLEANUP: Make wait_for_clean() clearer changing variable name
Signed-off-by: David Zafman <dzafman@redhat.com>
2016-11-22 21:38:42 -08:00
David Zafman
c1eb8746bc test: Return wait_for_clean() to start sleeping at .1
Signed-off-by: David Zafman <dzafman@redhat.com>
2016-11-22 21:36:13 -08:00
David Zafman
453942946a test: Enhance get_timeout_delays()
Do all math using bc so we can have fractions
Allow caller to specify the first step (default 1)
Add testing of fractional first step

Signed-off-by: David Zafman <dzafman@redhat.com>
2016-11-22 21:32:34 -08:00
Jason Dillaman
5cd929aee3 Merge pull request #11960 from vshankar/wip-librbd-ec-support
librbd: bug fixes for optional data pool support

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2016-11-21 12:36:54 -05:00
Kefu Chai
23c21238b8 Merge pull request #12005 from wjwithagen/wip-wjw-freebsd-ceph-helpers
workunits/ceph-helpers.sh: FreeBSD returns a different errorstring.

Reviewed-by: Kefu Chai <kchai@redhat.com>
2016-11-21 22:53:23 +08:00
Loic Dachary
5e625674a8 tests: fix ceph-helpers.sh wait_for_clean delays
The TENTH_TIMEOUT was not delcared as an int and failed to be set with
the correct number. The test of the function did not catch this.

Implement computing of the increasingly large sleep delays in a separate
function so that it can be tested more easily. Give up on sub-second
sleep because a the function will not sleep at all if the cluster is
already clean. And if it is not already clean, it is very unlikely to
become clean within less than a second. The downside of having very
short sleep time is that it needlessly stress the machine and also
possibly spam the logs.

Refs: http://tracker.ceph.com/issues/17830

Signed-off-by: Loic Dachary <loic@dachary.org>
2016-11-21 11:42:42 +01:00
Venky Shankar
3a91d7bd2e qa / workunits: make rbd import/export work with image data pool
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2016-11-20 21:21:50 +05:30
Loic Dachary
cd72ff9f74 tests: save 9 characters for asok paths
For vstart.sh powered tests, save 9 characters in the path name
by replacing testdir/test- with td/t-

60 characters imposed by jenkins
9 characters for src/test
5 characters for td/t-

33 left (instead of 24) for the test to create asok such as out/client.admin.25327.asok

Moving these files outside of the build directory is a bad idea because
tests should only create/use files within the builddir and not write
outside of this directory. Doing so would make things more complicated
for cleanup in case the test fail and create other problems as a
consequence (filling out disk space, conflicting directories between
runs etc.).

For ceph-helpers.sh tests replace testdir with td, saving 5 characters.
This is not strictly necessary but keeps the directory names consistent:
if the developer wants to get rid of all the test leftovers, it is
enough to remove the a single directory: td.

Fixes: http://tracker.ceph.com/issues/16014

Signed-off-by: Loic Dachary <loic@dachary.org>
2016-11-18 09:19:18 +01:00
Kefu Chai
2c7f08b849 Merge pull request #9613 from dzafman/wip-16064
common osd: Improve scrub analysis, list-inconsistent-obj output and osd-scrub-repair test

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2016-11-17 15:48:32 +08:00
David Zafman
1a75696065 test: activate_osd() doesn't need to set crush
Tests use objectstore_tool() which stops and starts OSDs,
but may assume consistency of object locations.

Signed-off-by: David Zafman <dzafman@redhat.com>
2016-11-16 11:01:43 -08:00
David Zafman
f3def4a0e5 test: wait_for_clean() add sleep backoff
Reduce size of log on timeout by doing a backoff so that
we don't log 3000 loops at 1/10 second sleeps.

Signed-off-by: David Zafman <dzafman@redhat.com>
2016-11-16 11:01:43 -08:00
Willem Jan Withagen
e34e18609f workunits/ceph-helpers.sh: FreeBSD returns a different errorstring.
Signed-off-by: Willem Jan Withagen <wjw@digiware.nl>
2016-11-15 12:10:18 +01:00
Sage Weil
f9882f4abe qa/workunits/rados/test_envlibrados_for_rocksdb: force librados-dev install
On trusty we see

 WARNING: The following packages cannot be authenticated!
   librados-dev
 E: There are problems and -y was used without --force-yes

Signed-off-by: Sage Weil <sage@redhat.com>
2016-11-12 08:30:59 -05:00
Kefu Chai
36bf16d073 Merge pull request #11293 from liewegas/wip-mon-markout
mon: preserve osd weight when marking osd out, then in

Reviewed-by: Kefu Chai <kchai@redhat.com>
2016-11-11 22:00:39 +08:00
John Spray
36905d7caf Merge pull request #11783 from jcsp/wip-17796
mds: enable rmxattr on pool_namespace attrs 

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2016-11-11 12:09:57 +01:00
Sage Weil
573e5b060e qa/workunits/ceph-helpers.sh: allow pool deletes
Signed-off-by: Sage Weil <sage@redhat.com>
2016-11-10 11:43:41 -05:00
Kefu Chai
f669cf8261 Merge pull request #11633 from ddiss/tcc17684_rados_segv_without_pool
rados: fix segfaults when run without --pool

Reviewed-by: Nathan Cutler <ncutler@suse.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2016-11-10 19:53:44 +08:00
Kefu Chai
37f3897c55 Merge pull request #11702 from wjwithagen/wip-wjw-freebsd-cephtool-test
qa/workunits/cephtool/test.sh: FreeBSD has no distro.

Reviewed-by: Kefu Chai <kchai@redhat.com>
2016-11-10 10:18:23 +08:00
David Zafman
907e79e2b7 test: Add test support for deep-scrub
Signed-off-by: David Zafman <dzafman@redhat.com>
2016-11-08 15:16:52 -08:00
John Spray
f80c7a8f20 Merge pull request #11647 from ceph/wip-jlayton-cephfs
libcephfs client API overhaul and update

Reviewed-by: John Spray <john.spray@redhat.com>
2016-11-07 21:07:40 +01:00
Sage Weil
73a2c11e94 mon/OSDMonitor: remember old weight on 'osd out'
If we have an OSD with a weight that's not 1.0 and mark it out,
we should restore the same weight when we mark it back in. We
already do this when an OSD is automatically marked out, just
not when it is explicitly marked out.

Signed-off-by: Sage Weil <sage@redhat.com>
2016-11-07 11:28:43 -05:00
Mykola Golub
a3df091595 qa/workunits/rbd: simplify running nbd test under build env
Previously running the script as unprivileged user was not very useful
due to difficulty to change path sudo was looking for a command to
execute.

Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2016-11-04 16:22:51 +02:00
John Spray
77177530f2 mds: enable rmxattr on pool_namespace attrs
So that a user has a natural way of undoing a setxattr
which set a pool_namespace.

Fixes: http://tracker.ceph.com/issues/17797
Signed-off-by: John Spray <john.spray@redhat.com>
2016-11-04 13:00:05 +00:00
Mykola Golub
2012b4dfc6 rbd-nbd: disallow mapping images >2TB in size
Fixes: http://tracker.ceph.com/issues/17219
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
2016-11-03 14:50:47 +02:00