Commit Graph

581 Commits

Author SHA1 Message Date
Yan, Zheng
98d86a0752 qa/cephfs: kill mount if it gets evicted by mds
otherwise, teardown() hange at umount

Fixes: http://tracker.ceph.com/issues/21275
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2017-09-13 21:30:51 +08:00
Yan, Zheng
8433ced847 qa/cephfs: fix test_evict_client
executing mount_a.kill() twice, then executing mount_b.kill_cleanup()
twice do not make sense.

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2017-09-13 16:17:42 +08:00
Patrick Donnelly
06c94de584
mds: support limiting cache by memory
This introduces two config parameters:

    mds_cache_memory_limit: Sets the soft maximum of the cache to the given
    byte count. (Like mds_cache_size, this doesn't actually limit the maximum
    size of the cache. It just dictates the steady-state size.)

    mds_cache_reservation: This replaces mds_health_cache_threshold everywhere
    except the Beacon heartbeat sent to the mons. The idea here is to specify a
    reservation of memory (5% by default) for operations and the MDS tries to
    always maintain that reservation. So, the MDS will recall caps from clients
    when it begins dipping into its reservation of memory.

mds_cache_size still limits the cache by Inode count but is now by-default 0
(i.e. unlimited). The new preferred way of specifying cache limits is by memory
size. The default is 1GB.

Fixes: http://tracker.ceph.com/issues/20594
Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1464976

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-12 20:02:41 -07:00
yonghengdexin735
fc5ac9ea69 common:fix error word
Signed-off-by: yonghengdexin735 <zhang.zezhu@zte.com.cn>
2017-09-13 10:22:08 +08:00
Patrick Donnelly
f48113f50f
Merge PR #17679 into master
* refs/remotes/upstream/pull/17679/head:
	qa: get asok path from ceph.conf
	qa: use config_path property instead of literal

Reviewed-by: John Spray <john.spray@redhat.com>
2017-09-12 15:56:32 -07:00
Patrick Donnelly
bc3a00af0f
qa: get asok path from ceph.conf
The asok path was recently moved to /tmp to resolve unix path too long errors.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-12 15:52:54 -07:00
Patrick Donnelly
f95798b3ad
qa: use config_path property instead of literal
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-12 15:50:33 -07:00
Tamilarasi Muthamizhan
4d8971e1c8 Merge pull request #17650 from ceph/wip-ceph-deploy-fix
qa/tests: Fix get_system_type failure due to invalid remote name
2017-09-12 15:07:24 -07:00
Neha Ojha
59531d81c5 qa: avoid using make install for fio
Signed-off-by: Neha Ojha <nojha@redhat.com>
2017-09-12 08:26:27 -07:00
David Zafman
44f51024cc Merge pull request #17538 from dzafman/wip-21272
Add export and remove ceph-objectstore-tool command option

Reviewed-by: Sage Weil <sage@redhat.com>
2017-09-11 20:12:27 -07:00
Vasu Kulkarni
9fe58d5ed6 Fix get_system_type failure due to invalid remote name
recent changes caused the remote name to be invalid, fix the
arg passed to get_system_type

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-10 12:42:16 -07:00
Sage Weil
14e783c0b9 Merge pull request #17571 from jcsp/wip-21260
mon: fix dropping mgr metadata for active mgr (#21260)

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-09-10 14:20:30 -05:00
David Zafman
3bb20f6d75 ceph-objectstore-tool: Make pg removal require --force
Add new export-remove to combine the 2 operations

Fixes: http://tracker.ceph.com/issues/21272

Signed-off-by: David Zafman <dzafman@redhat.com>
2017-09-08 17:56:05 -07:00
Patrick Donnelly
1b7aae6cdc
Merge PR #17533 into master
* refs/remotes/upstream/pull/17533/head:
	qa: log ceph-fuse kill/cleanup

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-08 11:23:23 -07:00
John Spray
28cb4bb2e6 qa/tasks/mgr: regression test for 21260
(aka http://tracker.ceph.com/issues/21260)

Fixes: http://tracker.ceph.com/issues/21260
Signed-off-by: John Spray <john.spray@redhat.com>
2017-09-08 15:45:40 +02:00
Neha Ojha
2793ee7d7e qa: add cbt task for performance testing
Signed-off-by: Neha Ojha <nojha@redhat.com>
2017-09-07 15:44:48 -07:00
Tamilarasi Muthamizhan
594b736228 Merge pull request #17536 from ceph/wip-cd-rbd-pool
qa/tests: fix rbd pool creation for systemd tests
2017-09-07 15:13:32 -07:00
Vasu Kulkarni
5c43c5972d Add workaround for http://tracker.ceph.com/issues/20950
mgr bootstrap key differs on disk, rewrite the new key
using auth get.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:57 -07:00
Vasu Kulkarni
2116f76a1f Restart services after upgrade
This seems to be an issue and should be fixed in our systemd scripts,
need to discuss more on this.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:57 -07:00
Vasu Kulkarni
8a2b9a0809 since the roles are mapped inside ceph-deploy, store the roles that
are mapped and use the new mapped role for upgrades during later
stage.

eg: mon.a is mapped to mon.mira002 during install, store this mapping
and durig upgrade map it back to appropriate name to find the hostname
with that role

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:57 -07:00
Vasu Kulkarni
e7a0a4f87f Add doc string for missing ceph-deploy-branch
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:57 -07:00
Vasu Kulkarni
407ce2613d use mon.a as ceph-admin node, the previous get_first_mon returns
a sorted(mons)[0] as first mon.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:56 -07:00
Vasu Kulkarni
f3ec7cabf6 use elif instead of else to handle cases for jewel install
jewel needs neither filestore or bluestore as an option, so provide none
when running with jewel branch.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:56 -07:00
Vasu Kulkarni
9a73127c16 Add upgrade functionality using ceph-deploy
This is to test for customer like upgrade scenarios and to find
any issues that may be related to systemd, packaging etc

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:56 -07:00
Vasu Kulkarni
b409c3d163 Fix old flake8 issues
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:56 -07:00
Vasu Kulkarni
a9be0c387e Add option to skip mgr install for old release
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:56 -07:00
Vasu Kulkarni
f12127751d Add mgr create to fix single node test to reach healthy state
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-06 18:13:39 -07:00
Vasu Kulkarni
49f95b3d8b create the rbd pool right after install
rbd pool should exist for many rbd tests to work properly, create
the pool right after install is successful.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-06 18:07:54 -07:00
Patrick Donnelly
b4f962a486
qa: log ceph-fuse kill/cleanup
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-06 13:40:11 -07:00
Patrick Donnelly
28ca766cd2
Merge PR #16305 into master
* refs/remotes/upstream/pull/16305/head:
	qa/cephfs: test CephFS recovery pools
	qa/cephfs: support CephFS recovery pools
	qa/ceph_test_case: support CephFS recovery pools
	qa/cephfs: Allow deferred fs creation
	qa/cephfs: Refactor alternate pool test

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-05 20:42:30 -07:00
Sage Weil
21027233b2 qa/tasks/ceph_manager: revive osds before doing final rerr reset
We assume below that rerrosd is up, but it may not be when we exit the
loop.

Fixes: http://tracker.ceph.com/issues/21206
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-31 14:55:46 -04:00
Douglas Fuller
6af2ae80d3 qa/cephfs: test CephFS recovery pools
Test recovering metadata in to a separate RADOS pool with
cephfs_data_scan and friends.

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-08-30 09:02:44 -04:00
Douglas Fuller
8f9a252020 qa/cephfs: support CephFS recovery pools
Add support for testing recovery of CephFS metadata into an alternate
RADOS pool, useful as a disaster recovery mechanism that avoids
modifying the metadata in-place.

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-08-30 09:02:44 -04:00
Douglas Fuller
c85562c94a qa/ceph_test_case: support CephFS recovery pools
Add support for testing recovery of CephFS metadata into an alternate
RADOS pool, useful as a disaster recovery mechanism that avoids
modifying the metadata in-place.

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-08-30 09:02:44 -04:00
Douglas Fuller
5fafc03cb9 qa/cephfs: Allow deferred fs creation
Permit Filesystem objects to be created and settings modified before
calling Filesystem.create().

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-08-30 09:02:44 -04:00
Douglas Fuller
47318f8ac4 qa/cephfs: Refactor alternate pool test
Remove the alternate pool recovery test from test_data_scan. Newer
commits will place the test in its own file.

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-08-30 09:02:44 -04:00
Jason Dillaman
1b2f1358f7 Merge pull request #17346 from idryomov/wip-krbd-xfstests-update
qa/suites/krbd: rbd_xfstests job overhaul

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-08-29 18:28:48 -04:00
Ilya Dryomov
01914be5a3 qa/tasks/rbd.xfstests: take exclude list from yaml
Different filesystems (and further, different configurations of the
same filesystem) need different exclude lists.  Hard coding the list in
a wrapper script is inflexible.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-08-29 16:09:35 +02:00
Casey Bodley
0542974504 qa: create_cache_pool no longer runs 'pool application enable'
Fixes: http://tracker.ceph.com/issues/21155

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-08-29 10:01:04 -04:00
Xie Xingguo
3e0506a9a5 Merge pull request #17253 from hjwsm1989/fix-qa-bug
qa: fix the potential delay of pg state change

Reviewed-by: Sage Weil <sage@redhat.com>
2017-08-28 08:35:01 +08:00
Kefu Chai
30b5b4627c Merge pull request #16494 from asomers/bin_bash
misc: Fix bash path in shebangs

Reviewed-by: Willem Jan Withagen <wjw@digiware.nl>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-08-27 10:14:14 +08:00
huangjun
3bff102d4b qa: fix the potential delay of pg state change
If start osd process first and then mark it in, the
pg state may remain all active+clean when doing
wait_for_clean() check, which may fail the next
osd_scrub_pgs() process.
So faster pg state change by marking osd in first.

Signed-off-by: huangjun <huangjun@xsky.com>
2017-08-25 17:07:37 +08:00
Sage Weil
800fdd9953 qa/tasks/ceph_deploy: gatherkeys before mgr deploy
Otherwise we may be missing the bootstrap-mgr key.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-24 09:52:17 -04:00
Casey Bodley
47ffcb1caf Merge pull request #17162 from cbodley/wip-qa-rgw-pool-application
qa/rgw: use 'ceph osd pool application enable' on created pools

Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
2017-08-23 12:52:33 -04:00
Casey Bodley
568c3f3a23 qa/rgw: enable 'rgw' application on created pools
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-08-22 14:37:02 -04:00
Casey Bodley
7c865effd8 qa: add optional 'application' to pool creation helpers
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-08-22 14:37:02 -04:00
Radoslaw Zarzynski
43a7399720 qa/tasks/rgw: make the frontend_prefix per-client configurable.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-14 01:11:18 +00:00
Radoslaw Zarzynski
3dbacd1387 qa/tasks/keystone: make possible to deploy Keystone on CentOS as well.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
afe1ad3010 qa, rgw: Keystone's instances can be now accessed via non-local network interfaces.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
f81669f673 qa/tasks/keystone: handle packaged dependencies of Keystone.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
1d6be97861 qa/tasks/tempest: make the Tempest blacklisting configurable.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
1d8a4e5921 qa/tasks/tox: separate tox deployment from Keystone.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
2a845344e7 qa/tasks/keystone: comply wih the CLI interface older openstack-client.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
c1d93e80c8 qa/tasks/keystone: deploy tox.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
979d936ad1 qa/tasks/keystone: the task can be now executed also on remotes.
Initially it was compatible with local_runner only.

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
728d642bc4 qa/tasks/tempest: properly update tempest.conf on remotes.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
c1fe31d327 qa/tasks/rgw: add the "frontend_prefix" configurable.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
b6a1533962 qa/tasks/tempest: integrate OpenStack Tempest for testing RadosGW's Swift API.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
2ee29a7c6a qa/tasks/keystone: deploy a Keystone instance for testing RadosGW.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Sage Weil
22a46aab3c Merge pull request #16954 from gregsfortytwo/wip-20891-pool-metadata
mon: fix "fs new" pool metadata update, tests

Reviewed-by: Sage Weil <sage@redhat.com>
2017-08-10 21:29:37 -05:00
Greg Farnum
c85af7b146 qa: test that "fs new" correctly set the application_metadata
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2017-08-10 11:09:38 -07:00
Sage Weil
c8c837f665 Merge pull request #16905 from liewegas/wip-upgrade-sloppy-flush
qa/tasks/ceph.py: tolerate flush pg stats exception
2017-08-09 21:21:51 -05:00
Matt Benjamin
0956b3aafd Merge pull request #16834 from mdw-at-linuxbox/policy
radosgw: usage: fix bytes_sent bug.
2017-08-09 14:24:01 -04:00
Marcus Watts
a45ab45f74 Test bytes_sent bugs.
Rearrange logic to make it easier to measure accumulation.
Instrument the boto request/response loop to count bytes in and out.
Accumulate byte counts in usage like structure.
Compare actual usage reported by ceph against local usage measured.
Report and assert if there are any short-comings.
Remove zone placement rule that was newly added at end: tests should be rerunable.

Nit: the logic to wait for "delete_obj" is not quite right.

Fixes: http://tracker.ceph.com/issues/19870
Signed-off-by: Marcus Watts <mwatts@redhat.com>
2017-08-08 21:56:01 -04:00
Patrick Donnelly
eabe662614
Merge PR #16378 into master
* refs/remotes/upstream/pull/16378/head:
	doc: remove accidental additions to release notes
	qa/cephfs: Fix race in test_volume_client
	qa/cephfs: Test filtered df
	PendingReleaseNotes: add note about df filtering
	client: Support new, filtered MStatfs
	objecter: Support new, filtered MStatfs
	mon/PGMap stats: Support new, filtered MStatfs
	messages: Add optional data pool to MStatfs

Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2017-08-08 09:33:52 -07:00
Sage Weil
9da7e63c64 qa/tasks/ceph.py: tolerate flush pg stats exception
If the OSD doesn't see IO, it won't flush more pg/osd stats when the
luminous flag is not yet set (legacy pgmonitor mode).

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-08 12:08:31 -04:00
Sage Weil
d9a0145f8f Merge pull request #16824 from liewegas/wip-more-scrub-time
qa/tasks/ceph: wait longer for scrub
2017-08-05 13:35:55 -05:00
Sage Weil
62482ce82c qa/tasks/ceph: debug osd setup
I've seen a couple rbd runs that seem to skip the next block :/

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-05 13:53:26 -04:00
Douglas Fuller
552225f329 qa/cephfs: Fix race in test_volume_client
Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-08-04 14:38:50 -04:00
Sage Weil
d09606619f qa/tasks/ceph: wait longer for scrub
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-04 12:06:27 -04:00
Sage Weil
f681d6e794 Merge pull request #16794 from liewegas/wip-rbd-app
qa/tasks/ceph: enable rbd on rbd pool
2017-08-04 08:40:45 -05:00
Sage Weil
7ed9d1e8aa qa/tasks/ceph: enable rbd on rbd pool
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-04 09:40:26 -04:00
Patrick Donnelly
d89af4a3e8
Merge PR #16802 into master
* refs/remotes/upstream/pull/16802/head:
	qa: update wait_for_health for new health json syntax

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-03 16:20:20 -07:00
John Spray
0613d411aa qa: update wait_for_health for new health json syntax
Fixes: http://tracker.ceph.com/issues/20890
Signed-off-by: John Spray <john.spray@redhat.com>
2017-08-03 23:46:41 +01:00
Patrick Donnelly
66756c4f65
Merge PR #16292 into master
* refs/remotes/upstream/pull/16292/head:
	qa: use new hex rep of inode
	qa: fix whitelist error message
	mds: refine "Scrub error" cluster log message
	mds: polish clog messages
	doc: developer logging guidance

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-03 13:55:21 -07:00
Yuri Weinstein
09fd18d031 Merge pull request #16760 from cbodley/wip-rgw-disable-lifecycle-s3tests
qa/rgw: disable lifecycle tests because of expiration failures

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2017-08-03 13:25:39 -07:00
Douglas Fuller
b9d11af92b qa/cephfs: Test filtered df
Add a test for filtered df for file systems with single data pools.

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-08-03 14:11:47 -04:00
Josh Durgin
ae48c75065 Merge pull request #16797 from jdurgin/wip-upgrade-jewel-x
qa: timeout when waiting for mgr to be available in healthy()

Reviewed-by: Sage Weil <sage@redhat.com>
2017-08-03 11:11:41 -07:00
Patrick Donnelly
8d33cbbf5c
qa: use new hex rep of inode
Resolves a failure from QA:

    2017-08-02T19:23:27.489 INFO:tasks.cephfs_test_runner:======================================================================
    2017-08-02T19:23:27.489 INFO:tasks.cephfs_test_runner:FAIL: test_oversize (tasks.cephfs.test_fragment.TestFragmentation)
    2017-08-02T19:23:27.489 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
    2017-08-02T19:23:27.490 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
    2017-08-02T19:23:27.490 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_ceph-c_wip-pdonnell-testing-20170802/qa/tasks/cephfs/test_fragment.py", line 71, in test_oversize
    2017-08-02T19:23:27.490 INFO:tasks.cephfs_test_runner:    self.assertEqual(frags[0]['dirfrag'], "10000000000.0*")
    2017-08-02T19:23:27.490 INFO:tasks.cephfs_test_runner:AssertionError: u'0x10000000000.0*' != '10000000000.0*'
    2017-08-02T19:23:27.490 INFO:tasks.cephfs_test_runner:

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-02 21:39:48 -07:00
Patrick Donnelly
6cad5be68c
Merge PR #16714 into master
* refs/remotes/upstream/pull/16714/head:
	qa: test export_pin is correct in dumped subtree
	mds: print export_pin for dumped subtree

Reviewed-by: Douglas Fuller <dfuller@redhat.com>
Reviewed-by: huanwen ren <ren.huanwen@zte.com.cn>
2017-08-02 18:41:12 -07:00
Casey Bodley
0debf4dc6e qa/rgw: disable lifecycle tests because of expiration failures
lifecycle expiration tests are too reliant on timing, and have been
failing consistently for a long time

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-08-02 11:06:35 -04:00
Josh Durgin
63693779fc qa: timeout when waiting for mgr to be available
Otherwise during upgrades we wait forever.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2017-08-02 02:18:28 -04:00
Kefu Chai
1ff1f836da Merge pull request #16722 from tchaikov/wip-qa-fixes
qa/suites: escape the parenthesis of the whitelist text

Reviewed-by: Sage Weil <sage@redhat.com>
2017-08-02 13:00:01 +08:00
Jason Dillaman
2589f57ecd Merge pull request #16656 from idryomov/wip-qa-newer-fio
qa/tasks/rbd_fio: bump default fio version to 2.21

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-08-01 10:14:46 -04:00
Kefu Chai
d12c51ca91 qa/suites: escape the parenthesis of the whitelist text
so we can avoid the warnings like

grep: Unmatched ( or \(

because we pass the whitelisted string to `egrep -v "$1"` directly.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-08-01 21:54:44 +08:00
Patrick Donnelly
8db2c43e79
qa: test export_pin is correct in dumped subtree
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-31 15:33:49 -07:00
Patrick Donnelly
019f20ff98
Merge PR #16640 into master
* refs/remotes/upstream/pull/16640/head:
	qa: fix wait for wrong health message

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-28 09:55:49 -07:00
Patrick Donnelly
6fc2ee383f
Merge PR #16413 into master
* refs/remotes/upstream/pull/16413/head:
	qa/cephfs: lsof if umount fails

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-28 09:55:23 -07:00
Patrick Donnelly
ced01a2335
qa: fix wait for wrong health message
Fixes: http://tracker.ceph.com/issues/20805

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-27 14:40:05 -07:00
Sage Weil
41bcf2fee5 Merge pull request #16281 from badone/wip-PG-cluster-log-audit
osd: Log audit

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-07-27 16:25:30 -05:00
Alan Somers
3aae5ca6fd scripts: fix bash path in shebangs
/bin/bash is a Linuxism.  Other operating systems install bash to
different paths.  Use /usr/bin/env in shebangs to find bash.

Signed-off-by: Alan Somers <asomers@gmail.com>
2017-07-27 13:24:26 -06:00
Ilya Dryomov
bd6e3e5f1f qa/tasks/rbd_fio: bump default fio version to 2.21
I'm seeing sporadic single thread deadlocks on fio stat_mutex during krbd
thrash runs:

  (gdb) info threads
    Id   Target Id         Frame
  * 1    Thread 0x7f89ee730740 (LWP 15604) 0x00007f89ed9f41bd in __lll_lock_wait () from /lib64/libpthread.so.0
  (gdb) bt
  #0  0x00007f89ed9f41bd in __lll_lock_wait () from /lib64/libpthread.so.0
  #1  0x00007f89ed9f17b2 in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
  #2  0x00000000004429b9 in fio_mutex_down (mutex=0x7f89ee72d000) at mutex.c:170
  #3  0x0000000000459704 in thread_main (data=<optimized out>) at backend.c:1639
  #4  0x000000000045b013 in fork_main (offset=0, shmid=<optimized out>, sk_out=0x0) at backend.c:1778
  #5  run_threads (sk_out=sk_out@entry=0x0) at backend.c:2195
  #6  0x000000000045b47f in fio_backend (sk_out=sk_out@entry=0x0) at backend.c:2400
  #7  0x000000000040cb0c in main (argc=2, argv=0x7fffad3e3888, envp=<optimized out>) at fio.c:63
  (gdb) up 2
  170                     pthread_cond_wait(&mutex->cond, &mutex->lock);
  (gdb) p mutex.lock.__data.__owner
  $1 = 15604

Upgrading to 2.21 seems to make these go away.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-07-27 18:57:43 +02:00
Sage Weil
a40d94b163 qa/tasks/ceph: wait for pg stats to flush in healthy check
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-27 12:10:27 -04:00
Sage Weil
80978dea8a qa/tasks/ceph_manager: wait_for_all_up -> wait_for_all_osds_up
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-27 12:10:26 -04:00
Sage Weil
7648894e55 qa/tasks/ceph_manager: expose flush_all_pg_stats
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-27 12:10:26 -04:00
David Zafman
e92c953d7b Merge pull request #16610 from dzafman/wip-fix-reg11184
test: reg11184 might not always find pg 2.0 prior to import

Reviewed-by: Sage Weil <sage@redhat.com>
2017-07-26 11:42:15 -07:00
Sage Weil
59a3a4a40e Merge pull request #16559 from hjwsm1989/dump-stuck
qa/tasks/dump_stuck: fix dump_stuck test bug

Reviewed-by: Sage Weil <sage@redhat.com>
2017-07-26 11:59:21 -05:00
David Zafman
7c43840399 test: reg11184 might not always find pg 2.0 prior to import
Signed-off-by: David Zafman <dzafman@redhat.com>
2017-07-26 09:46:15 -07:00
Sage Weil
56ffd7a727 Merge pull request #16571 from ceph/wip-cd-bluestore-2
qa/tasks/ceph-deploy: Fix bluestore options for ceph-deploy

Reviewed-by: Tamil Muthamizhan <tmuthami@redhat.com>
2017-07-26 11:43:50 -05:00
Brad Hubbard
f8acc53d82 osd: Log audit
Review current log messages for consistency, accuracy and necessesity as
part of usability initiative. First in a series.

Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
2017-07-26 17:34:28 +10:00
Vasu Kulkarni
2fa0fae72f Add option to specify bluestore/filestore options
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-07-25 12:26:03 -07:00
huangjun
daf8efee32 qa/tasks/dump_stuck: fix dump_stuck test bug
Test cluster with 2 osds, stop osd.0, if osd.1
  report the pg stats during pg peering, mon will
  record pg state to 'peering',then stop osd.1,
  finally the pg state will stuck in 'stale+peering',
  which is unexpected.

  Let's wait_for_active() after stop osd.0.

  Signed-off-by: huangjun <huangjun@xsky.com>
2017-07-25 11:14:07 +00:00
Sage Weil
2e5955212d qa/tasks/workunit: allow alt basedir
Instead of 'qa/workunits' allow something like 'qa/standalone'.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-24 15:44:51 -04:00
Sage Weil
02c2e853d3 Merge pull request #16509 from liewegas/wip-rgw-wait
qa/suits/rados/basic/tasks/rgw_snaps: wait for pools to be created

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-24 11:55:54 -05:00
Sage Weil
29549e6834 Merge pull request #13723 from ovh/bp-forced-recovery
osd/PG: make prioritized recovery possible

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-07-24 09:01:03 -05:00
Sage Weil
ecd1193ab9 qa/suites/rados/basic/tasks/rgw_snaps: wait for pools to be be created
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-22 18:54:46 -04:00
Patrick Donnelly
9506789ce1
Merge PR 16379 into master
* refs/remotes/upstream/pull/16379/head:
	qa: fix MDS_CLIENT_RECALL copy error

Reviewed-by: Zheng Yan <zyan@redhat.com>
2017-07-21 13:23:07 -07:00
Patrick Donnelly
23e3d40751
Merge PR 16226 into master
* refs/remotes/upstream/pull/16226/head:
	qa: wait for OSDMap to propagate for snap purge

Reviewed-by: Zheng Yan <zyan@redhat.com>
2017-07-21 13:22:47 -07:00
Kefu Chai
4599eb7963 Merge pull request #16454 from liewegas/wip-fix-ceph-scrub
qa/tasks/ceph_manager: wait for osd to start after objectstore-tool sequence

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-07-21 19:31:19 +08:00
Sage Weil
59e3827be7 qa/tasks/reg11184: import run
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-20 17:42:59 -04:00
Sage Weil
27e8d75f61 Merge pull request #16429 from liewegas/wip-jewel-x
qa/suites/upgrade/jewel-x: misc fixes for new health checks
2017-07-20 10:47:05 -05:00
Sage Weil
3de9f22ce0 Merge pull request #16423 from liewegas/wip-ls
mon: '* list' -> '* ls'

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-07-20 10:43:34 -05:00
Sage Weil
583a38bca2 qa/tasks/ceph_manager: wait for osd to start after objectstore-tool sequence
Fixes: http://tracker.ceph.com/issues/20705
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-20 11:41:36 -04:00
Kefu Chai
a1d16185a2 qa/tasks/reg11184: use literal 'foo' instead pool_name
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-07-20 21:35:41 +08:00
Ilya Dryomov
67db89f6c2 Merge pull request #16428 from idryomov/wip-krbd-luminous-thrash
qa: thrash tests for backoff and upmap

Reviewed-by: Vasu Kulkarni <vasu@redhat.com>
2017-07-20 11:28:22 +02:00
Piotr Dałek
b0134cc7a8 qa: add force/cancel recovery/backfill to QA testing
This randomly issues pg force-recovery/force-backfill and
pg cancel-force-recovery/cancel-force-backfill during QA
testing. Disabled for upgrades from hammer, jewel and kraken.

Signed-off-by: Piotr Dałek <piotr.dalek@corp.ovh.com>
2017-07-20 09:35:55 +02:00
Jason Dillaman
836ab7ad95 test: skip pool application metadata tests if OSDs not at min luminous
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-07-19 13:13:01 -04:00
Sage Weil
572a942f8f mon: 'auth list' -> 'auth ls'
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-19 12:33:14 -04:00
Ilya Dryomov
682c5a42e1 qa/tasks/rbd_fio: dump fio options before starting
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-07-19 12:18:16 +02:00
Ilya Dryomov
03f69b3275 qa/tasks/rbd_fio: support libaio engine
Want to set iodepth and do direct AIO.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-07-19 12:18:16 +02:00
Yan, Zheng
b49d6d8ead qa/cephfs: lsof if umount fails
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2017-07-19 15:32:37 +08:00
Patrick Donnelly
f8e0571982
qa: fix MDS_CLIENT_RECALL copy error
Fixes: http://tracker.ceph.com/issues/20682

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-18 16:06:20 -07:00
Patrick Donnelly
5b1a229fca
Merge PR 16200 into master
* refs/remotes/upstream/pull/16200/head:
	qa: thrash max_mds and deactivate ranks

Reviewed-by: Zheng Yan <zyan@redhat.com>
2017-07-17 22:33:34 -07:00
Patrick Donnelly
b94d1dc385
Merge PR 16288 into master
* refs/remotes/upstream/pull/16288/head:
	qa/cephfs: don't use int() to convert string of float point number

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-17 22:31:05 -07:00
Sage Weil
dfc9c36606 fix ceph.py 2017-07-17 16:27:13 -04:00
Sage Weil
c66da972df qa/tasks/ceph.py: create osds in order
We aren't passing id to legacy 'osd create', which means we have to go
in order!

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-17 15:22:47 -04:00
Ilya Dryomov
0f75d79c34 qa/tasks/rbd_fio: use teuthology.packaging for handling packages
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-07-17 15:32:51 +02:00
Jason Dillaman
4fa1918717 qa/tasks: rbd-mirror daemon not properly run in foreground mode
Fixes: http://tracker.ceph.com/issues/20630
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-07-14 10:32:28 -04:00
Sage Weil
56e2965502 qa/tasks/ceph_manager: wait longer for pg stats to flush
An ill-timed mgr restart could blow the current 15s wait.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-13 12:13:45 -04:00
Yan, Zheng
e4844706b0 qa/cephfs: don't use int() to convert string of float point number
Fixes: http://tracker.ceph.com/issues/20582
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2017-07-13 15:55:22 +08:00
Sage Weil
fd39700d55 qa/tasks/ceph: wait for osds to come up before creating pool
Avoid health warnings.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
25717f7e84 qa/tasks/ceph_test_case.py: update health check helpers
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
93de19adcf qa: whitelist health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
63f97ddcf6 qa/suites/rados: whitelist health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
b25ddafa16 qa/tasks/mon_clock_skew_check: vastly simplify
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
83aaa75991 qa/tasks/ceph: stop logging health on shutdown
Don't log health during actual teardown or we'll see
various scary messages unrelated to our test run.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
64f47c89fe Merge pull request #16265 from dzafman/wip-20470
test: Fix reg11184 test to remove extraneous pg

Reviewed-by: Sage Weil <sage@redhat.com>
2017-07-11 13:46:33 -05:00
David Zafman
8b4b11a905 test: Fix reg11184 test to remove extraneous pg
Minor cleanups
Remove unused pool to aid future analysis

Signed-off-by: David Zafman <dzafman@redhat.com>
2017-07-11 10:41:09 -07:00
Sage Weil
b5f81e7313 qa/tasks/ceph.py: no osd id to 'osd create' command
This isn't recognized by hammer, and we don't need it for jewel.

Fixes: http://tracker.ceph.com/issues/20548
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-10 10:43:16 -04:00
Sage Weil
8b21c6b6fd Merge pull request #16027 from liewegas/wip-crush-rule-class
mon,crush: create crush rules using device classes for replicated and ec pools via cli

Reviewed-by: Loic Dachary <ldachary@redhat.com>
2017-07-08 21:34:13 -05:00
Sage Weil
0c79c4ffac Merge pull request #16228 from smithfarm/wip-rados-upgrade-2
tests: fix rados/upgrade/jewel-x-singleton and make workunit task handle repo URLs not ending in ".git"

Reviewed-by: Sage Weil <sage@redhat.com>
2017-07-08 21:32:36 -05:00
Nathan Cutler
98eda9a4dc qa: workunit: handle repo ending in "/ceph-ci"
If the repo ends in "/ceph-ci" it's the same as if it ended in "/ceph-ci.git"

Before this change, the following command was broken if the workunit specified,
e.g., "branch: jewel":

teuthology-suite --ceph-repo https://github.com/ceph/ceph --ceph master
--suite-repo https://github.com/ceph/ceph-ci --suite-branch wip-foo . . .

Fixes: http://tracker.ceph.com/issues/20554
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-07-08 19:03:30 +02:00
Sage Weil
f17516656e Merge pull request #16212 from dillaman/wip-qa-ceph-task
qa/tasks/ceph: don't hard-code cluster name when copying fsid

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-07-08 08:54:18 -05:00
Sage Weil
704d13d36f Merge pull request #16213 from liewegas/wip-radosbench-timeout
qa/tasks/radosbench: longer timeout
2017-07-08 08:53:55 -05:00
Patrick Donnelly
62d008436b
qa: wait for OSDMap to propagate for snap purge
Note: unmounting the client is not necessary for purging snapshots.

Fixes: http://tracker.ceph.com/issues/20072

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-07 15:12:42 -07:00
Jason Dillaman
a06f8844c7 qa/tasks/ceph: supply cluster name to ceph CLI
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-07-07 11:21:51 -04:00
Vasu Kulkarni
9b81349985 create mgr nodes explicitly now, previously they where
started alongside monitor daemons due to systemd rules.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-07-07 08:16:21 -07:00
Sage Weil
597a971686 qa/tasks/radosbench: longer timeout
Hit another timeout due to aggressive thrashing with size==min_size.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-07 11:04:36 -04:00
Jason Dillaman
5a8967cde7 qa/tasks/ceph: don't hard-code cluster name when copying fsid
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-07-07 10:44:20 -04:00
Patrick Donnelly
a84e3c89bf
qa: thrash max_mds and deactivate ranks
Fixes: http://tracker.ceph.com/issues/10792

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-06 22:29:41 -07:00
Patrick Donnelly
5b87301192
Merge remote-tracking branch 'upstream/pull/15822/head' into master
* upstream/pull/15822/head:
  qa: add timeout/repeat to pool df

Reviewed-by: John Spray <jspray@redhat.com>
2017-07-06 22:14:32 -07:00
Patrick Donnelly
77b707a6ea
Merge remote-tracking branch 'upstream/pull/15950/head' into master
* upstream/pull/15950/head:
  qa: update thrash max mds testing

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-06 22:11:18 -07:00
Patrick Donnelly
97cdb1e34a
Merge remote-tracking branch 'upstream/pull/15817/head' into master
* upstream/pull/15817/head:
  qa: wait for healthy cluster before testing pins

Reviewed-by: Zheng Yan <zyan@redhat.com>
2017-07-06 21:36:34 -07:00
Patrick Donnelly
2cb42a4dbf
Merge remote-tracking branch 'upstream/pull/13770/head' into master
* upstream/pull/13770/head:
  tasks/cephfs: add TestStrays.test_replicated_delete_speed

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-06 21:33:03 -07:00
Sage Weil
dc7a2aaf7a erasure-code: ruleset-* -> crush-*
1) ruleset is an obsolete term, and
2) crush-{rule,failure-domain,...} is more descriptive.

Note that we are changing the names of the erasure code profile keys
from ruleset-* to crush-*.  We will update this on upgrade when the
luminous flag is set, but that means that during mon upgrade you cannot
create EC pools that use these fields.

When the upgrade completes (users sets require_osd_release = luminous)
existing ec profiles are updated automatically.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-06 15:01:03 -04:00
Sage Weil
d02c9a5b01 Merge pull request #16122 from tchaikov/wip-qa-warning-msg
qa/tasks/ceph: should be "Waiting for all PGs", not "all osds"

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
2017-07-05 10:26:56 -05:00
Mykola Golub
2a9f56f818 Merge pull request #15860 from dillaman/wip-20168
librbd: fail IO request when exclusive lock cannot be obtained

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
2017-07-05 14:52:55 +03:00
Kefu Chai
2cae544cf2 qa/tasks/ceph: should be "Waiting for all PGs", not "all osds"
amend the warning message

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-07-05 18:58:58 +08:00
Mykola Golub
866cf72440 Merge pull request #15956 from dillaman/wip-librbd-devstack
test: fix failing rbd devstack teuthology test

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
2017-07-01 15:02:29 +03:00
Josh Durgin
4bcd6f666b Merge pull request #14760 from dzafman/wip-19657
osd: On EIO from read recover the primary replica from another copy

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-06-29 14:00:18 -07:00
John Spray
623f1240a2 tasks/cephfs: add TestStrays.test_replicated_delete_speed
Reproducer for http://tracker.ceph.com/issues/16914

Signed-off-by: John Spray <john.spray@redhat.com>
2017-06-29 17:21:57 +01:00
Sage Weil
5712698787 Merge pull request #15894 from liewegas/wip-no-default-pool
mon,osd: do not create rbd pool by default
2017-06-29 11:21:42 -05:00
Yuri Weinstein
ea87f42cac Merge pull request #15913 from smithfarm/wip-20417
tests: rgw.py: put client roles in a separate list

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-06-28 08:07:54 -07:00
Sage Weil
93918b0be5 test,qa/workunits: fix a zillion tests
Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-28 10:52:49 -04:00
Sage Weil
fc486c6fbd qa/tasks/ceph: create rbd pool after starting the osds
Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-28 10:52:49 -04:00
Sage Weil
fc7afc239f Merge pull request #15853 from liewegas/wip-simpler-ceph
qa/tasks/ceph: simplify ceph deployment slightly

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-06-27 14:13:53 -05:00
Jason Dillaman
fdc4c9221f qa/tasks/qemu: rbd cache is enabled by default
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-06-27 14:03:34 -04:00
Sage Weil
e7006d06fb qa/tasks/ceph: explicitly add osds to crush map for upgrades
Before kraken, ceph-osd didn't add itself to crush... ceph-osd-prestart.sh
did it.  And ceph.py doesn't use that.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-27 12:52:35 -04:00
Jason Dillaman
6d4b32638c qa/task/qemu: ensure test has access to all generated log files
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-06-27 12:48:11 -04:00
Jason Dillaman
f4f26b86d5 qa/suites/rbd: ensure guest VM updates to Ceph version under test
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-06-27 12:44:49 -04:00
Sage Weil
80e2b156e0 qa/tasks/dump_stuck: fix for active+clean+remapped
In d24a888665 we made remapped a clean
state but didn't fix this test.

Fixes: http://tracker.ceph.com/issues/20431
Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-27 12:01:07 -04:00
Yan, Zheng
8d1828dc60 qa: update thrash max mds testing
Current monitor only allows deactivating one mds at a time. Besides,
the mds to deactivate should have max rank id.

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2017-06-27 22:08:26 +08:00
Patrick Donnelly
95c0ca6a2b
qa: add timeout/repeat to pool df
Fixes: http://tracker.ceph.com/issues/20212

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-06-27 06:50:15 -07:00
Nathan Cutler
9778643c67 tests: rgw.py: put client roles in a separate list
Fixes: http://tracker.ceph.com/issues/20417
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-06-27 14:17:56 +02:00
Sage Weil
dfa95db57d qa/tasks/radosbench: increase timeout
The current timeout isn't enough in some cases (powercycle thrashing leaves
osds down for a long time because rebooting is so slow).

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-26 15:09:19 -04:00
Zack Cerza
5442b5ff25 Merge pull request #15859 from smithfarm/wip-swift-task-move-master
tests: move swift.py task from teuthology to ceph, phase one (master)
2017-06-26 10:16:42 -06:00
Nathan Cutler
2bc0c7c617 tests: swift.py: tweak imports
The ".." form only works within the teuthology repo. With swift.py now in the
Ceph repo, we have to be explicit.

Error message was: "ValueError: Attempted relative import beyond toplevel
package

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-06-25 12:42:36 +02:00
Casey Bodley
6b42352b70 Merge pull request #14624 from ceph/wip-s3a-hadoop
qa/tasks: S3A hadoop task to test s3a with Ceph

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-06-23 13:46:05 -04:00
John Spray
b6cfa35458 qa: no longer need to explicitly enable multimds
Signed-off-by: John Spray <john.spray@redhat.com>
2017-06-23 17:07:34 +01:00
David Zafman
33edfe3a0f test: Add two new singleton test yamls radom-eio and thrash-eio
New option "random_eio" to Thrasher, sets 1 osd random read percentage
New option "objectsize" to radosbench task (-o bench option)
New option "type" to radosbench specify write, seq or rand

Signed-off-by: David Zafman <dzafman@redhat.com>
2017-06-23 08:09:15 -07:00
Nathan Cutler
aab3920977 Merge branch 'master' of /home/smithfarm/src/ceph/upstream/teuthology into wip-swift-task-move-master 2017-06-23 08:30:38 +02:00
Nathan Cutler
7b58ac97e9 tests: move swift.py task to qa/tasks
In preparation for moving this task from ceph/teuthology.git into ceph/ceph.git

The move is necessary because jewel-specific changes are needed, yet teuthology
does not maintain a separate branch for jewel. Also, swift.py is a
Ceph-specific task so it makes more sense to have it in Ceph.

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-06-23 08:27:42 +02:00
Sage Weil
2c8904506a qa/tasks/ceph: create osds before starting
This ensure they are in the OSDMap before the CRUSH
map, which avoids the OSD_ORPHAN health warning.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-22 13:45:27 -04:00
Sage Weil
0eed85515a qa/tasks/ceph: do not pregenerate osdmap
This is useless, and further leads to a situation
where we have a populated CRUSH map before any OSDs,
triggering the OSD_ORPHAN error.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-22 13:45:26 -04:00
John Spray
38dccd2c72 Merge pull request #15548 from ukernel/wip-20196
mds: improvements for stray reintegration

Reviewed-by: John Spray <john.spray@redhat.com>
2017-06-22 06:46:27 -04:00
Orit Wasserman
c320fbd9f8 Merge pull request #15753 from pritha-srivastava/wip-rgw-s3tests-conf
rgw: Changes for s3test config file, to add user under a tenant.
Reviewed-by: Casey Bodely <cbodley@redhat.com>
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
2017-06-22 11:00:26 +03:00
Patrick Donnelly
d4870a093c
qa: wait for healthy cluster before testing pins
Fixes: http://tracker.ceph.com/issues/20318

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-06-21 13:21:32 -07:00
Vasu Kulkarni
14b6267cba s3a task to test radosgw compatibility with hadoop s3a interface
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-06-21 11:52:10 -07:00
Sage Weil
6a00ba0e26 qa/tasks/ceph_manager: get osds all in after thrashing
Otherwise we might end up with some PGs remapped, which means they won't
get scrubbed.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-20 12:07:25 -04:00
Yan, Zheng
57e82edc9c qa/cephfs: use ceph.dir.pin to trigger migration
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2017-06-20 17:39:46 +08:00
Pritha Srivastava
5e94a9852c rgw: Changes for s3test config file, to add user under a tenant.
Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>
2017-06-20 12:57:24 +05:30
Sage Weil
04969eff23 qa/tasks/resolve_stuck_peering: start osd at end
Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-19 14:28:28 -04:00
Sage Weil
cc902a1f6b qa/tasks/ceph: osd_scrub_pgs: reissue scrub requests in loop
The scrub commands are not reliable: if the OSD doesn't happen to
be connected at the time the command is issued it may not get
delivered.  Re-request scrubs for each PG that has not yet been
scrubbed so that we don't wait forever when the original request
is dropped.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-19 12:00:12 -04:00
Sage Weil
32361a798f qa/tasks/ceph: osd_scrub_pgs: tolerate down osd at initial scrub time
Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-19 12:00:12 -04:00
Sage Weil
bdf40c546d Merge pull request #15717 from liewegas/wip-20326
qa/tasks/ceph.py: tolerate active+clean+something

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-06-16 16:12:20 -05:00
Sage Weil
1565b86dc0 qa/tasks/ceph.py: tolerate active+clean+something
where something is, say, snaptrim.  or maybe scrubbing.
or whatever.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-15 22:29:28 -04:00
Sage Weil
f870cc5f28 qa/tasks/thrashosds: wait before wait_for_recovery
Make sure OSDs are up *and* they have flushed their PG stats before
waiting for recovery to ensure that we do not see a stale 'clean' state.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-15 12:14:24 -04:00
Sage Weil
200abcee6d qa/tasks/ceph: raise exception if scrubs time out
Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-15 11:23:18 -04:00
Sage Weil
0d80c88667 qa/tasks/ceph: raise an exception if pgs are not clean
If this happens the preceding test should have cleaned
up (e.g., ceph.healthy:).

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-15 11:23:18 -04:00
Sage Weil
6fa9d32407 qa/tasks/ceph: osd_scrub_pgs: try a bit longer
I just saw a test fail that was still waiting for
scrubs to complete.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-15 11:23:18 -04:00