Commit Graph

601 Commits

Author SHA1 Message Date
Patrick Donnelly
37a7737712
qa: whitelist failover log warnings
Fixes: http://tracker.ceph.com/issues/21509

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-22 09:42:06 -07:00
Patrick Donnelly
8ecef57bb1
qa: ignore evicted client in client-recovery
Fixes: http://tracker.ceph.com/issues/21508

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-22 09:38:59 -07:00
Nathan Cutler
662168b5d7 Merge pull request #17910 from smithfarm/wip-21499
tests: point-to-point-x: upgrade client.1 to -x along with cluster nodes

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-09-22 17:07:02 +02:00
Nathan Cutler
afc5a7d320 qa: point-to-point-x: upgrade client.1 to -x along with cluster nodes
The client.1 rgw in workload_x had not been upgraded to -x.

Fixes: http://tracker.ceph.com/issues/21499
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-09-22 10:07:28 +02:00
Kefu Chai
e8741d5ed8 Merge pull request #17794 from hjwsm1989/wip-21425
qa/suites/rest/basic/tasks/rest_test: more whitelisting

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-09-22 12:42:27 +08:00
Vasu Kulkarni
56d1511bdf use ovh instead of vps
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-21 16:43:24 -07:00
Vasu Kulkarni
184d5e6bad qa/tests: update required ceph-ansbile vars
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-19 17:12:09 -07:00
Vasu Kulkarni
077904a0b5 qa/tests: Fix ceph-ansible upstream vars
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-19 17:11:56 -07:00
Neha Ojha
11d8dfe591 qa/suites/rados/perf: create pool with lower pg_num
Signed-off-by: Neha Ojha <nojha@redhat.com>
2017-09-19 16:40:45 -07:00
Josh Durgin
401ccd5966 Merge pull request #17786 from neha-ojha/wip-cbt-teuthology-integration-cntd
qa/suites/rados/perf: add optimized settings

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-09-19 08:35:10 -07:00
huangjun
4d30d02e52 qa/suites/rest/basic/tasks/rest_test: more whitelisting
Fixes: http://tracker.ceph.com/issues/21425

Signed-off-by: huangjun <huangjun@xsky.com>
2017-09-19 16:13:34 +08:00
Neha Ojha
2635e7a591 qa/suites/rados/perf: add optimized settings
Signed-off-by: Neha Ojha <nojha@redhat.com>
2017-09-18 15:53:28 -07:00
Casey Bodley
a97d5bea42 qa/rgw: hadoop suite targets centos_latest
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-09-18 13:29:40 -04:00
Casey Bodley
4a953a16d7 qa/rgw: use overrides for hadoop version
instead of having two separate tasks, add v27.yaml and v28.yaml with
overrides for hadoop_version

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-09-18 13:28:24 -04:00
Patrick Donnelly
064f1dfc3c
Merge PR #17676 into master
* refs/remotes/upstream/pull/17676/head:
	qa/tasks/cephfs: Whitelist POOL_APP_NOT_ENABLED for test_misc

Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-15 14:12:29 -07:00
Josh Durgin
520a5a218c Merge pull request #17583 from neha-ojha/wip-cbt-teuthology-integration
qa: add cbt task for performance testing

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-09-14 16:36:26 -07:00
Vasu Kulkarni
1f5aba674c Various whitelists for tests to pass
See tracker http://tracker.ceph.com/issues/21376 for details

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-14 13:32:25 -07:00
Xie Xingguo
b4bb7ce2da Merge pull request #17371 from xiexingguo/wip-per-pool-full-control
mon, osd: per pool space-full flag support

Reviewed-by: Sage Weil <sage@redhat.com>
2017-09-14 18:26:12 +08:00
Patrick Donnelly
d929dae49b
Merge PR #17657 into master
* refs/remotes/upstream/pull/17657/head:
	mds: optimize MDCache::rejoin_scour_survivor_replicas()
	mds: fix MDSCacheObject::clear_replica_map
	mds: support limiting cache by memory
	common: refactor of lru
	mds: resolve unsigned coercion compiler warning
	common: use safer uint64_t for list size
	common: add bytes2str pretty print function
	mds: check if waiting is allocated before use
	mds: go back to compact_map for replicas
	mds: use mempool for cache objects
	mds: cleanup replica_map access
	common: add alloc_ptr smart pointer
	common: add warning on base class use of mempool
	common: use atomic uin64_t for counter

Reviewed-by: Zheng Yan <zyan@redhat.com>
2017-09-13 20:08:51 -07:00
Casey Bodley
72b08e8f96 qa/smoke: add rgw crypto config for s3tests
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-09-13 11:22:28 -04:00
Xie Xingguo
0e604b112e Merge pull request #17515 from xiexingguo/wip-data-digest
osd/PrimaryLogPG: do not set data digest for bluestore

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2017-09-13 18:31:10 +08:00
xie xingguo
afcb617dc9 osd/PrimaryLogPG: do not generate data digest for BlueStore by default
BlueStore enables CRC by default, so this is a dup and gains
no more benefits.

Turn this off by default, which is good for performance.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2017-09-13 12:17:16 +08:00
Patrick Donnelly
06c94de584
mds: support limiting cache by memory
This introduces two config parameters:

    mds_cache_memory_limit: Sets the soft maximum of the cache to the given
    byte count. (Like mds_cache_size, this doesn't actually limit the maximum
    size of the cache. It just dictates the steady-state size.)

    mds_cache_reservation: This replaces mds_health_cache_threshold everywhere
    except the Beacon heartbeat sent to the mons. The idea here is to specify a
    reservation of memory (5% by default) for operations and the MDS tries to
    always maintain that reservation. So, the MDS will recall caps from clients
    when it begins dipping into its reservation of memory.

mds_cache_size still limits the cache by Inode count but is now by-default 0
(i.e. unlimited). The new preferred way of specifying cache limits is by memory
size. The default is 1GB.

Fixes: http://tracker.ceph.com/issues/20594
Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1464976

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-12 20:02:41 -07:00
Douglas Fuller
eba01c48ca qa/tasks/cephfs: Whitelist POOL_APP_NOT_ENABLED for test_misc
test_misc verifies that ceph fs new will not create a filesystem
on a pool that already contains objects. As part of the test, it
inserts a dummy object into a pool and then attempts to use it for
CephFS. This triggers POOL_APP_NOT_ENABLED. Setting the application
metadata for the pool (and having ceph fs new fail because of the
existing metadata) would then exercise a different failure case.

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-09-12 13:22:09 -04:00
Neha Ojha
59531d81c5 qa: avoid using make install for fio
Signed-off-by: Neha Ojha <nojha@redhat.com>
2017-09-12 08:26:27 -07:00
Mykola Golub
b196d38783 Merge pull request #17621 from idryomov/wip-krbd-xfstests-blockdev
qa/suites/krbd: enable xfstests blockdev tests

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
2017-09-11 14:09:32 +03:00
Tamilarasi Muthamizhan
ec423100dc Merge pull request #17096 from ceph/wip-ansbile-fixes
[qa/tests]: misc ceph-ansible fixes and udpate
2017-09-08 14:47:12 -07:00
Patrick Donnelly
52c79aecdd
Merge PR #17466 into master
* refs/remotes/upstream/pull/17466/head:
	qa: whitelist expected MDS_READ_ONLY for kcephfs test

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-08 11:23:19 -07:00
Neha Ojha
1dfd12e852 qa/suites/rados: add perf suite
Signed-off-by: Neha Ojha <nojha@redhat.com>
2017-09-08 11:15:11 -07:00
Vasu Kulkarni
b1fd615e9f Stop the mgr on node which is not client, this will ensure
the client.0's mgr is active mgr for workunit to work.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-08 10:00:49 -07:00
Ilya Dryomov
131c1189d6 qa/suites/krbd: enable xfstests blockdev tests
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-09-08 15:45:34 +02:00
xie xingguo
b4ca5ae462 mon, osd: per pool space-full flag support
The newly introduced 'device-class' can be used to separate
different type of devices into different pools, e.g, hdd-pool
for backup data and all-flash-pool for DB applications.

However, if any osd of the cluster is currently running out
of space (exceeding the predefined 'full' threshold), Ceph
will mark the whole cluster as full and prevent writes to all pools,
which turns out to be very wrong.

This patch instead makes the space 'full' control at pool granularity,
which exactly leverages the pool quota logic but shall solve
the above problem.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2017-09-08 10:03:17 +08:00
Vasu Kulkarni
e5b5a1b8c8 Test ceph-mgr RESTful api
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 17:10:04 -07:00
Vasu Kulkarni
e963bfd6ca Add additional variables as required by ceph-ansible for upstream
installation.

possible revert if those are made default
issue: https://github.com/ceph/ceph-ansible/issues/1834

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 16:54:51 -07:00
Vasu Kulkarni
71ea4987e0 Add kraken to luminous upgrade case
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:57 -07:00
Vasu Kulkarni
526d4aef1c Add kernel tar/untar and systemd task
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:57 -07:00
Vasu Kulkarni
9c85c76a90 Jewel to luminous upgrade scenario using ceph-deploy
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:56 -07:00
Vasu Kulkarni
30dbbfe4ae Remove unsupported 2-size-1-min-size config
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 09:47:15 -07:00
Mykola Golub
f618508624 Merge pull request #17504 from dillaman/wip-21251
qa/suites/rbd: miscellaneous test fixes

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
2017-09-07 11:34:01 +03:00
Patrick Donnelly
73e927293c
Merge PR #17319 into master
* refs/remotes/upstream/pull/17319/head:
	qa: whitelist expected rstat warning
	qa: sync whitelist with fs/basic_functional
	qa: whitelist expected MDS_CACHE_OVERSIZED

Reviewed-by: Zheng Yan <zyan@redhat.com>
2017-09-05 20:49:54 -07:00
Patrick Donnelly
28ca766cd2
Merge PR #16305 into master
* refs/remotes/upstream/pull/16305/head:
	qa/cephfs: test CephFS recovery pools
	qa/cephfs: support CephFS recovery pools
	qa/ceph_test_case: support CephFS recovery pools
	qa/cephfs: Allow deferred fs creation
	qa/cephfs: Refactor alternate pool test

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-05 20:42:30 -07:00
Jason Dillaman
27194c30c5 qa/suites/rbd: reduce monitor requirements for mirroring tests
Fixes: http://tracker.ceph.com/issues/21251
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-09-05 16:43:34 -04:00
Jason Dillaman
98061bb3d7 qa/suites/rbd: fixed cache override
Fixes: http://tracker.ceph.com/issues/21251
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-09-05 16:43:34 -04:00
Jason Dillaman
763f1877ec qa/suites/rbd: reduce OSD debug log levels
The teuthology machines are periodically running out of space
due to the aggressive log settings.

Fixes: http://tracker.ceph.com/issues/21251
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-09-05 16:43:30 -04:00
Yan, Zheng
584241285f qa: whitelist expected MDS_READ_ONLY for kcephfs test
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2017-09-05 17:13:59 +08:00
Josh Durgin
317a8bae5e Merge pull request #17160 from ceph/wip-luminous-x-master
qa: add upgrade/luminous-x suite

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-09-01 14:19:48 -07:00
Yuri Weinstein
aff342627c Initial check in
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2017-09-01 08:56:19 -07:00
Sage Weil
ec5e9c8d76 Merge pull request #17379 from liewegas/wip-div-p
qa/suites/rados/singleton/diverget_priors*: broaden whitelist
2017-08-31 13:56:59 -05:00
Sage Weil
9d9c4bcbba Merge pull request #17247 from liewegas/wip-post-luminous-cleanup
osd: clean up pre-luminous compat cruft (part 1 of many)

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-08-30 14:15:23 -05:00
Sage Weil
39e5efbad2 qa/suites/rados/singleton/diverget_priors*: broaden whitelist
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-30 15:13:08 -04:00
Douglas Fuller
6af2ae80d3 qa/cephfs: test CephFS recovery pools
Test recovering metadata in to a separate RADOS pool with
cephfs_data_scan and friends.

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-08-30 09:02:44 -04:00
Ilya Dryomov
6647c65ebe qa/suites/krbd: new rbd_xfstests job definition
ext4 seems to be a better choice for our purposes -- less test churn,
rather small and reliable exclude list.

All excluded tests but generic/050 fail with no krbd in the mix, most
have popped up on linux-ext4 list at least once.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-08-29 16:09:35 +02:00
Sage Weil
7b51cedac6 qa/suites/rados/upgrade: jewel-x -> luminous-x
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-28 23:11:27 -04:00
Sage Weil
d8dead1aaf qa/suites/rados: remove luminous tests
- snapdir conversion (at-end) stuff
- merge luminous-specific collections that avoided the above back
into their normal locations

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-28 23:10:32 -04:00
Patrick Donnelly
c53d091e51
qa: whitelist expected rstat warning
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-28 11:52:36 -07:00
Patrick Donnelly
701e0910e3
qa: sync whitelist with fs/basic_functional
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-28 11:49:53 -07:00
Patrick Donnelly
71f0066f6e
qa: whitelist expected MDS_CACHE_OVERSIZED
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-28 11:49:33 -07:00
Sage Weil
5235f35ac8 qa/suites/powercycle/osd/whitelist_health: whitelist more
"2017-08-26 16:09:27.704418 mon.a mon.0 172.21.15.169:6789/0 876 : cluster [WRN] MDS health message (mds.0): Behind on trimming (66/30)" in cluster log

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-28 13:58:02 -04:00
Kefu Chai
b2d7f4f4c7 qa/suites/rados/upgrade/jewel-x-singleton: tolerate sloppy past_intervals
See-also: d5d5d7d1
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-08-28 15:19:41 +08:00
Sage Weil
893b3ac6fa Merge pull request #17227 from liewegas/wip-jewel-x
qa/suites/upgrade/jewel-x/parallel: tolerate laggy mgr
2017-08-24 09:30:31 -05:00
Sage Weil
bf296018ff qa/suites/upgrade/jewel-x/parallel: tolerate laggy mgr
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-24 10:30:01 -04:00
Sage Weil
d3632fd2f9 Merge pull request #17226 from liewegas/wip-jewel-x
qa/suites/upgrade/jewel-x/stress-split: tolerate sloppy past_intervals
2017-08-24 09:27:44 -05:00
Sage Weil
d5d5d7d1d2 qa/suites/upgrade/jewel-x/stress-split: tolerate sloppy past_intervals
This is harmless in general, esp during upgrade.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-24 10:23:22 -04:00
Sage Weil
4f1fca0483 Merge pull request #17203 from liewegas/wip-jewel-x
qa/suites/upgarde/jewel-x/parallel: tolerate mgr warning
2017-08-23 17:21:37 -05:00
Yuri Weinstein
304b492187 Initial check in luminous-x suite
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2017-08-23 14:53:55 -07:00
Matt Benjamin
1e1731e663 Merge pull request #16612 from cbodley/wip-20668
rgw: fixes for multisite replication of encrypted objects
2017-08-23 15:57:02 -04:00
Sage Weil
5455f599b3 qa/suites/upgrade/jewel-x/parallel: tolerate OBJECT_MISPLACED
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-23 14:24:00 -04:00
Sage Weil
2504ab1675 qa/suites/upgarde/jewel-x/parallel: tolerate mgr warning
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-23 14:22:34 -04:00
Patrick Donnelly
75967dbfe7
Merge PR #17111 into master
* refs/remotes/upstream/pull/17111/head:
	qa: add health whitelist for kcephfs suite

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-23 06:49:11 -07:00
Casey Bodley
5e67c681f7 Merge pull request #16344 from rzarzynski/wip-rgwqa-tempest
rgw, qa: integrate Tempest to verify RadosGW's compliance with Swift API

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-08-22 15:02:15 -04:00
Yan, Zheng
b10989209f qa: add health whitelist for kcephfs suite
Fixes: http://tracker.ceph.com/issues/20892
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2017-08-21 17:01:22 +08:00
Vasu Kulkarni
9cc00c5c1a Rename folders to fix task order
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-08-18 11:35:54 -07:00
Vasu Kulkarni
1041c803f1 use bluestore with dmcrypt option
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-08-18 11:09:50 -07:00
Vasu Kulkarni
f6de5d9f9e Add dmcrypt option
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-08-18 11:08:00 -07:00
Vasu Kulkarni
60d00e0ead Separate the main task from options
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-08-18 11:05:01 -07:00
Vasu Kulkarni
0395b84488 Catchup with recent changes with ceph-ansible
Adds osd_scenario and ceph_stable_release variables

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-08-18 10:47:22 -07:00
Casey Bodley
f27ebabe55 test/rgw: add kms encryption key for teuthology
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-08-16 12:22:31 -04:00
Kefu Chai
17d6e96a6f Merge pull request #16967 from liewegas/wip-upgrade-health
mon: fix legacy health checks in 'ceph status' during upgrade; fix jewel-x upgrade combo

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-08-16 19:44:36 +08:00
Radoslaw Zarzynski
ed8a6b89e4 qa/suites/rgw/tempest: use fixed-1 cluster instead of fixed-2.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-14 01:11:22 +00:00
Radoslaw Zarzynski
43a7399720 qa/tasks/rgw: make the frontend_prefix per-client configurable.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-14 01:11:18 +00:00
Radoslaw Zarzynski
09db786581 qa/suites/rgw: move the Tempest testing to its dedicated sub-suite.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
99e1d443a0 qa/suites/rgw: freeze the Tempest version for RGW testing.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
afe1ad3010 qa, rgw: Keystone's instances can be now accessed via non-local network interfaces.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Radoslaw Zarzynski
849f46f8cf qa/suites/rgw: integrate Tempest to verify Swift API compliance.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-08-13 23:14:40 +00:00
Sage Weil
d69f0e120b qa/suites/rados/objectstore/objectstore: less debug
Saw an ENOSPC.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-13 14:41:43 -04:00
Sage Weil
41e5a85308 qa/suites/rados/verify/validater/valgrind: whitelist PG_
Peering might be slow due to valgrind.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-12 14:18:59 -04:00
Sage Weil
12007044b1 qa/suites/rados/multimon/tasks/mon_lock_with_skew: whitelist PG_
Default pool pgs not up because mons too broken for OSDs to peer.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-12 14:15:15 -04:00
Sage Weil
ad23d7dc1f qa/suites/rados/multimon: whitelist mgr down vs clock skew test
Clock skew might make us fail the mgr.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-11 13:42:02 -04:00
Sage Weil
dd2fb6c40b Merge pull request #16944 from liewegas/wip-kraken-x
mon/Elector: force election epoch bump on start

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2017-08-10 11:12:43 -05:00
Sage Weil
c46bdf5efd Revert "qa/suites/upgrade/jewel-x/parallel: thrash layout"
This reverts commit 435777dbff.

This test combination is not yet stable.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-10 09:51:29 -04:00
Sage Weil
a0b9f37dbc qa/suites/upgrade/jewel-x/parallel: no loadgenbig
When we do the thrashing this leads to ENOSPC on smithi.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-10 09:44:17 -04:00
Sage Weil
435777dbff qa/suites/upgrade/jewel-x/parallel: thrash layout
We can't kill and restart osds because that will interfere with
the upgrade process.  We can, however, thrash the layout by
tweaking osd weights and so on.  This will exercise osd recovery
paths during the upgrade that aren't normally exercised (outside
of stress-split..which doesn't upgrade individual osds while they
are non-clean).

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-09 22:07:48 -04:00
Sage Weil
b61be07d45 qa/suites/upgrade/kraken-x/stress-split: more whitelisting
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-09 13:58:55 -04:00
Sage Weil
1043fca076 Merge pull request #16923 from liewegas/wip-20738
qa/suites/rados/objectstore: logs
2017-08-09 12:45:29 -05:00
Sage Weil
bbd5fe354c qa/suites/upgarde/jewel-x/point-to-point-x: disable app warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-09 09:18:54 -04:00
Sage Weil
c8d60396c7 qa/suites/rados/objectstore: logs
Hunting http://tracker.ceph.com/issues/20738

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-08 18:07:18 -04:00
Sage Weil
bf29142b08 qa/suites/upgrade/kraken-x/stress-split*: whitelist
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-07 21:36:58 -04:00
Sage Weil
2234a0ed11 qa/suites/upgrade/kraken-x/parallel: whitelist
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-07 21:36:58 -04:00
Sage Weil
3e7d157871 qa/suites/upgrade/jewel-x/parallel: fix POOL_APP_NOT_ENABLED disable
This code runs on the mgr.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-07 15:12:10 -04:00
Sage Weil
387ad56a69 qa/clusters/fixed-[23]: 4 osds per node, not 3
Smithi have 4 nvme partitions available for use.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-07 13:36:05 -04:00
Sage Weil
b5fae9a9ca Merge pull request #16873 from liewegas/wip-4-nodes
qa/suites: change fixed-2.yaml users to get 4 openstack disks

Reviewed-by: Zack Cerza <zcerza@redhat.com>
2017-08-07 11:27:40 -05:00
Sage Weil
3ffca50824 Merge pull request #16864 from smithfarm/wip-big-openstack
qa: big: add openstack.yaml
2017-08-07 11:02:59 -05:00
Sage Weil
f683d2d374 qa/suites: change fixed-2.yaml users to get 4 openstack disks
Follow-up for 4203c4f887

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-07 11:56:33 -04:00
Sage Weil
a872c44be7 Merge pull request #16842 from liewegas/wip-more-ec-map-discon
qa/suites/rados/thrash: fix thrashing with ec vs map discon
2017-08-07 10:48:56 -05:00
Nathan Cutler
8bb3d8444f qa: big: add openstack.yaml
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-08-07 12:07:36 +02:00
Sage Weil
ed2d984ad1 qa/suites/upgarde/jewel-x/parallel: more whitelisting
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-06 10:04:14 -04:00
Sage Weil
58f15d2b98 qa/suites/upgrade/jewel-x/parallel: more whitelisting
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-06 09:56:55 -04:00
Sage Weil
622e950e43 qa/suites/upgrade/*-x/parallel: whitelist more stuff
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-06 09:56:55 -04:00
Sage Weil
2d260443f0 qa/suites/upgrade/*/parallel: disable POOL_APP_NOT_ENABLED
There is some other random workload running (that creates pools)
while we upgrade and wait for healthy.  Just disable the warning
for these tests.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-06 09:56:55 -04:00
Sage Weil
f4c2863999 qa/suites/upgrade/jewel-x/parallel: whitelist OSD_DOWN
We restart OSDs during the upgrade.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-06 09:56:55 -04:00
Sage Weil
6307e03c6d qa/suites/rados/thrash/workloads/cache-agent-big: m=2
...because we do the test_map_discontinuity thing.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-05 14:33:13 -04:00
Patrick Donnelly
04d8ba4b04
Merge PR #16833 into master
* refs/remotes/upstream/pull/16833/head:
	qa: whitelist expected MDS_CLIENT_OLDEST_TID warn
	qa: ignore insufficient standby during failover
	qa: fix read-only whitelist
	mds: MDS_DAMAGED to MDS_DAMAGE
	doc: remove duplicate CephFS health check doc
2017-08-04 20:26:09 -07:00
Patrick Donnelly
29e5f0a450
qa: whitelist expected MDS_CLIENT_OLDEST_TID warn
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-04 20:21:43 -07:00
Patrick Donnelly
06f53e4a82
qa: ignore insufficient standby during failover
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-04 20:14:59 -07:00
Patrick Donnelly
42cd1c7122
qa: fix read-only whitelist
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-04 20:14:48 -07:00
Sage Weil
62e51661e6 Merge branch 'wip-qa-rbd-health' of git://github.com/dillaman/ceph
# Conflicts:
#	qa/tasks/ceph.py
2017-08-04 15:07:22 -04:00
Sage Weil
ffd171fd46 Merge pull request #16820 from liewegas/wip-more-whitelist
qa/suites/rados: a bit more whitelisting

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-08-04 13:44:08 -05:00
Sage Weil
82cf3046de qa/suites/rados/basic/tasks/rados_python: POOL_APP_NOT_ENABLED
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-04 13:39:13 -04:00
Sage Weil
9c7a653fee Merge pull request #16769 from liewegas/wip-20295-b
os/bluestore: allow multiple DeferredBatches in flight at once

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
2017-08-04 11:04:38 -05:00
Sage Weil
c8af364699 Merge pull request #16739 from liewegas/wip-multi-backfill-reject
qa/suites/rados/singleton-nomsgr/all/multi-backfill-reject: sleep longer
2017-08-04 08:41:06 -05:00
Sage Weil
1ae9ff173b qa/suites/rados/upgrade: ignore FS_DEGRADED from mds restart
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-04 09:34:31 -04:00
Sage Weil
27a685f626 qa/suites/rados/monthrash: ignore MGR_DOWN
Heavily thrashing mons + mgr reconnect backoff may make us fail
to process the beacon.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-04 09:34:15 -04:00
Nathan Cutler
d919987caa tests: rbd: reproducer for rbd-on-EC issue
This introduces a new "rbd/singleton-bluestore" suite because creating an rbd
on an EC-backed datapool will fail on filestore.

References: http://tracker.ceph.com/issues/20295
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-08-03 22:54:17 -04:00
Patrick Donnelly
9d348ad8c9
qa: add health whitelist for all fs sub-suites
Fixes: http://tracker.ceph.com/issues/20892

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-03 14:01:28 -07:00
Patrick Donnelly
60fa9714d4
Merge PR #16768 into master
* refs/remotes/upstream/pull/16768/head:
	qa: fix log whitelist string

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-08-03 13:55:42 -07:00
Patrick Donnelly
66756c4f65
Merge PR #16292 into master
* refs/remotes/upstream/pull/16292/head:
	qa: use new hex rep of inode
	qa: fix whitelist error message
	mds: refine "Scrub error" cluster log message
	mds: polish clog messages
	doc: developer logging guidance

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-03 13:55:21 -07:00
Sage Weil
342607f4d5 Merge pull request #16749 from tchaikov/wip-restful-delete-key
mgr: handle "module.set_config(.., None)" correctly 

Reviewed-by: John Spray <john.spray@redhat.com>
2017-08-03 15:53:27 -05:00
Josh Durgin
b172642124 Merge pull request #16789 from liewegas/wip-ec-m-2
qa: avoid map-gap tests for k=2 m=1

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-08-03 11:20:13 -07:00
Sage Weil
ef21c9d7df qa/suites/rados/thrash-erasure-code: do not test map gap with m=1
We test EC profiles with m=1 here, and mapgap can lead to incomplete pgs
because it takes an osd down and waits for healthy.

Fixes: http://tracker.ceph.com/issues/20844
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-03 14:13:02 -04:00
Sage Weil
f74d71f708 qa/suites/rados/thrash-erasure-coe-big/clsuter: 12 osds on 3 nodes not 4
smithi have 4 nvme partitions available, not 3.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-03 14:11:43 -04:00
Sage Weil
63221e21f5 qa/suites/rados/thrash-erasure-code-big: add k=4 m=2
Get better coverage for larger codes.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-03 14:10:36 -04:00
Sage Weil
e994b03335 qa/suites/rados/monthrash/worklaods/rados_api_tests: whitelist SMALLER_PGP_NUM
The rados/test.sh fiddles with pg_num.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-03 13:31:39 -04:00
Sage Weil
7c350180b1 qa/suites/rados/mgr/tasks/failover: whitelist
remote/smithi025/log/ceph.log.gz:2017-08-03 07:02:15.049074 mon.b mon.0 172.21.15.25:6789/0 197 : cluster [INF] Manager daemon x is unresponsive, replacing it with standby daemon y
remote/smithi025/log/ceph.log.gz:2017-08-03 07:03:10.078032 mon.b mon.0 172.21.15.25:6789/0 226 : cluster [WRN] Manager daemon x is unresponsive.  No standby daemons available.

x and y may be swapped, so whitelist the rest of the string.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-03 12:40:01 -04:00
Jason Dillaman
c2b451e8cb qa: fix RBD-related POOL_APP_NOT_ENABLED health warnings
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-08-03 09:50:41 -04:00
Patrick Donnelly
d4ed085238
Merge PR #16713 into master
* refs/remotes/upstream/pull/16713/head:
	qa: ignore failed MDS message during upgrade
2017-08-02 19:41:42 -07:00
Patrick Donnelly
7f04d88af8
qa: fix whitelist error message
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-02 16:52:30 -07:00
Patrick Donnelly
8e975a6347
qa: fix log whitelist string
Fixes: http://tracker.ceph.com/issues/20889

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-08-02 16:32:19 -07:00
Sage Weil
5085dc1164 qa/suites/powercycle: whitelist health for thrashing
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-02 11:06:43 -04:00
Kefu Chai
da1a60ced1 qa: refactor suites/rados/rest/mgr-restful
- use "ceph restful restart" to restart the restful API server instead
of restarting the ceph-mgr
- test "ceph restful delete-key"
- test "ceph restful list-keys"

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-08-02 18:20:56 +08:00
Kefu Chai
1ff1f836da Merge pull request #16722 from tchaikov/wip-qa-fixes
qa/suites: escape the parenthesis of the whitelist text

Reviewed-by: Sage Weil <sage@redhat.com>
2017-08-02 13:00:01 +08:00
Kefu Chai
a70be4e00c qa/suites: more whitelisting
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-08-02 10:00:57 +08:00
Sage Weil
c955bf528f qa/suites/rados/singleton-nomsgr/all/multi-backfill-reject: sleep longer
I saw a failure where the 30% backfill probability was enough that we
just didn't manage to backfill all of the pgs during the 5 minute recovery
timeout during ceph.py shutdown.  Build in some additional time for the
test to recover.

http://pulpito.ceph.com/sage-2017-08-01_15:32:10-rados-wip-sage-testing-distro-basic-smithi/1469184

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-01 15:50:47 -04:00
Kefu Chai
d12c51ca91 qa/suites: escape the parenthesis of the whitelist text
so we can avoid the warnings like

grep: Unmatched ( or \(

because we pass the whitelisted string to `egrep -v "$1"` directly.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-08-01 21:54:44 +08:00
John Spray
ac2b9d63ca qa: include config help in admin socket test
Signed-off-by: John Spray <john.spray@redhat.com>
2017-08-01 13:38:40 +01:00
Patrick Donnelly
5e5ff5c086
qa: ignore failed MDS message during upgrade
The cluster is expected to become degraded during reboot.

Fixes: http://tracker.ceph.com/issues/20731
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-31 14:45:07 -07:00
Sage Weil
c3c2b31c87 Merge pull request #16568 from liewegas/wip-application-warn
qa,doc: document and fix tests for pool application warnings
2017-07-28 09:00:46 -05:00
Patrick Donnelly
fb039383e9
Merge PR #16435 into master
* refs/remotes/upstream/pull/16435/head:
	qa: whitelist trim error during powercycle tests

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-07-27 17:54:59 -07:00
Sage Weil
41bcf2fee5 Merge pull request #16281 from badone/wip-PG-cluster-log-audit
osd: Log audit

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-07-27 16:25:30 -05:00
Sage Weil
862392fbf9 Merge pull request #16514 from liewegas/wip-20744
qa/tasks/ceph: wait for mgr to activate and pg stats to flush in health()

Reviewed-by: John Spray <john.spray@redhat.com>
2017-07-27 16:24:59 -05:00
Patrick Donnelly
d7f5af40a2
qa: whitelist trim error during powercycle tests
Fixes: http://tracker.ceph.com/issues/20566

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-27 13:24:21 -07:00