Commit Graph

639 Commits

Author SHA1 Message Date
Sage Weil
3f922e79c3 qa/tasks/ceph: disable osd_debug_deep_scrub_sleep in case it is set
Otherwise the final scrub may take too long.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-01-14 10:47:24 -06:00
Mykola Golub
de97a3be48 rbd: adjusted "showmapped" JSON and XML formatted output
Signed-off-by: Mykola Golub <mgolub@suse.com>
2018-01-12 21:36:00 +02:00
Yuri Weinstein
c30eca51a4
Merge pull request #19765 from liewegas/wip-force-backfill-cleanups
qa: tolerate failure to force backfill

Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-01-08 12:50:18 -08:00
Patrick Donnelly
f6e1a797d4
Revert "Merge PR #19369 into master"
This reverts commit 3189ba19a7, reversing
changes made to b7620de020.

Despite the change in json format being positive, the unfortunate side-effect
is that it broke upgrade testing (because the QA framework must handle the
transition of mdsmap["info"] to a list from object) and the ceph-mgr.

Fixes: http://tracker.ceph.com/issues/22527
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-01-04 09:42:37 -08:00
Sage Weil
819a3578fa Merge tag 'v13.0.1' 2018-01-03 10:04:20 -06:00
Sage Weil
8651e15c93 qa/tasks/ceph_manager: tolerate failure to force backfill/recoery
The pool may have been deleted out from underneath us.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-01-03 08:37:02 -06:00
Kefu Chai
0cd700e64d
Merge pull request #19634 from jcsp/wip-22514
qa: configure zabbix properly before selftest

Reviewed-by: Sage Weil <sage@redhat.com>
2018-01-03 00:09:37 +08:00
Sage Weil
acf4ea1cad
Merge branch 'master' into lc-time-disable 2017-12-29 13:38:37 -06:00
Patrick Donnelly
3189ba19a7
Merge PR #19369 into master
* refs/pull/19369/head:
	qa: update handling of fs status format
	PendingReleaseNotes: add note for format change
	mds/MDSMap : use arrary_section for mds stat

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Zheng Yan <zyan@redhat.com>
Reviewed-by: Xiaoxi Chen <xiaoxchen@ebay.com>
2017-12-21 20:21:18 -08:00
Sage Weil
1a549a01ca
Merge pull request #19588 from batrick/i22475
qa: check pool full flags

Reviewed-by: Sage Weil <sage@redhat.com>
2017-12-21 15:16:54 -06:00
Patrick Donnelly
1f1a2a27ef
qa: update handling of fs status format
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-12-21 10:35:09 -08:00
John Spray
c64c9ff00d qa: configure zabbix properly before selftest
Even though the selftest routine doesn't care about
the settings, we should set them to avoid emitting
nasty log/health messages when enabling the module.

Fixes: http://tracker.ceph.com/issues/22514
Signed-off-by: John Spray <john.spray@redhat.com>
2017-12-21 08:28:55 -05:00
Sage Weil
f33ab7e03a Merge remote-tracking branch 'gh/mimic-dev1' 2017-12-20 15:08:30 -06:00
Kefu Chai
4d266e3347 qa/tasks: remove test_buildpackages
the buildpackages feature is not part of ceph. and its test was moved to
teuthology in 5a43f8d5. let's remove the leftovers from ceph/ceph. BTW,
we have not run this test suite since then. and its dependency cannot be
fulfilled.

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 582f04508f)
2017-12-19 11:12:25 -06:00
Patrick Donnelly
6e046dfc90
qa: check pool full flags
Cluster-wide flag removed in b4ca5ae462.

Fixes: http://tracker.ceph.com/issues/22475

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-12-18 21:59:57 -08:00
Patrick Donnelly
b2284f23b8
qa: don't configure ec data pool with memstore
Fixes: http://tracker.ceph.com/issues/22436

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-12-18 21:12:22 -08:00
John Spray
213af8b666
Merge pull request #19533 from batrick/vstart-runner-fixes
vstart_runner: fixes for recent cephfs changes

Reviewed-by: John Spray <john.spray@redhat.com>
2017-12-18 13:18:12 +00:00
Patrick Donnelly
ee9d5f9a45
Merge PR #19440 into master
* refs/pull/19440/head:
	mds: update mds option descriptions
	mds: obsolete MDSMap option configs
	mds: organize Filesystem class def

Reviewed-by: John Spray <john.spray@redhat.com>
2017-12-15 11:07:58 -08:00
Patrick Donnelly
cd1e5f53f0
vstart_runner: ignore failed dentry invalidation
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-12-13 19:15:43 -08:00
Patrick Donnelly
0f507d8855
vstart_runner: set ec_profile attribute
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-12-13 19:15:39 -08:00
Patrick Donnelly
67ca6cd229
mds: obsolete MDSMap option configs
These configs were used for initialization but it is more appropriate to
require setting these file system attributes via `ceph fs set`. This is similar
to what was already done with max_mds. There are new variables added for `fs
set` where missing.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-12-13 18:30:52 -08:00
Kefu Chai
c7539de95c
Merge pull request #19365 from liewegas/wip-tell
qa/tasks/ceph_manager: tolerate tell osd.* error

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-12-13 19:58:41 +08:00
John Spray
91655239fa
Merge pull request #19114 from tchaikov/wip-rm-request-slow
mgr/PGMap: drop REQUEST_{SLOW,STUCK} HEALTH_WARNs

Reviewed-by: John Spray <john.spray@redhat.com>
2017-12-13 11:46:34 +00:00
Vasu Kulkarni
e7883c1fb2 qa/tests: when no branch is specified, use wip branch under test for upgrade
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-12-07 19:11:42 -08:00
Vasu Kulkarni
9430f352f4 qa/tests: Fix mapped_role attribute where it gets overwritten for subsequent calls.
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-12-07 19:11:42 -08:00
vasukulkarni
8a6434c20e
Merge pull request #19244 from ceph/wip-cd-volume
qa/tasks: update ceph-deploy task to use newer ceph-volume syntax
2017-12-07 10:54:47 -08:00
Sage Weil
aafb3a565d qa/tasks/ceph_manager: tolerate tell osd.* error
It's possible for tell osd.* to race against an osd we stopped but the
cluster doesn't know is down yet.  In tha case we'll get ENXIO on that
osd and the command will fail.

In this context, we don't care.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-12-06 17:51:20 -06:00
Neha Ojha
46dee4b102 qa: preserve cbt task results
Signed-off-by: Neha Ojha <nojha@redhat.com>
2017-12-06 13:23:44 -08:00
Vasu Kulkarni
9f1a1e0543 qa/tests: Add debug info when creating ceph volumes
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-12-06 12:14:27 -08:00
Kefu Chai
5418c98fb7
Merge pull request #19307 from tchaikov/wip-drop-ubuntu-trusty
qa,doc: drop support of ubuntu trusty

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-12-05 20:31:58 +08:00
Neha Ojha
877e3ccada qa: install collectl with cbt task
Signed-off-by: Neha Ojha <nojha@redhat.com>
2017-12-04 13:50:33 -08:00
Kefu Chai
582f04508f qa/tasks: remove test_buildpackages
the buildpackages feature is not part of ceph. and its test was moved to
teuthology in 5a43f8d5. let's remove the leftovers from ceph/ceph. BTW,
we have not run this test suite since then. and its dependency cannot be
fulfilled.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-12-04 14:06:18 +08:00
Vasu Kulkarni
3ecaebd0d8 qa/tests: update tests to use new ceph-volume syntax
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-12-01 14:11:55 -08:00
Kefu Chai
89c876d1b6
Merge pull request #19233 from tchaikov/wip-no-mon-health-to-clog-when-upgrading
qa: disable mon-health-to-clog in upgrade test

Reviewed-by: Sage Weil <sage@redhat.com>
2017-12-01 11:42:52 +08:00
Patrick Donnelly
49028ee79f
Merge PR #19069 into master
* refs/pull/19069/head:
	qa/tasks/ceph_fuse: fix ceph-fuse mount failed

Reviewed-by: Yunfei Guan <yunfei.guan@xtaotech.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Amit Kumar <amitkuma@redhat.com>
2017-11-29 22:11:12 -08:00
Sage Weil
dda79ad1fa
Merge pull request #15482 from myoungwon/wip-chunked-manifest
osd,librados: add manifest, operations for chunked object

Reviewed-by: Sage Weil <sage@redhat.com>
2017-11-29 21:13:43 -06:00
Kefu Chai
185d2f5375 qa/tasks/ceph: add mon-health-to-clog option to "restart"
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-29 20:04:44 +08:00
Kefu Chai
a406553a79 qa/tasks/ceph_manager: add inject_args() method
* move Thrasher._set_config() to CephManager, and make it a public
  method, and rename it to inject_args(),
* use this method instead of using 'tell ... injectargs ...' directly

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-29 18:44:16 +08:00
Kefu Chai
f5f2ced624 mgr/PGMap: drop REQUEST_{SLOW,STUCK} HEALTH_WARNs in mimic
SLOW_OPS unifies both of them since mimic

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-23 17:41:47 +08:00
Kefu Chai
e25300cb6a
Merge pull request #18614 from tchaikov/wip-osd-mgrreport
osd,mgr: report slow requests and pending creating pgs to mgr

Reviewed-by: John Spray <john.spray@redhat.com>
2017-11-23 15:01:01 +08:00
Kefu Chai
4a1f2a5c78 qa: silence SLOW_OPS,PENDING_CREATING_PGS warnings
this is an intermediate step to deprecate REQUEST_SLOW warnings.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-23 13:59:42 +08:00
liyan
0a752e634a qa/tasks/ceph_fuse: fix ceph-fuse mount failed
Fixes: http://tracker.ceph.com/issues/22205

Signed-off-by: liyan <liyan.wang@xtaotech.com>
2017-11-22 15:02:47 +08:00
Patrick Donnelly
df43e415c6
Merge PR #18274 into master
* refs/pull/18274/head:
	mds: fold mds_revoke_cap_timeout into mds_session_timeout
	client: add new delegation testcases
	client: add delegation support for cephfs
	common: remove data_dir_option from common_preinit and global_pre_init

Reviewed-by: Gregory Farnum <gfarnum@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-11-20 21:48:19 -08:00
Kefu Chai
bcf12a6abd
Merge pull request #19024 from tchaikov/wip-21474
qa/tasks: prolong revive_osd() timeout to 6 min

Reviewed-by: Sage Weil <sage@redhat.com>
2017-11-20 22:54:27 +08:00
Kefu Chai
749bbda075 qa/tasks: prolong revive_osd() timeout to 6 min
see also #17902

Fixes: http://tracker.ceph.com/issues/21474
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-20 13:40:59 +08:00
Matt Benjamin
c662dc3067
Merge pull request #17977 from cbodley/wip-rgw-beast-qa
qa/rgw: add beast frontend to some rgw suites
2017-11-16 15:14:28 -05:00
lvshanchun
2c1653ca37 radosgw-admin zonegroup get and zone get return defaults when there is no realm
Fixs: http://tracker.ceph.com/issues/21615
Signed-off-by: lvshanchun <lvshanchun@gmail.com>
2017-11-16 23:43:55 +08:00
Josh Durgin
e81a7ffcda
Merge pull request #18808 from tchaikov/wip-qa-max-pg-per-osd
qa: do not wait for down/out osd for pg convergence

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-11-15 18:05:58 -08:00
Jeff Layton
3321cc7b37 mds: fold mds_revoke_cap_timeout into mds_session_timeout
Right now, we have two different timeout settings -- one for when the
client is just not responding at all (mds_session_timeout), and one for
when the client is otherwise responding but isn't returning caps in a
timely fashion (mds_cap_revoke_timeout).

The default settings on them are equivalent (60s), but only the
mds_session_timeout is communicated via the mdsmap. The
mds_cap_revoke_timeout is known only to the MDS. Neither timeout results
in anything other than warnings in the current codebase.

There is also a third setting (mds_session_autoclose) that is also
communicated via the MDSmap. Exceeding that value (default of 300s)
could eventually result in the client being blacklisted from the
cluster. The code to implement that doesn't exist yet, however.

The current codebase doesn't do any real sanity checking of these
timeouts, so the potential for admins to get them wrong is rather high.
It's hard to concoct a use-case where we'd want to warn about these
events at different intervals.

Simplify this by just removing the mds_cap_revoke_timeout setting, and
replace its use in the code with the mds_session_timeout. With that, the
client can at least determine when warnings might start showing up in
the MDS' logs.

Signed-off-by: Jeff Layton <jlayton@redhat.com>
2017-11-14 07:27:01 -05:00
Casey Bodley
fa8a0713de qa/rgw: add kwargs for debug output
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-11-10 13:23:03 -05:00
Kefu Chai
7f549af459 qa: do not wait for down/out osd for pg convergence
that osd is not invovlved in the PG state changes.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-08 14:50:10 +08:00
Ilya Dryomov
1591426c30 qa/tasks/cram: include /usr/sbin in the PATH for all commands
/usr/sbin is not in non-login PATH on centos.  We already do this for
workunits, see commit 0e53f5f38b0c ("workunit: include /usr/sbin in the
PATH for all commands").

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-11-06 20:05:38 +01:00
Mykola Golub
3a09801ceb qa/suites/rbd: enable fsx deep_copy test case
Signed-off-by: Mykola Golub <to.my.trociny@gmail.com>
2017-11-06 10:29:45 +02:00
myoungwon oh
93be6f79e0 qa/suites/rados/thrash: add set_chunk test case
Signed-off-by: Myoungwon Oh <omwmw@sk.com>
2017-11-06 15:53:46 +09:00
Sage Weil
e7d4f390d6
Merge pull request #18670 from liewegas/wip-min-in-4
qa/tasks/thrashosds: set min_in default to 4

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-11-03 06:55:23 -05:00
John Spray
4fb3025682 qa: fix mgr _load_module helper
I inadvertently broke this with the latest change
to the module ls output.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-11-02 07:42:56 -04:00
Sage Weil
d21809b14e qa/tasks/thrashosds: set min_in default to 4
We have EC tests with k=2,m=2, so we need a min of 4.

Fixes: http://tracker.ceph.com/issues/21997
Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-01 08:32:48 -05:00
John Spray
9988ebed95 qa: fix mgr caps
This was still using Kraken era settings

Signed-off-by: John Spray <john.spray@redhat.com>
2017-11-01 08:21:43 -04:00
John Spray
05e648be6a qa: expand mgr testing
Some extra coverage of the dashboard, including its standby
redirect mode and the publishing of URIs.

Also invoking the command_spam mode of the selftest module.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-11-01 08:21:42 -04:00
Casey Bodley
607b72f553 qa/rgw: add beast frontend to some rgw suites
added a qa/rgw_frontend directory for civetweb.yaml and the new
beast.yaml. the rgw suites for multifs and singleton now symlink
rgw_frontend/civetweb.yaml. the multisite, tempest and verify suites
symlink rgw_frontend to test both. this doubles the number of jobs in
those suites

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-10-27 11:20:52 -04:00
Jason Dillaman
c2b45a9db8 Merge pull request #18583 from idryomov/wip-qa-xfstests-except
qa/tasks/rbd: run all xfstests runs to completion

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-10-27 11:12:18 -04:00
Josh Durgin
a8b000fc41 Merge pull request #18543 from neha-ojha/wip-add-cbt-repo
qa: add cbt repo parameter

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-10-26 20:24:29 -07:00
Ilya Dryomov
48784f3f32 qa/tasks/rbd: run all xfstests runs to completion
rbd.xfstests task allows spawning xfstests runs on multiple nodes.
Don't unwind task contexts if one of the runs fails -- let the other
runs finish.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-10-26 19:30:09 +02:00
Patrick Donnelly
2bba5d8e0f
Merge PR #18192 into master
* refs/pull/18192/head:
	qa/cephfs: test ec data pool
	qa/suites/fs/basic_functional/clusters: more osds

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-10-25 17:05:38 -07:00
Neha Ojha
973e21694f qa: add cbt repo parameter
Signed-off-by: Neha Ojha <nojha@redhat.com>
2017-10-25 10:22:43 -07:00
Patrick Donnelly
c58161f25b
Merge PR #17266 into master
* refs/pull/17266/head:
	qa: update test_ceph_argparse to test fs cmds
	qa: use fs rm_data_pool
	qa: fix mdsmap lookup
	qa: remove usage of mds dump
	PendingReleaseNotes: add obsoleted mds commands
	qa: remove use of obsolete mds commands
	ceph_volume_client: remove use of obsolete mds cmd
	doc: update on obsolete mds commands
	cephfs: obsolete deprecated mds commands

Reviewed-by: Douglas Fuller <dfuller@redhat.com>
2017-10-24 16:37:14 -07:00
Patrick Donnelly
3a5f090a1e
qa: remove usage of mds dump
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-10-24 11:32:43 -07:00
Sage Weil
d0732fc96f qa/cephfs: test ec data pool
Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-23 21:11:24 -05:00
John Spray
bd8814bf58 Merge pull request #18021 from jcsp/wip-prometheus-disk-occupation
mgr/prometheus: add ceph_disk_occupation series

Reviewed-by: Jan Fajerski <jfajerski@suse.com>
2017-10-20 12:28:12 +01:00
Zack Cerza
f9acdc774d qa/tasks/ceph: Set owner on mds mount point
Signed-off-by: Zack Cerza <zack@redhat.com>
2017-10-18 12:59:09 -06:00
Zack Cerza
e606386626 qa/tasks/cephfs/filesystem: Check for mds failure
... inside Filesystem.are_daemons_healthy()

Signed-off-by: Zack Cerza <zack@redhat.com>
2017-10-18 12:59:09 -06:00
Zack Cerza
4516ebb6c0 qa/tasks/ceph: Remove /var/lib/ceph copypasta
Signed-off-by: Zack Cerza <zack@redhat.com>
2017-10-18 12:59:09 -06:00
Zack Cerza
a653720f50 qa/tasks/ceph: Set owner on osd mount point
This is needed when running with systemd

Signed-off-by: Zack Cerza <zack@redhat.com>
2017-10-18 12:59:09 -06:00
Zack Cerza
16dcceae5c qa/tasks/ceph: Set owner on mon mount point
This is needed when running with systemd

Signed-off-by: Zack Cerza <zack@redhat.com>
2017-10-18 12:59:09 -06:00
Kefu Chai
4c7df944c7 osd: add max-pg-per-osd limit
osd will refused to create new pgs, until its pg number is lower
than the max-pg-per-osd upper bound setting.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-17 23:08:40 +08:00
Kefu Chai
ac1e13ce63 qa/tasks/ceph: add "create_rbd_pool" option
True by default, but we can opt not to create the "rbd" pool.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-17 11:15:32 +08:00
Kefu Chai
357fc97b67 Merge pull request #18235 from tchaikov/wip-qa-backfilling
qa: s/backfill/backfilling/

Reviewed-by: Sage Weil <sage@redhat.com>
2017-10-12 14:54:41 +08:00
John Spray
d96a59e74b qa/mgr: fix influx/prometheus test names
This was a typo: they were swapped around.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-10-11 17:00:01 +01:00
Kefu Chai
e21114274f qa: s/backfill/backfilling/
it's renamed "backfilling" in 4015343f .

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-11 11:52:43 +08:00
Casey Bodley
160240e0c1 qa/rgw: ignore errors from 'pool application enable'
Fixes: http://tracker.ceph.com/issues/21715

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-10-09 10:25:26 -04:00
Sage Weil
b6a5c09dba ceph-objectstore-tool: remove rm-past-intervals op
The OSD doesn't rebuild this on demand anymore.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-06 13:08:18 -05:00
John Spray
47bfe6cf17 Merge pull request #17735 from jcsp/wip-mgr-perf-interface
mgr: common interface for TSDB modules

Reviewed-by: My Do <mhdo@umich.edu>
Reviewed-by: Jan Fajerski <jfajerski@suse.com>
Reviewed-by: John Spray <john.spray@redhat.com>
2017-10-02 11:12:35 +01:00
Patrick Donnelly
183646c919
qa: remove use of obsolete mds commands
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-10-01 17:22:36 -07:00
Patrick Donnelly
534c30aca4
Merge PR #18041 into master
* refs/remotes/upstream/pull/18041/head:
	qa: relax cap expected value check
2017-09-30 17:43:56 -07:00
Patrick Donnelly
b37c7f7db7
qa: relax cap expected value check
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-29 08:48:14 -07:00
Ramana Raja
baf3b88800 ceph_volume_client: fix setting caps for IDs
... that have empty OSD and MDS caps. Don't add a ',' at the
start of OSD and MDS caps.

Fixes: http://tracker.ceph.com/issues/21501
Signed-off-by: Ramana Raja <rraja@redhat.com>
2017-09-29 17:06:05 +05:30
Patrick Donnelly
1aef50a1ed
Merge PR #17697 into master
* refs/remotes/upstream/pull/17697/head:
	pybind/ceph_volume_client: add get, put, and delete object interfaces
	pybind/ceph_volume_client: remove 'compat_version'
	pybind/ceph_volume_client: set the version

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-28 18:26:06 -07:00
Patrick Donnelly
1da4a5090a
Merge PR #16036 into HEAD
* refs/remotes/upstream/pull/16036/head:
	mds: improve cap min/max ratio descriptions
	mds: fix whitespace
	mds: cap client recall to min caps per client
	mds: fix conf types
	mds: fix whitespace
	doc/cephfs: add client min cache and max cache ratio describe
	mds: adding tunable features for caps_per_client

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Zheng Yan <zyan@redhat.com>
2017-09-28 17:00:39 -07:00
Patrick Donnelly
538834171f
mds: cap client recall to min caps per client
Fixes: http://tracker.ceph.com/issues/21575

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-28 15:55:57 -07:00
John Spray
99352ceced qa: add mgr module selftest task
The module self test commands give us a chance to
catch any other ceph changes that change something
that a module was relying on reading.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-09-27 14:20:22 -04:00
Sage Weil
61799c4c8c Merge pull request #17810 from hjwsm1989/wip-21294
qa/ceph_manager: check pg state again before timedout

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-09-25 12:33:34 -05:00
Sage Weil
af5032e729 Merge pull request #17902 from tchaikov/wip-21474
qa/tasks: prolong revive_osd() timeout to 6 min

Reviewed-by: Sage Weil <sage@redhat.com>
2017-09-24 21:14:58 -05:00
John Spray
dc23bade5e Merge pull request #17850 from batrick/vstart-confg-path
qa: revert "qa: use config_path property instead of literal"

Reviewed-by: John Spray <john.spray@redhat.com>
2017-09-23 16:36:25 +01:00
vasukulkarni
d02de6b7fc Merge pull request #17757 from ceph/wip-rgw-ansible-var
qa/tests: Add additional required ceph-ansible vars due to upstream changes
2017-09-22 11:22:02 -07:00
Ramana Raja
d1bd171d6b pybind/ceph_volume_client: add get, put, and delete object interfaces
Wrap low-level rados APIs to allow ceph_volume_client to get, put, and
delete objects. The interfaces would allow OpenStack Manila's
cephfs driver to store config data in a shared storage to implement
highly available Manila deployments. Restrict  write(put) and
read(get) object sizes to 'osd_max_size' config setting.

Signed-off-by: Ramana Raja <rraja@redhat.com>
2017-09-22 16:24:38 +05:30
Kefu Chai
42be200c56 qa/tasks: prolong revive_osd() timeout to 6 min
bluestore_fsck_on_mount and bluestore_fsck_on_mount_deep are enabled by
default. and bluestore is used as the default store backend. it takes
longer to perform the deep fsck with verbose log. so prolong the
revive_osd()'s timeout from 150 sec to 360 sec.

Fixes: http://tracker.ceph.com/issues/21474
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-09-22 10:58:41 +08:00
Patrick Donnelly
614b0d9368
qa: revert "qa: use config_path property instead of literal"
This reverts commit f95798b3ad.

The config_path method wasn't available through inheritance as I thought. Oops.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-20 16:09:40 -07:00
Patrick Donnelly
8a535d9c72
qa: get config only on running MDS
Fixes: http://tracker.ceph.com/issues/21466

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-20 15:47:28 -07:00
Abhishek Lekshmanan
29de857c63 qa: s3: re enable LC tests on ceph-master
Needs to go in after https://github.com/ceph/s3-tests/pull/187 gets
merged

Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>
2017-09-20 10:02:37 +02:00
Vasu Kulkarni
e11a1f773d qa/tests: skip test that use public bucket landsat-pds from AWS
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-19 17:12:15 -07:00
Sage Weil
6767f841e5 Merge pull request #17427 from liewegas/wip-pg-num-limits
mon/OSDMonitor: implement cluster pg limit

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-09-19 12:57:10 -05:00
huangjun
fa40add7f0 qa/ceph_manager: check pg state again before timedout
Pg state maybe all in active+clean when no recovering going on,
so check it again before timedout.

Fixes: http://tracker.ceph.com/issues/21294

Signed-off-by: huangjun <huangjun@xsky.com>
2017-09-20 00:04:04 +08:00
Patrick Donnelly
3c727d9a36
Merge PR #17701 into master
* refs/remotes/upstream/pull/17701/head:
	qa/cephfs: Fix error in test_filtered_df

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-15 14:12:35 -07:00
Patrick Donnelly
8a54e101e5
Merge PR #17694 into master
* refs/remotes/upstream/pull/17694/head:
	qa/cephfs: kill mount if it gets evicted by mds
	qa/cephfs: fix test_evict_client

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-15 14:12:33 -07:00
Josh Durgin
520a5a218c Merge pull request #17583 from neha-ojha/wip-cbt-teuthology-integration
qa: add cbt task for performance testing

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-09-14 16:36:26 -07:00
Sage Weil
986b86fbeb mon: rename mon_pg_warn_max_per_osd -> mon_max_pg_per_osd
Signed-off-by: Sage Weil <sage@redhat.com>
2017-09-14 16:00:31 -04:00
Patrick Donnelly
d929dae49b
Merge PR #17657 into master
* refs/remotes/upstream/pull/17657/head:
	mds: optimize MDCache::rejoin_scour_survivor_replicas()
	mds: fix MDSCacheObject::clear_replica_map
	mds: support limiting cache by memory
	common: refactor of lru
	mds: resolve unsigned coercion compiler warning
	common: use safer uint64_t for list size
	common: add bytes2str pretty print function
	mds: check if waiting is allocated before use
	mds: go back to compact_map for replicas
	mds: use mempool for cache objects
	mds: cleanup replica_map access
	common: add alloc_ptr smart pointer
	common: add warning on base class use of mempool
	common: use atomic uin64_t for counter

Reviewed-by: Zheng Yan <zyan@redhat.com>
2017-09-13 20:08:51 -07:00
Douglas Fuller
b059cb6290 qa/cephfs: Fix error in test_filtered_df
ceph df accounts for pool size, so there is no need to do it in the test.

Fixes: http://tracker.ceph.com/issues/21381
Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-09-13 14:02:24 -04:00
Yan, Zheng
98d86a0752 qa/cephfs: kill mount if it gets evicted by mds
otherwise, teardown() hange at umount

Fixes: http://tracker.ceph.com/issues/21275
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2017-09-13 21:30:51 +08:00
Yan, Zheng
8433ced847 qa/cephfs: fix test_evict_client
executing mount_a.kill() twice, then executing mount_b.kill_cleanup()
twice do not make sense.

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2017-09-13 16:17:42 +08:00
Patrick Donnelly
06c94de584
mds: support limiting cache by memory
This introduces two config parameters:

    mds_cache_memory_limit: Sets the soft maximum of the cache to the given
    byte count. (Like mds_cache_size, this doesn't actually limit the maximum
    size of the cache. It just dictates the steady-state size.)

    mds_cache_reservation: This replaces mds_health_cache_threshold everywhere
    except the Beacon heartbeat sent to the mons. The idea here is to specify a
    reservation of memory (5% by default) for operations and the MDS tries to
    always maintain that reservation. So, the MDS will recall caps from clients
    when it begins dipping into its reservation of memory.

mds_cache_size still limits the cache by Inode count but is now by-default 0
(i.e. unlimited). The new preferred way of specifying cache limits is by memory
size. The default is 1GB.

Fixes: http://tracker.ceph.com/issues/20594
Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1464976

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-12 20:02:41 -07:00
yonghengdexin735
fc5ac9ea69 common:fix error word
Signed-off-by: yonghengdexin735 <zhang.zezhu@zte.com.cn>
2017-09-13 10:22:08 +08:00
Patrick Donnelly
f48113f50f
Merge PR #17679 into master
* refs/remotes/upstream/pull/17679/head:
	qa: get asok path from ceph.conf
	qa: use config_path property instead of literal

Reviewed-by: John Spray <john.spray@redhat.com>
2017-09-12 15:56:32 -07:00
Patrick Donnelly
bc3a00af0f
qa: get asok path from ceph.conf
The asok path was recently moved to /tmp to resolve unix path too long errors.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-12 15:52:54 -07:00
Patrick Donnelly
f95798b3ad
qa: use config_path property instead of literal
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-12 15:50:33 -07:00
Tamilarasi Muthamizhan
4d8971e1c8 Merge pull request #17650 from ceph/wip-ceph-deploy-fix
qa/tests: Fix get_system_type failure due to invalid remote name
2017-09-12 15:07:24 -07:00
Neha Ojha
59531d81c5 qa: avoid using make install for fio
Signed-off-by: Neha Ojha <nojha@redhat.com>
2017-09-12 08:26:27 -07:00
David Zafman
44f51024cc Merge pull request #17538 from dzafman/wip-21272
Add export and remove ceph-objectstore-tool command option

Reviewed-by: Sage Weil <sage@redhat.com>
2017-09-11 20:12:27 -07:00
Vasu Kulkarni
9fe58d5ed6 Fix get_system_type failure due to invalid remote name
recent changes caused the remote name to be invalid, fix the
arg passed to get_system_type

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-10 12:42:16 -07:00
Sage Weil
14e783c0b9 Merge pull request #17571 from jcsp/wip-21260
mon: fix dropping mgr metadata for active mgr (#21260)

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-09-10 14:20:30 -05:00
David Zafman
3bb20f6d75 ceph-objectstore-tool: Make pg removal require --force
Add new export-remove to combine the 2 operations

Fixes: http://tracker.ceph.com/issues/21272

Signed-off-by: David Zafman <dzafman@redhat.com>
2017-09-08 17:56:05 -07:00
Patrick Donnelly
1b7aae6cdc
Merge PR #17533 into master
* refs/remotes/upstream/pull/17533/head:
	qa: log ceph-fuse kill/cleanup

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-08 11:23:23 -07:00
John Spray
28cb4bb2e6 qa/tasks/mgr: regression test for 21260
(aka http://tracker.ceph.com/issues/21260)

Fixes: http://tracker.ceph.com/issues/21260
Signed-off-by: John Spray <john.spray@redhat.com>
2017-09-08 15:45:40 +02:00
Neha Ojha
2793ee7d7e qa: add cbt task for performance testing
Signed-off-by: Neha Ojha <nojha@redhat.com>
2017-09-07 15:44:48 -07:00
Tamilarasi Muthamizhan
594b736228 Merge pull request #17536 from ceph/wip-cd-rbd-pool
qa/tests: fix rbd pool creation for systemd tests
2017-09-07 15:13:32 -07:00
Vasu Kulkarni
5c43c5972d Add workaround for http://tracker.ceph.com/issues/20950
mgr bootstrap key differs on disk, rewrite the new key
using auth get.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:57 -07:00
Vasu Kulkarni
2116f76a1f Restart services after upgrade
This seems to be an issue and should be fixed in our systemd scripts,
need to discuss more on this.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:57 -07:00
Vasu Kulkarni
8a2b9a0809 since the roles are mapped inside ceph-deploy, store the roles that
are mapped and use the new mapped role for upgrades during later
stage.

eg: mon.a is mapped to mon.mira002 during install, store this mapping
and durig upgrade map it back to appropriate name to find the hostname
with that role

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:57 -07:00
Vasu Kulkarni
e7a0a4f87f Add doc string for missing ceph-deploy-branch
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:57 -07:00
Vasu Kulkarni
407ce2613d use mon.a as ceph-admin node, the previous get_first_mon returns
a sorted(mons)[0] as first mon.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:56 -07:00
Vasu Kulkarni
f3ec7cabf6 use elif instead of else to handle cases for jewel install
jewel needs neither filestore or bluestore as an option, so provide none
when running with jewel branch.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:56 -07:00
Vasu Kulkarni
9a73127c16 Add upgrade functionality using ceph-deploy
This is to test for customer like upgrade scenarios and to find
any issues that may be related to systemd, packaging etc

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:56 -07:00
Vasu Kulkarni
b409c3d163 Fix old flake8 issues
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:56 -07:00
Vasu Kulkarni
a9be0c387e Add option to skip mgr install for old release
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:56 -07:00
Vasu Kulkarni
f12127751d Add mgr create to fix single node test to reach healthy state
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-06 18:13:39 -07:00
Vasu Kulkarni
49f95b3d8b create the rbd pool right after install
rbd pool should exist for many rbd tests to work properly, create
the pool right after install is successful.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-06 18:07:54 -07:00
Patrick Donnelly
b4f962a486
qa: log ceph-fuse kill/cleanup
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-06 13:40:11 -07:00
Patrick Donnelly
28ca766cd2
Merge PR #16305 into master
* refs/remotes/upstream/pull/16305/head:
	qa/cephfs: test CephFS recovery pools
	qa/cephfs: support CephFS recovery pools
	qa/ceph_test_case: support CephFS recovery pools
	qa/cephfs: Allow deferred fs creation
	qa/cephfs: Refactor alternate pool test

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-05 20:42:30 -07:00
Sage Weil
21027233b2 qa/tasks/ceph_manager: revive osds before doing final rerr reset
We assume below that rerrosd is up, but it may not be when we exit the
loop.

Fixes: http://tracker.ceph.com/issues/21206
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-31 14:55:46 -04:00
Douglas Fuller
6af2ae80d3 qa/cephfs: test CephFS recovery pools
Test recovering metadata in to a separate RADOS pool with
cephfs_data_scan and friends.

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-08-30 09:02:44 -04:00
Douglas Fuller
8f9a252020 qa/cephfs: support CephFS recovery pools
Add support for testing recovery of CephFS metadata into an alternate
RADOS pool, useful as a disaster recovery mechanism that avoids
modifying the metadata in-place.

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-08-30 09:02:44 -04:00
Douglas Fuller
c85562c94a qa/ceph_test_case: support CephFS recovery pools
Add support for testing recovery of CephFS metadata into an alternate
RADOS pool, useful as a disaster recovery mechanism that avoids
modifying the metadata in-place.

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-08-30 09:02:44 -04:00
Douglas Fuller
5fafc03cb9 qa/cephfs: Allow deferred fs creation
Permit Filesystem objects to be created and settings modified before
calling Filesystem.create().

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-08-30 09:02:44 -04:00
Douglas Fuller
47318f8ac4 qa/cephfs: Refactor alternate pool test
Remove the alternate pool recovery test from test_data_scan. Newer
commits will place the test in its own file.

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2017-08-30 09:02:44 -04:00
Jason Dillaman
1b2f1358f7 Merge pull request #17346 from idryomov/wip-krbd-xfstests-update
qa/suites/krbd: rbd_xfstests job overhaul

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-08-29 18:28:48 -04:00
Ilya Dryomov
01914be5a3 qa/tasks/rbd.xfstests: take exclude list from yaml
Different filesystems (and further, different configurations of the
same filesystem) need different exclude lists.  Hard coding the list in
a wrapper script is inflexible.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-08-29 16:09:35 +02:00
Casey Bodley
0542974504 qa: create_cache_pool no longer runs 'pool application enable'
Fixes: http://tracker.ceph.com/issues/21155

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-08-29 10:01:04 -04:00
Xie Xingguo
3e0506a9a5 Merge pull request #17253 from hjwsm1989/fix-qa-bug
qa: fix the potential delay of pg state change

Reviewed-by: Sage Weil <sage@redhat.com>
2017-08-28 08:35:01 +08:00
Kefu Chai
30b5b4627c Merge pull request #16494 from asomers/bin_bash
misc: Fix bash path in shebangs

Reviewed-by: Willem Jan Withagen <wjw@digiware.nl>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-08-27 10:14:14 +08:00
huangjun
3bff102d4b qa: fix the potential delay of pg state change
If start osd process first and then mark it in, the
pg state may remain all active+clean when doing
wait_for_clean() check, which may fail the next
osd_scrub_pgs() process.
So faster pg state change by marking osd in first.

Signed-off-by: huangjun <huangjun@xsky.com>
2017-08-25 17:07:37 +08:00