Commit Graph

3105 Commits

Author SHA1 Message Date
Sage Weil
431d1482ff qa/suites/rados/thrash: extend mgr beacon grace when many msgr failures injected
Fixes: http://tracker.ceph.com/issues/21147
Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-29 10:29:52 -06:00
John Spray
967fc9cf8a
Merge pull request #18649 from b-ranto/wip-restful-fixes
mgr/restful: A couple of restful fixes

Reviewed-by: John Spray <john.spray@redhat.com>
2017-11-29 12:11:58 +00:00
Kefu Chai
185d2f5375 qa/tasks/ceph: add mon-health-to-clog option to "restart"
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-29 20:04:44 +08:00
Kefu Chai
a406553a79 qa/tasks/ceph_manager: add inject_args() method
* move Thrasher._set_config() to CephManager, and make it a public
  method, and rename it to inject_args(),
* use this method instead of using 'tell ... injectargs ...' directly

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-29 18:44:16 +08:00
Kefu Chai
0e987665fe qa/suites/rados/singleton: more whitelist
* SLOW_OPS is normal in a cluster with flattering OSDs
* so is OBJECT_MISPLACED.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-29 13:53:44 +08:00
Kefu Chai
6a9e622cf9
Merge pull request #19199 from tchaikov/wip-22136
qa/ceph-disk: enlarge the simulated SCSI disk

Reviewed-by: Alfredo Deza <adeza@redhat.com>
2017-11-28 23:46:49 +08:00
Kefu Chai
b3c159e9fb qa/ceph-disk: enlarge the simulated SCSI disk
100MB will be allocated for journal, and the remaining 100MB is for data
device. taking the inode into consideration, there will be approximately
87988 kB available for the activated OSD. and it will complain with a
"nearfull" state.

Fixes: http://tracker.ceph.com/issues/22136
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-28 14:55:33 +08:00
Kefu Chai
aa34321a4a qa/workunits/ceph-disk: do not redirect stderr to stdout
normally, if we care about the output of ceph-disk, we expect a json
string, and ceph-disk sends the output to stdout, and errors/warnings
to stderr. so everything works as expected. and the test should also
follow this tradition. for example, if deprecated warnings are printed,
the warning message should not be collected along with the json string.

see also: d44334f3

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-28 14:55:19 +08:00
Sage Weil
ddb274669c qa/suites/rados/rest: move rest_test from qa/suites/rest/
...and add more health whitelists.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-27 10:11:51 -06:00
Boris Ranto
059588e758 qa/restful: Test pg_num/pgp_num modifications
Signed-off-by: Boris Ranto <branto@redhat.com>
2017-11-27 14:35:26 +01:00
Kefu Chai
f5f2ced624 mgr/PGMap: drop REQUEST_{SLOW,STUCK} HEALTH_WARNs in mimic
SLOW_OPS unifies both of them since mimic

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-23 17:41:47 +08:00
Kefu Chai
e25300cb6a
Merge pull request #18614 from tchaikov/wip-osd-mgrreport
osd,mgr: report slow requests and pending creating pgs to mgr

Reviewed-by: John Spray <john.spray@redhat.com>
2017-11-23 15:01:01 +08:00
Kefu Chai
4a1f2a5c78 qa: silence SLOW_OPS,PENDING_CREATING_PGS warnings
this is an intermediate step to deprecate REQUEST_SLOW warnings.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-23 13:59:42 +08:00
Joao Eduardo Luis
0b30a1538a qa: add simple and dirty script to find ports being used
Seriously, having this is way better than realizing that one needs to
find an unused port each time a new test is created.

It's far from a perfect script. It's downright not elegant.

Signed-off-by: Joao Eduardo Luis <joao@suse.de>
2017-11-22 16:58:34 +00:00
liyan
0a752e634a qa/tasks/ceph_fuse: fix ceph-fuse mount failed
Fixes: http://tracker.ceph.com/issues/22205

Signed-off-by: liyan <liyan.wang@xtaotech.com>
2017-11-22 15:02:47 +08:00
Mykola Golub
7fb9920316 qa/workunits/rbd: wait for status propagated only if daemon started
Signed-off-by: Mykola Golub <to.my.trociny@gmail.com>
2017-11-21 22:29:16 +02:00
Jason Dillaman
0502cb1573
Merge pull request #19073 from trociny/wip-qa-demote
qa/workunits/rbd: wait for demote status is propagated

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-11-21 12:52:08 -05:00
Andrew Schoen
7efdf3c50a
Merge pull request #19075 from tchaikov/wip-22154
qa/workunits: silence py warnings for ceph-disk tests

Reviewed-by: Andrew Schoen <aschoen@redhat.com>
2017-11-21 09:42:30 -06:00
Kefu Chai
d44334f317 qa/workunits: silence py warnings for ceph-disk tests
ceph-disk now prints "depreacted" warning message when it starts. but
the tests parses its stdout and stderr for a json string. so we need to
silence the warnings for the tests.

Fixes: http://tracker.ceph.com/issues/22154
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-21 21:49:48 +08:00
Mykola Golub
f4b88b2c9d qa/workunits/rbd: wait for demote status is propagated
(before promoting the image on another cluster)

Signed-off-by: Mykola Golub <to.my.trociny@gmail.com>
2017-11-21 13:22:50 +02:00
Patrick Donnelly
df43e415c6
Merge PR #18274 into master
* refs/pull/18274/head:
	mds: fold mds_revoke_cap_timeout into mds_session_timeout
	client: add new delegation testcases
	client: add delegation support for cephfs
	common: remove data_dir_option from common_preinit and global_pre_init

Reviewed-by: Gregory Farnum <gfarnum@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-11-20 21:48:19 -08:00
Kefu Chai
bcf12a6abd
Merge pull request #19024 from tchaikov/wip-21474
qa/tasks: prolong revive_osd() timeout to 6 min

Reviewed-by: Sage Weil <sage@redhat.com>
2017-11-20 22:54:27 +08:00
Kefu Chai
749bbda075 qa/tasks: prolong revive_osd() timeout to 6 min
see also #17902

Fixes: http://tracker.ceph.com/issues/21474
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-20 13:40:59 +08:00
Sage Weil
25b7965f88 qa/suites/rados: test for recovery_unfound bug
See http://tracker.ceph.com/issues/22145

Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-19 21:32:57 -06:00
Matt Benjamin
c662dc3067
Merge pull request #17977 from cbodley/wip-rgw-beast-qa
qa/rgw: add beast frontend to some rgw suites
2017-11-16 15:14:28 -05:00
lvshanchun
2c1653ca37 radosgw-admin zonegroup get and zone get return defaults when there is no realm
Fixs: http://tracker.ceph.com/issues/21615
Signed-off-by: lvshanchun <lvshanchun@gmail.com>
2017-11-16 23:43:55 +08:00
Josh Durgin
e81a7ffcda
Merge pull request #18808 from tchaikov/wip-qa-max-pg-per-osd
qa: do not wait for down/out osd for pg convergence

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-11-15 18:05:58 -08:00
Yuri Weinstein
c9dbb86dc4
Merge pull request #17761 from cbodley/wip-rgw-bilog-trim
rgw multisite: automated trimming for bucket index logs

Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
2017-11-15 09:04:50 -08:00
Jason Dillaman
09070016f5 Merge pull request #17032 from yangdongsheng/rbd_qos
rbd: implement image qos in tokenbucket algorithm

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-11-14 19:58:13 -05:00
Jeff Layton
3321cc7b37 mds: fold mds_revoke_cap_timeout into mds_session_timeout
Right now, we have two different timeout settings -- one for when the
client is just not responding at all (mds_session_timeout), and one for
when the client is otherwise responding but isn't returning caps in a
timely fashion (mds_cap_revoke_timeout).

The default settings on them are equivalent (60s), but only the
mds_session_timeout is communicated via the mdsmap. The
mds_cap_revoke_timeout is known only to the MDS. Neither timeout results
in anything other than warnings in the current codebase.

There is also a third setting (mds_session_autoclose) that is also
communicated via the MDSmap. Exceeding that value (default of 300s)
could eventually result in the client being blacklisted from the
cluster. The code to implement that doesn't exist yet, however.

The current codebase doesn't do any real sanity checking of these
timeouts, so the potential for admins to get them wrong is rather high.
It's hard to concoct a use-case where we'd want to warn about these
events at different intervals.

Simplify this by just removing the mds_cap_revoke_timeout setting, and
replace its use in the code with the mds_session_timeout. With that, the
client can at least determine when warnings might start showing up in
the MDS' logs.

Signed-off-by: Jeff Layton <jlayton@redhat.com>
2017-11-14 07:27:01 -05:00
Dongsheng Yang
bf4e454a22 librbd: limit IO per second by TokenBucketThrottle
Signed-off-by: Dongsheng Yang <dongsheng.yang@easystack.cn>
2017-11-14 09:19:00 +08:00
Sage Weil
7f0d82d93b
Merge pull request #18826 from liewegas/wip-22090
qa/suite/rados: fix balancer vs firefly tunables failures

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-11-12 19:50:14 -06:00
Casey Bodley
fa8a0713de qa/rgw: add kwargs for debug output
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-11-10 13:23:03 -05:00
Sage Weil
c1c81a04f5 qa/suites/rados/perf: whitelist health warnings
We may see these as cluster stabilizes.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-10 09:00:39 -06:00
Sage Weil
6455954d29 qa/suites/rados: stop testing firefly tunables
We can't mix the balancer compat-set testing with firefly tunables because
it requires that all buckets be straw2.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-10 07:10:11 -06:00
Jason Dillaman
5094838ab9
Merge pull request #18793 from idryomov/wip-qa-cram-path
qa/tasks/cram: include /usr/sbin in the PATH for all commands

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-11-08 11:00:41 -05:00
Kefu Chai
7f549af459 qa: do not wait for down/out osd for pg convergence
that osd is not invovlved in the PG state changes.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-08 14:50:10 +08:00
Ilya Dryomov
8cd42742b6 qa/suites/krbd: enable generic/050 and generic/448
generic/448 has been failing on ext4 and is now fixed upstream.
The fix for generic/050 is in testing, scheduled for 4.15-rc1.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-11-07 18:05:44 +01:00
Ilya Dryomov
1591426c30 qa/tasks/cram: include /usr/sbin in the PATH for all commands
/usr/sbin is not in non-login PATH on centos.  We already do this for
workunits, see commit 0e53f5f38b ("workunit: include /usr/sbin in the
PATH for all commands").

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-11-06 20:05:38 +01:00
Mykola Golub
3a09801ceb qa/suites/rbd: enable fsx deep_copy test case
Signed-off-by: Mykola Golub <to.my.trociny@gmail.com>
2017-11-06 10:29:45 +02:00
myoungwon oh
93be6f79e0 qa/suites/rados/thrash: add set_chunk test case
Signed-off-by: Myoungwon Oh <omwmw@sk.com>
2017-11-06 15:53:46 +09:00
Sage Weil
b6fe5d3d72
Merge pull request #18691 from liewegas/wip-fix-balancer
mgr/balancer: fix upmap; default balancer module enabled
2017-11-03 21:54:58 -05:00
Sage Weil
e7d4f390d6
Merge pull request #18670 from liewegas/wip-min-in-4
qa/tasks/thrashosds: set min_in default to 4

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-11-03 06:55:23 -05:00
Sage Weil
26710f0a9b mgr/balancer: enable module by default
It will still be "off".

Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-02 16:11:26 -05:00
John Spray
4fb3025682 qa: fix mgr _load_module helper
I inadvertently broke this with the latest change
to the module ls output.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-11-02 07:42:56 -04:00
Ilya Dryomov
c1d6f8364f
Merge pull request #18652 from idryomov/wip-qa-krbd-blkroset
qa/suites/krbd: add krbd BLKROSET test

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-11-02 10:10:39 +01:00
Jason Dillaman
540b4cf575
Merge pull request #18619 from trociny/wip-qa-mirror-remove
qa/workunits/rbd: unnecessary sleep after failed remove

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-11-01 21:34:35 -04:00
Sage Weil
aa828be008
Merge pull request #17983 from liewegas/wip-balancer
mgr/balancer: make crush-compat mode work

Reviewed-by: John Spray <john.spray@redhat.com>
2017-11-01 14:46:07 -05:00
Yuri Weinstein
223f45e4e1
Merge pull request #18610 from cbodley/wip-osd-subop-num
osd: add processed_subop_count for cls_cxx_subop_version()

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-11-01 08:24:13 -07:00
Kefu Chai
2a3d3b87fe
Merge pull request #18659 from tchaikov/wip-21993
qa: stop testing deprecated "ceph osd create"

Reviewed-by: Sage Weil <sage@redhat.com>
2017-11-01 23:08:55 +08:00
Sage Weil
d21809b14e qa/tasks/thrashosds: set min_in default to 4
We have EC tests with k=2,m=2, so we need a min of 4.

Fixes: http://tracker.ceph.com/issues/21997
Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-01 08:32:48 -05:00
Sage Weil
2c9c18d1ec qa/suites/rados/thrash/d-balancer: enable balancer in various modes
Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-01 07:28:49 -05:00
John Spray
9988ebed95 qa: fix mgr caps
This was still using Kraken era settings

Signed-off-by: John Spray <john.spray@redhat.com>
2017-11-01 08:21:43 -04:00
John Spray
05e648be6a qa: expand mgr testing
Some extra coverage of the dashboard, including its standby
redirect mode and the publishing of URIs.

Also invoking the command_spam mode of the selftest module.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-11-01 08:21:42 -04:00
Kefu Chai
371d3f4091 qa: stop testing deprecated "ceph osd create"
"ceph osd create" is not idempotent, and is considered deprecated.

Fixes: http://tracker.ceph.com/issues/21993
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-01 10:45:11 +08:00
vasukulkarni
e12f7a980b
Merge pull request #18634 from tchaikov/wip-qa-add-restful
qa: add "restful" to ceph_mgr_modules in ceph-ansible suite
2017-10-31 12:36:06 -07:00
Ilya Dryomov
c4888ddc1f qa/workunits/rbd: remove set_ro.py
This test is incomplete and has been obsoleted by krbd_blkroset.t.
It's also not wired up, so it's not actually being run.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-10-31 13:40:16 +01:00
Ilya Dryomov
2275381286 qa/suites/krbd: add krbd BLKROSET test
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-10-31 13:40:16 +01:00
Kefu Chai
b4ffd03ca3
Merge pull request #18283 from tchaikov/wip-laggy-mgr
qa: reduce mon-client-hunt-interval-max-multiple to 2

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2017-10-31 19:03:13 +08:00
Kefu Chai
caf9ee5c60 qa: add "restful" to ceph_mgr_modules in ceph-ansible suite
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-31 11:21:58 +08:00
Casey Bodley
83d6b493f7 qa: add ceph_test_cls_log to cls workunit
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-10-30 15:31:03 -04:00
Mykola Golub
cb90069604 qa/workunits/rbd: unnecessary sleep after failed remove
Signed-off-by: Mykola Golub <to.my.trociny@gmail.com>
2017-10-30 08:57:29 +02:00
Yuri Weinstein
51d0dbe315
Merge pull request #18594 from zmc/wip-cephmetrics
suites/cephmetrics: Add Centos 7

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2017-10-29 18:25:43 -07:00
Mykola Golub
9625274d3f
Merge pull request #18592 from dillaman/wip-21956
cls/journal: fixed possible infinite loop in expire_tags

Reviewed-by: Mykola Golub <to.my.trociny@gmail.com>
2017-10-28 09:36:30 +03:00
Jason Dillaman
27613a63d1 qa/suites/rbd: run cls tests for all dependencies
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-10-27 15:23:45 -04:00
David Zafman
f94322066f
Merge pull request #18449 from dzafman/wip-zafman-misc
mark_unfound_lost fix and some other minor changes

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-10-27 10:21:25 -07:00
Zack Cerza
58b7e4bbb1 Use a symlink for Ubuntu fragment
Signed-off-by: Zack Cerza <zack@redhat.com>
2017-10-27 10:34:06 -06:00
Zack Cerza
53abe7119c suites/cephmetrics: Add Centos 7
Signed-off-by: Zack Cerza <zack@redhat.com>
2017-10-27 10:34:06 -06:00
Casey Bodley
607b72f553 qa/rgw: add beast frontend to some rgw suites
added a qa/rgw_frontend directory for civetweb.yaml and the new
beast.yaml. the rgw suites for multifs and singleton now symlink
rgw_frontend/civetweb.yaml. the multisite, tempest and verify suites
symlink rgw_frontend to test both. this doubles the number of jobs in
those suites

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-10-27 11:20:52 -04:00
Jason Dillaman
c2b45a9db8 Merge pull request #18583 from idryomov/wip-qa-xfstests-except
qa/tasks/rbd: run all xfstests runs to completion

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-10-27 11:12:18 -04:00
Josh Durgin
a8b000fc41 Merge pull request #18543 from neha-ojha/wip-add-cbt-repo
qa: add cbt repo parameter

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-10-26 20:24:29 -07:00
Josh Durgin
af97e2f757 Merge pull request #18573 from neha-ojha/wip-add-cbt-workloads
qa/suites/rados/perf: add workloads

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-10-26 20:13:40 -07:00
Neha Ojha
9cf94276cf qa/suites/rados/perf: add workloads
Signed-off-by: Neha Ojha <nojha@redhat.com>
2017-10-26 18:10:49 -07:00
Sage Weil
7740af808c Merge pull request #18552 from liewegas/wip-mds-cluster-down
qa/workunits/rest/test.py: stop trying to test obsolte cluster_up/down

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-10-26 17:47:51 -05:00
vasukulkarni
b2a1fd1972 Merge pull request #18451 from zmc/wip-cephmetrics
Add cephmetrics suite
2017-10-26 11:31:19 -07:00
Ilya Dryomov
48784f3f32 qa/tasks/rbd: run all xfstests runs to completion
rbd.xfstests task allows spawning xfstests runs on multiple nodes.
Don't unwind task contexts if one of the runs fails -- let the other
runs finish.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-10-26 19:30:09 +02:00
Zack Cerza
41f55068cd Add cephmetrics suite
Signed-off-by: Zack Cerza <zack@redhat.com>
2017-10-26 10:55:30 -06:00
Sage Weil
810a138133 qa/workunits/rest/test.py: stop trying to test obsolete 'mds *' commands
These were recently removed.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-26 11:22:20 -05:00
Sage Weil
c337b8739d Merge pull request #18533 from tchaikov/wip-qa-cephtool
qa: do not "ceph fs get cephfs" w/o a cephfs fs

Reviewed-by: Douglas Fuller <dfuller@redhat.com>
2017-10-26 09:19:11 -05:00
Sage Weil
f1afb1684c Merge pull request #18527 from xiexingguo/wip-hunt-scrub-mismatch
osd/PrimaryLogPG: several oi.extents fixes

Reviewed-by: Sage Weil <sage@redhat.com>
2017-10-25 21:49:45 -05:00
Patrick Donnelly
2bba5d8e0f
Merge PR #18192 into master
* refs/pull/18192/head:
	qa/cephfs: test ec data pool
	qa/suites/fs/basic_functional/clusters: more osds

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-10-25 17:05:38 -07:00
Neha Ojha
973e21694f qa: add cbt repo parameter
Signed-off-by: Neha Ojha <nojha@redhat.com>
2017-10-25 10:22:43 -07:00
Kefu Chai
3ceab4ca43 Merge pull request #16332 from badone/wip-warn-about-objects-with-too-many-omap-entries
osd: Warn about objects with too many omap entries

Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-10-25 19:20:00 +08:00
Kefu Chai
0f1e97238c qa: do not "ceph fs get cephfs" w/o a cephfs fs
introduced by 183646c

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-25 19:12:39 +08:00
Patrick Donnelly
c58161f25b
Merge PR #17266 into master
* refs/pull/17266/head:
	qa: update test_ceph_argparse to test fs cmds
	qa: use fs rm_data_pool
	qa: fix mdsmap lookup
	qa: remove usage of mds dump
	PendingReleaseNotes: add obsoleted mds commands
	qa: remove use of obsolete mds commands
	ceph_volume_client: remove use of obsolete mds cmd
	doc: update on obsolete mds commands
	cephfs: obsolete deprecated mds commands

Reviewed-by: Douglas Fuller <dfuller@redhat.com>
2017-10-24 16:37:14 -07:00
Patrick Donnelly
b897ef9445
qa: use fs rm_data_pool
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-10-24 11:51:02 -07:00
Patrick Donnelly
3a5f090a1e
qa: remove usage of mds dump
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-10-24 11:32:43 -07:00
Brad Hubbard
71bf04775b osd: Warn about objects with too many omap entries
Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
2017-10-24 17:27:57 +10:00
xie xingguo
f82228c4af osd/osd_type.cc: dump extents map object_info_t
which is good for bug hunting and diagnosing.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2017-10-24 11:46:23 +08:00
Sage Weil
d0732fc96f qa/cephfs: test ec data pool
Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-23 21:11:24 -05:00
Jason Dillaman
918df91213 rbd: bench should flush prior to starting a test
For the new read-based bench tests, flushing prior to the start of the test
will result in the exclusive lock being acquired and the object map being
utilized.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-10-23 13:22:50 -04:00
Sage Weil
ddf3e9162d qa/suites/rados/rest/mgr-restful: whitelist more health
The test is fiddling with OSDs.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-20 23:14:36 -05:00
John Spray
bd8814bf58 Merge pull request #18021 from jcsp/wip-prometheus-disk-occupation
mgr/prometheus: add ceph_disk_occupation series

Reviewed-by: Jan Fajerski <jfajerski@suse.com>
2017-10-20 12:28:12 +01:00
David Zafman
f918b1fac1 test: Remove bogus check in ceph_objectstore_tool.py
Signed-off-by: David Zafman <dzafman@redhat.com>
2017-10-18 18:07:23 -07:00
Josh Durgin
a5e917fb9b Merge pull request #18145 from dzafman/wip-18162
osd: object added to missing set for backfill, but is not in recovering, error!

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-10-18 17:33:12 -07:00
vasukulkarni
3172a63c64 Merge pull request #18380 from zmc/wip-master-init
Changes required for teuthology's systemd support
2017-10-18 13:00:07 -07:00
Zack Cerza
f9acdc774d qa/tasks/ceph: Set owner on mds mount point
Signed-off-by: Zack Cerza <zack@redhat.com>
2017-10-18 12:59:09 -06:00
Zack Cerza
e606386626 qa/tasks/cephfs/filesystem: Check for mds failure
... inside Filesystem.are_daemons_healthy()

Signed-off-by: Zack Cerza <zack@redhat.com>
2017-10-18 12:59:09 -06:00
Zack Cerza
4516ebb6c0 qa/tasks/ceph: Remove /var/lib/ceph copypasta
Signed-off-by: Zack Cerza <zack@redhat.com>
2017-10-18 12:59:09 -06:00
Zack Cerza
a653720f50 qa/tasks/ceph: Set owner on osd mount point
This is needed when running with systemd

Signed-off-by: Zack Cerza <zack@redhat.com>
2017-10-18 12:59:09 -06:00
Zack Cerza
16dcceae5c qa/tasks/ceph: Set owner on mon mount point
This is needed when running with systemd

Signed-off-by: Zack Cerza <zack@redhat.com>
2017-10-18 12:59:09 -06:00
David Zafman
69b5fc54fe test: Cleanup test-erasure-eio.sh code
Signed-off-by: David Zafman <dzafman@redhat.com>
2017-10-18 11:12:14 -07:00
David Zafman
c2572bee3c test: Add replicated recovery/backfill test
Signed-off-by: David Zafman <dzafman@redhat.com>
2017-10-18 11:12:14 -07:00
David Zafman
bb2bcb95f5 osd: Add new UnfoundBackfill and UnfoundRecovery pg transitions
Signed-off-by: David Zafman <dzafman@redhat.com>
2017-10-18 11:01:39 -07:00
David Zafman
b9de5eec26 test: Test case that reproduces tracker 18162
recover_replicas: object added to missing set for backfill, but is not in recovering, error!

Signed-off-by: David Zafman <dzafman@redhat.com>
2017-10-18 10:58:23 -07:00
Yuri Weinstein
aa1d93fdb5 Merge pull request #18378 from ceph/wip-yuriw-21822-master
tests - ceph-ansible vars additions

Reviewed-by: Vasu Kulkarni <vasu@redhat.com>
2017-10-18 10:19:35 -07:00
Yuri Weinstein
fe4b0aaa33 tests - ceph-ansible vars additions
added symlinks for distros

Fixes http://tracker.ceph.com/issues/21822
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
(cherry picked from commit f30e9a5e6b)
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2017-10-18 09:06:35 -07:00
Kefu Chai
09010d2e4c Merge pull request #18358 from tchaikov/wip-max-pg-per-osd
osd,mon: add max-pg-per-osd limit

Reviewed-by: Sage Weil <sage@redhat.com>
2017-10-18 10:44:16 +08:00
Kefu Chai
9911e905e4 Merge pull request #16745 from Liuchang0812/object-store-repair
common, tool: update kvstore-tool to repair key/value database

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-10-18 00:24:39 +08:00
Kefu Chai
4c7df944c7 osd: add max-pg-per-osd limit
osd will refused to create new pgs, until its pg number is lower
than the max-pg-per-osd upper bound setting.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-17 23:08:40 +08:00
Kefu Chai
ac1e13ce63 qa/tasks/ceph: add "create_rbd_pool" option
True by default, but we can opt not to create the "rbd" pool.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-17 11:15:32 +08:00
Jason Dillaman
ecec3217b6 Merge pull request #17134 from PCzhangPC/metaexin
rbd: export/import image-meta when we export/import an image

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-10-16 21:00:08 -04:00
Chang Liu
8d5d9c6a62 test: new test case for ceph-kvstore-tool
Signed-off-by: Chang Liu <liuchang0812@gmail.com>
2017-10-16 22:52:10 +08:00
Kefu Chai
358f53cbc4 qa: reduce mon-client-hunt-interval-max-multiple to 2
monclient could experience frequent connection issues if "ms inject
socket failures" is enabled with high frequency. so reduce this number
to relieve the symptom.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-13 12:06:18 +08:00
Yuri Weinstein
862463dbfc Merge pull request #18246 from liewegas/wip-seesaw-warning
qa/suites/rados/singleton/all/mon-seesaw: whitelist MON_DOWN

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-10-12 12:52:42 -07:00
Kefu Chai
357fc97b67 Merge pull request #18235 from tchaikov/wip-qa-backfilling
qa: s/backfill/backfilling/

Reviewed-by: Sage Weil <sage@redhat.com>
2017-10-12 14:54:41 +08:00
John Spray
d96a59e74b qa/mgr: fix influx/prometheus test names
This was a typo: they were swapped around.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-10-11 17:00:01 +01:00
Sage Weil
0a886bc9f3 qa/suites/rados/singleton/all/mon-seesaw: whitelist MON_DOWN
Mgr can get marked down when mon weirdness is happening.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-11 08:24:50 -05:00
Kefu Chai
e21114274f qa: s/backfill/backfilling/
it's renamed "backfilling" in 4015343f .

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-11 11:52:43 +08:00
Sage Weil
998122c324 qa/suites/fs/basic_functional/clusters: more osds
Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-09 14:35:12 -05:00
Casey Bodley
160240e0c1 qa/rgw: ignore errors from 'pool application enable'
Fixes: http://tracker.ceph.com/issues/21715

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-10-09 10:25:26 -04:00
Sage Weil
96ddf5c3a0 Merge pull request #17708 from liewegas/wip-pg
osd: initial minimal efforts to clean up PG interface
2017-10-08 21:47:49 -05:00
Sage Weil
6f72b404f4 Merge pull request #18144 from hjwsm1989/wip-21425-part2
qa/suites/rest/basic/tasks/rest_test: whiltelist OSD_DOWN

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
2017-10-08 21:47:02 -05:00
Sage Weil
b4e4ad3b85 Merge pull request #18122 from liewegas/wip-monthrash-peering
qa/suites/rados/monthrash: tolerate PG_AVAILABILITY during mon thrashing
2017-10-06 15:32:45 -05:00
Sage Weil
b6a5c09dba ceph-objectstore-tool: remove rm-past-intervals op
The OSD doesn't rebuild this on demand anymore.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-06 13:08:18 -05:00
Sage Weil
886606bfd7 qa/standalone/scrub/osd-scrub-repair.sh: drop omap_digest flag
This is no longer set if we are backed by bluestore, which we are by
default.  See be078c8b7b

Signed-off-by: Sage Weil <sage@redhat.com>
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2017-10-06 19:25:40 +08:00
huangjun
838e12cfc8 qa/suites/rest/basic/tasks/rest_test: whiltelist OSD_DOWN
Fixes: http://tracker.ceph.com/issues/21425
Signed-off-by: huangjun <huangjun@xsky.com>
2017-10-06 11:59:50 +08:00
Sage Weil
eaa350be95 Merge pull request #18094 from xiexingguo/wip-tracker-21618
qa/standalone/scrub/osd-scrub-repair.sh: add extents flag into object_info_t

Reviewed-by: Sage Weil <sage@redhat.com>
2017-10-05 11:14:01 -05:00
Sage Weil
f26c3c79a6 Merge pull request #18079 from liewegas/wip-21618
qa/standalone/scrub/osd-scrub-repair: no -y to diff

Reviewed-by: David Zafman <dzafman@redhat.com>
2017-10-04 22:05:20 -05:00
Sage Weil
9c1d621083 qa/suites/rados/monthrash: tolerate PG_AVAILABILITY during mon thrashing
Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-04 21:26:56 -05:00
Mykola Golub
8152fecd37 Merge pull request #18097 from dillaman/wip-21663
qa/workunits/rbd: fixed variable name for resync image id

Reviewed-by: Mykola Golub <to.my.trociny@gmail.com>
2017-10-03 23:39:54 +03:00
Sage Weil
15b63d6795 qa/standalone/scrub/osd-scrub-repair: no -y to diff
With -y you can't see the entire line when it is long, which is
needed to identify the diff failure in
http://tracker.ceph.com/issues/21618

Instead, let the interactive user specify the option if they want it.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-03 14:35:35 -05:00
Yuri Weinstein
7b1c77a643 Merge pull request #18078 from liewegas/wip-21614
qa/suites/rados/singleton/all/recover-preemption: handle slow starting osd

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-10-03 12:33:29 -07:00
Jason Dillaman
22816ed3f4 qa/workunits/rbd: fixed variable name for resync image id
Fixes: http://tracker.ceph.com/issues/21663
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-10-03 13:25:33 -04:00
Josh Durgin
4570075984 Merge pull request #17576 from ceph/wip-rm-1-minsize
qa/tests/rados: Remove unsupported 2-size-1-min-size config

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-10-03 09:45:45 -07:00
xie xingguo
2470ab4aba qa/standalone/scrub/osd-scrub-repair.sh: add extents flag into object_info_t
Introduced-by: https://github.com/ceph/ceph/pull/15199
Fixes: http://tracker.ceph.com/issues/21618
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2017-10-03 21:14:53 +08:00
Sage Weil
76d84ac194 qa/suites/rados/singleton/all/recover-preemption: handle slow starting osd
The OSD may not be marked up yet; set the config via the admin
socket.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-02 07:20:57 -05:00
Jason Dillaman
a2702d70dc Merge pull request #18067 from trociny/wip-qa-pg_num
qa/workunits/rbd: pool create may fail for small cluster

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-10-02 08:20:09 -04:00
John Spray
47bfe6cf17 Merge pull request #17735 from jcsp/wip-mgr-perf-interface
mgr: common interface for TSDB modules

Reviewed-by: My Do <mhdo@umich.edu>
Reviewed-by: Jan Fajerski <jfajerski@suse.com>
Reviewed-by: John Spray <john.spray@redhat.com>
2017-10-02 11:12:35 +01:00
Patrick Donnelly
183646c919
qa: remove use of obsolete mds commands
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-10-01 17:22:36 -07:00
Mykola Golub
fcaffe7e01 qa/workunits/rbd: pool create may fail for small cluster
ceph osd pool create test 100
Error ERANGE:  pg_num 100 size 3 would mean 648 total pgs, which exceeds max 600 (mon_max_pg_per_osd 200 * num_in_osds 3)

Signed-off-by: Mykola Golub <to.my.trociny@gmail.com>
2017-10-01 16:36:17 +02:00
Mykola Golub
0c9c894acf Merge pull request #18026 from dillaman/wip-21535
rbd-mirror: sync image metadata when transfering remote image

Reviewed-by: Mykola Golub <to.my.trociny@gmail.com>
2017-10-01 12:27:26 +03:00
Patrick Donnelly
534c30aca4
Merge PR #18041 into master
* refs/remotes/upstream/pull/18041/head:
	qa: relax cap expected value check
2017-09-30 17:43:56 -07:00
Patrick Donnelly
e48683e35d
Merge PR #17935 into master
* refs/remotes/upstream/pull/17935/head:
	ceph_volume_client: fix setting caps for IDs

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-30 17:43:52 -07:00
Kefu Chai
3dfe209499 Merge pull request #17955 from asomers/bin_bash2
test: fix bash path in shebangs (part 2)

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-09-30 12:13:35 +08:00
Jason Dillaman
ae1530bbfb Merge pull request #17971 from idryomov/wip-krbd-exclude-shared-298
qa/suites/krbd: exclude shared/298

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-09-29 16:07:59 -04:00
David Zafman
2f466f8b26 Merge pull request #17920 from dzafman/wip-21382
Erasure code recovery should send additional reads if necessary

Fixes: http://tracker.ceph.com/issues/21382

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-09-29 09:04:43 -07:00
Patrick Donnelly
b37c7f7db7
qa: relax cap expected value check
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-09-29 08:48:14 -07:00
Ramana Raja
baf3b88800 ceph_volume_client: fix setting caps for IDs
... that have empty OSD and MDS caps. Don't add a ',' at the
start of OSD and MDS caps.

Fixes: http://tracker.ceph.com/issues/21501
Signed-off-by: Ramana Raja <rraja@redhat.com>
2017-09-29 17:06:05 +05:30
David Zafman
1235810c2a osd: Allow recovery to send additional reads
For now it doesn't include non-acting OSDs
Added test for this case

Signed-off-by: David Zafman <dzafman@redhat.com>
2017-09-28 23:31:18 -07:00