Commit Graph

4756 Commits

Author SHA1 Message Date
xie xingguo
6a8aedc107 qa: add new test case for pulling error
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-04-04 11:04:43 +08:00
David Zafman
11f072fee1 Add checking of num_shards_repaired in osd stats
Signed-off-by: David Zafman <dzafman@redhat.com>
2019-04-04 11:04:42 +08:00
Sage Weil
53a036cc32 Merge PR #27249 into master
* refs/pull/27249/head:
	qa/suites/rados/thrash-old-clients: fix up v1/v2 options

Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-03-29 17:58:53 -05:00
Yuri Weinstein
a016563804 qa/tests: added nautilus-x run
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2019-03-28 13:03:24 -07:00
Sage Weil
e58bf34386 qa/suites/rados/thrash-old-clients: fix up v1/v2 options
- don't specify ceph.py options in the ceph.py
- instead, specify them in the per-version facet

Note that we don't currently have a way to do v2 only for the mon IPs, so
in the v2only cases, we are still binding the mons to v1.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-28 14:06:58 -05:00
Lenz Grimmer
96a65fbfb7
Merge pull request #26914 from votdev/issue_38331
mgr/dashboard: Add separate option to config SSL port

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Sebastian Wagner <swagner@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
2019-03-28 10:55:27 +01:00
Yuri Weinstein
47d69860bf qa/tests: added nautilus-p2p to cron
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2019-03-27 09:26:10 -07:00
Yuri Weinstein
f71b36e28a
Merge pull request #27165 from yuriw/wip-yuriw-crontab-master_2
qa/tests: reduced number of jobs for `kcephfs`
2019-03-27 09:19:56 -07:00
Sage Weil
420edba243 Merge PR #27169 into master
* refs/pull/27169/head:
	common/config: parse --default-$option as a default value

Reviewed-by: Sébastien Han <seb@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-03-27 09:48:33 -05:00
Sage Weil
64addc597a Merge PR #27187 into master
* refs/pull/27187/head:
	mgr/dashboard/qa: Increase timeout for test_disable (tasks.mgr.dashboard.test_mgr_module.MgrModuleTelemetryTest)

Reviewed-by: Lenz Grimmer <lgrimmer@suse.com>
2019-03-27 09:43:21 -05:00
David Zafman
33357740f5
Merge pull request #26942 from dzafman/wip-38616
Feature: Improvements to auto repair

Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-03-26 16:48:09 -07:00
Sage Weil
1dad9d2fe1 Merge PR #27155 into master
* refs/pull/27155/head:
	mon/ConfigMonitor: make 'config reset' idempotent

Reviewed-by: Sage Weil <sage@redhat.com>
2019-03-26 11:35:45 -05:00
Sage Weil
fdd2000631 common/config: parse --default-$option as a default value
Sometimes it is useful to specify an alternative default value for an
option via the command line such that it has a lower priority than the
mon config database, config file, the rest of the command line, or the
environment.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-26 11:00:27 -05:00
Volker Theile
91fe2f0799 mgr/dashboard/qa: Increase timeout for test_disable (tasks.mgr.dashboard.test_mgr_module.MgrModuleTelemetryTest)
Fixes: https://tracker.ceph.com/issues/38820

Signed-off-by: Volker Theile <vtheile@suse.com>
2019-03-26 15:43:54 +01:00
Ilya Dryomov
0ba913ba2c
Merge pull request #27192 from idryomov/wip-krbd-discard-fixups
qa/workunits/rbd: fixups for the new krbd discard behavior

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2019-03-26 14:09:41 +01:00
Lenz Grimmer
7c1f1efeb5
Merge pull request #26925 from votdev/issue_38265
mgr/dashboard/qa: Improve tasks.mgr.test_dashboard.TestDashboard.test_standby

Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
2019-03-26 13:44:19 +01:00
Jason Dillaman
6e32108204
Merge pull request #27137 from trociny/wip-qa-rbd-qos
qa: add RBD QOS functional test

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2019-03-26 08:21:18 -04:00
David Zafman
57abdb11fa osd, test: Add num_shards_repaired to osd_stat_t for pushes with repair set 3(3)
Fixes: http://tracker.ceph.com/issues/38616

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-03-25 16:03:36 -07:00
David Zafman
d2ca3d2feb osd: Track num_objects_repaired in pg stats 2(3)
Leave repair pg state on until recovery finishes or a new scrub starts

Fixes: http://tracker.ceph.com/issues/38616

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-03-25 16:03:36 -07:00
Ilya Dryomov
f3f757aaa3 qa/workunits/rbd: fixups for the new krbd discard behavior
With discard_granularity set to alloc_size, we no longer get object
size alignment from blk_bio_discard_split().

This assumption is pretty deeply ingrained in krbd_data_pool.sh, so
make it explicit.  For krbd_fallocate.sh, just fix the expectation.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2019-03-25 22:06:14 +01:00
Ilya Dryomov
673673f53d qa: krbd_fallocate.sh: zero can be munged to truncate
The test case is issuing discards that span two objects: the tail of
the first object is truncated, the head of the second object is zeroed.
These discards aren't serial, so there is a race:

  discard i ~ i + 1: truncate i, zero i + 1
  discard i + 1 ~ i + 2: truncate i + 1, zero i + 2

can be executed as

  truncate i + 1, zero i + 2, truncate i, zero i + 1

For object i + 1, the sequence ends up being truncate tail, then zero
head.  This zero op is munged to truncate on the OSD, resulting in size
0 instead of OBJECT_SIZE / 2.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2019-03-25 22:06:14 +01:00
Casey Bodley
9fdcaae19a
Merge pull request #26100 from ceph/wip-s3a-hadoop-ver
qa/tests: update s3a hadoop versions used for test

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2019-03-25 13:22:39 -04:00
Yuri Weinstein
c480715253 qa/tests: reduced number of jobs for kcephfs suite
now we get 502 jobs and run 3 times a week
this change will generate/run 44 jobs every day

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2019-03-25 10:02:24 -07:00
xie xingguo
794e3dfc80 mon/ConfigMonitor: make 'config reset' idempotent
This partially revert 1bc9c86d08.
It's generally not a good idea if a command is not idempotent.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-03-25 20:35:42 +08:00
Xie Xingguo
c9dbaf279e
Merge pull request #27090 from xiexingguo/wip-config-mon-core
mon/ConfigMonitor: make 'num' of 'config reset' optional; allow target version 0

Reviewed-by: Sage Weil <sage@redhat.com>
2019-03-25 13:28:41 +08:00
xie xingguo
1bc9c86d08 mon/ConfigMonitor: make 'num' of 'config reset' command optional
This way it can be used to fast cancel/undo the last command.
Also make the tip message a litter bit nicer..

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-03-25 09:41:05 +08:00
Sage Weil
2d920de190 Merge PR #27147 into master
* refs/pull/27147/head:
	qa/workunits/mon/config.sh: s|bin/ceph|ceph|

Reviewed-by: Sage Weil <sage@redhat.com>
2019-03-24 10:25:45 -05:00
Sage Weil
25ae57b651 Merge PR #27138 into master
* refs/pull/27138/head:
	qa/suites/upgrade/nautilus-x: misc updates

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2019-03-23 15:09:29 -05:00
David Zafman
2202e5d0b1 test, osd: Improvements to auto_repair 1(3)
Allow auto_repair for replicated bluestore pools
Regular scrub within auto repair parameters will trigger deep scrub
New state failed_repair if PG repair attempt could not fix everything
Set failed_repair if not possible to repair anything

Fixes: http://tracker.ceph.com/issues/38616

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-03-23 09:52:40 -07:00
David Zafman
315d324889 test: osd-scrub-repair.sh: use corrupt_and_repair_lrc for lrc tests
Fix for argument handling of create_ec_pool()
Always pass a value for allow_overwrites for consistency

Caused by: 3ca750d41d

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-03-23 09:52:40 -07:00
Kefu Chai
b8082ebc56 qa/workunits/mon/config.sh: s|bin/ceph|ceph|
so we can verify that ceph is able to reject wrong debug levels.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2019-03-23 22:43:08 +08:00
Neha Ojha
6931552f47
Merge pull request #27140 from neha-ojha/wip-fix-cron
qa/crontab/teuthology-cronjobs: fix suite-branch

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2019-03-22 16:33:09 -07:00
Yuri Weinstein
bb257f10f7
Merge pull request #27129 from yuriw/wip-yuriw-subsets-master
qa/tests: added the subset clause for nautilus branch
2019-03-22 15:45:50 -07:00
Neha Ojha
a21e183834 qa/crontab/teuthology-cronjobs: fix suite-branch
Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-03-22 15:29:04 -07:00
Sage Weil
f3499b10b3 qa/suites/upgrade/nautilus-x: misc updates
- start with msgr2 enabled (defaults)
- run nautilus branch for workunits
- drop msgr2 enable step at teh end
- add octopus placeholder (although it is empty for now)

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-22 16:38:27 -05:00
Sage Weil
d3856ea21c Merge PR #27112 into master
* refs/pull/27112/head:
	qa/suites: do not test luminous-x upgrade path

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-03-22 16:26:53 -05:00
Neha Ojha
abdfb8b286
Merge pull request #27083 from neha-ojha/wip-38635
qa/objectstore: test with reduced value of osd_memory_target

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Mark Nelson <mnelson@redhat.com>
2019-03-22 10:07:50 -07:00
Yuri Weinstein
2aec1a824c qa/tests: added the subset clause for nuatilis branch
--subset x/2999 should generate ~250 jobs as result

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2019-03-22 09:39:38 -07:00
Mykola Golub
1a2243403d qa/suites/rbd: enable QOS functional test
Signed-off-by: Mykola Golub <mgolub@suse.com>
2019-03-22 14:59:01 +00:00
Mykola Golub
63f3d7e377 qa/workunits/rbd: add QOS functional test
Signed-off-by: Mykola Golub <mgolub@suse.com>
2019-03-22 14:58:22 +00:00
Kefu Chai
1231a306ec qa/suites: do not test luminous-x upgrade path
in this change,

* suites/rados/upgrade: luminous-x-singleton => mimic-x-singleton
* suites/upgrade: luminous-x => nautilus-x

we support upgrade from n to n+2 release. otherwise monitor refuses to
do so:

mon.a@-1(probing) e1 current monmap has min_mon_release 15 (luminous)
which is >2 releases older than me 15 (octopus), stopping.

Fixes: https://tracker.ceph.com/issues/38845
Signed-off-by: Kefu Chai <kchai@redhat.com>
2019-03-22 22:39:42 +08:00
Sage Weil
be1187575b Merge PR #27021 into master
* refs/pull/27021/head:
	msg: remove XioMessenger
	qa/suites/rados/thrash-old-clients: add nautilus
	qa/suites/rados/thrash-old-clients: add mimic v1 variant
	qa/suites/rados/thrash-old-clients: add mimic
	qa/suites/rados/thrash-old-clients: collapse msgr and client choice
	qa: remove simplemessenger tests
	ceph_test_msgr: remove simple
	msg: remove SimpleMessenger

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Matt Benjamin <mbenjami@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-03-22 04:42:30 -05:00
Kefu Chai
32df73f9f2
Merge pull request #26940 from xiexingguo/wip-monc-add-con
mon/MonClient: weight-based mon selection

Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-03-22 15:43:04 +08:00
Kefu Chai
f2b3bfa3aa
Merge pull request #26955 from liewegas/wip-slow-add
crush: various fixes for weight-sets, the osd_crush_update_weight_set option, and tests

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-03-22 15:42:13 +08:00
Volker Theile
c29e587007 mgr/dashboard/qa: Improve tasks.mgr.test_dashboard.TestDashboard.test_standby
The changes of this PR were done while trying to fix the failing test. The problem has been solved by another PR, but the changes are worth to be integrated because they help debugging and an additional test has been added (check if previously active manager is listed as standby).

Signed-off-by: Volker Theile <vtheile@suse.com>
2019-03-21 15:26:52 +01:00
Sebastian Wagner
cee38745ea
Merge pull request #26404 from sebastian-philipp/orchestrator-error-handling
mgr/orchestrator: Add error handling to interface

Reviewed-by: Jeff Layton <jlayton@redhat.com>
Reviewed-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>
Reviewed-by: Tim Serong <tserong@suse.com>
2019-03-21 12:30:13 +01:00
Sage Weil
cc9a9142fd msg: remove XioMessenger
There is a lot of good stuff going on here, but nobody is investing in xio
and it is not expected to be the path forward for RDMA.  If that ever
changes, we can resurrect the code.  Until then, let's clean up the tree
and reduce friction for changes going forward.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-21 00:57:33 -05:00
Neha Ojha
1432f5544f qa/objectstore: test with reduced value of osd_memory_target
This addresses Part 1 of https://tracker.ceph.com/issues/38635.

Fixes: https://tracker.ceph.com/issues/38635
Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-03-20 21:37:06 -07:00
Yuri Weinstein
33d929373e
Merge pull request #27057 from yuriw/wip-yuriw-crontab
qa/tests: reduced frequency for luminous and mimic runs...
2019-03-20 16:36:54 -07:00
Yuri Weinstein
afb47c6264 qa/tests: reduced frequency for luminous and mimic runs
added all suites for nauitlus

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2019-03-20 13:26:58 -07:00