Commit Graph

4190 Commits

Author SHA1 Message Date
Sage Weil
98fc7ebc99 Merge PR #24184 into master
* refs/pull/24184/head:
	mgr/DaemonServer: remove any upmaps on merging PGs
	mgr/DaemonServer: prevent merge if either pg is remapped|upmap
	mgr/DaemonServer: move pending merge check for more consistent code
	qa/suites/rados/thrash*/thrashers/careful.yaml: thrash with mgr controller
	mgr/DaemonServer: add option to bypass careful throttling for thrasher
	PendingReleaseNotes: note about mgr/balancer/max_misplaced change
	mgr/DaemonServer: remove stale/misleading check
	mgr/DaemonServer: throttle pgp_num changes based on misplaced %
	mgr/DaemonServer: block pg_num decrease(merge) until pgp_num is reduced
	mgr/DaemonServer: adjust_pgs(): cosmetic change to debug output
	mon/PGMap: add get_recovery_stats()
	mgr/balancer: mgr/balancer/max_misplaced -> pg_max_misplaced
	pybind/mgr/mgr_module: add get_option()
	mgr/DaemonServer: allow pg_num increases that abort pending merges
	mon/OSDMonitor: resent pre-nautilus client ops on aborted merge
	mon/OSDMonitor: make pgp_num track pg_num more consistently

Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
2018-10-20 16:40:22 -05:00
Sage Weil
86ae8fb6b8 qa/suites/rados/thrash*/thrashers/careful.yaml: thrash with mgr controller
Thrash such that we still exercise the careful throttling in the mgr.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-20 15:21:58 -05:00
Sage Weil
70ec5bda23 mgr/DaemonServer: add option to bypass careful throttling for thrasher
Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-20 15:21:58 -05:00
Patrick Donnelly
b1dca00f90
Merge PR #24503 into master
* refs/pull/24503/head:
	qa: increase timeout for cleanup

Reviewed-by: Jeff Layton <jlayton@redhat.com>
2018-10-17 10:18:50 -07:00
Patrick Donnelly
6345c3f80d
Merge PR #24455 into master
* refs/pull/24455/head:
	qa: use timeout for fs asok operations

Reviewed-by: Jeff Layton <jlayton@redhat.com>
2018-10-17 10:17:29 -07:00
Lenz Grimmer
a848953f28
Merge pull request #24617 from p-na/fix-python3-issue
mgr/dashboard: Fix Python3 issue

Reviewed-by: Ricardo Marques <rimarques@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
2018-10-17 18:06:58 +02:00
Sage Weil
f8b6cbda34 Merge PR #24359 into master
* refs/pull/24359/head:
	qa/tests: update ansible version to 2.6 for master branch testing.
	qa/tests: use lvm as default for ceph-ansible testing, this should also work with raw devices

Reviewed-by: Alfredo Deza <adeza@redhat.com>
2018-10-17 10:06:42 -05:00
Sage Weil
54d539d79a Merge PR #24603 into master
* refs/pull/24603/head:
	crush: get "ceph osd crush class create/rm" back

Reviewed-by: Sage Weil <sage@redhat.com>
2018-10-17 10:06:26 -05:00
Patrick Nawracay
5c0c122597 mgr/dashboard: Fix Python3 issue
Which results in a 500 error when trying to access the `Performance
Counter` tab on the OSD list.

Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2018-10-17 14:02:12 +02:00
Sage Weil
05faeb4d12 Merge PR #24579 into master
* refs/pull/24579/head:
	qa/osd: fixup osd-rep-recov-eio.sh fails to parse pg dump

Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2018-10-16 07:17:59 -05:00
John Spray
e6a26aeff7
Merge pull request #24597 from batrick/i36450
qa: fix run call args

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: Lenz Grimmer <lgrimmer@suse.com>
2018-10-16 13:09:08 +01:00
Patrick Donnelly
6a4cc58a9d
Merge PR #24292 into master
* refs/pull/24292/head:
	qa: add test for rctime on root inode
	mds: set rctime on new system inode
	mds: small refactor

Reviewed-by: Zheng Yan <zyan@redhat.com>
2018-10-15 21:31:04 -07:00
xie xingguo
d7ff33e9fd crush: get "ceph osd crush class create/rm" back
This reverts a27fd9d25c and
b863883ca7.

Quote form Sébastien Han:
> IIRC at some point, we were able to create a device class from the CLI.
Now it seems that the device class gets created when at least one OSD
of a particular class starts.
In ceph-ansible, we create pools after the initial monitors are up and
we want to assign a device crush class on some of them.
That's not possible at the moment since there no device class available yet.
Also, someone might want to create its own device class.
Something as crazy as running Filestore with a tmpfs osd store and
might want to isolate them.
I know it's a very limited use case, but still, it could be desired.

See also https://www.spinics.net/lists/ceph-devel/msg41152.html

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2018-10-16 08:45:49 +08:00
Patrick Donnelly
d491227956
qa: fix run call args
Fixes: http://tracker.ceph.com/issues/36450
Introduced-by: 95746ecce9
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-15 14:45:18 -07:00
huanwen ren
f1219d716d qa/osd: fixup osd-rep-recov-eio.sh fails to parse pg dump
Fixes: http://tracker.ceph.com/issues/36418
Signed-off-by: huanwen ren <ren.huanwen@zte.com.cn>
2018-10-16 02:18:22 +08:00
Sage Weil
7ac6ab4b2f Merge PR #24494 into master
* refs/pull/24494/head:
	ceph-kvstore-tool: rename repair -> destructive-repair

Reviewed-by: Neha Ojha <nojha@redhat.com>
2018-10-14 13:11:11 -05:00
Sage Weil
8cc6369511 ceph-kvstore-tool: rename repair -> destructive-repair
This is shown to corrupt otherwise healthy rocksdb databases.  Rename to
make it clear that it is generally not safe to run and shoud only be used
as a last resort.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-14 11:41:24 -05:00
Mykola Golub
1d92788f71
Merge pull request #24563 from dillaman/wip-36410
test: move OpenStack devstack test to rocky release

Reviewed-by: Mykola Golub <mgolub@suse.com>
2018-10-14 10:40:05 +03:00
Sage Weil
9db328f2ab Merge PR #24204 into master
* refs/pull/24204/head:
	qa/suites/rgw/tempest: valgrind on centos only

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2018-10-12 16:29:44 -05:00
Jason Dillaman
1e3dc02604 qa/tasks/workunit: use suite branch/SHA1 when cloning workunits
Right now it's using the Ceph branch/SHA1 but it's using the suite
Git URL.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-10-12 12:41:58 -04:00
Jason Dillaman
2a1dca3fca qa/workunits/rbd: switch devstack to rocky branch and tempest to 19.0.0 tag
Fixes: http://tracker.ceph.com/issues/36410
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-10-12 08:40:44 -04:00
Jason Dillaman
ba32395165
Merge pull request #24550 from dillaman/wip-36409
qa/workunits: replace 'realpath' with 'readlink -f' in fsstress.sh

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-12 08:39:37 -04:00
Jason Dillaman
c1f950236b qa/tasks: qemu task now uses a relative path in suite repo for test
This makes it easier to re-run tests against a suite branch without
requiring a full ceph-ci build and repo.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-10-11 20:23:39 -04:00
Jason Dillaman
ddb7e5eb80 qa/workunits: replace 'realpath' with 'readlink -f' in fsstress.sh
Fixes: http://tracker.ceph.com/issues/36409
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-10-11 15:25:20 -04:00
Jason Dillaman
c60ffac61e
Merge pull request #24428 from trociny/wip-rbd-config-global
rbd: add 'config global' command to get/store overrides in mon config db

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-10-11 10:50:33 -04:00
Patrick Donnelly
d03a8302b5
Merge PR #23069 into master
* refs/pull/23069/head:
	tests/libcephfs: add simple reclaim test
	mds: check auth name before reclaiming session
	mds: reclaim session before allowing mds to become active
	mds: allow client to specify its session timeout
	mds: initial code for client states reclaim

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-10 15:29:41 -07:00
Mykola Golub
c552f166c3
Merge pull request #24476 from dillaman/wip-35944
librbd: add new configuration option to always move deleted items to the trash

Reviewed-by: Mykola Golub <mgolub@suse.com>
2018-10-10 18:40:14 +03:00
Jason Dillaman
a6a2f395b7
Merge pull request #24512 from trociny/wip-qa-namespace-create_args
qa/workunits/rbd: force v2 image format for namespace test

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-10-10 10:42:08 -04:00
Mykola Golub
b3e4d43c15 rbd: add 'config global' command to get/store overrides in mon config db
Signed-off-by: Mykola Golub <mgolub@suse.com>
2018-10-10 17:31:32 +03:00
Jason Dillaman
97950fec27 qa/workunits/rbd: test new move-to-trash on image remove config override
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-10-10 09:08:03 -04:00
John Spray
67d147c00d
Merge pull request #23622 from renhwztetecs/renhw-wip-25103
mgr: fixup pgs show in unknown state

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: John Spray <john.spray@redhat.com>
2018-10-10 13:28:33 +01:00
John Spray
5a39f0a9aa
Merge pull request #23970 from noahdesu/mgr-always-on-mods-v2
mgr: always on modules v2

Reviewed-by: John Spray <john.spray@redhat.com>
2018-10-10 12:17:41 +01:00
John Spray
17778d217c
Merge pull request #24239 from votdev/python_ceph_audit_log
mgr: Add ability to trigger a cluster/audit log message from Python

Reviewed-by: John Spray <john.spray@redhat.com>
2018-10-10 11:47:49 +01:00
Mykola Golub
24e1a6bd47 qa/workunits/rbd: force v2 image format for namespace test
While here, fix test_locking too.

Signed-off-by: Mykola Golub <mgolub@suse.com>
2018-10-10 10:50:30 +03:00
Patrick Donnelly
dc6e7eadf8
qa: increase timeout for cleanup
Apparently 15m is not long enough for some workunits like fsstress.

Fixes: http://tracker.ceph.com/issues/36365

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-09 13:31:26 -07:00
Ricardo Marques
13af4b52f7
Merge pull request #21614 from Devp00l/pool-creation-form
mgr/dashboard: Pool management

Reviewed-by: Ricardo Marques <rimarques@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
2018-10-09 17:29:09 +01:00
Stephan Müller
8e56421bcd mgr/dashboard: Update PG update test
It is now commented out like it was before,
but I've added a comment what happened during this test with the QA
system. The problem was that even with only a increase of 1 PG the QA
cluster went into a cluster warning state and did not recover in time.
The QA coverage timeout is 2 minutes.

I could not reproduce this behavior with a local cluster, but I've
added a loop to wait until pgp and pg number are equal and the cluster
is in a healthy state again. This can take locally about 5 seconds.
The internal loop has a timeout of 3 minutes.

Fixes: https://tracker.ceph.com/issues/36362
Signed-off-by: Stephan Müller <smueller@suse.com>
2018-10-09 15:56:54 +02:00
Stephan Müller
c98c8055c2 mgr/dashboard: Unset compression arguments for pools
The dashboard backend can now unset all set compression arguments if the
compression mode is switched to 'unset'. In the case of 'unset' Ceph
itself will only delete the 'compression_mode' argument, not all other
set arguments. The other arguments that should be removed, too, are
added to the update arguments in order to delete all set arguments.

Fixes: https://tracker.ceph.com/issues/36355
Signed-off-by: Stephan Müller <smueller@suse.com>
2018-10-09 15:56:47 +02:00
Tatjana Dehler
18d91e2419 mgr/dashboard: add backend method to create config options
Fixes: http://tracker.ceph.com/issues/24455

Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2018-10-09 14:50:00 +02:00
Tatjana Dehler
2a630be872 mgr/dashboard: refactor '_get_mon_allow_pool_delete_config'
Refactor '_get_mon_allow_pool_delete_config' method to be a little bit
more general. The method can now be used to get the value of every
config option known to the cluster.

Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2018-10-09 14:47:24 +02:00
Jason Dillaman
82a98163af
Merge pull request #24457 from trociny/wip-qa-cli-namespace
qa/workunits/rbd: fix cli generic namespace test

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-10-08 15:15:30 -04:00
Patrick Nawracay
a4956b40b2 mgr/dashboard: Fix errors when clicking on new OSD
Fixes: http://tracker.ceph.com/issues/36245

Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2018-10-08 10:45:58 +02:00
Patrick Nawracay
0f0de0ce2a mgr/dashboard: Add type hints to QA helper
Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2018-10-08 10:42:14 +02:00
Mykola Golub
f2d72d46d4 qa/workunits/rbd: fix cli generic namespace test
expect_fail incorrectly unset '-e' option and if a consequent test
failed it did not abort the execution. And two typos in the namespace
tests were not detected due to this.

Signed-off-by: Mykola Golub <mgolub@suse.com>
2018-10-07 15:09:25 +03:00
Patrick Donnelly
296d89b046
qa: use timeout for fs asok operations
Otherwise a bug preventing an asok operation from completing will cause the
entire job to fail.

Fixes: http://tracker.ceph.com/issues/36335

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-06 10:32:50 -07:00
Jeff Layton
9bca82d00e tests/libcephfs: add simple reclaim test
Signed-off-by: Jeff Layton <jlayton@redhat.com>
2018-10-05 10:18:28 -04:00
Volker Theile
34525ba3af Relocate cluster_log(). Only active modules can use it.
Signed-off-by: Volker Theile <vtheile@suse.com>
2018-10-05 14:46:58 +02:00
Volker Theile
95746ecce9 mgr: Add ability to trigger a cluster/audit log message from Python
Fixes: https://tracker.ceph.com/issues/36194

Signed-off-by: Volker Theile <vtheile@suse.com>
2018-10-04 13:33:18 +02:00
Ricardo Dias
43a72f0e49
Merge pull request #24350 from ricardoasmarques/wip-json-decode
mgr/dashboard: Check content-type before decode json response

Reviewed-by: Lenz Grimmer <lgrimmer@suse.com>
Reviewed-by: Ricardo Dias <rdias@suse.com>
2018-10-04 10:08:37 +01:00
Patrick Donnelly
ddbc3016a5
qa: add test for rctime on root inode
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-03 10:10:47 -07:00