Commit Graph

4373 Commits

Author SHA1 Message Date
Ricardo Dias
2f5e7c3392
mgr/dashboard: backend: JWT based authentication
Signed-off-by: Ricardo Dias <rdias@suse.com>
2018-10-29 15:47:14 +00:00
Sage Weil
c40685ebdd Merge PR #24787 into master
* refs/pull/24787/head:
	Merge PR #24796 into nautilus
	osd: fix heartbeat_reset unlock
	Merge PR #24780 into nautilus
	Merge PR #24761 into nautilus
	Merge PR #24651 into nautilus
	osd: fix race between op_wq and context_queue
	test: Make sure kill_daemons failure will be easy to find
	test: Add flush_pg_stats to make test more deterministic
2018-10-29 08:36:34 -05:00
Igor Fedotov
5d38f8b49b qa/standtalone/osd-bluefs-volume-ops: remove redundant code.
Signed-off-by: Igor Fedotov <ifedotov@suse.com>
2018-10-29 16:30:36 +03:00
Ganesh Maharaj Mahalingam
625868974b fix python collections module warning for v3.7 and above
Python 3.7 now shows a warning as below.

/usr/bin/ceph:128: DeprecationWarning: Using or importing the ABCs from
'collections' instead of from 'collections.abc' is deprecated, and in
3.8 it will stop working
  import rados

This patch addresses the that particular issue.

Signed-off-by: Ganesh Maharaj Mahalingam <ganesh.mahalingam@intel.com>
2018-10-28 23:32:22 -07:00
Sage Weil
143f601155 Merge PR #24651 into nautilus
* refs/pull/24651/head:
	test: Make sure kill_daemons failure will be easy to find
	test: Add flush_pg_stats to make test more deterministic

Reviewed-by: Neha Ojha <nojha@redhat.com>
2018-10-26 21:07:09 -05:00
Xie Xingguo
e6f9241aeb
Merge pull request #24657 from xiexingguo/wip-rm-device-class-fix
mon/OSDMonitor: two "ceph osd crush class rm" fixes

Reviewed-by: Sage Weil <sage@redhat.com>
2018-10-27 09:49:57 +08:00
Sage Weil
80bb0664d6 Merge PR #24713 into master
* refs/pull/24713/head:
	mon: drop repeated 'goodchars' and add osd crush ls testcase

Reviewed-by: João Eduardo Luís <joao@suse.de>
2018-10-26 09:18:13 -05:00
John Spray
c5fd31dfcc
Merge pull request #24767 from votdev/issue_36581
mgr/dashboard/qa: Fix various vstart_runner.py issues

Reviewed-by: Tatjana Dehler <tdehler@suse.com>
Reviewed-by: John Spray <john.spray@redhat.com>
2018-10-26 15:09:59 +01:00
Lenz Grimmer
0c84be2306
Merge pull request #24727 from zmc/wip-dashboard-gzip
mgr/dashboard: Enable gzip compression

Reviewed-by: Ricardo Dias <rdias@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
2018-10-26 11:05:27 +02:00
Volker Theile
00e3022710 mgr/dashboard/qa: CephfsTest - admin_socket() got an unexpected keyword argument 'timeout'
Adapt method arguments of LocalRemote::run() according to teuthology.orchestra.run.run() (see https://github.com/ceph/teuthology/blob/master/teuthology/orchestra/run.py#L364) to be able to run QA tests locally in a vstart cluster.

Fixes: http://tracker.ceph.com/issues/36581
Signed-off-by: Volker Theile <vtheile@suse.com>
2018-10-26 09:59:11 +02:00
Zack Cerza
03de8f9557 mgr/dashboard: Enable gzip compression
This is related to http://tracker.ceph.com/issues/36453. It is far from
a complete solution, but seems like a positive move.

I tested this change by first disabling my browser cache, and then used
the /docs endpoint to query /api/dashboard/health. Before compression:
  Content-Length: 60748
  Time: 615ms
After:
  Content-Length: 7505
  Time: 92ms

Then, I logged into the dashboard as normal and reloaded the page once I
was in. Some values for the reload operation before compression:
  Total page load time: 58.48s
  vendor.js Content-Length: 6486025
  vendor.js time: 48.09s
After:
  Total page load time: 14.55s
  vendor.js Content-Length: 1143178
  vendor.js time: 4.50s

Signed-off-by: Zack Cerza <zack@redhat.com>
2018-10-24 16:04:37 -06:00
Zack Cerza
bd09bc0462 DashboardTestCase: add assertHeaders()
Signed-off-by: Zack Cerza <zack@redhat.com>
2018-10-24 16:04:37 -06:00
Zack Cerza
b134972035 LocalCephManager.admin_socket: add timeout kwarg
This fixes "TypeError: admin_socket() got an unexpected keyword argument
'timeout'". The value is never used.

Signed-off-by: Zack Cerza <zack@redhat.com>
2018-10-24 16:04:37 -06:00
Jason Dillaman
484dc12089 qa/tasks/qemu: use unique clone directory to avoid race with workunit
If there is a workunit task associated with the same client, the two
tasks will attempt to clone the suite repo to the same directory.
Worse, if it's parallel tasks, the two clones will clobber each
other.

Fixes: http://tracker.ceph.com/issues/36542
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 5d56014c61)
2018-10-24 10:30:43 -04:00
Patrick Donnelly
60c63f71f6
Merge PR #24533 into master
* refs/pull/24533/head:
	qa: add timeouts for remote ops for client mounts

Reviewed-by: Zheng Yan <zyan@redhat.com>
2018-10-23 14:46:34 -07:00
xie xingguo
5bcac35213 mon/OSDMonitor: do not remove device class still referenced by ec-profiles
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2018-10-23 21:17:56 +08:00
xie xingguo
4bc54587a1 mon/OSDMonitor: make "ceph osd crush class rm" idempotent
Removing a non-existent device class should be generally okay.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2018-10-23 21:17:56 +08:00
Yan Jun
1e98c72dfc mon: drop repeated 'goodchars' and add osd crush ls testcase
Signed-off-by: Yan Jun <yan.jun8@zte.com.cn>
2018-10-23 16:32:45 +08:00
Mykola Golub
5dd0599bdf
Merge pull request #24696 from dillaman/wip-36542
qa/tasks/qemu: use unique clone directory to avoid race with workunit

Reviewed-by: Mykola Golub <mgolub@suse.com>
2018-10-23 09:41:43 +03:00
Sage Weil
a350131d16 Merge PR #24698 into master
* refs/pull/24698/head:
	Merge PR #24697 into nautilus
	ceph_test_msgr: fix authorizer behavior
	Merge pull request #24667 from liewegas/wip-ec-thrash-full
	Merge PR #24689 into nautilus
	qa/suites/rados/thrash-erasure-code*/thrashers/*: less likely resv rejection injection

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-10-22 22:46:23 -05:00
Jason Dillaman
5d56014c61 qa/tasks/qemu: use unique clone directory to avoid race with workunit
If there is a workunit task associated with the same client, the two
tasks will attempt to clone the suite repo to the same directory.
Worse, if it's parallel tasks, the two clones will clobber each
other.

Fixes: http://tracker.ceph.com/issues/36542
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-10-22 10:44:40 -04:00
Josh Durgin
36ca230776
Merge pull request #24667 from liewegas/wip-ec-thrash-full
qa/suites/rados/thrash-erasure-code*/thrashers/*: less likely resv rejection injection

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-10-22 07:39:26 -07:00
Kefu Chai
4af71e7c00
Merge pull request #23103 from ifed01/wip-ifed-bluefs-migrate
os/bluestore: allow ceph-bluestore-tool to coalesce, add and migrate BlueFS backing volumes

Reviewed-by:  Sage Weil <sage@redhat.com>
2018-10-22 22:33:08 +08:00
Sage Weil
ae583f5dde Merge PR #24689 into master
* refs/pull/24689/head:
	qa/tasks/ceph_manager: fix get_stuck_pgs from pg dump change
	Merge PR #24625 into nautilus
	qa/suites/rados/mgr/tasks/module_selftest: whitelist 'foo bar security'

Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-10-22 09:19:46 -05:00
liuchang0812
7c008d279e tool/ceph_objectstore_tool: add new op that reset last_complete to last_update
Fixes: http://tracker.ceph.com/issues/19382

Signed-off-by: liuchang0812 <liuchang0812@gmail.com>
2018-10-22 11:03:06 +08:00
Sage Weil
b678356594 qa/tasks/ceph_manager: fix get_stuck_pgs from pg dump change
Fixes 95b7d2340c

Fixes: http://tracker.ceph.com/issues/36485
Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-21 10:52:38 -05:00
Sage Weil
98fc7ebc99 Merge PR #24184 into master
* refs/pull/24184/head:
	mgr/DaemonServer: remove any upmaps on merging PGs
	mgr/DaemonServer: prevent merge if either pg is remapped|upmap
	mgr/DaemonServer: move pending merge check for more consistent code
	qa/suites/rados/thrash*/thrashers/careful.yaml: thrash with mgr controller
	mgr/DaemonServer: add option to bypass careful throttling for thrasher
	PendingReleaseNotes: note about mgr/balancer/max_misplaced change
	mgr/DaemonServer: remove stale/misleading check
	mgr/DaemonServer: throttle pgp_num changes based on misplaced %
	mgr/DaemonServer: block pg_num decrease(merge) until pgp_num is reduced
	mgr/DaemonServer: adjust_pgs(): cosmetic change to debug output
	mon/PGMap: add get_recovery_stats()
	mgr/balancer: mgr/balancer/max_misplaced -> pg_max_misplaced
	pybind/mgr/mgr_module: add get_option()
	mgr/DaemonServer: allow pg_num increases that abort pending merges
	mon/OSDMonitor: resent pre-nautilus client ops on aborted merge
	mon/OSDMonitor: make pgp_num track pg_num more consistently

Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
2018-10-20 16:40:22 -05:00
Sage Weil
86ae8fb6b8 qa/suites/rados/thrash*/thrashers/careful.yaml: thrash with mgr controller
Thrash such that we still exercise the careful throttling in the mgr.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-20 15:21:58 -05:00
Sage Weil
70ec5bda23 mgr/DaemonServer: add option to bypass careful throttling for thrasher
Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-20 15:21:58 -05:00
Sage Weil
c22d5a0fce qa/suites/rados/thrash-erasure-code*/thrashers/*: less likely resv rejection injection
For EC pools we have a lot of shards, and 30% probability on each one
means we are very like to repeatedly fail backfill reservations.. long
enough that teuthology gives up waiting.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-18 17:17:41 -05:00
David Zafman
da3c556aa2 test: Make sure kill_daemons failure will be easy to find
Signed-off-by: David Zafman <dzafman@redhat.com>
2018-10-17 16:54:45 -07:00
David Zafman
b33edbc4f6 test: Add flush_pg_stats to make test more deterministic
Signed-off-by: David Zafman <dzafman@redhat.com>
2018-10-17 16:54:45 -07:00
Igor Fedotov
02b5768a4f tests: add qa test case for bluefs volume coalescence
Signed-off-by: Igor Fedotov <ifedotov@suse.com>
2018-10-17 22:39:27 +03:00
Patrick Donnelly
092801ae34
qa: add timeouts for remote ops for client mounts
Fixes: https://tracker.ceph.com/issues/36390

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-17 10:39:13 -07:00
Patrick Donnelly
b1dca00f90
Merge PR #24503 into master
* refs/pull/24503/head:
	qa: increase timeout for cleanup

Reviewed-by: Jeff Layton <jlayton@redhat.com>
2018-10-17 10:18:50 -07:00
Patrick Donnelly
6345c3f80d
Merge PR #24455 into master
* refs/pull/24455/head:
	qa: use timeout for fs asok operations

Reviewed-by: Jeff Layton <jlayton@redhat.com>
2018-10-17 10:17:29 -07:00
Lenz Grimmer
a848953f28
Merge pull request #24617 from p-na/fix-python3-issue
mgr/dashboard: Fix Python3 issue

Reviewed-by: Ricardo Marques <rimarques@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
2018-10-17 18:06:58 +02:00
Sage Weil
f8b6cbda34 Merge PR #24359 into master
* refs/pull/24359/head:
	qa/tests: update ansible version to 2.6 for master branch testing.
	qa/tests: use lvm as default for ceph-ansible testing, this should also work with raw devices

Reviewed-by: Alfredo Deza <adeza@redhat.com>
2018-10-17 10:06:42 -05:00
Sage Weil
54d539d79a Merge PR #24603 into master
* refs/pull/24603/head:
	crush: get "ceph osd crush class create/rm" back

Reviewed-by: Sage Weil <sage@redhat.com>
2018-10-17 10:06:26 -05:00
Sage Weil
b833d35d9b qa/suites/rados/mgr/tasks/module_selftest: whitelist 'foo bar security'
Avoid failures like

"2018-10-16 20:36:00.437153 mgr.y (mgr.25609) 6 : cluster [SEC] foo bar security" in cluster log

Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-17 07:09:15 -05:00
Patrick Nawracay
5c0c122597 mgr/dashboard: Fix Python3 issue
Which results in a 500 error when trying to access the `Performance
Counter` tab on the OSD list.

Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2018-10-17 14:02:12 +02:00
Sage Weil
05faeb4d12 Merge PR #24579 into master
* refs/pull/24579/head:
	qa/osd: fixup osd-rep-recov-eio.sh fails to parse pg dump

Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2018-10-16 07:17:59 -05:00
John Spray
e6a26aeff7
Merge pull request #24597 from batrick/i36450
qa: fix run call args

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: Lenz Grimmer <lgrimmer@suse.com>
2018-10-16 13:09:08 +01:00
Patrick Donnelly
6a4cc58a9d
Merge PR #24292 into master
* refs/pull/24292/head:
	qa: add test for rctime on root inode
	mds: set rctime on new system inode
	mds: small refactor

Reviewed-by: Zheng Yan <zyan@redhat.com>
2018-10-15 21:31:04 -07:00
xie xingguo
d7ff33e9fd crush: get "ceph osd crush class create/rm" back
This reverts a27fd9d25c and
b863883ca7.

Quote form Sébastien Han:
> IIRC at some point, we were able to create a device class from the CLI.
Now it seems that the device class gets created when at least one OSD
of a particular class starts.
In ceph-ansible, we create pools after the initial monitors are up and
we want to assign a device crush class on some of them.
That's not possible at the moment since there no device class available yet.
Also, someone might want to create its own device class.
Something as crazy as running Filestore with a tmpfs osd store and
might want to isolate them.
I know it's a very limited use case, but still, it could be desired.

See also https://www.spinics.net/lists/ceph-devel/msg41152.html

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2018-10-16 08:45:49 +08:00
Patrick Donnelly
d491227956
qa: fix run call args
Fixes: http://tracker.ceph.com/issues/36450
Introduced-by: 95746ecce9
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-15 14:45:18 -07:00
huanwen ren
f1219d716d qa/osd: fixup osd-rep-recov-eio.sh fails to parse pg dump
Fixes: http://tracker.ceph.com/issues/36418
Signed-off-by: huanwen ren <ren.huanwen@zte.com.cn>
2018-10-16 02:18:22 +08:00
Sage Weil
7ac6ab4b2f Merge PR #24494 into master
* refs/pull/24494/head:
	ceph-kvstore-tool: rename repair -> destructive-repair

Reviewed-by: Neha Ojha <nojha@redhat.com>
2018-10-14 13:11:11 -05:00
Sage Weil
8cc6369511 ceph-kvstore-tool: rename repair -> destructive-repair
This is shown to corrupt otherwise healthy rocksdb databases.  Rename to
make it clear that it is generally not safe to run and shoud only be used
as a last resort.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-14 11:41:24 -05:00
Mykola Golub
1d92788f71
Merge pull request #24563 from dillaman/wip-36410
test: move OpenStack devstack test to rocky release

Reviewed-by: Mykola Golub <mgolub@suse.com>
2018-10-14 10:40:05 +03:00
Sage Weil
9db328f2ab Merge PR #24204 into master
* refs/pull/24204/head:
	qa/suites/rgw/tempest: valgrind on centos only

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2018-10-12 16:29:44 -05:00
Jason Dillaman
1e3dc02604 qa/tasks/workunit: use suite branch/SHA1 when cloning workunits
Right now it's using the Ceph branch/SHA1 but it's using the suite
Git URL.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-10-12 12:41:58 -04:00
Jason Dillaman
2a1dca3fca qa/workunits/rbd: switch devstack to rocky branch and tempest to 19.0.0 tag
Fixes: http://tracker.ceph.com/issues/36410
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-10-12 08:40:44 -04:00
Jason Dillaman
ba32395165
Merge pull request #24550 from dillaman/wip-36409
qa/workunits: replace 'realpath' with 'readlink -f' in fsstress.sh

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-12 08:39:37 -04:00
Jason Dillaman
c1f950236b qa/tasks: qemu task now uses a relative path in suite repo for test
This makes it easier to re-run tests against a suite branch without
requiring a full ceph-ci build and repo.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-10-11 20:23:39 -04:00
Jason Dillaman
ddb7e5eb80 qa/workunits: replace 'realpath' with 'readlink -f' in fsstress.sh
Fixes: http://tracker.ceph.com/issues/36409
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-10-11 15:25:20 -04:00
Jason Dillaman
c60ffac61e
Merge pull request #24428 from trociny/wip-rbd-config-global
rbd: add 'config global' command to get/store overrides in mon config db

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-10-11 10:50:33 -04:00
Patrick Donnelly
d03a8302b5
Merge PR #23069 into master
* refs/pull/23069/head:
	tests/libcephfs: add simple reclaim test
	mds: check auth name before reclaiming session
	mds: reclaim session before allowing mds to become active
	mds: allow client to specify its session timeout
	mds: initial code for client states reclaim

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-10 15:29:41 -07:00
Mykola Golub
c552f166c3
Merge pull request #24476 from dillaman/wip-35944
librbd: add new configuration option to always move deleted items to the trash

Reviewed-by: Mykola Golub <mgolub@suse.com>
2018-10-10 18:40:14 +03:00
Jason Dillaman
a6a2f395b7
Merge pull request #24512 from trociny/wip-qa-namespace-create_args
qa/workunits/rbd: force v2 image format for namespace test

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-10-10 10:42:08 -04:00
Mykola Golub
b3e4d43c15 rbd: add 'config global' command to get/store overrides in mon config db
Signed-off-by: Mykola Golub <mgolub@suse.com>
2018-10-10 17:31:32 +03:00
Jason Dillaman
97950fec27 qa/workunits/rbd: test new move-to-trash on image remove config override
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-10-10 09:08:03 -04:00
John Spray
67d147c00d
Merge pull request #23622 from renhwztetecs/renhw-wip-25103
mgr: fixup pgs show in unknown state

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: John Spray <john.spray@redhat.com>
2018-10-10 13:28:33 +01:00
John Spray
5a39f0a9aa
Merge pull request #23970 from noahdesu/mgr-always-on-mods-v2
mgr: always on modules v2

Reviewed-by: John Spray <john.spray@redhat.com>
2018-10-10 12:17:41 +01:00
John Spray
17778d217c
Merge pull request #24239 from votdev/python_ceph_audit_log
mgr: Add ability to trigger a cluster/audit log message from Python

Reviewed-by: John Spray <john.spray@redhat.com>
2018-10-10 11:47:49 +01:00
Mykola Golub
24e1a6bd47 qa/workunits/rbd: force v2 image format for namespace test
While here, fix test_locking too.

Signed-off-by: Mykola Golub <mgolub@suse.com>
2018-10-10 10:50:30 +03:00
Patrick Donnelly
dc6e7eadf8
qa: increase timeout for cleanup
Apparently 15m is not long enough for some workunits like fsstress.

Fixes: http://tracker.ceph.com/issues/36365

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-09 13:31:26 -07:00
Ricardo Marques
13af4b52f7
Merge pull request #21614 from Devp00l/pool-creation-form
mgr/dashboard: Pool management

Reviewed-by: Ricardo Marques <rimarques@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
2018-10-09 17:29:09 +01:00
Stephan Müller
8e56421bcd mgr/dashboard: Update PG update test
It is now commented out like it was before,
but I've added a comment what happened during this test with the QA
system. The problem was that even with only a increase of 1 PG the QA
cluster went into a cluster warning state and did not recover in time.
The QA coverage timeout is 2 minutes.

I could not reproduce this behavior with a local cluster, but I've
added a loop to wait until pgp and pg number are equal and the cluster
is in a healthy state again. This can take locally about 5 seconds.
The internal loop has a timeout of 3 minutes.

Fixes: https://tracker.ceph.com/issues/36362
Signed-off-by: Stephan Müller <smueller@suse.com>
2018-10-09 15:56:54 +02:00
Stephan Müller
c98c8055c2 mgr/dashboard: Unset compression arguments for pools
The dashboard backend can now unset all set compression arguments if the
compression mode is switched to 'unset'. In the case of 'unset' Ceph
itself will only delete the 'compression_mode' argument, not all other
set arguments. The other arguments that should be removed, too, are
added to the update arguments in order to delete all set arguments.

Fixes: https://tracker.ceph.com/issues/36355
Signed-off-by: Stephan Müller <smueller@suse.com>
2018-10-09 15:56:47 +02:00
Tatjana Dehler
18d91e2419 mgr/dashboard: add backend method to create config options
Fixes: http://tracker.ceph.com/issues/24455

Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2018-10-09 14:50:00 +02:00
Tatjana Dehler
2a630be872 mgr/dashboard: refactor '_get_mon_allow_pool_delete_config'
Refactor '_get_mon_allow_pool_delete_config' method to be a little bit
more general. The method can now be used to get the value of every
config option known to the cluster.

Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2018-10-09 14:47:24 +02:00
Jason Dillaman
82a98163af
Merge pull request #24457 from trociny/wip-qa-cli-namespace
qa/workunits/rbd: fix cli generic namespace test

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-10-08 15:15:30 -04:00
Patrick Nawracay
a4956b40b2 mgr/dashboard: Fix errors when clicking on new OSD
Fixes: http://tracker.ceph.com/issues/36245

Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2018-10-08 10:45:58 +02:00
Patrick Nawracay
0f0de0ce2a mgr/dashboard: Add type hints to QA helper
Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2018-10-08 10:42:14 +02:00
Mykola Golub
f2d72d46d4 qa/workunits/rbd: fix cli generic namespace test
expect_fail incorrectly unset '-e' option and if a consequent test
failed it did not abort the execution. And two typos in the namespace
tests were not detected due to this.

Signed-off-by: Mykola Golub <mgolub@suse.com>
2018-10-07 15:09:25 +03:00
Patrick Donnelly
296d89b046
qa: use timeout for fs asok operations
Otherwise a bug preventing an asok operation from completing will cause the
entire job to fail.

Fixes: http://tracker.ceph.com/issues/36335

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-06 10:32:50 -07:00
Jeff Layton
9bca82d00e tests/libcephfs: add simple reclaim test
Signed-off-by: Jeff Layton <jlayton@redhat.com>
2018-10-05 10:18:28 -04:00
Volker Theile
34525ba3af Relocate cluster_log(). Only active modules can use it.
Signed-off-by: Volker Theile <vtheile@suse.com>
2018-10-05 14:46:58 +02:00
Volker Theile
95746ecce9 mgr: Add ability to trigger a cluster/audit log message from Python
Fixes: https://tracker.ceph.com/issues/36194

Signed-off-by: Volker Theile <vtheile@suse.com>
2018-10-04 13:33:18 +02:00
Ricardo Dias
43a72f0e49
Merge pull request #24350 from ricardoasmarques/wip-json-decode
mgr/dashboard: Check content-type before decode json response

Reviewed-by: Lenz Grimmer <lgrimmer@suse.com>
Reviewed-by: Ricardo Dias <rdias@suse.com>
2018-10-04 10:08:37 +01:00
Patrick Donnelly
ddbc3016a5
qa: add test for rctime on root inode
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-03 10:10:47 -07:00
Nathan Cutler
b6c13e8ee6 qa/distros: add openSUSE Leap 42.3 and 15.0
References: https://tracker.ceph.com/issues/35927
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-10-02 21:36:49 +02:00
Ilya Dryomov
ccf5f07a0d
Merge pull request #24354 from idryomov/wip-krbd-kfsx-fancy-striping
qa/suites/krbd: more fsx tests

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-10-02 09:32:57 +02:00
Kefu Chai
7e90989214
Merge pull request #24291 from tchaikov/wip-23964
erasure-code: add clay codes

Reviewed-by: Myna Vajha <mynaramana@gmail.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2018-10-02 10:48:21 +08:00
Vasu Kulkarni
4c371daf99 qa/tests: update ansible version to 2.6 for master branch testing.
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2018-10-01 15:10:49 -07:00
Noah Watkins
e2341ad331 qa/mgr: avoid enabling always on modules
when enabling a module attempt to determine if it is an always on
module, and if it is, then return without waiting on the active manager
daemon to restart---which it won't if it is an always on module.

Signed-off-by: Noah Watkins <nwatkins@redhat.com>
2018-10-01 14:34:27 -07:00
Noah Watkins
bca519a1be qa/mgr: do not disable always on module
Signed-off-by: Noah Watkins <nwatkins@redhat.com>
2018-10-01 14:06:42 -07:00
Dan Mick
38de3d568b
Merge pull request #24340 from tchaikov/wip-36260
qa/workunits/mon/test_mon_config_key.py: bump up the size limit
2018-10-01 13:04:58 -07:00
Patrick Donnelly
bae625e666
Merge PR #21566 into master
* refs/pull/21566/head:
	test: add test for mds drop cache command
	mds: command to trim mds cache and client caps
	mds: implement journal flush as asynchronous context execution
	mds: cleanup some asok commands

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-10-01 12:48:16 -07:00
Vasu Kulkarni
b0fc647e3f qa/tests: use lvm as default for ceph-ansible testing, this should also work with raw devices
and using the batch mode should auto select nvme/ssd for journal.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2018-10-01 12:05:50 -07:00
Patrick Donnelly
b07a184859
Merge PR #24044 into master
* refs/pull/24044/head:
	qa: tolerate failed rank while waiting for state

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2018-10-01 10:43:35 -07:00
Ilya Dryomov
870e42ac6a qa/suites/krbd: more fsx tests
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2018-10-01 16:48:47 +02:00
Ricardo Marques
731dcefbeb mgr/dashboard: Check content-type before decode json response
Signed-off-by: Ricardo Marques <rimarques@suse.com>
2018-10-01 11:47:38 +01:00
Patrick Donnelly
3a10d74f3a
qa: add timeout to cleaning up workunit sandbox
If there is a bug preventing rm from completing, the workunit will get stuck.

Fixes: http://tracker.ceph.com/issues/36184
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-30 18:12:19 -07:00
Patrick Donnelly
7a64eb9dfb
qa: add timeout to kclient umount
Otherwise QA sits forever waiting for the kclient to umount when there is a
problem.

Fixes: http://tracker.ceph.com/issues/36184
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-30 18:12:15 -07:00
Patrick Donnelly
d4b8f94cf8
qa: do not cleanup sandbox on error
Otherwise the command will hang if the mount is broken.

Fixes: http://tracker.ceph.com/issues/36184
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-30 18:12:11 -07:00
Patrick Donnelly
bdd2ddcfd8
qa: use default timeout in fs workunits
Six hours is unnecessarily long.

Fixes: http://tracker.ceph.com/issues/36184
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-30 18:10:05 -07:00
Kefu Chai
b35f6a533a qa/workunits/mon/test_mon_config_key.py: bump up the size limit
in b38b8e980c, we changed the upper
limit of size of `config key` 's value to 64k, so we need to update
the test accordingly.

Fixes: http://tracker.ceph.com/issues/36260
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-09-30 16:04:04 +08:00
Venky Shankar
43d1b8e174 test: add test for mds drop cache command
Fixes: http://tracker.ceph.com/issues/23362
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2018-09-28 00:07:59 -04:00
huanwen ren
e5096b41ca common: adapt to the new pg dump format.
Signed-off-by: huanwen ren <ren.huanwen@zte.com.cn>
2018-09-27 23:22:50 +08:00
huanwen ren
ed442447c0 qa: modify the format for add pgmap_ready.
Signed-off-by: huanwen ren <ren.huanwen@zte.com.cn>
2018-09-27 23:22:50 +08:00
Sage Weil
bc86b06a67 qa/suites/rados/thrash-erasure-code: add clay 4+2
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-27 18:56:21 +08:00
Lenz Grimmer
75c1d8eacd
Merge pull request #23351 from tspmelo/wip-rbd-trash
mgr/dashboard: Add support for RBD Trash

Reviewed-by: Sebastian Wagner <swagner@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
2018-09-27 11:04:46 +02:00
Nathan Cutler
a38af571c2
Merge pull request #23131 from smithfarm/wip-15100
qa: add test that builds example librados programs

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Jesse Williamson <jwilliamson@suse.de>
2018-09-27 00:17:48 +02:00
Mykola Golub
78881934c0
Merge pull request #22043 from dillaman/wip-24139
rbd-mirror: bootstrap needs to handle local image id collision

Reviewed-by: Mykola Golub <mgolub@suse.com>
2018-09-26 22:49:02 +03:00
Casey Bodley
8622d8a241
Merge pull request #24011 from cbodley/wip-qa-rgw-multisite-speedup
qa/rgw: reduce number of multisite log shards

Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
2018-09-25 16:39:51 -04:00
Jason Dillaman
5c317aef31 qa/tasks/rbd_mirror_thrash: let daemon gracefully shut down if possible
Otherwise, try to capture a core dump to discover what was blocking the
shutdown process.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-09-25 16:08:59 -04:00
Jason Dillaman
dca9e3e252 qa/workunits/rbd: wait max 2 hrs for all stress images to sync
Sporadically the rbd-mirror fsx stress test would fail due to very
slow sync times due to overloaded clusters. Attempt to wait for all
images to be replicated before proceeding with the comparison.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-09-25 16:08:59 -04:00
Jason Dillaman
d04a7679c0 qa/workunits/rbd: exclude rbd-mirror sync-point snaps from comparison
This is a temporary workaround to tracker ticket issue #36185

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-09-25 16:08:25 -04:00
Patrick Donnelly
7bcab914bb
Merge PR #24252 into master
* refs/pull/24252/head:
	qa: cleanup workunit dir for each unit
	qa: use sudo to cleanup workspace

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-09-25 07:36:16 -07:00
Tiago Melo
6bb951b1bf mgr/dashboard: Add RBD Trash endpoints
Fixes: http://tracker.ceph.com/issues/24272

Signed-off-by: Tiago Melo <tmelo@suse.com>
2018-09-25 14:02:58 +01:00
Jason Dillaman
27832e2781 qa/workunits/rbd: image compare should print byte offset of any deltas
This will assist in debugging any mirroring issues.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-09-25 08:45:08 -04:00
Nathan Cutler
c46c890d02 qa: add test that builds example librados programs
Fixes: http://tracker.ceph.com/issues/15100
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-09-25 13:18:04 +02:00
Patrick Donnelly
0dd623c8aa
Merge PR #23187 into master
* refs/pull/23187/head:
	test: make rank argument mandatory when running journal_tool
	cephfs-journal-tool: make "--rank" argument mandatory
	cephfs-journal-tool: pass local arg vector for Journal actions
	cephfs-journal-tool: dump to per rank output file wherever necessary

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-24 14:46:14 -07:00
Patrick Donnelly
51b27b7e3d
Merge PR #23530 into master
* refs/pull/23530/head:
	qa/vstart_runner: fix daemons list
	PendingReleaseNotes: note multifs support in libcephfs
	test/cephfs: add pybind test for mount_root
	pybind/cephfs: enable passing filesystem name to mount
	libcephfs: add ceph_select_filesystem
	common: add doc strings to client_mds_namespace
	client: allow passing fs name to mount()

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

Conflicts:
    PendingReleaseNotes
2018-09-24 14:42:13 -07:00
Patrick Donnelly
c0ba741a6e
Merge PR #24158 into master
* refs/pull/24158/head:
	cephfs: remove vestiges of mds deactivate

Reviewed-by: Douglas Fuller <dfuller@redhat.com>
2018-09-24 14:39:45 -07:00
Sage Weil
0ae5e9a480 Merge PR #24234 into master
* refs/pull/24234/head:
	qa/suites/rados/thrash-old-clients/workloads/rbd_cls.yaml: skip parents test

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-09-24 15:15:52 -05:00
Jason Dillaman
3e8f16b484 qa/suites/rbd: increase librbd debug level for mirror-thrash
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-09-24 15:12:26 -04:00
Patrick Donnelly
70844f3f55
qa: cleanup workunit dir for each unit
This was wrongly dropped and moved to the finalizer.

Introduced-by: de824f74dd
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-24 11:29:10 -07:00
Patrick Donnelly
1eaf78a754
qa: use sudo to cleanup workspace
Files in scratch_tmp may not be owned by ubuntu.

Fixes: http://tracker.ceph.com/issues/36165
Introduced-by: de824f74dd
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-24 11:02:49 -07:00
Sage Weil
757a94b980 qa/suites/rados/thrash-old-clients/workloads/rbd_cls.yaml: skip parents test
We can't (easily) build updated hammer packages, but all this sh script does
it run this one test binary with --gtest_filter arguments, so just do
it directly and skip the test explicitly here.  (Newer version of the .sh
understand the environemnt variable but the hammer version does not.)

Fixes: http://tracker.ceph.com/issues/36104
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-24 10:57:40 -05:00
Jason Dillaman
762a7ae386
Merge pull request #23743 from trociny/wip-rbd-config-pool
librbd: pool and image level config overrides

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-09-24 10:19:01 -04:00
Lenz Grimmer
4d3f896c89
Merge pull request #23568 from rhcs-dashboard/wip-24573-landing-page
mgr/dashboard: New Landing Page

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Kanika Murarka <kmurarka@redhat.com>
Reviewed-by: Ricardo Marques <rimarques@suse.com>
Reviewed-by: Stephan Müller <smueller@suse.com>
Reviewed-by: Tiago Melo <tmelo@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
2018-09-24 15:32:48 +02:00
Patrick Donnelly
9e9f3ead36
Merge PR #24157 into master
* refs/pull/24157/head:
	qa: cleanup parallel execution of fsstress

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-09-21 13:01:14 -07:00
Patrick Donnelly
de824f74dd
qa: cleanup parallel execution of fsstress
Two instances of fsstress clobber each other. Just build it in the local sandbox.

Fixes: http://tracker.ceph.com/issues/24177

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-21 11:34:20 -07:00
Laura Paduano
3db50d5aca
Merge pull request #24219 from p-na/fix-test-scrub
mgr/dashboard: Possible fix for some dashboard timing issues

Reviewed-by: Volker Theile <vtheile@suse.com>
Reviewed-by: Ricardo Dias <rdias@suse.com>
2018-09-21 17:24:32 +02:00
Patrick Nawracay
8daffe86a6 mgr/dashboard: Fix for some dashboard timing issues
Specifically fixes the recurringly occurring `test_osd.py` error on the
`test_scrub` method. But this change should also prevent other issues of
the same kind. Issues of "same kind" are issues which occurr due to
tests which do not immediately result in a clean cluster status and
aren't manually programmed to wait for it.

Fixes: http://tracker.ceph.com/issues/36107

Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2018-09-21 16:01:24 +02:00
Sage Weil
9bf7c810a7 Merge PR #23985 into master
* refs/pull/23985/head:
	ceph-objectstore-tool: add back pool dne check
	qa/suites/rados/singleton/reg11184: remove old test
	ceph-objectstore-tool: import pg at original epoch
	osd: handle null pg slot on startup
	ceph-objectstore-tool: drop support for ancient export files
	osd: avoid dropping osd_lock when pg osdmaps are not laggy
	qa/standalone/osd/pg-merge.sh: add merge vs pg import test

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-09-21 08:21:53 -05:00
Venky Shankar
f65193d955 test: make rank argument mandatory when running journal_tool
Also, fix a bunch of quirky journal_tool invocations that pass
"--rank" argument as the command argument rather than passing it
as function argument.

Fixes: https://tracker.ceph.com/issues/24780
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2018-09-21 06:09:39 -04:00
Kefu Chai
f3bc838894
Merge pull request #24139 from tchaikov/wip-fix-typos
*/: fix typos in docs,messages,logs,comments

Reviewed-by: Lenz Grimmer <lgrimmer@suse.com>
Reviewed-by: Alfredo Deza <adeza@redhat.com>
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Ricardo Dias <rdias@suse.com>
2018-09-21 16:56:31 +08:00
Kefu Chai
4b0e2c8ed4 qa: fix typos
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-09-21 12:41:42 +08:00
Sage Weil
04e608e1a2 qa/suites/rgw/tempest: valgrind on centos only
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-20 13:00:47 -05:00
Sage Weil
4e5f2bb596 qa/suites/rados/singleton/reg11184: remove old test
This bug was about filtering missing and divergent when doing a partial
PG import.  We don't support partial PG imports any more, so this can
go away!

Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-20 12:58:00 -05:00
Sage Weil
26cb966cab ceph-objectstore-tool: import pg at original epoch
- In the jewel era, we fast-forwarded the PG to the OSD's latest epoch
and cleared past_intervals.

- In mimic, as of 2347ecb961, we brought the
PG up to date while updating past_intervals.  (At the same time we removed
the OSD's parallel past_intervals regeneration.)

The problem is that the tool then has to reimplement the past_intervals
update logic, and *also* has to cope with splits and merges.  Splits are
somewhat easier (until now we enable partial import of a PG into a split
child), but merges are not so easy.

This patch changes it so we import the PG and leave the pg_epoch matching
the import file.  The OSD is then responsible for bringing it up to date
with the latest map, and dealing with any intervening splits or merges.

We also adjust the safety check to ensure that we don't collide with
any existing PG, either a child we eventually split into, or a parent
we eventually merge into.

Fixes: http://tracker.ceph.com/issues/35955
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-20 12:58:00 -05:00
Patrick Donnelly
3e3b69dbde
Merge PR #24156 into master
* refs/pull/24156/head:
	qa/tests: removed knfs suite

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-20 09:31:50 -07:00
Patrick Donnelly
a14716a533
Merge PR #24153 into master
* refs/pull/24153/head:
	qa/tests: removed knfs from crontab

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-20 09:31:15 -07:00
Patrick Donnelly
e606e21746
Merge PR #24185 into master
* refs/pull/24185/head:
	qa: install build dependencies for cfuse_workunit_kernel_untar_build.yaml

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-20 08:02:48 -07:00
Mykola Golub
ecdf8cd6b2 rbd: add config commands
Signed-off-by: Mykola Golub <mgolub@suse.com>
2018-09-20 10:35:18 +03:00
Patrick Donnelly
f113fa80a9
cephfs: remove vestiges of mds deactivate
Fixes: http://tracker.ceph.com/issues/24001

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-19 15:23:38 -07:00
Neha Ojha
38ef3da8d2 qa: install build dependencies for cfuse_workunit_kernel_untar_build.yaml
Fixes: https://tracker.ceph.com/issues/36076
Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-09-19 15:22:16 -07:00
Mykola Golub
ce6e35a81e
Merge pull request #23662 from dillaman/wip-24412
librbd: support v2 cloning across namespaces

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>
Reviewed-by: songweibin <song.weibin@zte.com.cn>
2018-09-19 21:20:58 +03:00
Casey Bodley
bd6d3f61e1
Merge pull request #24059 from cbodley/wip-rgw-opstate-rm
rgw, cls: remove cls_statelog and rgw opstate tracking

Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
2018-09-19 14:01:04 -04:00
Yuri Weinstein
eaca188733 qa/tests: removed knfs suite
Fixes: http://tracker.ceph.com/issues/36075
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2018-09-19 09:36:34 -07:00
Sage Weil
fe14970768 Merge PR #24143 into master
* refs/pull/24143/head:
	qa/workunits/cephtool/test_kvstore_tool.sh: run test in ., not /tmp

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2018-09-19 10:41:04 -05:00
Lenz Grimmer
01c2ea52c1
Merge pull request #24128 from ricardoasmarques/wip-optional-user-password
mgr/dashboard: User password should be optional

Reviewed-by: Ricardo Dias <rdias@suse.com>
Reviewed-by: Stephan Müller <smueller@suse.com>
2018-09-19 16:44:51 +02:00
Casey Bodley
d897b92878 osd: remove statelog from osd_class_load_list config
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2018-09-19 10:32:55 -04:00
Ricardo Marques
49ba4a7b84 mgr/dashboard: User password should be optional
Fixes: https://tracker.ceph.com/issues/36031

Signed-off-by: Ricardo Marques <rimarques@suse.com>
2018-09-19 13:49:22 +01:00
Jason Dillaman
d4c7ce3c0a qa/workunits/rbd: test clone v2 across namespaces
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-09-19 08:05:29 -04:00
alfonsomthd
f9fbc25f60 mgr/dashboard: New Landing Page
Updated integration tests to check data from new python code

Fixes: https://tracker.ceph.com/issues/24573

Signed-off-by: Alfonso Martínez <almartin@redhat.com>
2018-09-19 08:54:41 +02:00
Yuri Weinstein
765521b665 qa/tests: removed knfs from crontab
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2018-09-18 14:44:59 -07:00
Sage Weil
4d905d5d29 qa/workunits/mon/caps.sh: wait longer to time out
5s -> 5m to gives us more leeway for when the mons are thrashing.

Also, *only* set this timeout when we expect a timeout.  If we don't,
wait forever.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-18 08:57:25 -05:00
Sage Weil
5940487be3 qa/workunits/cephtool/test_kvstore_tool.sh: run test in ., not /tmp
Notably, we can't make a bluestore store on a tmpfs.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-18 07:39:05 -05:00
Sage Weil
5cd067b16b Merge PR #24104 into master
* refs/pull/24104/head:
	mgr/diskprediction: add prototype diskprediction module

Reviewed-by: John Spray <john.spray@redhat.com>
2018-09-18 07:08:05 -05:00
John Spray
da5b25db64 qa/vstart_runner: fix daemons list
This was missing a cluster name prefix that
was added at some point, and consequently
calls to iter_daemons_of_role were returning
no daemons.

This was causing e.g. TestVolumeClient.test_data_isolated
to fail when run in vstart_runner.

Signed-off-by: John Spray <john.spray@redhat.com>
2018-09-17 14:00:36 -04:00
Sage Weil
da887c82ce qa/standalone/osd/pg-merge.sh: add merge vs pg import test
- You can't import the source half a PG that's since merged.  Sorry!  We
could implement this later.
- You can import the target half, but the result will then be incomplete,
and you rely on backfill to clean it up.
- Map gaps don't affect this behavior.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-17 12:52:46 -05:00
Kefu Chai
338612ad88
Merge pull request #24088 from dzafman/wip-35982
qa/standalone: Standalone test corrections

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-09-17 22:35:43 +08:00
Rick Chen
4abb79f159 mgr/diskprediction: add prototype diskprediction module
This module is written by Rick Chen <rick.chen@prophetstor.com> and
provides both a built-in local predictor and a cloud mode that queries
a cloud service (provided by ProphetStor) to predict device failures.

Signed-off-by: Rick Chen <rick.chen@prophetstor.com>
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-17 08:20:57 -05:00
Kefu Chai
f46523e464
Merge pull request #23955 from wjwithagen/wjw-fix-ceph-helpers.sh
test: Start using GNU awk and fix archiving directory

Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-09-17 15:44:06 +08:00
Patrick Donnelly
ea43ce7aae
Merge PR #21992 into master
* refs/pull/21992/head:
	qa: extend timeout for SessionMap flush

Reviewed-by: John Spray <john.spray@redhat.com>
2018-09-16 14:30:28 -07:00
Mykola Golub
a68be2b452
Merge pull request #24036 from dillaman/wip-24688
rbd-mirror: optionally extract peer secrets from config-key

Reviewed-by: Mykola Golub <mgolub@suse.com>
2018-09-16 11:53:04 +03:00
Cleber Rosa
f0f6e46084 qa/btrfs/test_rmdir_async_snap: remove binary file
On a quick look at the source code, I noticed this binary file, which
looks like was committed by mistake.

Signed-off-by: Cleber Rosa <crosa@redhat.com>
2018-09-14 13:26:32 -04:00
David Zafman
ef6940fbb6 test: osd-backfill-stats.sh: Fix subtests to get primary which can change
Fixes: http://tracker.ceph.com/issues/35982

Signed-off-by: David Zafman <dzafman@redhat.com>
2018-09-13 13:19:23 -07:00
David Zafman
6d53e2c380 test: Fix for error message changed in ceph-objectstore-tool
Fixes: http://tracker.ceph.com/issues/35982

Caused by: 6bd682f53d

Signed-off-by: David Zafman <dzafman@redhat.com>
2018-09-13 13:19:11 -07:00
Mykola Golub
ad9fc49776
Merge pull request #23896 from Songweibin/wip-group-snap-rollback
rbd: add group snap rollback method

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>
2018-09-13 19:35:10 +03:00
Jason Dillaman
d3a1a831ac qa/workunits/rbd: new test for rbd-mirror peer config-key secrets
Fixes: http://tracker.ceph.com/issues/24688
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-09-13 09:58:34 -04:00
Kefu Chai
59e6fbbe70
Merge pull request #24024 from p-na/fix-flaky-qa-tests
mgr/dashboard: Fix flaky QA tests

Reviewed-by: Ricardo Dias <rdias@suse.com>
2018-09-13 21:52:35 +08:00
David Zafman
7f83a24553
Merge pull request #24018 from dzafman/wip-35912
qa/standalone: Minor test improvements

Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-09-12 13:15:44 -07:00
Patrick Donnelly
644a535ee6
qa: extend timeout for SessionMap flush
Fixes: https://tracker.ceph.com/issues/24129

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-12 12:52:00 -07:00
Sage Weil
4591f134c8 Merge PR #24025 into master
* refs/pull/24025/head:
	mon/OSDMonitor: reject pg_num -> 0

Reviewed-by: huangjun <huangjun@xsky.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2018-09-12 11:51:58 -05:00
songweibin
dec2ea83cf tools/rbd: support group snap rollback CLI
Signed-off-by: songweibin <song.weibin@zte.com.cn>
2018-09-12 23:42:15 +08:00
Kefu Chai
1578875194
Merge pull request #24013 from dzafman/wip-35845
test: Use a grep pattern that works across releases

Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-09-12 23:00:39 +08:00
Patrick Donnelly
e52976e64d
qa: tolerate failed rank while waiting for state
Fixes: http://tracker.ceph.com/issues/35828

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-12 06:37:13 -07:00
Patrick Nawracay
079cfeae4d mgr/dashboard: Cleanup
Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2018-09-12 10:16:54 +02:00
Patrick Nawracay
49ab1315a3 mgr/dashboard: Fix flaky QA tests
Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2018-09-12 10:16:54 +02:00
xie xingguo
6d8d20a55b mon/OSDMonitor: reject pg_num -> 0
Otherwise it will finally crash the mgr process:
```
-2> 2018-09-11 16:23:31.272 7ff521d59700 -1 /clove/xxG/ceph/src/osd/osd_types.cc: In function 'pg_t pg_t::get_parent() const'
 thread 7ff521d59700 time 2018-09-11 16:23:31.272096
/clove/xxG/ceph/src/osd/osd_types.cc: 587: FAILED ceph_assert(bits)
```

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2018-09-12 08:58:10 +08:00
John Spray
0551d0e14b
Merge pull request #23497 from noahdesu/insights
mgr/insights: insights reporting module

Reviewed-by: John Spray <john.spray@redhat.com>
2018-09-11 15:56:10 +01:00
John Spray
b18bee1b19
Merge pull request #23146 from jcsp/wip-progress
mgr/progress: improve+test OSD out handling

Reviewed-by: Noah Watkins <nwatkins@redhat.com>
2018-09-11 15:37:51 +01:00
Kefu Chai
510d9e1345
Merge pull request #23723 from xiexingguo/wip-list-missing
osd/PrimaryLogPG: rename list_missing -> list_unfound command

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2018-09-11 20:25:21 +08:00
John Spray
46a74a07d9 qa: add task for progress module
Signed-off-by: John Spray <john.spray@redhat.com>
2018-09-11 11:21:35 +01:00
John Spray
09f5f9e1eb qa: add 4th OSD to mgr test cluster
This is useful for testing progress module.

Signed-off-by: John Spray <john.spray@redhat.com>
2018-09-11 11:21:35 +01:00
John Spray
5ecd69099d qa: add tests for progress module
Signed-off-by: John Spray <john.spray@redhat.com>
2018-09-11 11:21:35 +01:00
Noah Watkins
5c2615e3d7 qa/tasks/mgr: whitelist insights test health checks
these drive health history tracking tests.

Signed-off-by: Noah Watkins <nwatkins@redhat.com>
2018-09-10 15:02:49 -07:00
David Zafman
050a4bb20e test: Allow directory specified to run-standalone.sh
Fixes: http://tracker.ceph.com/issues/35912

Example:
cd build
../qa/run-standalone.sh scrub

Signed-off-by: David Zafman <dzafman@redhat.com>
2018-09-10 12:23:11 -07:00
David Zafman
6e3f04365f test: Trap termination so we can capture logs on teuthology timeout
Signed-off-by: David Zafman <dzafman@redhat.com>
2018-09-10 12:23:07 -07:00
vasukulkarni
10f1c4c9de
Merge pull request #23602 from smanjara/wip-test-netem
qa: Task to emulate network delay and packet drop between two given h…
2018-09-10 09:57:10 -07:00
David Zafman
dc80f8585a test: Use a grep pattern that works across releases
Fixes: http://tracker.ceph.com/issues/35845

Signed-off-by: David Zafman <dzafman@redhat.com>
2018-09-10 08:21:36 -07:00
Sage Weil
d71258495e
Merge pull request #23997 from batrick/multimds-qa-broken-symlink
qa: fix symlink
2018-09-10 09:26:12 -05:00
Sage Weil
4d2a73c7f1 Merge PR #23845 into master
* refs/pull/23845/head:
	osd/OSDMap: include age in up and in counts for ceph status
	mon/OSDMonitor: set new_last_{up,in}_change
	osd/OSDMap: store last_up_change and last_in_change
	mgr/MgrMap: include mgr age in map printer
	mon/MgrMap: track active_changed timestamp
	mon: include mon quorum age in status
	include/utime: add utimespan_str helper

Reviewed-by: John Spray <john.spray@redhat.com>
2018-09-10 07:45:58 -05:00
Patrick Donnelly
a45852f8fd
qa: fix symlink
Introduced-by: 6ac1882dc4

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-08 19:21:57 -07:00
Sage Weil
f47921f293 qa/standalone/osd/osd-backfill-stats: fixes
Grep from the primary's log, not every osd's log.

For the backfill_remapped task in particular, after the pg_temp change it
just so happens that the primary changes across the pool size change and
thus two different primaries do (some) backfill.  Fix that test to pass
the correct primary.

Other tests are unaffected as they do not (happen to) trigger a primary
change and already satisfied the (removed) check that only one OSD does
backfill.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-07 17:11:18 -05:00
Sage Weil
09ee3f3538 Merge PR #20469 into master
* refs/pull/20469/head:
	osd/PG: remove warn on delete+merge race
	osd: base project_pg_history on is_new_interval
	osd: make project_pg_history handle concurrent osdmap publish
	osd: handle pg delete vs merge race
	osd/PG: do not purge strays in premerge state
	doc/rados/operations/placement-groups: a few minor corrections
	doc/man/8/ceph: drop enumeration of pg states
	doc/dev/placement-groups: drop old 'splitting' reference
	osd: wait for laggy pgs without osd_lock in handle_osd_map
	osd: drain peering wq in start_boot, not _committed_maps
	osd: kick split children
	osd: no osd_lock for finish_splits
	osd/osd_types: remove is_split assert
	ceph-objectstore-tool: prevent import of pg that has since merged
	qa/suites: test pg merging
	qa/tasks/thrashosds: support merging pgs too
	mon/OSDMonitor: mon_inject_pg_merge_bounce_probability
	doc/rados/operations/placement-groups: update to describe pg_num reductions too
	doc/rados/operations: remove reference to lpgs
	osd: implement pg merge
	osd/PG: implement merge_from
	osdc/Objecter: resend ops on pg merge
	osd: collect and record pg_num changes by pool
	osd: make load_pgs remove message more accurate
	osd/osd_types: pg_t: add is_merge_target()
	osd/osd_types: pg_t::is_merge -> is_merge_source
	osd/osd_types: adding or substracting invalid stats -> invalid stats
	osd/PG: clear_ready_to_merge on_shutdown (or final merge source prep)
	osd: debug pending_creates_from_osd cleanup, don't use cbegin
	ceph-objectstore-tool: debug intervals update
	mgr/ClusterState: discard pg updates for pgs >= pg_num
	mon/OSDMonitor: fix long line
	mon/OSDMonitor: move pool created check into caller
	mon/OSDMonitor: adjust pgp_num_target down along with pg_num_target as needed
	mon/OSDMonitor: add mon_osd_max_initial_pgs to cap initial pool pgs
	osd/OSDMap: set pg[p]_num_target in build_simple*() methods
	mon/PGMap: adjust SMALLER_PGP_NUM warning to use *_target values
	mon/OSDMonitor: set CREATING flag for force-create-pg
	mon/OSDMonitor: start sending new-style pg_create2 messages
	mon/OSDMonitor: set last_force_resend_prenautilus for pg_num_pending changes
	osd: ignore pg creates when pool FLAG_CREATING is not set
	mgr: do not adjust pg_num until FLAG_CREATING removed from pool
	mon/OSDMonitor: add FLAG_CREATING on upgrade if pools still creating
	mon/OSDMonitor: prevent FLAG_CREATING from getting set pre-nautilus
	mon/OSDMonitor: disallow pg_num changes while CREATING flag is set
	mon/OSDMonitor: set POOL_CREATING flag until initial pool pgs are created
	osd/osd_types: add pg_pool_t FLAG_POOL_CREATING
	osd/osd_types: introduce last_force_resend_prenautilus
	osd/PGLog: merge_from helper
	osd: no cache agent or snap trimming during premerge
	osd: notify mon when pending PGs are ready to merge
	mgr: add simple controller to adjust pg[p]_num_actual
	mon/OSDMonitor: MOSDPGReadyToMerge to complete a pg_num change
	mon/OSDMonitor: allow pg_num to adjusted up or down via pg[p]_num_target
	osd/osd_types: make pg merge an interval boundary
	osd/osd_types: add pg_t::is_merge() method
	osd/osd_types: add pg_num_pending to pg_pool_t
	osd: allow multiple threads to block on wait_min_pg_epoch
	osd: restructure advance_pg() call mechanism
	mon/PGMap: prune merged pgs
	mon/PGMap: track pgs by state for each pool
	osd/SnapMapper: allow split_bits to decrease (merge)
	os/bluestore: fix osr_drain before merge
	os/bluestore: allow reuse of osr from existing collection
	os/filestore: (re)implement merge
	os/filestore: add _merge_collections post-check
	os: implement merge_collection
	os/ObjectStore: add merge_collection operation to Transaction
2018-09-07 15:55:21 -05:00
Ilya Dryomov
478aca82eb
Merge pull request #23976 from idryomov/wip-cram-git-clone
qa/tasks/cram: tasks now must live in the repository

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-09-07 19:57:42 +02:00
Sage Weil
6bd682f53d ceph-objectstore-tool: prevent import of pg that has since merged
We currently import a portion of the PG if it has split.  Merge is more
complicated, though, mainly because COT is operating in a mode where it
fast-forwards the PG to the latest OSDMap epoch, which means it has to
implement any transformations to the PG (split/merge) independently.
Avoid doing this for merge.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-07 12:09:05 -05:00
Sage Weil
44de03d5e6 qa/suites: test pg merging
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-07 12:09:05 -05:00
Sage Weil
0b59b7a688 qa/tasks/thrashosds: support merging pgs too
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-07 12:09:05 -05:00
Sage Weil
4fc02a7f48 osd/OSDMap: include age in up and in counts for ceph status
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-07 09:07:50 -05:00
vasukulkarni
93748a325c
Merge pull request #23944 from ceph/wip-s3a-update-mirror
qa/tasks: update mirror link for maven
2018-09-06 14:44:29 -07:00
Ilya Dryomov
592f566b4e qa/tasks/cram: tasks now must live in the repository
Commit 0d8887652d ("qa/tasks/cram: use suite_repo repository for all
cram jobs") removed hardcoded git.ceph.com links, but as it turned out
it is still used for nightlies.  There is no good way to accommodate
the different URL schemes, so let's get rid of URLs altogether.

Fixes: https://tracker.ceph.com/issues/27211
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2018-09-06 22:32:39 +02:00
Ilya Dryomov
e1c89b51c8 qa/tasks/workunit: factor out overrides and refspec logic
Allow for reuse in the cram task.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2018-09-06 22:31:55 +02:00