Commit Graph

901 Commits

Author SHA1 Message Date
Sage Weil
a350131d16 Merge PR #24698 into master
* refs/pull/24698/head:
	Merge PR #24697 into nautilus
	ceph_test_msgr: fix authorizer behavior
	Merge pull request #24667 from liewegas/wip-ec-thrash-full
	Merge PR #24689 into nautilus
	qa/suites/rados/thrash-erasure-code*/thrashers/*: less likely resv rejection injection

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-10-22 22:46:23 -05:00
Sage Weil
ae583f5dde Merge PR #24689 into master
* refs/pull/24689/head:
	qa/tasks/ceph_manager: fix get_stuck_pgs from pg dump change
	Merge PR #24625 into nautilus
	qa/suites/rados/mgr/tasks/module_selftest: whitelist 'foo bar security'

Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-10-22 09:19:46 -05:00
Sage Weil
98fc7ebc99 Merge PR #24184 into master
* refs/pull/24184/head:
	mgr/DaemonServer: remove any upmaps on merging PGs
	mgr/DaemonServer: prevent merge if either pg is remapped|upmap
	mgr/DaemonServer: move pending merge check for more consistent code
	qa/suites/rados/thrash*/thrashers/careful.yaml: thrash with mgr controller
	mgr/DaemonServer: add option to bypass careful throttling for thrasher
	PendingReleaseNotes: note about mgr/balancer/max_misplaced change
	mgr/DaemonServer: remove stale/misleading check
	mgr/DaemonServer: throttle pgp_num changes based on misplaced %
	mgr/DaemonServer: block pg_num decrease(merge) until pgp_num is reduced
	mgr/DaemonServer: adjust_pgs(): cosmetic change to debug output
	mon/PGMap: add get_recovery_stats()
	mgr/balancer: mgr/balancer/max_misplaced -> pg_max_misplaced
	pybind/mgr/mgr_module: add get_option()
	mgr/DaemonServer: allow pg_num increases that abort pending merges
	mon/OSDMonitor: resent pre-nautilus client ops on aborted merge
	mon/OSDMonitor: make pgp_num track pg_num more consistently

Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
2018-10-20 16:40:22 -05:00
Sage Weil
86ae8fb6b8 qa/suites/rados/thrash*/thrashers/careful.yaml: thrash with mgr controller
Thrash such that we still exercise the careful throttling in the mgr.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-20 15:21:58 -05:00
Sage Weil
c22d5a0fce qa/suites/rados/thrash-erasure-code*/thrashers/*: less likely resv rejection injection
For EC pools we have a lot of shards, and 30% probability on each one
means we are very like to repeatedly fail backfill reservations.. long
enough that teuthology gives up waiting.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-18 17:17:41 -05:00
Sage Weil
f8b6cbda34 Merge PR #24359 into master
* refs/pull/24359/head:
	qa/tests: update ansible version to 2.6 for master branch testing.
	qa/tests: use lvm as default for ceph-ansible testing, this should also work with raw devices

Reviewed-by: Alfredo Deza <adeza@redhat.com>
2018-10-17 10:06:42 -05:00
Sage Weil
b833d35d9b qa/suites/rados/mgr/tasks/module_selftest: whitelist 'foo bar security'
Avoid failures like

"2018-10-16 20:36:00.437153 mgr.y (mgr.25609) 6 : cluster [SEC] foo bar security" in cluster log

Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-17 07:09:15 -05:00
Sage Weil
9db328f2ab Merge PR #24204 into master
* refs/pull/24204/head:
	qa/suites/rgw/tempest: valgrind on centos only

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2018-10-12 16:29:44 -05:00
Jason Dillaman
c1f950236b qa/tasks: qemu task now uses a relative path in suite repo for test
This makes it easier to re-run tests against a suite branch without
requiring a full ceph-ci build and repo.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-10-11 20:23:39 -04:00
Ilya Dryomov
ccf5f07a0d
Merge pull request #24354 from idryomov/wip-krbd-kfsx-fancy-striping
qa/suites/krbd: more fsx tests

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-10-02 09:32:57 +02:00
Kefu Chai
7e90989214
Merge pull request #24291 from tchaikov/wip-23964
erasure-code: add clay codes

Reviewed-by: Myna Vajha <mynaramana@gmail.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2018-10-02 10:48:21 +08:00
Vasu Kulkarni
4c371daf99 qa/tests: update ansible version to 2.6 for master branch testing.
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2018-10-01 15:10:49 -07:00
Vasu Kulkarni
b0fc647e3f qa/tests: use lvm as default for ceph-ansible testing, this should also work with raw devices
and using the batch mode should auto select nvme/ssd for journal.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2018-10-01 12:05:50 -07:00
Ilya Dryomov
870e42ac6a qa/suites/krbd: more fsx tests
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2018-10-01 16:48:47 +02:00
Patrick Donnelly
bdd2ddcfd8
qa: use default timeout in fs workunits
Six hours is unnecessarily long.

Fixes: http://tracker.ceph.com/issues/36184
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-30 18:10:05 -07:00
Sage Weil
bc86b06a67 qa/suites/rados/thrash-erasure-code: add clay 4+2
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-27 18:56:21 +08:00
Nathan Cutler
a38af571c2
Merge pull request #23131 from smithfarm/wip-15100
qa: add test that builds example librados programs

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Jesse Williamson <jwilliamson@suse.de>
2018-09-27 00:17:48 +02:00
Mykola Golub
78881934c0
Merge pull request #22043 from dillaman/wip-24139
rbd-mirror: bootstrap needs to handle local image id collision

Reviewed-by: Mykola Golub <mgolub@suse.com>
2018-09-26 22:49:02 +03:00
Casey Bodley
8622d8a241
Merge pull request #24011 from cbodley/wip-qa-rgw-multisite-speedup
qa/rgw: reduce number of multisite log shards

Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
2018-09-25 16:39:51 -04:00
Nathan Cutler
c46c890d02 qa: add test that builds example librados programs
Fixes: http://tracker.ceph.com/issues/15100
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-09-25 13:18:04 +02:00
Jason Dillaman
3e8f16b484 qa/suites/rbd: increase librbd debug level for mirror-thrash
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-09-24 15:12:26 -04:00
Sage Weil
757a94b980 qa/suites/rados/thrash-old-clients/workloads/rbd_cls.yaml: skip parents test
We can't (easily) build updated hammer packages, but all this sh script does
it run this one test binary with --gtest_filter arguments, so just do
it directly and skip the test explicitly here.  (Newer version of the .sh
understand the environemnt variable but the hammer version does not.)

Fixes: http://tracker.ceph.com/issues/36104
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-24 10:57:40 -05:00
Sage Weil
9bf7c810a7 Merge PR #23985 into master
* refs/pull/23985/head:
	ceph-objectstore-tool: add back pool dne check
	qa/suites/rados/singleton/reg11184: remove old test
	ceph-objectstore-tool: import pg at original epoch
	osd: handle null pg slot on startup
	ceph-objectstore-tool: drop support for ancient export files
	osd: avoid dropping osd_lock when pg osdmaps are not laggy
	qa/standalone/osd/pg-merge.sh: add merge vs pg import test

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-09-21 08:21:53 -05:00
Sage Weil
04e608e1a2 qa/suites/rgw/tempest: valgrind on centos only
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-20 13:00:47 -05:00
Sage Weil
4e5f2bb596 qa/suites/rados/singleton/reg11184: remove old test
This bug was about filtering missing and divergent when doing a partial
PG import.  We don't support partial PG imports any more, so this can
go away!

Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-20 12:58:00 -05:00
Patrick Donnelly
3e3b69dbde
Merge PR #24156 into master
* refs/pull/24156/head:
	qa/tests: removed knfs suite

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-20 09:31:50 -07:00
Patrick Donnelly
e606e21746
Merge PR #24185 into master
* refs/pull/24185/head:
	qa: install build dependencies for cfuse_workunit_kernel_untar_build.yaml

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-20 08:02:48 -07:00
Neha Ojha
38ef3da8d2 qa: install build dependencies for cfuse_workunit_kernel_untar_build.yaml
Fixes: https://tracker.ceph.com/issues/36076
Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-09-19 15:22:16 -07:00
Yuri Weinstein
eaca188733 qa/tests: removed knfs suite
Fixes: http://tracker.ceph.com/issues/36075
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2018-09-19 09:36:34 -07:00
Casey Bodley
d897b92878 osd: remove statelog from osd_class_load_list config
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2018-09-19 10:32:55 -04:00
Jason Dillaman
d3a1a831ac qa/workunits/rbd: new test for rbd-mirror peer config-key secrets
Fixes: http://tracker.ceph.com/issues/24688
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-09-13 09:58:34 -04:00
John Spray
0551d0e14b
Merge pull request #23497 from noahdesu/insights
mgr/insights: insights reporting module

Reviewed-by: John Spray <john.spray@redhat.com>
2018-09-11 15:56:10 +01:00
John Spray
46a74a07d9 qa: add task for progress module
Signed-off-by: John Spray <john.spray@redhat.com>
2018-09-11 11:21:35 +01:00
John Spray
09f5f9e1eb qa: add 4th OSD to mgr test cluster
This is useful for testing progress module.

Signed-off-by: John Spray <john.spray@redhat.com>
2018-09-11 11:21:35 +01:00
Noah Watkins
5c2615e3d7 qa/tasks/mgr: whitelist insights test health checks
these drive health history tracking tests.

Signed-off-by: Noah Watkins <nwatkins@redhat.com>
2018-09-10 15:02:49 -07:00
Patrick Donnelly
a45852f8fd
qa: fix symlink
Introduced-by: 6ac1882dc4

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-08 19:21:57 -07:00
Sage Weil
09ee3f3538 Merge PR #20469 into master
* refs/pull/20469/head:
	osd/PG: remove warn on delete+merge race
	osd: base project_pg_history on is_new_interval
	osd: make project_pg_history handle concurrent osdmap publish
	osd: handle pg delete vs merge race
	osd/PG: do not purge strays in premerge state
	doc/rados/operations/placement-groups: a few minor corrections
	doc/man/8/ceph: drop enumeration of pg states
	doc/dev/placement-groups: drop old 'splitting' reference
	osd: wait for laggy pgs without osd_lock in handle_osd_map
	osd: drain peering wq in start_boot, not _committed_maps
	osd: kick split children
	osd: no osd_lock for finish_splits
	osd/osd_types: remove is_split assert
	ceph-objectstore-tool: prevent import of pg that has since merged
	qa/suites: test pg merging
	qa/tasks/thrashosds: support merging pgs too
	mon/OSDMonitor: mon_inject_pg_merge_bounce_probability
	doc/rados/operations/placement-groups: update to describe pg_num reductions too
	doc/rados/operations: remove reference to lpgs
	osd: implement pg merge
	osd/PG: implement merge_from
	osdc/Objecter: resend ops on pg merge
	osd: collect and record pg_num changes by pool
	osd: make load_pgs remove message more accurate
	osd/osd_types: pg_t: add is_merge_target()
	osd/osd_types: pg_t::is_merge -> is_merge_source
	osd/osd_types: adding or substracting invalid stats -> invalid stats
	osd/PG: clear_ready_to_merge on_shutdown (or final merge source prep)
	osd: debug pending_creates_from_osd cleanup, don't use cbegin
	ceph-objectstore-tool: debug intervals update
	mgr/ClusterState: discard pg updates for pgs >= pg_num
	mon/OSDMonitor: fix long line
	mon/OSDMonitor: move pool created check into caller
	mon/OSDMonitor: adjust pgp_num_target down along with pg_num_target as needed
	mon/OSDMonitor: add mon_osd_max_initial_pgs to cap initial pool pgs
	osd/OSDMap: set pg[p]_num_target in build_simple*() methods
	mon/PGMap: adjust SMALLER_PGP_NUM warning to use *_target values
	mon/OSDMonitor: set CREATING flag for force-create-pg
	mon/OSDMonitor: start sending new-style pg_create2 messages
	mon/OSDMonitor: set last_force_resend_prenautilus for pg_num_pending changes
	osd: ignore pg creates when pool FLAG_CREATING is not set
	mgr: do not adjust pg_num until FLAG_CREATING removed from pool
	mon/OSDMonitor: add FLAG_CREATING on upgrade if pools still creating
	mon/OSDMonitor: prevent FLAG_CREATING from getting set pre-nautilus
	mon/OSDMonitor: disallow pg_num changes while CREATING flag is set
	mon/OSDMonitor: set POOL_CREATING flag until initial pool pgs are created
	osd/osd_types: add pg_pool_t FLAG_POOL_CREATING
	osd/osd_types: introduce last_force_resend_prenautilus
	osd/PGLog: merge_from helper
	osd: no cache agent or snap trimming during premerge
	osd: notify mon when pending PGs are ready to merge
	mgr: add simple controller to adjust pg[p]_num_actual
	mon/OSDMonitor: MOSDPGReadyToMerge to complete a pg_num change
	mon/OSDMonitor: allow pg_num to adjusted up or down via pg[p]_num_target
	osd/osd_types: make pg merge an interval boundary
	osd/osd_types: add pg_t::is_merge() method
	osd/osd_types: add pg_num_pending to pg_pool_t
	osd: allow multiple threads to block on wait_min_pg_epoch
	osd: restructure advance_pg() call mechanism
	mon/PGMap: prune merged pgs
	mon/PGMap: track pgs by state for each pool
	osd/SnapMapper: allow split_bits to decrease (merge)
	os/bluestore: fix osr_drain before merge
	os/bluestore: allow reuse of osr from existing collection
	os/filestore: (re)implement merge
	os/filestore: add _merge_collections post-check
	os: implement merge_collection
	os/ObjectStore: add merge_collection operation to Transaction
2018-09-07 15:55:21 -05:00
Ilya Dryomov
478aca82eb
Merge pull request #23976 from idryomov/wip-cram-git-clone
qa/tasks/cram: tasks now must live in the repository

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-09-07 19:57:42 +02:00
Sage Weil
44de03d5e6 qa/suites: test pg merging
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-07 12:09:05 -05:00
Ilya Dryomov
592f566b4e qa/tasks/cram: tasks now must live in the repository
Commit 0d8887652d ("qa/tasks/cram: use suite_repo repository for all
cram jobs") removed hardcoded git.ceph.com links, but as it turned out
it is still used for nightlies.  There is no good way to accommodate
the different URL schemes, so let's get rid of URLs altogether.

Fixes: https://tracker.ceph.com/issues/27211
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2018-09-06 22:32:39 +02:00
Patrick Donnelly
6ac1882dc4
qa: automate distro/kernel matrix for kclient
It's no longer necessary to pass `-k testing` to teuthology-suite. We're also
now regularly testing RHEL 7.5 kernel in upstream testing.

This work is prep for eventually integrating kclient into fs.

Fixes: http://tracker.ceph.com/issues/26995

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-06 10:23:59 -07:00
Casey Bodley
2629231ca1 qa/rgw: reduce number of multisite log shards
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2018-09-05 14:45:04 -04:00
Ilya Dryomov
a0df578139
Merge pull request #23905 from idryomov/wip-cram-suite-repo
qa/tasks/cram: use suite_repo repository for all cram jobs

Reviewed-by: Nathan Cutler <ncutler@suse.com>
2018-09-04 14:27:28 +02:00
Lenz Grimmer
82412896ff
Merge pull request #23491 from p-na/per-osd-settings
mgr/dashboard: Add support for managing individual OSD settings in the backend

Reviewed-by: Sebastian Wagner <swagner@suse.com>
Reviewed-by: Stephan Müller <smueller@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
2018-09-04 12:01:19 +02:00
Ilya Dryomov
0d8887652d qa/tasks/cram: use suite_repo repository for all cram jobs
Currently git.ceph.com is hardcoded for all cram jobs.  Testing
modifications is a pain: one needs to push to either ceph/ceph.git or
ceph/ceph-ci.git (depending on where the ceph branch is at, triggering
unnecessary builds in the latter case) and wait for the mirror to sync.
Runs scheduled against branches in developer's forks fail.

Move away from git.ceph.com to allow mixing branches and repositories,
similar to workunits.

Fixes: https://tracker.ceph.com/issues/27211
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2018-09-03 22:07:20 +02:00
Patrick Nawracay
e71466cc49 mgr/dashboard: Add support for managing individual OSD settings (backend)
Add options to mark OSDs in/out/down/reweight/lost/remove/destroy/create

Fixes: http://tracker.ceph.com/issues/24270

Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2018-09-03 12:51:04 +02:00
Sage Weil
88df536908 Merge PR #23540 into master
* refs/pull/23540/head:
	include/ceph_fs: rename old auid field
	PendingReleaseNotes: note about auid support removal
	radosgw-admin: remove -a --auth-uid arg
	rgw: remove auid member from RGWUserInfo
	auth: remove auid member from EntityAuth
	osd: remove auid session member
	mon: remove auid session member
	doc/dev/cephx_protocol: drop auid reference
	auth: remove auid args from handle_request and verify_authorizer
	mon/OSDMonitor: remove 'osd pool {get,set} <name> auid ...'
	mon/OSDMonitor: remove auid arg for 'osd lspools' and deprecate
	osd/OSDCap: remove auid from grammar
	osd/OSDCap: remove auid from is_capable() etc args
	auth: clean up cap parse error messages
	mon/AuthMonitor: raise health warning on invalid caps
	mon/AuthMonitor: drop ancient auth inc encoding compat
	messages/MPoolOp: drop auid member
	osdc/Objecter: drop change_pool_auid
	pybind/rados: drop auid arg to pool_create
	pybind/rados: drop change_auid
	rados: drop mkpool, rmpool commands
	rados: remove 'chown' command
	librados: deprecate calls that take auid
	librados: mark all auid calls deprecated
	mon/OSDMonitor: drop variable pool auid for prepare_new_pool
	mon/OSDMonitor: remove pool auid change support
	osdc/Objecter: do not pass auid to create_pool
	ceph-authtool: remove auid options
	qa/workunits/cephtool: remove auid tests

Reviewed-by: Gregory Farnum <gfarnum@redhat.com>
2018-09-01 15:53:31 -05:00
Sage Weil
35820f4b88 mon/AuthMonitor: raise health warning on invalid caps
Raise a health warning if we have invalid (unparsable) caps in the auth
database.  Include a simple test.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-08-31 15:54:58 -05:00
Ilya Dryomov
a4df8c3562 qa: rbd_workunit_kernel_untar_build: install build dependencies
Commit f0fe0936e6 ("qa: use recent kernel to kernel build testing")
bumped the kernel to 4.17.

Fixes: http://tracker.ceph.com/issues/35074
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2018-08-31 14:31:03 +02:00
Patrick Donnelly
3aa392ca73
Merge PR #23439 into master
* refs/pull/23439/head:
	qa: whitelist cap revoke warning
	doc: document cap revoke non-responders client eviction
	test: validate client eviction for cap revoke non-responders
	mds: add counter for tracking cap non-responding clients
	mds: evict clients that do not respond to cap revoke by MDS
	mds: pass timeout argument for fetching late clients

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Zheng Yan <zyan@redhat.com>
2018-08-25 13:04:58 -07:00