Commit Graph

869 Commits

Author SHA1 Message Date
John Spray
0551d0e14b
Merge pull request #23497 from noahdesu/insights
mgr/insights: insights reporting module

Reviewed-by: John Spray <john.spray@redhat.com>
2018-09-11 15:56:10 +01:00
John Spray
46a74a07d9 qa: add task for progress module
Signed-off-by: John Spray <john.spray@redhat.com>
2018-09-11 11:21:35 +01:00
John Spray
09f5f9e1eb qa: add 4th OSD to mgr test cluster
This is useful for testing progress module.

Signed-off-by: John Spray <john.spray@redhat.com>
2018-09-11 11:21:35 +01:00
Noah Watkins
5c2615e3d7 qa/tasks/mgr: whitelist insights test health checks
these drive health history tracking tests.

Signed-off-by: Noah Watkins <nwatkins@redhat.com>
2018-09-10 15:02:49 -07:00
Patrick Donnelly
a45852f8fd
qa: fix symlink
Introduced-by: 6ac1882dc4

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-08 19:21:57 -07:00
Sage Weil
09ee3f3538 Merge PR #20469 into master
* refs/pull/20469/head:
	osd/PG: remove warn on delete+merge race
	osd: base project_pg_history on is_new_interval
	osd: make project_pg_history handle concurrent osdmap publish
	osd: handle pg delete vs merge race
	osd/PG: do not purge strays in premerge state
	doc/rados/operations/placement-groups: a few minor corrections
	doc/man/8/ceph: drop enumeration of pg states
	doc/dev/placement-groups: drop old 'splitting' reference
	osd: wait for laggy pgs without osd_lock in handle_osd_map
	osd: drain peering wq in start_boot, not _committed_maps
	osd: kick split children
	osd: no osd_lock for finish_splits
	osd/osd_types: remove is_split assert
	ceph-objectstore-tool: prevent import of pg that has since merged
	qa/suites: test pg merging
	qa/tasks/thrashosds: support merging pgs too
	mon/OSDMonitor: mon_inject_pg_merge_bounce_probability
	doc/rados/operations/placement-groups: update to describe pg_num reductions too
	doc/rados/operations: remove reference to lpgs
	osd: implement pg merge
	osd/PG: implement merge_from
	osdc/Objecter: resend ops on pg merge
	osd: collect and record pg_num changes by pool
	osd: make load_pgs remove message more accurate
	osd/osd_types: pg_t: add is_merge_target()
	osd/osd_types: pg_t::is_merge -> is_merge_source
	osd/osd_types: adding or substracting invalid stats -> invalid stats
	osd/PG: clear_ready_to_merge on_shutdown (or final merge source prep)
	osd: debug pending_creates_from_osd cleanup, don't use cbegin
	ceph-objectstore-tool: debug intervals update
	mgr/ClusterState: discard pg updates for pgs >= pg_num
	mon/OSDMonitor: fix long line
	mon/OSDMonitor: move pool created check into caller
	mon/OSDMonitor: adjust pgp_num_target down along with pg_num_target as needed
	mon/OSDMonitor: add mon_osd_max_initial_pgs to cap initial pool pgs
	osd/OSDMap: set pg[p]_num_target in build_simple*() methods
	mon/PGMap: adjust SMALLER_PGP_NUM warning to use *_target values
	mon/OSDMonitor: set CREATING flag for force-create-pg
	mon/OSDMonitor: start sending new-style pg_create2 messages
	mon/OSDMonitor: set last_force_resend_prenautilus for pg_num_pending changes
	osd: ignore pg creates when pool FLAG_CREATING is not set
	mgr: do not adjust pg_num until FLAG_CREATING removed from pool
	mon/OSDMonitor: add FLAG_CREATING on upgrade if pools still creating
	mon/OSDMonitor: prevent FLAG_CREATING from getting set pre-nautilus
	mon/OSDMonitor: disallow pg_num changes while CREATING flag is set
	mon/OSDMonitor: set POOL_CREATING flag until initial pool pgs are created
	osd/osd_types: add pg_pool_t FLAG_POOL_CREATING
	osd/osd_types: introduce last_force_resend_prenautilus
	osd/PGLog: merge_from helper
	osd: no cache agent or snap trimming during premerge
	osd: notify mon when pending PGs are ready to merge
	mgr: add simple controller to adjust pg[p]_num_actual
	mon/OSDMonitor: MOSDPGReadyToMerge to complete a pg_num change
	mon/OSDMonitor: allow pg_num to adjusted up or down via pg[p]_num_target
	osd/osd_types: make pg merge an interval boundary
	osd/osd_types: add pg_t::is_merge() method
	osd/osd_types: add pg_num_pending to pg_pool_t
	osd: allow multiple threads to block on wait_min_pg_epoch
	osd: restructure advance_pg() call mechanism
	mon/PGMap: prune merged pgs
	mon/PGMap: track pgs by state for each pool
	osd/SnapMapper: allow split_bits to decrease (merge)
	os/bluestore: fix osr_drain before merge
	os/bluestore: allow reuse of osr from existing collection
	os/filestore: (re)implement merge
	os/filestore: add _merge_collections post-check
	os: implement merge_collection
	os/ObjectStore: add merge_collection operation to Transaction
2018-09-07 15:55:21 -05:00
Ilya Dryomov
478aca82eb
Merge pull request #23976 from idryomov/wip-cram-git-clone
qa/tasks/cram: tasks now must live in the repository

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-09-07 19:57:42 +02:00
Sage Weil
44de03d5e6 qa/suites: test pg merging
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-07 12:09:05 -05:00
Ilya Dryomov
592f566b4e qa/tasks/cram: tasks now must live in the repository
Commit 0d8887652d ("qa/tasks/cram: use suite_repo repository for all
cram jobs") removed hardcoded git.ceph.com links, but as it turned out
it is still used for nightlies.  There is no good way to accommodate
the different URL schemes, so let's get rid of URLs altogether.

Fixes: https://tracker.ceph.com/issues/27211
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2018-09-06 22:32:39 +02:00
Patrick Donnelly
6ac1882dc4
qa: automate distro/kernel matrix for kclient
It's no longer necessary to pass `-k testing` to teuthology-suite. We're also
now regularly testing RHEL 7.5 kernel in upstream testing.

This work is prep for eventually integrating kclient into fs.

Fixes: http://tracker.ceph.com/issues/26995

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-09-06 10:23:59 -07:00
Ilya Dryomov
a0df578139
Merge pull request #23905 from idryomov/wip-cram-suite-repo
qa/tasks/cram: use suite_repo repository for all cram jobs

Reviewed-by: Nathan Cutler <ncutler@suse.com>
2018-09-04 14:27:28 +02:00
Lenz Grimmer
82412896ff
Merge pull request #23491 from p-na/per-osd-settings
mgr/dashboard: Add support for managing individual OSD settings in the backend

Reviewed-by: Sebastian Wagner <swagner@suse.com>
Reviewed-by: Stephan Müller <smueller@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
2018-09-04 12:01:19 +02:00
Ilya Dryomov
0d8887652d qa/tasks/cram: use suite_repo repository for all cram jobs
Currently git.ceph.com is hardcoded for all cram jobs.  Testing
modifications is a pain: one needs to push to either ceph/ceph.git or
ceph/ceph-ci.git (depending on where the ceph branch is at, triggering
unnecessary builds in the latter case) and wait for the mirror to sync.
Runs scheduled against branches in developer's forks fail.

Move away from git.ceph.com to allow mixing branches and repositories,
similar to workunits.

Fixes: https://tracker.ceph.com/issues/27211
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2018-09-03 22:07:20 +02:00
Patrick Nawracay
e71466cc49 mgr/dashboard: Add support for managing individual OSD settings (backend)
Add options to mark OSDs in/out/down/reweight/lost/remove/destroy/create

Fixes: http://tracker.ceph.com/issues/24270

Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2018-09-03 12:51:04 +02:00
Sage Weil
88df536908 Merge PR #23540 into master
* refs/pull/23540/head:
	include/ceph_fs: rename old auid field
	PendingReleaseNotes: note about auid support removal
	radosgw-admin: remove -a --auth-uid arg
	rgw: remove auid member from RGWUserInfo
	auth: remove auid member from EntityAuth
	osd: remove auid session member
	mon: remove auid session member
	doc/dev/cephx_protocol: drop auid reference
	auth: remove auid args from handle_request and verify_authorizer
	mon/OSDMonitor: remove 'osd pool {get,set} <name> auid ...'
	mon/OSDMonitor: remove auid arg for 'osd lspools' and deprecate
	osd/OSDCap: remove auid from grammar
	osd/OSDCap: remove auid from is_capable() etc args
	auth: clean up cap parse error messages
	mon/AuthMonitor: raise health warning on invalid caps
	mon/AuthMonitor: drop ancient auth inc encoding compat
	messages/MPoolOp: drop auid member
	osdc/Objecter: drop change_pool_auid
	pybind/rados: drop auid arg to pool_create
	pybind/rados: drop change_auid
	rados: drop mkpool, rmpool commands
	rados: remove 'chown' command
	librados: deprecate calls that take auid
	librados: mark all auid calls deprecated
	mon/OSDMonitor: drop variable pool auid for prepare_new_pool
	mon/OSDMonitor: remove pool auid change support
	osdc/Objecter: do not pass auid to create_pool
	ceph-authtool: remove auid options
	qa/workunits/cephtool: remove auid tests

Reviewed-by: Gregory Farnum <gfarnum@redhat.com>
2018-09-01 15:53:31 -05:00
Sage Weil
35820f4b88 mon/AuthMonitor: raise health warning on invalid caps
Raise a health warning if we have invalid (unparsable) caps in the auth
database.  Include a simple test.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-08-31 15:54:58 -05:00
Ilya Dryomov
a4df8c3562 qa: rbd_workunit_kernel_untar_build: install build dependencies
Commit f0fe0936e6 ("qa: use recent kernel to kernel build testing")
bumped the kernel to 4.17.

Fixes: http://tracker.ceph.com/issues/35074
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2018-08-31 14:31:03 +02:00
Patrick Donnelly
3aa392ca73
Merge PR #23439 into master
* refs/pull/23439/head:
	qa: whitelist cap revoke warning
	doc: document cap revoke non-responders client eviction
	test: validate client eviction for cap revoke non-responders
	mds: add counter for tracking cap non-responding clients
	mds: evict clients that do not respond to cap revoke by MDS
	mds: pass timeout argument for fetching late clients

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Zheng Yan <zyan@redhat.com>
2018-08-25 13:04:58 -07:00
Patrick Donnelly
4367de377e
qa: whitelist cap revoke warning
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-08-25 12:42:26 -07:00
Noah Watkins
a087e417d0 qa/tasks/mgr: whitelist insights warning
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
2018-08-24 10:45:40 -07:00
David Zafman
687f63e599 test: Update tests for error message changes
Signed-off-by: David Zafman <dzafman@redhat.com>
2018-08-23 11:09:22 -07:00
David Zafman
b40784290f qa: Add new message to whitelist for scrub/repair tests
Signed-off-by: David Zafman <dzafman@redhat.com>
2018-08-23 11:09:22 -07:00
Patrick Donnelly
862b84c1e8
qa: use cephfs standard configs for kclient
Specifically, qa/cephfs/begin.yaml.

Fixes: http://tracker.ceph.com/issues/26967

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-08-20 13:58:50 -07:00
Noah Watkins
3d96b2faaa qa/tasks/mgr: add tests for insights module
Signed-off-by: Noah Watkins <nwatkins@redhat.com>
2018-08-16 15:29:46 -07:00
Patrick Donnelly
0b5df6e00a
qa: add cephfs-shell skeleton test case
Right now just tests that "help" works.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-08-16 09:13:36 -07:00
Sage Weil
b0b616df7b Merge PR #23240 into master
* refs/pull/23240/head:
	qa/suites/rados, qa/workunits/rados: Add suite/workunit for ceph-crash
	add ceph-crash service
	common/options: enable mgr 'crash' module by default
	global/signal_handler: add 'done' file to signal crashdump is ready

Reviewed-by: Sage Weil <sage@redhat.com>
2018-08-14 20:31:29 -05:00
Dan Mick
298a1d92d2 qa/suites/rados, qa/workunits/rados: Add suite/workunit for ceph-crash
Signed-off-by: Dan Mick <dan.mick@redhat.com>
2018-08-13 13:53:26 -07:00
Casey Bodley
7ad313768a qa/rgw: raise default curl timeout for multisite suite
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2018-08-10 14:50:52 -04:00
Sage Weil
b500359409 qa/rgw: vary omap osd-side limit
Signed-off-by: Sage Weil <sage@redhat.com>
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2018-08-10 14:50:52 -04:00
Patrick Donnelly
2ea86fa4d3
Merge PR #21948 into master
* refs/pull/21948/head:
	ceph_volume_client: use integer division for pg_num
	packages.yaml: install py3 compatible cephfs and rados bindings...
	qa: test py2 and py3 for volume client
	qa: make test_volume_client.py py3 compatible
	qa/ceph-volume: allow executing python pyloads using python3
	cephfs.pyx: add py3 compatibility
	ceph-volume-client: add py3 compatibility

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-08-01 21:42:27 -07:00
Casey Bodley
8d36655fcb
Merge pull request #23372 from cbodley/wip-25214
qa/rgw: override valgrind --max-threads for radosgw

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
2018-08-01 16:12:21 -04:00
Casey Bodley
e617990efe qa/rgw: override valgrind --max-threads for radosgw
radosgw now uses 512 frontend threads by default, and valgrind won't
start with its default --max-threads=500

Fixes: http://tracker.ceph.com/issues/25214

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2018-08-01 14:45:24 -04:00
Casey Bodley
ce45875d80
Merge pull request #23145 from cbodley/wip-qa-rgw-admin-rest
qa/rgw: add radosgw-admin-rest task to singleton suite

Reviewed-by: Orit Wasserman <owasserm@redhat.com>
2018-08-01 13:52:40 -04:00
Patrick Donnelly
a5e3b0512c
qa: test py2 and py3 for volume client
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-07-30 16:12:48 -07:00
Sage Weil
1ebafdb65f
Merge pull request #23292 from yuriw/wip-yuriw-25140-master
qa/tests: added 1st draft of mimic-x suite
2018-07-30 14:55:41 -05:00
Yuri Weinstein
baa4d0ea78
Merge pull request #23305 from smithfarm/wip-cleanup-upgrade
qa/upgrade: cleanup for nautilus

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2018-07-30 09:01:03 -07:00
Yuri Weinstein
e6f21c1aa3 qa/tests: added 1st draft of mimic-x suite
Fixes: https://tracker.ceph.com/issues/25140
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2018-07-30 08:41:18 -07:00
Nathan Cutler
17d9b5be4d qa/upgrade: cleanup for nautilus
Drop unused suites, which ATM means all of them except upgrade/luminous-x
which recently got a cleanup in https://github.com/ceph/ceph/pull/23162

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-07-29 19:56:53 +02:00
Casey Bodley
bfc892ab84 qa/rgw: add radosgw-admin-rest task to singleton suite
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2018-07-27 14:29:47 -04:00
Nathan Cutler
4432b16673 tests: upgrade/luminous-x: fix order of final-workload directory
This is just a cosmetic fix.

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-07-21 20:30:38 +02:00
Nathan Cutler
2555524f1f tests: upgrade/luminous-x: whitelist REQUEST_SLOW for rados_mon_thrash
Fixes: http://tracker.ceph.com/issues/25051
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-07-21 11:31:20 +02:00
Sage Weil
e2465fdc22
Merge pull request #23151 from neha-ojha/wip-25008
qa/suites/powercycle: whitelist MDS_SLOW_REQUEST
2018-07-20 15:59:16 -05:00
Yuri Weinstein
6d77600fef
Merge pull request #23123 from yuriw/wip-yuriw-24926-master
qa/tests: Set ansible-version: 2.5
2018-07-20 13:04:36 -07:00
Neha Ojha
ac87fd946c qa/suites/powercycle: whitelist MDS_SLOW_REQUEST
Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-07-20 12:50:02 -07:00
Casey Bodley
0b5bed7ecf
Merge pull request #22919 from liewegas/wip-rgw-cls-test
qa/suites/rgw/verify/tasks/cls_rgw: test cls_rgw

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2018-07-20 10:01:36 -04:00
Yuri Weinstein
f40ea9709f qa/tests: Set ansible-version: 2.5
Fixes http://tracker.ceph.com/issues/24926
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2018-07-18 17:28:34 -07:00
Patrick Donnelly
333444e8aa
Merge PR #21885 into master
* refs/pull/21885/head:
	qa: update cluster log health warning message
	qa: add tests for client features
	mds: evict clients that lack required features
	mds: cleanup MDSRank::evict_client
	mds: infer client version by client metadata and connection's features
	mds: introduce "ceph fs set <fs_name> min_compat_client <release_name>"
	mds: tell client why it's rejected
	mds: introduce cephfs' own feature bits
	mds: make Server::prepare_force_open_sessions() update client metadata

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-07-13 13:57:17 -07:00
Patrick Donnelly
3b7233a402
qa: use 4 mds cluster for multifs tests
Fixes: http://tracker.ceph.com/issues/24899

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-07-13 12:11:44 -07:00
Patrick Donnelly
87dcaab130
qa: update cluster log health warning message
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-07-13 12:06:11 -07:00
Patrick Donnelly
441a2730e5
qa: add tests for client features
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-07-12 10:12:12 -07:00