Commit Graph

738 Commits

Author SHA1 Message Date
Yuri Weinstein
b28ab5616d
Merge pull request #20678 from ceph/wip-s3a-fix
fix s3atests that are failing for sometime

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2018-05-01 09:28:24 -07:00
Mykola Golub
6b752a3859
Merge pull request #21697 from dillaman/wip-18753-1
rbd-mirror: additional thrasher testing

Reviewed-by: Mykola Golub <mgolub@suse.com>
2018-04-30 18:25:35 +03:00
Kefu Chai
770dbae2ca qa/suites/rados/thrash-old-clients: ms_type=simple
hammer does not support async messenger, so set ms_type to "simple" for
hammer client.

Fixes: http://tracker.ceph.com/issues/23922
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-04-30 21:40:53 +08:00
Josh Durgin
aef8d96def
Merge pull request #21710 from neha-ojha/wip-cosbench-workloads
qa: add cosbench workloads and override teuthology default settings

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-04-27 18:30:11 -07:00
Neha Ojha
1b6bf0d423 qa/suites/perf-basic: add cosbench write workload
Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-04-27 16:15:30 -07:00
Neha Ojha
f670f2d6d3 qa/suites/perf-basic: override default teuthology settings
Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-04-27 16:07:11 -07:00
Neha Ojha
f01eb7adc6 qa/suites/rados/perf: add cosbench workloads
Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-04-27 16:02:40 -07:00
Jason Dillaman
0cb3e64221
Merge pull request #21704 from liewegas/wip-rados-old-clients
qa/suites/rados/thrash-old-clients: add rbd tests

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2018-04-27 17:35:58 -04:00
Yuri Weinstein
2c98007a13
Merge pull request #21630 from neha-ojha/wip-rgw-pools
qa: modify rgw default pool names

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-04-27 14:21:52 -07:00
Sage Weil
9fbdca34bf qa/suites/rados/thrash-old-clients: add cls_rbd, rbd api tests
Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-27 14:04:25 -05:00
Neha Ojha
33db6bdfdd qa/suites/rados/perf: override default teuthology settings
Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-04-27 12:04:00 -07:00
Sage Weil
7cbaa62498 qa/suites/rados/thrash-old-clients: add hammer
Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-27 11:40:53 -05:00
Sage Weil
41fae6e9c4
Merge pull request #21679 from liewegas/wip-rados-old-clients
qa/suites/rados/thrash-old-clients: do some thrashing with jewel and luminous clients

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-04-27 11:30:39 -05:00
Jason Dillaman
de4587012d qa/suites/rbd: rbd-mirror fsx stress test
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-04-27 08:34:42 -04:00
Sage Weil
2332e63e33
Merge pull request #21646 from liewegas/wip-rados-api-tests
qa/suites/rados/verify/tasks/rados_api_tests: whitelist OBJECT_MISPLACED
2018-04-26 12:39:23 -05:00
Kefu Chai
966c76330b qa: reduce "mon client hunt interval max multiple" to 2 for all clients
because with high failure rate, we need to connect to mon more
frequently if the connection fails.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-04-26 12:04:49 +08:00
Sage Weil
e331311b87 qa/suites/rados/verify/tasks/rados_api_tests: whitelist OBJECT_MISPLACED
The api tests do some splits, which can move data.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-25 10:33:52 -05:00
Mykola Golub
808503f4d9
Merge pull request #21541 from dillaman/wip-18753
qa: introduce rbd-mirror thrasher to existing tests

Reviewed-by: Mykola Golub <mgolub@suse.com>
2018-04-25 16:07:16 +03:00
Neha Ojha
8158a2c430 qa: add rgw index pool to rgw_snaps.yaml
Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-04-24 15:22:18 -07:00
Sage Weil
27e91a99f5
Merge pull request #21273 from jdurgin/wip-23195
osd/ECBackend: only check required shards when finishing recovery reads

Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2018-04-24 17:20:25 -05:00
Sage Weil
a43c7b3a23 qa/suites/rados/thrash-old-clients: do some thrashing with jewel and luminous clients
Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-24 16:14:36 -05:00
Neha Ojha
48f4b8172a qa: correct default rgw pool name
Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-04-24 13:45:33 -07:00
Kefu Chai
cdcbd47e1e qa/suite: whitelist PG_AVAILABILITY in rados_api_tests.yaml
pg will be created when increasing pgp-num and pg-num. so at that
moment, PG_AVAILABILITY is reported. so whitelist it in all tests which
run rados/test.sh. that script exercises ceph_test_rados_api_list.

Fixes: http://tracker.ceph.com/issues/23763
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-04-24 10:16:12 +08:00
Jason Dillaman
2311eff622 qa/suites/rbd: thrash rbd-mirror daemon during tests
Fixes: http://tracker.ceph.com/issues/18753
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2018-04-23 16:02:59 -04:00
Ricardo Dias
7309be145b
qa/suites/rados/mgr/tasks/dashboard: add MDS_(ALL_DOWN|UP_LESS_THAN_MAX) to whitelist
Signed-off-by: Ricardo Dias <rdias@suse.com>
2018-04-21 15:58:23 +01:00
Josh Durgin
234d652317 qa/suites/rados: add coverage for osd_recovery_max_single_start > 1
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2018-04-20 19:42:15 -04:00
Patrick Donnelly
bb95292213
Merge PR #16608 into master
* refs/pull/16608/head:
	qa: whitelist mds down wrn during cephfs testing
	mds: add config to disable fragmentation
	qa: add max_mds thrash test
	qa: mds_thrash updates for new max_mds behavior
	doc: update upgrade procedure and release notes
	qa: add test for cluster resizing
	qa: remove use of mds deactivate
	cephfs: add new down/joinable fs flags
	mds: evict all clients if last mds shutting down
	cephfs: deprecate ceph mds deactivate
	cephfs: kill allow_dirfrags
	cephfs: Kill allow_multimds
	cephfs: Change behavior of cluster_down flag
	mon/FSCommands: Set extra MDS to standby
	cephfs: Health check changes
	mon/MDSMonitor: Remove command support for legacy syntax

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-04-18 14:36:54 -07:00
Patrick Donnelly
88922255e9
qa: add test for cluster resizing
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-04-17 11:26:56 -07:00
Kefu Chai
d55724ec4a
Merge pull request #21331 from liewegas/wip-slow-ops
qa/suites/rados/monthrash: whitelist SLOW_OPS

Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-04-13 17:49:30 +08:00
Sage Weil
2f00b8f990 Merge PR #21302 into wip-sage-testing-20180410.153118
* refs/pull/21302/head:
	qa/suites/rados/singleton/all/random-eio: whitelist eio error message

Reviewed-by: David Zafman <dzafman@redhat.com>
2018-04-10 10:31:26 -05:00
Sage Weil
c365614292 qa/suites/rados/monthrash: whitelist SLOW_OPS
Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-10 08:56:39 -05:00
Kefu Chai
acc08559ce qa/suites: whitelist SLOW_OPS
Fixes: http://tracker.ceph.com/issues/23495
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-04-10 19:25:47 +08:00
Sage Weil
35c14a0162 qa/suites/rados/singleton/all/random-eio: whitelist eio error message
"cluster [ERR] 2.1 shard 1: soid 2:8007ad8d:::benchmark_data_smithi115_12935_object2439:head candidate had a read error"

is normal when we're injecting EIO.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-09 07:40:23 -05:00
Brad Hubbard
eeeed6497b qa/suites/rados: Disable scrub backoff
A long run of lost coin flips can lead to a timeout in
test_large_omap_detection.py.

Fixes: http://tracker.ceph.com/issues/23578

Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
2018-04-09 17:21:01 +10:00
Sage Weil
d7bca5d19f Merge PR #20512 into wip-sage-testing-20180407.022843
* refs/pull/20512/head:
	osd: osd_mon_report_interval_min -> osd_mon_report_interval, kill _max

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-04-06 21:29:32 -05:00
Gregory Farnum
6d2e4c9b7b
Merge pull request #19973 from liewegas/wip-peering-fast-dispatch
osd: fast dispatch of peering events and pg_map + osd sharded wq refactor

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2018-04-06 11:48:11 -07:00
Sage Weil
577737d007 osd: osd_mon_report_interval_min -> osd_mon_report_interval, kill _max
The _max isn't used.  Drop the _min suffix.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-06 11:00:14 -05:00
Sage Weil
8f8f65594b
Merge pull request #21250 from liewegas/wip-recovery-slow-ops
qa/suites/rados/singleton/all/recovery_preemption: whitelist SLOW_OPS
2018-04-06 10:59:23 -05:00
Sage Weil
b235a3f62a qa/suites/rados/singleton/all/ec-lost-unfound: whitelist SLOW_OPS
Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-06 10:38:45 -05:00
Joao Eduardo Luis
940dd941ef
Merge pull request #19331 from jecluis/wip-mon-osdmap-prune
mon: osdmap prune

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-04-06 15:22:28 +01:00
Joao Eduardo Luis
3997eed4db qa: enable mon osdmap pruning on 'rados/' suites
Signed-off-by: Joao Eduardo Luis <joao@suse.de>
2018-04-06 04:18:23 +01:00
Joao Eduardo Luis
65e7e4a38f qa: enable mon osdmap pruning on rados/monthrash
Signed-off-by: Joao Eduardo Luis <joao@suse.de>
2018-04-06 04:18:23 +01:00
Casey Bodley
35fd4050b7 qa/rgw: verify suite tests beast with ssl
instead of adding special frontend configs like civetweb_ssl.yaml and
beast_ssl.yaml, added a new proto/ subdirectory for http.yaml and
https.yaml

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2018-04-05 15:30:22 -04:00
Sage Weil
3b3c32f643 qa/suites/rados/singleton/all/recovery_preemption: whitelist SLOW_OPS
Recovery and peering can be slow enough with all the logging enabled to
trigger a slow ops warning.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-04 16:24:31 -05:00
vasukulkarni
806072a494
Merge pull request #21146 from zmc/wip-cephmetrics
qa/suites/cephmetrics: Updates for new version
2018-04-04 08:28:57 -07:00
Sage Weil
26f00dd67c qa/suites: mon warn on pool no app = false for api tests
Among other things, the list.cc tests set pg_num which waits for cluster
healthy.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-04 08:26:58 -05:00
Sage Weil
c2cce3bc88 qa/suites/rados/basic/tasks/rados_api_tests: debug ms = 1
Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-04 08:26:58 -05:00
Sage Weil
29a885c915 qa/suites/rados/singleton/all/recovery_preemption: make test more reliable
A 30 second run did only 7000 ops, which means ~50 log entires per pg...
not enough to trigger backfill.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-04 08:26:57 -05:00
Sage Weil
c3589df320 qa/suites/rados/singleton/all/mon-seesaw: whitelist PG_AVAILABILITY
The seesaw might delay pg creation by more than 60s.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-04 08:26:57 -05:00
Patrick Donnelly
ebd0fcd067
Merge PR #16779 into master
* refs/pull/16779/head:
	mds: cleanup MDCache::open_snaprealms()
	mds: make sure snaptable version > 0
	mds: don't consider CEPH_INO_LOST_AND_FOUND as base inode
	mds: replace MAX() with std::max()
	tools/cephfs: make cephfs-data-scan create snaprealm for base inodes
	qa/cephfs: don't run TestSnapshots.test_kill_mdstable on kclient
	qa/cephfs: adjust check of 'cephfs-table-tool all show snap' output
	mds: don't warn unconnected snaplrealms in cluster log
	mds: update CInode/CDentry's first according to global snapshot seq
	qa/cephfs: add tests for snapclient cache
	qa/cephfs: add tests for snaptable transaction
	mds: add asok command that dumps cached snap infos
	qa/cephfs: add tests for multimds snapshot
	client: don't mark snap directory complete when its dirstat is empty
	qa/workunits/snaps: add snaprealm split test
	mds: make sure mds has uptodate mdsmap before checking 'allows_snaps'
	client: fix incorrect snaprealm when adding caps
	qa/workunits/snaps: add hardlink snapshot test
	mds: add incompat feature and bump protocol for snapshot changes
	mds: detach inode with single hardlink from global snaprealm
	mds: record hardlink snaps in inode's snaprealm
	mds: attach inode with multiple hardlinks to dummy global snaprealm
	mds: cleanup rename code
	mds: ensure xlocker has uptodate lock state
	mds: simplify SnapRealm::build_snap_{set,trace}
	mds: record global last_created/last_destroyed in snaptable
	mds: pop projected snaprealm before inode's parent changes
	mds: keep isnap lock in sync state
	mds: handle mksnap vs resolve_snapname race
	mds: cleanup snaprealm past parents open check
	mds: rollback snaprealms when rolling back slave request
	mds: send updated snaprealms along with slave requests
	mds: explict notification for snap update
	mds: send snap related messages centrally during mds recovery
	mds: synchronize snaptable caches when mds recovers
	mds: introduce MDCache::maybe_finish_slave_resolve()
	mds: notify all mds about prepared snaptable update
	mds: record snaps in old snaprealm when moving inode into new snaprealm
	mds: cache snaptable in snapclient
	mds: recover snaptable client when mds enters resolve state

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-04-03 15:41:19 -07:00