Commit Graph

100 Commits

Author SHA1 Message Date
Ronen Friedman
d232c4e8d8 qa/suites/rados: add simultaneous scrubs (multiple options) to the thrasher
Setting osd-max-scrubs to either 2 or 3.

Triggered by https://tracker.ceph.com/issues/50346

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2021-06-24 18:53:50 +03:00
Josh Durgin
0e273e6760
Merge pull request #40593 from ideepika/wip-new-testing-params
qa/config/rados: add dispatch delay testing params

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Sam Just <sjust@redhat.com>
2021-04-28 11:26:58 -07:00
Deepika Upadhyay
b2c2a4326c qa/config/rados: add dispatch delay testing params
these parameters have proven to catch some of the uncaught bugs such as:
https://tracker.ceph.com/issues/48417, adopting them will help in
preventing more such hard to debug bugs.

Signed-off-by: Deepika Upadhyay <dupadhya@redhat.com>
2021-04-19 12:28:18 +05:30
Sage Weil
babbab14f4 qa/suites/rados/thrash: shorten radosbench
This is the longest of the thrash workloads; reducing it will bring
this test in line with the others (<= 45 min).

Signed-off-by: Sage Weil <sage@newdream.net>
2021-04-13 17:42:21 -05:00
Kefu Chai
caf364db5e
Merge pull request #38120 from kiizawa/wip-cls-remote-read
osd: allow remote read by calling cls method from within cls context

Reviewed-by: Samuel Just <sjust@redhat.com>
2021-04-12 16:42:52 +08:00
Ken Iizawa
7d0ea1b861 qa: allow all the tasks that run rados/test.sh to load object classes for testing remote-reads functionality
Fixes: https://tracker.ceph.com/issues/48182
Signed-off-by: Ken Iizawa <iizawa.ken@fujitsu.com>
2021-04-06 19:57:14 +09:00
myoungwon oh
b5f5649ecd qa: fix typo to call rollback op
Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
2021-04-01 18:44:37 +09:00
myoungwon oh
dfce69e995 qa: add dedup-io-snaps test
Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
2021-03-29 17:10:25 +09:00
myoungwon oh
9e8601ddf5 qa: add dedup-io-mixed test
Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
2021-03-29 17:09:49 +09:00
Kefu Chai
181dc1a43f
Merge pull request #39757 from aclamk/wip-qa-test-bluestore-reshard
qa: Add bluestore resharding test

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2021-03-17 22:41:34 +08:00
Adam Kupczyk
a84820b743 qa: Add bluestore resharing test
Signed-off-by: Adam Kupczyk <akupczyk@redhat.com>
2021-03-10 10:21:09 +01:00
Jason Dillaman
103243725f qa/suites: move RADOS tests to use new debug log objectstores
This will retain the debug log settings for all RADOS suites
that were previously symlinked to the 'objectstore'
directory. The next commit will reduce the debug log level
for the original 'objectstore' directory for the remainder
of tests.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2021-03-03 14:47:59 -05:00
Greg Farnum
9506d09e3b Merge remote-tracking branch 'origin/master' into wip-stretch-mode
Conflicts:
	src/include/ceph_features.h

Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2020-09-15 02:25:07 +00:00
Greg Farnum
d02625331c Merge remote-tracking branch 'origin/master' into wip-stretch-mode 2020-09-14 02:32:19 +00:00
Samuel Just
4a73fd2251
Merge pull request #35899 from myoungwon/wip-fix-manifest-tier
osd, test: refactoring manfiest-tier

Reviewed-by: Samuel Just <sjust@redhat.com>
2020-09-01 12:49:57 -07:00
myoungwon oh
7217a6c00e osd, test: remove all things related to manifest_flush
Due to a plan to re-work based on CDC-based flush,
remove manifest_flush()

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsumg.com>
2020-08-26 21:05:14 +09:00
Neha Ojha
c0a358670a qa/suites/rados/thrash: log-whitelist -> log-ignorelist
Signed-off-by: Neha Ojha <nojha@redhat.com>
2020-08-24 19:53:08 +00:00
Sage Weil
2ee9365d0b qa: log-whitelist -> log-ignorelist
Signed-off-by: Sage Weil <sage@newdream.net>
2020-08-24 19:53:08 +00:00
Neha Ojha
f9399b508c qa/suites/rados/thrash/crc-failures: randomly inject bad incremental osdmap crc
Signed-off-by: Neha Ojha <nojha@redhat.com>
2020-07-28 23:15:42 +02:00
Greg Farnum
39d71f7841 test: add a mon_election directory to the rados and upgrade suites
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2020-07-08 04:26:03 +00:00
Josh Durgin
beaa4b04bc
Merge pull request #34880 from neha-ojha/wip-balancer-cleanup
qa/suites/rados/thrash*: remove upmap.yaml and off.yaml

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2020-05-01 13:15:58 -07:00
Neha Ojha
d46ea00320 qa/suites/rados/thrash*: add on.yaml
To test with balancer in upmap mode, which is the default.

Signed-off-by: Neha Ojha <nojha@redhat.com>
2020-05-01 18:49:36 +00:00
Neha Ojha
cb5611e0d5 qa/suites/rados/thrash*: remove upmap.yaml and off.yaml
The balancer is always on, so we don't need upmap.yaml.
off.yaml doesn't make sense anymore.

Signed-off-by: Neha Ojha <nojha@redhat.com>
2020-05-01 18:31:01 +00:00
Neha Ojha
4a6361d93e qa/suites/rados/thrash/workloads: use default mon osd initial require min compat client
Signed-off-by: Neha Ojha <nojha@redhat.com>
2020-04-24 18:29:06 +00:00
Samuel Just
ceb8e457e7 suites/rados/thrash: add rados-high-concurrency
Signed-off-by: Samuel Just <sjust@redhat.com>
2020-03-19 08:15:02 +08:00
Sage Weil
e04a08069f qa/suites/rados/thrash: debug monc
Hunting https://tracker.ceph.com/issues/43825

Signed-off-by: Sage Weil <sage@redhat.com>
2020-01-26 11:13:20 -06:00
Sage Weil
2ba0ff7117 qa/suites/rados/thrash[-erasure-code]: add misc -{localized,balanced}.yaml jobs
Signed-off-by: Sage Weil <sage@redhat.com>
2019-12-19 17:35:36 -08:00
Sage Weil
71d74aa8c6 qa: more tries for mon tell when injecting msgr failures
With failure injection the default 2 tries isn't quite enough

Signed-off-by: Sage Weil <sage@redhat.com>
2019-10-11 14:16:42 -05:00
David Zafman
fdf93add0b
Merge pull request #30714 from dzafman/wip-41743
test: Ignore OSD_SLOW_PING_TIME* if injecting socket failures

Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-10-04 18:28:48 -07:00
Neha Ojha
375217a3d7 qa/suites/rados: add more-async-partial-recovery.yaml in the thrash suite
This will increase both async and partial recovery.

Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-10-03 11:15:51 -07:00
Neha Ojha
e953719f2c qa/suites/rados: add more-partial-recovery.yaml in the thrash suite
Increasing osd_object_clean_region_max_num_intervals to track more
clean regions, resulting in more partial recovery.

Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-10-03 11:15:51 -07:00
Neha Ojha
a2eb45507a qa/suites/rados: add more-async-recovery.yaml in the thrash suite
Reducing osd_async_recovery_min_cost to 1 should allow more
asynchronous recovery to happen.

Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-10-03 11:15:51 -07:00
David Zafman
ded58ef91d test: Ignore OSD_SLOW_PING_TIME* if injecting socket failures
Fixes: https://tracker.ceph.com/issues/41743

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-10-03 09:09:10 -07:00
Kefu Chai
10af76ab1b
Merge pull request #28778 from myoungwon/wip-chunk-flush-op
src/osd: add tier-flush op

Reviewed-by: Sage Weil <sage@redhat.com>
2019-07-24 09:56:47 +08:00
myoungwon oh
f1acdf3f2e src/test, qa/suite: add tier_flush test
Signed-off-by: Myoungwon Oh <omwmw@sk.com>
2019-07-12 22:23:22 +09:00
myoungwon oh
c1b69f48d2 src/test, qa/suites/rados/thrash: add dedup test
Signed-off-by: Myoungwon Oh <omwmw@sk.com>
2019-07-11 20:28:55 +09:00
Sage Weil
afd0b508c2 qa/suites/rados/thrash: force normal pg log length with cache tiering
When we are doing cache tiering, we are more sensitive to short PG logs
because the dup op entries are not perfectly promoted from the base to
the cache.

See:
 http://tracker.ceph.com/issues/38358
 http://tracker.ceph.com/issues/24320

This works around the problem by not testing short pg logs in combination
with cache tiering.  This works because the short_pg_log.yaml fragment
sets the short log in the [global] section but the cache workloads overload
it (back to a large/default value) in the [osd] section.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-06-19 11:15:25 -05:00
Josh Durgin
d45f18119b qa/suites: remove mon kv backend options
rocksdb is the default, leveldb is not recommended at this point, so drop it.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2019-02-08 16:58:44 -05:00
Sage Weil
ee59743a1a qa/suites/rados/workloads/rados_api_tests.yaml: debug mgrc = 20 on mon
Seeing some hangs when the mon is forwarding mgr commands (pg deep-scrub)
to the mgr.  This is a buggy test (it should send it to the mgr directly)
but it is helpful to verify the mon forwarding behavior works.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-02-07 12:10:34 -06:00
Neha Ojha
4ef94e89c8 qa/suites/rados/thrash: change crush_tunables to jewel in rados_api_tests
Fixes: http://tracker.ceph.com/issues/38042
Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-01-24 16:54:29 -08:00
xie xingguo
c7356c66b0 mgr/balancer: blame if upmap won't actually work
With automatic balancing on, and if mode is set to upmap,
balancer will fail silently if min_compat_client is lower than
luminous.
You can't figure out that unless you take a closer look at the
mgr log, which is super annoying..

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-01-16 09:19:54 +08:00
Sage Weil
d518eb6cac qa/msgr: move msgr factet into generic re-usable dir
Signed-off-by: Sage Weil <sage@redhat.com>
2019-01-03 11:17:38 -06:00
David Zafman
02964703de
Merge pull request #24749 from dzafman/wip-36474
Add support for osd_delete_sleep configuration value

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-11-07 14:05:55 -08:00
David Zafman
3f621a1190 test: Set any value for osd_delete_sleep to guarantee we are testing even SSD
Signed-off-by: David Zafman <dzafman@redhat.com>
2018-10-30 11:42:05 -07:00
Sage Weil
86ae8fb6b8 qa/suites/rados/thrash*/thrashers/careful.yaml: thrash with mgr controller
Thrash such that we still exercise the careful throttling in the mgr.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-20 15:21:58 -05:00
Sage Weil
44de03d5e6 qa/suites: test pg merging
Signed-off-by: Sage Weil <sage@redhat.com>
2018-09-07 12:09:05 -05:00
Patrick Donnelly
b39f9d06dc
qa: fix symlinks indirectly pointing at qa to .qa
Building on the previous commit.

Command used:

$ find suites/ -type l -and -not -name .qa -execdir ~/fix.sh {} \;

fix.sh:
    #!/bin/bash

    link="$(readlink "$1")"

    echo $link
    dirlink="$(dirname "$link")"
    baselink="$(basename "$link")"

    while true; do
        echo $dirlink
        if [ "$dirlink" -ef ~/ceph/qa ]; then
            ln -nsf ".qa/$baselink" "$1"
            exit
        else
            baselink="$(basename "$dirlink")/$baselink"
            dirlink="$(dirname "$dirlink")"
            if [ "$dirlink" -ef . ]; then
                break
            fi
        fi
    done

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-06-26 11:48:38 -07:00
Patrick Donnelly
716db6e2fd
qa: add .qa helper link
This utilizes the recent feature in teuthology [1] to skip hidden files in
suites when building the job matrix.

Idea of this change is to enable referring to the top-level qa directory in a
position-independent way such that copies of a suite to another location do not
break any symlinks.

[1] https://github.com/ceph/teuthology/pull/1185

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-06-26 11:33:48 -07:00
Yuri Weinstein
9f2c485942 tests/qa: adding rados/.. dirs
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2018-05-11 14:03:15 -07:00
Sage Weil
27e91a99f5
Merge pull request #21273 from jdurgin/wip-23195
osd/ECBackend: only check required shards when finishing recovery reads

Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2018-04-24 17:20:25 -05:00