Commit Graph

61 Commits

Author SHA1 Message Date
Patrick Donnelly
b39f9d06dc
qa: fix symlinks indirectly pointing at qa to .qa
Building on the previous commit.

Command used:

$ find suites/ -type l -and -not -name .qa -execdir ~/fix.sh {} \;

fix.sh:
    #!/bin/bash

    link="$(readlink "$1")"

    echo $link
    dirlink="$(dirname "$link")"
    baselink="$(basename "$link")"

    while true; do
        echo $dirlink
        if [ "$dirlink" -ef ~/ceph/qa ]; then
            ln -nsf ".qa/$baselink" "$1"
            exit
        else
            baselink="$(basename "$dirlink")/$baselink"
            dirlink="$(dirname "$dirlink")"
            if [ "$dirlink" -ef . ]; then
                break
            fi
        fi
    done

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-06-26 11:48:38 -07:00
Patrick Donnelly
716db6e2fd
qa: add .qa helper link
This utilizes the recent feature in teuthology [1] to skip hidden files in
suites when building the job matrix.

Idea of this change is to enable referring to the top-level qa directory in a
position-independent way such that copies of a suite to another location do not
break any symlinks.

[1] https://github.com/ceph/teuthology/pull/1185

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-06-26 11:33:48 -07:00
David Zafman
918921ab2f test: Need to escape parens in log-whitelist for grep
Signed-off-by: David Zafman <dzafman@redhat.com>
2018-05-21 09:47:59 -07:00
Yuri Weinstein
9f2c485942 tests/qa: adding rados/.. dirs
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2018-05-11 14:03:15 -07:00
Kefu Chai
966c76330b qa: reduce "mon client hunt interval max multiple" to 2 for all clients
because with high failure rate, we need to connect to mon more
frequently if the connection fails.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-04-26 12:04:49 +08:00
Sage Weil
35c14a0162 qa/suites/rados/singleton/all/random-eio: whitelist eio error message
"cluster [ERR] 2.1 shard 1: soid 2:8007ad8d:::benchmark_data_smithi115_12935_object2439:head candidate had a read error"

is normal when we're injecting EIO.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-09 07:40:23 -05:00
Gregory Farnum
6d2e4c9b7b
Merge pull request #19973 from liewegas/wip-peering-fast-dispatch
osd: fast dispatch of peering events and pg_map + osd sharded wq refactor

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2018-04-06 11:48:11 -07:00
Sage Weil
b235a3f62a qa/suites/rados/singleton/all/ec-lost-unfound: whitelist SLOW_OPS
Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-06 10:38:45 -05:00
Sage Weil
3b3c32f643 qa/suites/rados/singleton/all/recovery_preemption: whitelist SLOW_OPS
Recovery and peering can be slow enough with all the logging enabled to
trigger a slow ops warning.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-04 16:24:31 -05:00
Sage Weil
29a885c915 qa/suites/rados/singleton/all/recovery_preemption: make test more reliable
A 30 second run did only 7000 ops, which means ~50 log entires per pg...
not enough to trigger backfill.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-04 08:26:57 -05:00
Sage Weil
c3589df320 qa/suites/rados/singleton/all/mon-seesaw: whitelist PG_AVAILABILITY
The seesaw might delay pg creation by more than 60s.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-04-04 08:26:57 -05:00
Sage Weil
c77e19c9f2 qa: test config CLI interface
Signed-off-by: Sage Weil <sage@redhat.com>
2018-03-06 14:44:49 -06:00
Nathan Cutler
b69530e647 tests: rados suite: drop rest-api test cases
Fixes: http://tracker.ceph.com/issues/21264
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-03-06 06:58:59 +01:00
Sage Weil
c2d28e2750
Merge pull request #18971 from liewegas/wip-pg-scrub-preempt
osd/PG: allow scrub preemption

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-01-18 12:37:48 -06:00
Sage Weil
5ac3bfa34c qa/suites/rados/singleton/all/divergent_priors*: unsquelch osd debug
Signed-off-by: Sage Weil <sage@redhat.com>
2018-01-16 21:52:09 -06:00
David Zafman
c77941f593 qa: Ignore degraded PGs when injecting random eio errors
Signed-off-by: David Zafman <dzafman@redhat.com>
2018-01-14 18:17:23 -08:00
Sage Weil
f33ab7e03a Merge remote-tracking branch 'gh/mimic-dev1' 2017-12-20 15:08:30 -06:00
Kefu Chai
6b3d0f61f9 qa: decrease the msg_inject_socket_failures from 1/500 to 1/1000
Fixes: http://tracker.ceph.com/issues/22093
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-12-15 14:21:43 +08:00
John Spray
91655239fa
Merge pull request #19114 from tchaikov/wip-rm-request-slow
mgr/PGMap: drop REQUEST_{SLOW,STUCK} HEALTH_WARNs

Reviewed-by: John Spray <john.spray@redhat.com>
2017-12-13 11:46:34 +00:00
Sage Weil
0e81fa9f39 qa/suites/rados/singleton/all/threash-eio: whitelist OBJECT_*
Signed-off-by: Sage Weil <sage@redhat.com>
2017-12-04 21:16:44 -06:00
Kefu Chai
52bb6e8c6e
Merge pull request #19242 from liewegas/wip-21147
qa/suites/rados/thrash: extend mgr beacon grace when many msgr failures injected

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-12-02 14:14:29 +08:00
Sage Weil
431d1482ff qa/suites/rados/thrash: extend mgr beacon grace when many msgr failures injected
Fixes: http://tracker.ceph.com/issues/21147
Signed-off-by: Sage Weil <sage@redhat.com>
2017-11-29 10:29:52 -06:00
Kefu Chai
0e987665fe qa/suites/rados/singleton: more whitelist
* SLOW_OPS is normal in a cluster with flattering OSDs
* so is OBJECT_MISPLACED.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-29 13:53:44 +08:00
Kefu Chai
f5f2ced624 mgr/PGMap: drop REQUEST_{SLOW,STUCK} HEALTH_WARNs in mimic
SLOW_OPS unifies both of them since mimic

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-23 17:41:47 +08:00
Kefu Chai
4a1f2a5c78 qa: silence SLOW_OPS,PENDING_CREATING_PGS warnings
this is an intermediate step to deprecate REQUEST_SLOW warnings.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-11-23 13:59:42 +08:00
Kefu Chai
b4ffd03ca3
Merge pull request #18283 from tchaikov/wip-laggy-mgr
qa: reduce mon-client-hunt-interval-max-multiple to 2

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2017-10-31 19:03:13 +08:00
Kefu Chai
4c7df944c7 osd: add max-pg-per-osd limit
osd will refused to create new pgs, until its pg number is lower
than the max-pg-per-osd upper bound setting.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-17 23:08:40 +08:00
Kefu Chai
358f53cbc4 qa: reduce mon-client-hunt-interval-max-multiple to 2
monclient could experience frequent connection issues if "ms inject
socket failures" is enabled with high frequency. so reduce this number
to relieve the symptom.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-10-13 12:06:18 +08:00
Sage Weil
0a886bc9f3 qa/suites/rados/singleton/all/mon-seesaw: whitelist MON_DOWN
Mgr can get marked down when mon weirdness is happening.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-11 08:24:50 -05:00
Sage Weil
76d84ac194 qa/suites/rados/singleton/all/recover-preemption: handle slow starting osd
The OSD may not be marked up yet; set the config via the admin
socket.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-10-02 07:20:57 -05:00
Sage Weil
d7b29acb19 qa/suites/rados/singleton/all/recovery-preemption: add test
This mirrors what I was testing locally.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-09-28 13:48:14 -04:00
Sage Weil
39e5efbad2 qa/suites/rados/singleton/diverget_priors*: broaden whitelist
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-30 15:13:08 -04:00
Kefu Chai
1ff1f836da Merge pull request #16722 from tchaikov/wip-qa-fixes
qa/suites: escape the parenthesis of the whitelist text

Reviewed-by: Sage Weil <sage@redhat.com>
2017-08-02 13:00:01 +08:00
Kefu Chai
a70be4e00c qa/suites: more whitelisting
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-08-02 10:00:57 +08:00
Kefu Chai
d12c51ca91 qa/suites: escape the parenthesis of the whitelist text
so we can avoid the warnings like

grep: Unmatched ( or \(

because we pass the whitelisted string to `egrep -v "$1"` directly.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-08-01 21:54:44 +08:00
John Spray
ac2b9d63ca qa: include config help in admin socket test
Signed-off-by: John Spray <john.spray@redhat.com>
2017-08-01 13:38:40 +01:00
Sage Weil
c3c2b31c87 Merge pull request #16568 from liewegas/wip-application-warn
qa,doc: document and fix tests for pool application warnings
2017-07-28 09:00:46 -05:00
Sage Weil
326019a466 qa/suites/rados: whitelist various tests
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-25 22:29:07 -04:00
Sage Weil
2ef8614f67 qa/suites/rados/singleton/all/erasure-code-nonregression: fix typo
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-25 22:26:43 -04:00
Sage Weil
71ea171604 qa: move ceph-helpers and misc src/test/*.sh tests to qa/standalone
- stop running via make check
- add teuthology yamls to run them
- disable ceph_objecstore_tool.py for now (too slow for make check, and
we can't use vstart in teuthology via a package install)
- drop cephtool tests since those are already covered by other teuthology
tests
- leave a handful of (fast!) ceph-helpers tests for make check for minimal
integration tests.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-24 22:11:49 -04:00
John Spray
343e1a4281 qa: update whitelist for "wrongly marked me down"
Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-24 14:54:46 +01:00
Sage Weil
cb084a55f6 Merge pull request #16453 from liewegas/wip-workloadgen
crush: enforce buckets-before-rules rule

Reviewed-by: Joao Eduardo Luis <joao@suse.de>
2017-07-21 11:01:22 -05:00
Joao Eduardo Luis
6f6fbe7870 qa: flush out monc's dropped msgs on msgr failure injection
We have a few open tickets regarding the mgr being down during suites
involving messenger failure injection. There are a few suspicions that
this may be related with the monclient, but we'll need more logs to
validate those suspicions and, more, to validate we're actually fixing
the issue.

Signed-off-by: Joao Eduardo Luis <joao@suse.de>
2017-07-21 15:29:21 +01:00
Sage Weil
2e8413dede qa: remove workloadgen test
The CRUSH rule creation is busted (rules and buckets out of order), but
after I fix that it doesn't seem to run right anyway.  Remove it.
We get the mon thrasher coverage from rados/monthrash already; I don't
think this is adding meaningful coverage for the amount of effort it takes
to maintain.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-20 18:06:50 -04:00
Jason Dillaman
fa90be842e test: enable pool applications for new pools
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-07-19 13:13:01 -04:00
Sage Weil
4fcfb8ca9b qa/suites/rados/singleton/all/reg11184: whitelist health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 18:39:24 -04:00
Sage Weil
93de19adcf qa: whitelist health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
63f97ddcf6 qa/suites/rados: whitelist health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
e30b32bca4 qa/suites/rados/singleton/all/mon-auth-caps: more osds so we can go clean
and scrub

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-07 17:39:22 -04:00
Kefu Chai
04e0ef541d Merge pull request #15754 from tchaikov/wip-test-auth-caps
qa/suites: add test exercising workunits/mon/auth_caps.sh

Reviewed By: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: Joao Eduardo Luis <joao@suse.de>
2017-07-05 15:05:21 +08:00