Commit Graph

69 Commits

Author SHA1 Message Date
Kamoltat
5f33f2f6e0 mgr/test_progress.py: Delay recover in test_progress
Changes some the tests in teuthology to make
the test more deterministic.
Using:

`ceph osd set norecover` and
`ceph osd set nobackfill` when marking osds in
or out. As this will delay the recovery and make
sure it the test cases get the chance to check
that there is actually events poping up in
the progress module.

took out test_osd_cannot_recover from
tasks/mgr/test_progress.py since it is no longer
a relevant test case since recovery will get
triggered regardless if pg is unmoved.

Ignoring `OSDMAP_FLAGS` in teuthology
because we are using norecover and nobackfill
to delay the recovery process, therefore, it
will create a health warning and fails the
teuthology test.

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2021-07-13 19:33:20 +00:00
Sage Weil
3edc04a46b qa/suites/rados/mgr: whitelist module crash during selftest
One of the selftests triggers an exception from serve().

Signed-off-by: Sage Weil <sage@newdream.net>
2021-06-25 13:48:45 -04:00
Sridhar Seshasayee
cc1fc98ea4 qa/suites/rados/mgr/tasks/progress: use high_recovery_ops for faster recovery
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2021-05-06 17:54:38 +00:00
Kefu Chai
39b2b5edc0 qa/tasks/mgr: skip test_diskprediction_local on python>=3.8
query the python version before trying to test diskprediction_local

Fixes: https://tracker.ceph.com/issues/50196
Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-04-07 21:27:44 +08:00
Jason Dillaman
103243725f qa/suites: move RADOS tests to use new debug log objectstores
This will retain the debug log settings for all RADOS suites
that were previously symlinked to the 'objectstore'
directory. The next commit will reduce the debug log level
for the original 'objectstore' directory for the remainder
of tests.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2021-03-03 14:47:59 -05:00
Greg Farnum
d02625331c Merge remote-tracking branch 'origin/master' into wip-stretch-mode 2020-09-14 02:32:19 +00:00
Sage Weil
2ee9365d0b qa: log-whitelist -> log-ignorelist
Signed-off-by: Sage Weil <sage@newdream.net>
2020-08-24 19:53:08 +00:00
Greg Farnum
39d71f7841 test: add a mon_election directory to the rados and upgrade suites
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2020-07-08 04:26:03 +00:00
Venky Shankar
60b8f7a46b Revert "Revert "qa/suites/rados/mgr/tasks/module_selftest: whitelist mgr client getting backlisted""
This reverts commit a7994a0fdd.

Failed attempt at solving the issue is in PR #33272. Until we
find a clean solution for this, whiltelisting the warning is
probably the best thing for now.

Fixes: http://tracker.ceph.com/issues/43943
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2020-06-11 00:09:31 -04:00
Kefu Chai
83c632099b mgr/telegraf: catch FileNotFoundError exception
in tasks/module_selftest.yaml, `TestModuleSelftest.test_telegraf()` is
called. but we fail to prepare a unix domain socket to which the telegraf
module can send stats. and telegraf module does not catch
FileNotFoundError exception, so the exception is populated to ceph-mgr
and is found by the test, hence the test is marked a failure whenever
telegraf is tested.

in this change,

* catch this exception, so it won't be caught by ceph-mgr
* whitelist the error message, so the test can pass

Signed-off-by: Kefu Chai <kchai@redhat.com>
2020-04-08 21:07:07 +08:00
Sage Weil
ddb8ab3dd3 qa/suites/cephadm: move orchestrator_cli test into rados/cephadm
Signed-off-by: Sage Weil <sage@redhat.com>
2020-03-02 10:52:24 -06:00
Sage Weil
74cf76e9e9 qa/suites/rados: move cephadm_orchestrator to el8
The python3-remoto dependency does not exist on 18.04 (or any ubuntu or
debian AFAICS).

Signed-off-by: Sage Weil <sage@redhat.com>
2019-12-23 14:24:58 -06:00
Sage Weil
cd1c05acbb mgr/ssh -> mgr/cephadm
Signed-off-by: Sage Weil <sage@redhat.com>
2019-12-11 19:14:24 -06:00
Patrick Donnelly
19a08227fb
Merge PR #30890 into master
* refs/pull/30890/head:
	mgr: invoke plugin shutdown on SIG{TERM,INT} signals.
	mgr/volumes: guard volume delete by waiting for pending ops
	mgr/volumes: cleanup libcephfs handles when stopping
	Revert "qa/suites/rados/mgr/tasks/module_selftest: whitelist mgr client getting backlisted"

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2019-11-08 10:43:46 -08:00
Sebastian Wagner
157fb06fac mgr/orchestrator: check for DEVICE_{IDENT|FAULT}_ON
Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
2019-11-05 13:02:29 +01:00
Venky Shankar
a7994a0fdd Revert "qa/suites/rados/mgr/tasks/module_selftest: whitelist mgr client getting backlisted"
This reverts commit 0060f1c5b8.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
2019-10-24 03:34:44 -04:00
Nathan Cutler
493ee6d78f qa: enable dashboard tests to be run with "--suite rados/dashboard"
This moves dashboard.yaml from rados/mgr into a new, separate rados/dashboard
suite. The common elements it uses are moved from rados/mgr into qa/ and
replaced with symlinks.

Fixes: https://tracker.ceph.com/issues/41820
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2019-10-21 12:31:51 +02:00
Tatjana Dehler
a747efad29 mgr/dashboard: comment questionable suites out
We're currently facing some issues with our integration
tests. Because of that we agreed on commenting questionable
suites out to be able to run all other suites on open pull
requests.

'test_health' and 'test_perf_counters' are commented out
because they led to issues in relation to
https://tracker.ceph.com/issues/41538
As soon as the issue has been fixed, we need to re-add
these two suites again.

Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2019-10-11 10:58:41 +02:00
Tatjana Dehler
9eb64d758f mgr/dashboard: sort QA suites alphabetically
Sort QA suites alphabetically to recognize a missing
suite faster.

Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2019-10-11 10:56:45 +02:00
Sage Weil
0c0093e450 Merge PR #29034 into master
* refs/pull/29034/head:
	doc/mgr/crash: document missing commands, options
	qa/suites/rados/singleton/all/test-crash: whitelist RECENT_CRASH
	qa/suites/rados/mgr/tasks/insights: whitelist RECENT_CRASH
	qa/tasks/mgr/test_insights: crash module now rejects bad crash reports
	mgr/telemetry: fix remote into crash do_ls()
	mgr/crash: don't make these methods static
	mgr/BaseMgrModule: handle unicode health detail strings
	mgr/crash: verify timestamp is valid
	qa/suites/mgr: whitelist RECENT_CRASH
	mgr/crash: remove unused var
	mgr/crash: remove unused import 'six'
	qa/workunits/rados/test_crash: health check
	mgr/crash: improve validation on post
	mgr/crash: automatically prune old crashes after a year
	mgr/crash: raise RECENT_CRASH warning for recent (new) crashes
	mgr/crash: add 'crash ls-new'
	mgr/crash: add option and serve infra
	mgr/crash: keep copy of crashes in memory
	mgr/pg_autoscaler: adjust style to match built-in tables
	mgr/crash: make 'crash ls' a nice table with a NEW column
	mgr/crash: nicely format 'crash info' output
	mgr/crash: add 'crash archive <id>', 'crash archive-all' commands

Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-07-24 17:20:21 -05:00
Sage Weil
31b5c43806 Merge PR #29114 into master
* refs/pull/29114/head:
	mgr/dashboard: qa: whitelist client eviction warning

Reviewed-by: Laura Paduano <lpaduano@suse.com>
2019-07-23 09:09:15 -05:00
Sage Weil
209ce4ec68 qa/suites/rados/mgr/tasks/insights: whitelist RECENT_CRASH
Signed-off-by: Sage Weil <sage@redhat.com>
2019-07-22 13:42:06 -05:00
Sage Weil
0060f1c5b8 qa/suites/rados/mgr/tasks/module_selftest: whitelist mgr client getting backlisted
The mgr's libcephfs client gets evicted after the mgr fails over.
Whitelist the message.

Fixes: http://tracker.ceph.com/issues/40867
Signed-off-by: Sage Weil <sage@redhat.com>
2019-07-22 10:34:24 -05:00
Sage Weil
e2ff58ba16 qa/suites/mgr: whitelist RECENT_CRASH
Signed-off-by: Sage Weil <sage@redhat.com>
2019-07-19 14:45:47 -05:00
Ricardo Dias
658bd35b96
mgr/dashboard: qa: whitelist client eviction warning
This warning is caused by the recent changes to the volumes
module that cache the CephFS handles.
Commit 5c41e949af

Signed-off-by: Ricardo Dias <rdias@suse.com>
2019-07-18 15:44:04 +01:00
Lenz Grimmer
72ef9b58ec
Merge pull request #26510 from callithea/wip-add-ganesha-qa-tests
mgr/dashboard: Added NFS Ganesha suite to QA tests

Reviewed-by: Jeff Layton <jlayton@redhat.com>
2019-02-22 10:28:41 +01:00
Sage Weil
d198831c44 Merge PR #26262 into master
* refs/pull/26262/head:
	mgr/orch/ssh: add qa test
	qa: install/exclude ceph-mgr-ssh package
	debian: package ssh orchestrator
	spec.ceph: package ssh orchestrator
	ssh/orch: add ssh orchestrator
2019-02-20 13:05:11 -06:00
Noah Watkins
06422208ec mgr/orch/ssh: add qa test
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
2019-02-20 09:45:48 -08:00
Laura Paduano
bccf11f7ef mgr/dashboard: Added NFS Ganesha suite to QA tests
Signed-off-by: Laura Paduano <lpaduano@suse.com>
2019-02-20 10:27:33 +01:00
Patrick Nawracay
cfbefbf169 mgr/dashboard: Add support for managing RBD QoS
Fixes: http://tracker.ceph.com/issues/36191

Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2019-02-19 11:08:59 +01:00
Volker Theile
b9177e43c0 mgr/dashboard: Add UI to configure the telemetry mgr plugin
Fixes: tracker.ceph.com/issues/36488

Signed-off-by: Volker Theile <vtheile@suse.com>
2019-02-12 16:36:56 +01:00
Sebastian Wagner
933b2cfc28 mgr/orchestrator: Add test orchestrator
1. To be able to run the cli without an external orchestrator.
2. Run the CLI in Teuthology.

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
2018-12-20 10:56:49 +01:00
Stephan Müller
19b039c28e mgr/dashboard/qa: Fix ECP creation test
The current solution fails on our CI-system as some outputs can have
more values and some parameters like 'w' can vary in different
environments.

As this was only tested before in a vstart cluster environment it
worked.

Through this commit only the given attributes we know to be there,
will be tested.

Fixes: https://tracker.ceph.com/issues/37275
Signed-off-by: Stephan Müller <smueller@suse.com>
2018-12-10 12:37:03 +01:00
Zack Cerza
50b7d42fe5 mgr/dashboard: Replace dashboard service
This splits out the collection of health and log data from the
/api/dashboard/health controller into /api/health/{full,minimal} and
/api/logs/all.

/health/full contains all the data (minus logs) that /dashboard/health
did, whereas /health/minimal contains only what is needed for the health
component to function. /logs/all contains exactly what the logs portion
of /dashboard/health did.

By using /health/minimal, on a vstart cluster we pull ~1.4KB of data
every 5s, where we used to pull ~6KB; those numbers would get larger
with larger clusters. Once we split out log data, that will drop to
~0.4KB.

Fixes: http://tracker.ceph.com/issues/36675

Signed-off-by: Zack Cerza <zack@redhat.com>
2018-11-27 16:08:53 -07:00
Tatjana Dehler
ef62c5eb4e mgr/dashboard/qa: add missing dashboard suites
Fixes: https://tracker.ceph.com/issues/37152
Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2018-11-15 11:45:56 +01:00
Sage Weil
b833d35d9b qa/suites/rados/mgr/tasks/module_selftest: whitelist 'foo bar security'
Avoid failures like

"2018-10-16 20:36:00.437153 mgr.y (mgr.25609) 6 : cluster [SEC] foo bar security" in cluster log

Signed-off-by: Sage Weil <sage@redhat.com>
2018-10-17 07:09:15 -05:00
John Spray
0551d0e14b
Merge pull request #23497 from noahdesu/insights
mgr/insights: insights reporting module

Reviewed-by: John Spray <john.spray@redhat.com>
2018-09-11 15:56:10 +01:00
John Spray
46a74a07d9 qa: add task for progress module
Signed-off-by: John Spray <john.spray@redhat.com>
2018-09-11 11:21:35 +01:00
John Spray
09f5f9e1eb qa: add 4th OSD to mgr test cluster
This is useful for testing progress module.

Signed-off-by: John Spray <john.spray@redhat.com>
2018-09-11 11:21:35 +01:00
Noah Watkins
5c2615e3d7 qa/tasks/mgr: whitelist insights test health checks
these drive health history tracking tests.

Signed-off-by: Noah Watkins <nwatkins@redhat.com>
2018-09-10 15:02:49 -07:00
Patrick Nawracay
e71466cc49 mgr/dashboard: Add support for managing individual OSD settings (backend)
Add options to mark OSDs in/out/down/reweight/lost/remove/destroy/create

Fixes: http://tracker.ceph.com/issues/24270

Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2018-09-03 12:51:04 +02:00
Noah Watkins
a087e417d0 qa/tasks/mgr: whitelist insights warning
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
2018-08-24 10:45:40 -07:00
Noah Watkins
3d96b2faaa qa/tasks/mgr: add tests for insights module
Signed-off-by: Noah Watkins <nwatkins@redhat.com>
2018-08-16 15:29:46 -07:00
Dan Mick
85ab978567 qa/suites/rados/mgr: Add test_crash
Signed-off-by: Dan Mick <dan.mick@redhat.com>
2018-06-29 14:52:17 -07:00
Patrick Donnelly
b39f9d06dc
qa: fix symlinks indirectly pointing at qa to .qa
Building on the previous commit.

Command used:

$ find suites/ -type l -and -not -name .qa -execdir ~/fix.sh {} \;

fix.sh:
    #!/bin/bash

    link="$(readlink "$1")"

    echo $link
    dirlink="$(dirname "$link")"
    baselink="$(basename "$link")"

    while true; do
        echo $dirlink
        if [ "$dirlink" -ef ~/ceph/qa ]; then
            ln -nsf ".qa/$baselink" "$1"
            exit
        else
            baselink="$(basename "$dirlink")/$baselink"
            dirlink="$(dirname "$dirlink")"
            if [ "$dirlink" -ef . ]; then
                break
            fi
        fi
    done

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-06-26 11:48:38 -07:00
Patrick Donnelly
716db6e2fd
qa: add .qa helper link
This utilizes the recent feature in teuthology [1] to skip hidden files in
suites when building the job matrix.

Idea of this change is to enable referring to the top-level qa directory in a
position-independent way such that copies of a suite to another location do not
break any symlinks.

[1] https://github.com/ceph/teuthology/pull/1185

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-06-26 11:33:48 -07:00
Patrick Nawracay
d7de1fa067 mgr/dashboard/backend: Enable get/set of cluster-wide OSD settings
Add ability to list, set and unset cluster-wide OSD flags.

Flags can be listed and changed through the `/api/osd/flags` API
resource. By using a GET request, the list is retrieved. By using a PUT
request, the flags are updated (all at once). Flags not contained in the
data of the PUT are removed, additional once are added. Note that the
PUT requests require a JSON body with the data contained as value of the
'flags' key like so:

    {"flags": ["flag1", "flag2", ...]}

Fixes: http://tracker.ceph.com/issues/24056

Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2018-06-15 10:26:29 +02:00
Yuri Weinstein
9f2c485942 tests/qa: adding rados/.. dirs
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2018-05-11 14:03:15 -07:00
Ricardo Dias
7309be145b
qa/suites/rados/mgr/tasks/dashboard: add MDS_(ALL_DOWN|UP_LESS_THAN_MAX) to whitelist
Signed-off-by: Ricardo Dias <rdias@suse.com>
2018-04-21 15:58:23 +01:00
Ricardo Dias
7b774f1bb0
qa/suites/rados/mgr/tasks/dashboard: add fail_on_skip = false
This commit prevents dashboard API tests to be tagged as failed when
some test is decorated with @skip.

Signed-off-by: Ricardo Dias <rdias@suse.com>
2018-03-16 08:40:42 +00:00