Commit Graph

7631 Commits

Author SHA1 Message Date
Kefu Chai
d7f1674b4f
Merge pull request #38138 from batrick/centos-8.2
qa: update centos release to 8.2

Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
2020-11-27 14:18:29 +08:00
Kefu Chai
09f1e7ffff
Merge pull request #38241 from Rethan/update_url_from_master_to_lastest
doc/dev: use http://docs.ceph.com/en/latest/ instead of /docs/master/ for docs

Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Willem Jan Withagen <wjw@digiware.nl>
2020-11-27 14:00:12 +08:00
haoyixing
0e7e036aa7 doc/dev: use http://docs.ceph.com/en/latest/ instead of /docs/master/ for docs
Several links under http://docs.ceph.com/docs/master/ were unable to access.
Change them to http://docs.ceph.com/en/lastest so we can access them directly.

Signed-off-by: haoyixing <haoyixing@kuaishou.com>
2020-11-24 12:49:47 +08:00
Ilya Dryomov
0e4bc27722 qa/tasks/ceph: resurrect log compression
Commit 9536625558 ("qa/tasks/ceph: use Cluster.sh() and Remote.sh()
when appropriate") dropped run.wait(), which waits for all given
processes to exit.  This resulted in errors like

  INFO:teuthology.orchestra.run.smithi107.stderr:tar: ./objectstore_tool..log: file changed as we read it
  INFO:teuthology.orchestra.run.smithi107.stderr:tar: ./ceph-client.admin.175125.log: File removed before we read it

as the task moved on to archiving semi-corrupted and uncompressed logs,
filling up the lab cluster.

Revert that hunk, as Cluster.sh() is useless here -- we don't need
stdout or stderr, but very much need parallel execution and wait for
the compression to finish.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2020-11-24 00:51:54 +01:00
Ernesto Puerta
e238c6f6e7
Merge pull request #37740 from s0nea/wip-dashboard-ceph-input-validation
mgr/dashboard: validate username while creation

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2020-11-20 17:45:44 +01:00
David Zafman
89af82bf4f
Merge pull request #38054 from dzafman/wip-test-fixes
test: Fix osd-scrub-test.sh and ver-health.sh tests

Reviewed-by: Neha Ojha <nojha@redhat.com>
2020-11-18 08:52:28 -08:00
rakeshgm
672c4844ef
Merge pull request #38132 from ceph/rhel_8_distros
qa/distros: add RHEL 8.3 and use RHEL 8.3 as default
2020-11-18 21:39:25 +05:30
Patrick Donnelly
cfabba6668
Merge PR #34552 into master
* refs/pull/34552/head:
	tests: add snap schedule tests
	snap-schedule: reject invalid retention specs
	mgr/snap_schedule: log snapshot name on creation
	pybind / snap_schedule: sanely dump schedules and retention specs
	pybind/snap_schedule: config option to allow minute granularity snaps
	pybind / snap_schedule: remove 'SNAP_SCHED_TESTING' check

Reviewed-by: Jan Fajerski <jfajerski@suse.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-18 07:54:15 -08:00
Neha Ojha
1523bf9bdb
Merge pull request #38107 from ceph/wip-mgr-progress-fix-48217
qa/mgr/test_progress: add _get_osd_in_out_events to account for osd marked in/out events

Reviewed-by: Neha Ojha <nojha@redhat.com>
2020-11-18 07:15:22 -08:00
Kefu Chai
739550f365
Merge pull request #38004 from tchaikov/wip-48058
qa/tasks/ceph: update_archive_setting() only if ctx.archive is valid

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2020-11-18 09:58:47 +08:00
Josh Durgin
c2b9152eac
Merge pull request #37488 from kamoltat/wip-mgr-progress-turn-off-option
mgr/progress: progress module on/off option

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2020-11-17 13:37:30 -08:00
Patrick Donnelly
e5170de73f
qa: update centos release to 8.2
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-17 08:18:18 -08:00
Patrick Donnelly
f6639c80ed
Merge PR #35749 into master
* refs/pull/35749/head:
	Update Release notes for multimds scrub
	qa/cephfs: log-ignorelist scrub errors
	qa/cephfs: Add more tests for multimds scrub
	qa/cephfs: add tests for multimds scrub
	qa/cephfs: update existing scrub test cases
	mds: don't skip validating disk state of symlink
	mds: abort/pause/resume scrubs in multiple mds
	mds: track scrub status in multiple mds
	mds: remove on_finish from {CInode,CDir}::scrub_info_t
	Continuation: don't delete self while there are in-processing stages
	mds: auth pin CInode when validating its disk state
	mds: rdlock file/nest lock when accumulating stats of subtree dirfrags
	mds: multiple mds scrub support
	include/frag: add encode/decode functions for fragset_t
	mds: remove object can't be scrubbed immediately from scrub stack
	mds: prevent dirfrag scrub/fragment from running at the same time
	mds: change scrub traverse from post-order to breadth-first search
	mds: make both CInode and CDir as entities of scrub
	mds: remove ScrubStack::scrubstack

Reviewed-by: Sidharth Anupkrishnan <sanupkri@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-17 07:33:20 -08:00
Venky Shankar
a8c8b3ade2 tests: add snap schedule tests
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2020-11-17 08:39:09 -05:00
rakeshgm
5eabe906e6 qa/distros: add RHEL 8.3 and use RHEL 8.3 as default
Signed-off-by: rakeshgm <rakeshgm@redhat.com>
2020-11-17 18:44:16 +05:30
Kamoltat
52fe9dbdae qa/mgr/test_progress: fix bug 48217
Fixes a failing test case regarding osd coming back
after being marked out. The old test case wasn't accounting
for a specific event, therefore this resulted in the failure.
The fix basically accounts for a specific event of osd being
marked in/out.

Fixes: https://tracker.ceph.com/issues/48217

Signed-off-by: Kamoltat <ksirivad@redhat.com>
2020-11-17 07:54:35 +00:00
David Zafman
38c3130654 test: Fix TEST_scrub_extended_sleep test (corrected test name)
Didn't really test extended sleep in original code:
Cause by: 3bfb5c2621

Signed-off-by: David Zafman <dzafman@redhat.com>
2020-11-16 18:30:14 -08:00
David Zafman
0a0ed890c2 test: Improve version checking test, to improve reliability
Signed-off-by: David Zafman <dzafman@redhat.com>
2020-11-16 18:30:14 -08:00
Jason Dillaman
8d4a8735c3
Merge pull request #37939 from trociny/wip-rbd-nbd-wait-for-terminate
rbd-nbd: fixes and improvements for unmap/detach wait for process terminate

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2020-11-16 17:10:17 -05:00
Casey Bodley
c10a7240b6
Merge pull request #38105 from ofriedma/wip-barbican-qa-issue
qa/rgw: fix "cannot create secret" on barbican test

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2020-11-16 09:53:08 -05:00
Tatjana Dehler
fbf1c37220 mgr/dashboard: validate username while creation
When creating a user the username is not checked if
it's valid from the Ceph perspective (`CephString`).
The commit adds a decorator to check if the input
values in the API are valid from the Ceph perspective
by calling the `valid()` method of the Ceph-defined
datatypes (`ceph_argparse.py`).

Fixes: https://tracker.ceph.com/issues/46548
Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2020-11-16 12:12:25 +01:00
Or Friedmann
7951391ce2 qa/rgw: fix "cannot create secret" on barbican test
fix "cannot create secret" on barbican test

Signed-off-by: Or Friedmann <ofriedma@redhat.com>

Fixes: https://tracker.ceph.com/issues/47799
2020-11-16 11:53:54 +02:00
Sridhar Seshasayee
b637909a91
Merge pull request #38007 from sseshasa/wip-qa-rados-perf-scheduler
qa/suite/rados: Introduce "scheduler" directory within rados perf suite.

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
2020-11-16 12:56:09 +05:30
Kamoltat
993bb02b30 mgr/progress: introduce turn off/on feature
progress module can be turned off/on by using
the commands: 'progress off' and 'progress on'

As well as refractoring teuthology test suite
to prevent future bugs that can possibly occur

fixes: https://tracker.ceph.com/issues/47238

Signed-off-by: kamoltat <ksirivad@redhat.com>
2020-11-16 03:46:42 +00:00
Yan, Zheng
c42570a329 qa/cephfs: log-ignorelist scrub errors
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2020-11-16 09:10:03 +08:00
Sidharth Anupkrishnan
ba9b85c481 qa/cephfs: Add more tests for multimds scrub
Signed-off-by: Sidharth Anupkrishnan <sanupkri@redhat.com>
2020-11-16 09:02:18 +08:00
Yan, Zheng
904c959df4 qa/cephfs: add tests for multimds scrub
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2020-11-16 09:02:18 +08:00
Yan, Zheng
31deca1fec qa/cephfs: update existing scrub test cases
Now scrub is always async.

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2020-11-16 09:02:18 +08:00
Patrick Donnelly
07cba31a03
Merge PR #34842 into master
* refs/pull/34842/head:
	qa/tasks/vstart_runner.py: disable the ns unsharing if using localhost

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2020-11-15 12:39:11 -08:00
Patrick Donnelly
3541c8de12
Merge PR #37982 into master
* refs/pull/37982/head:
	qa/cephfs: add code for when config is None in __init__

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
2020-11-15 12:37:02 -08:00
Patrick Donnelly
359c8b0ddc
Merge PR #38016 into master
* refs/pull/38016/head:
	qa: switch to 'osdop_read' instead of 'op_r'

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2020-11-15 12:35:58 -08:00
Kefu Chai
f47f13f446
Merge pull request #38038 from bk201/wip-48209
mgr/dashboard: fix cephadm e2e test failure on deleting OSDs

Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
2020-11-16 01:29:46 +08:00
Kefu Chai
0463a774c9
Merge pull request #37908 from dzafman/wip-47930
test: Fix race in TEST_recovery_scrub test

Reviewed-by: Neha Ojha <nojha@redhat.com>
2020-11-16 01:00:56 +08:00
Kefu Chai
f7962b1767
Merge pull request #37992 from rishabh-d-dave/dashboard-qa-skipTest
qa/mgr: mgr_test_case: raise SkipTest instead of calling skipTest()

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
2020-11-16 00:30:42 +08:00
Kefu Chai
9536625558 qa/tasks/ceph: use Cluster.sh() and Remote.sh() when appropriate
for better readability

Signed-off-by: Kefu Chai <kchai@redhat.com>
2020-11-14 15:22:10 +08:00
Kefu Chai
de71f6b0a3 qa/tasks/ceph: update_archive_setting() only if ctx.archive is valid
When running teuthology interactively, ctx.archive might not be set.
If it's not set, there is no point trying to access files there.

Fixes: https://tracker.ceph.com/issues/48058

Signed-off-by: Marcus Watts <mwatts@redhat.com>
Signed-off-by: Kefu Chai <kchai@redhat.com>
2020-11-14 15:22:00 +08:00
Neha Ojha
4a6f53afe6
Merge pull request #37902 from pdvian/wip-clog-health-detail
mon: Log "ceph health detail" periodically in cluster log

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2020-11-12 15:02:52 -08:00
kyr
6f75704a2e
Merge pull request #38029 from kshtsk/wip-cephadm-bootstrap-remote
qa/task/cephadm: run cephadm only on bootstrap_remote
2020-11-12 14:19:47 +01:00
Kiefer Chang
7df12da184
mgr/dashboard: fix cephadm e2e test failure on deleting OSDs
- Adapt the OSD ID column change in this PR:
  https://github.com/ceph/ceph/pull/37506
- Turn off Cypress retries because it makes the failure log misleading.

Fixes: https://tracker.ceph.com/issues/48209
Signed-off-by: Kiefer Chang <kiefer.chang@suse.com>
2020-11-12 17:20:48 +08:00
Kefu Chai
4dc60e9ece
Merge pull request #37922 from mgfritch/project-kubic
cephadm: install podman from the Kubic project

Reviewed-by: Kiefer Chang <kiefer.chang@suse.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2020-11-12 10:40:24 +08:00
Kefu Chai
dfe9fce746
Merge pull request #35865 from tchaikov/wip-crimson-recovery-qa
qa/suites: add crimson-rados suite

Reviewed-by: Samuel Just <sjust@redhat.com>
2020-11-12 09:52:33 +08:00
Xiubo Li
871ac6b376 qa/tasks/vstart_runner.py: disable the ns unsharing if using localhost
If the -l option is specified in :

$ MDS=3 MON=3 MGR=1 OSD=3 ../src/vstart.sh -n -l

The ceph cluster will use the localhost IP address as the mon host,
and then in the network namespace container the ceph-fuse daemon
couldn't connect to it.

Fixes: https://tracker.ceph.com/issues/45342
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-11-12 08:54:57 +08:00
Prashant D
f45712c190 mon: Log "ceph health detail" periodically in cluster log
change mon_health_to_clog_interval from 1_hr -> 10_min to
log health summary or detail more frequently.

Fixes: https://tracker.ceph.com/issues/48042

Signed-off-by: Prashant Dhange <pdhange@redhat.com>
2020-11-11 19:08:51 -05:00
David Zafman
870bde04a5 test: Changes based on code review comments
Signed-off-by: David Zafman <dzafman@redhat.com>
2020-11-11 15:31:26 -08:00
David Zafman
93373746f5 osd test: Delay reporting until mon_warn_older_version_delay has passed
Move release notes description to 16.0.0 and update
Update documentation

Signed-off-by: David Zafman <dzafman@redhat.com>
2020-11-11 15:10:11 -08:00
David Zafman
9d988c3dbc test: Simple test case for version health warning
Signed-off-by: David Zafman <dzafman@redhat.com>
2020-11-11 15:10:11 -08:00
Kyr Shatskyy
f75b00984e qa/task/cephadm: run cephadm only on bootstrap_remote
Because not all the node in teuthology config might
have any cephadm related role we should not execute
cephadm on random node.

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
2020-11-11 23:17:59 +01:00
Patrick Donnelly
d2769ff180
Merge PR #37978 into master
* refs/pull/37978/head:
	qa: fix broken links in multimds suite

Reviewed-by: Jeff Layton <jlayton@redhat.com>
2020-11-11 11:07:27 -08:00
Xiubo Li
4f1817aa8f qa: switch to 'osdop_read' instead of 'op_r'
The 'op_r' will just acount CEPH_OSD_FLAG_READ flag, which will
include some other none real data read opcodes, like the CEPH_OSD_OP_STAT.

Signed-off-by: Xiubo Li <xiubli@redhat.com>
2020-11-11 10:13:51 +08:00
David Zafman
9929588ce3
Merge pull request #37775 from dzafman/wip-upgrade-mimic-fix
test: Remove mimic-x* upgrade testing in master

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
2020-11-10 16:27:43 -08:00