Jason Dillaman
2589f57ecd
Merge pull request #16656 from idryomov/wip-qa-newer-fio
...
qa/tasks/rbd_fio: bump default fio version to 2.21
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2017-08-01 10:14:46 -04:00
Patrick Donnelly
019f20ff98
Merge PR #16640 into master
...
* refs/remotes/upstream/pull/16640/head:
qa: fix wait for wrong health message
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-28 09:55:49 -07:00
Patrick Donnelly
6fc2ee383f
Merge PR #16413 into master
...
* refs/remotes/upstream/pull/16413/head:
qa/cephfs: lsof if umount fails
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-28 09:55:23 -07:00
Patrick Donnelly
ced01a2335
qa: fix wait for wrong health message
...
Fixes: http://tracker.ceph.com/issues/20805
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-27 14:40:05 -07:00
Sage Weil
41bcf2fee5
Merge pull request #16281 from badone/wip-PG-cluster-log-audit
...
osd: Log audit
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-07-27 16:25:30 -05:00
Ilya Dryomov
bd6e3e5f1f
qa/tasks/rbd_fio: bump default fio version to 2.21
...
I'm seeing sporadic single thread deadlocks on fio stat_mutex during krbd
thrash runs:
(gdb) info threads
Id Target Id Frame
* 1 Thread 0x7f89ee730740 (LWP 15604) 0x00007f89ed9f41bd in __lll_lock_wait () from /lib64/libpthread.so.0
(gdb) bt
#0 0x00007f89ed9f41bd in __lll_lock_wait () from /lib64/libpthread.so.0
#1 0x00007f89ed9f17b2 in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
#2 0x00000000004429b9 in fio_mutex_down (mutex=0x7f89ee72d000) at mutex.c:170
#3 0x0000000000459704 in thread_main (data=<optimized out>) at backend.c:1639
#4 0x000000000045b013 in fork_main (offset=0, shmid=<optimized out>, sk_out=0x0) at backend.c:1778
#5 run_threads (sk_out=sk_out@entry=0x0) at backend.c:2195
#6 0x000000000045b47f in fio_backend (sk_out=sk_out@entry=0x0) at backend.c:2400
#7 0x000000000040cb0c in main (argc=2, argv=0x7fffad3e3888, envp=<optimized out>) at fio.c:63
(gdb) up 2
170 pthread_cond_wait(&mutex->cond, &mutex->lock);
(gdb) p mutex.lock.__data.__owner
$1 = 15604
Upgrading to 2.21 seems to make these go away.
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-07-27 18:57:43 +02:00
Sage Weil
a40d94b163
qa/tasks/ceph: wait for pg stats to flush in healthy check
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-27 12:10:27 -04:00
Sage Weil
80978dea8a
qa/tasks/ceph_manager: wait_for_all_up -> wait_for_all_osds_up
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-27 12:10:26 -04:00
Sage Weil
7648894e55
qa/tasks/ceph_manager: expose flush_all_pg_stats
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-27 12:10:26 -04:00
David Zafman
e92c953d7b
Merge pull request #16610 from dzafman/wip-fix-reg11184
...
test: reg11184 might not always find pg 2.0 prior to import
Reviewed-by: Sage Weil <sage@redhat.com>
2017-07-26 11:42:15 -07:00
Sage Weil
59a3a4a40e
Merge pull request #16559 from hjwsm1989/dump-stuck
...
qa/tasks/dump_stuck: fix dump_stuck test bug
Reviewed-by: Sage Weil <sage@redhat.com>
2017-07-26 11:59:21 -05:00
David Zafman
7c43840399
test: reg11184 might not always find pg 2.0 prior to import
...
Signed-off-by: David Zafman <dzafman@redhat.com>
2017-07-26 09:46:15 -07:00
Sage Weil
56ffd7a727
Merge pull request #16571 from ceph/wip-cd-bluestore-2
...
qa/tasks/ceph-deploy: Fix bluestore options for ceph-deploy
Reviewed-by: Tamil Muthamizhan <tmuthami@redhat.com>
2017-07-26 11:43:50 -05:00
Brad Hubbard
f8acc53d82
osd: Log audit
...
Review current log messages for consistency, accuracy and necessesity as
part of usability initiative. First in a series.
Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
2017-07-26 17:34:28 +10:00
Vasu Kulkarni
2fa0fae72f
Add option to specify bluestore/filestore options
...
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-07-25 12:26:03 -07:00
huangjun
daf8efee32
qa/tasks/dump_stuck: fix dump_stuck test bug
...
Test cluster with 2 osds, stop osd.0, if osd.1
report the pg stats during pg peering, mon will
record pg state to 'peering',then stop osd.1,
finally the pg state will stuck in 'stale+peering',
which is unexpected.
Let's wait_for_active() after stop osd.0.
Signed-off-by: huangjun <huangjun@xsky.com>
2017-07-25 11:14:07 +00:00
Sage Weil
2e5955212d
qa/tasks/workunit: allow alt basedir
...
Instead of 'qa/workunits' allow something like 'qa/standalone'.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-24 15:44:51 -04:00
Sage Weil
02c2e853d3
Merge pull request #16509 from liewegas/wip-rgw-wait
...
qa/suits/rados/basic/tasks/rgw_snaps: wait for pools to be created
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-24 11:55:54 -05:00
Sage Weil
29549e6834
Merge pull request #13723 from ovh/bp-forced-recovery
...
osd/PG: make prioritized recovery possible
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-07-24 09:01:03 -05:00
Sage Weil
ecd1193ab9
qa/suites/rados/basic/tasks/rgw_snaps: wait for pools to be be created
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-22 18:54:46 -04:00
Patrick Donnelly
9506789ce1
Merge PR 16379 into master
...
* refs/remotes/upstream/pull/16379/head:
qa: fix MDS_CLIENT_RECALL copy error
Reviewed-by: Zheng Yan <zyan@redhat.com>
2017-07-21 13:23:07 -07:00
Patrick Donnelly
23e3d40751
Merge PR 16226 into master
...
* refs/remotes/upstream/pull/16226/head:
qa: wait for OSDMap to propagate for snap purge
Reviewed-by: Zheng Yan <zyan@redhat.com>
2017-07-21 13:22:47 -07:00
Kefu Chai
4599eb7963
Merge pull request #16454 from liewegas/wip-fix-ceph-scrub
...
qa/tasks/ceph_manager: wait for osd to start after objectstore-tool sequence
Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-07-21 19:31:19 +08:00
Sage Weil
59e3827be7
qa/tasks/reg11184: import run
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-20 17:42:59 -04:00
Sage Weil
27e8d75f61
Merge pull request #16429 from liewegas/wip-jewel-x
...
qa/suites/upgrade/jewel-x: misc fixes for new health checks
2017-07-20 10:47:05 -05:00
Sage Weil
3de9f22ce0
Merge pull request #16423 from liewegas/wip-ls
...
mon: '* list' -> '* ls'
Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-07-20 10:43:34 -05:00
Sage Weil
583a38bca2
qa/tasks/ceph_manager: wait for osd to start after objectstore-tool sequence
...
Fixes: http://tracker.ceph.com/issues/20705
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-20 11:41:36 -04:00
Kefu Chai
a1d16185a2
qa/tasks/reg11184: use literal 'foo' instead pool_name
...
Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-07-20 21:35:41 +08:00
Ilya Dryomov
67db89f6c2
Merge pull request #16428 from idryomov/wip-krbd-luminous-thrash
...
qa: thrash tests for backoff and upmap
Reviewed-by: Vasu Kulkarni <vasu@redhat.com>
2017-07-20 11:28:22 +02:00
Piotr Dałek
b0134cc7a8
qa: add force/cancel recovery/backfill to QA testing
...
This randomly issues pg force-recovery/force-backfill and
pg cancel-force-recovery/cancel-force-backfill during QA
testing. Disabled for upgrades from hammer, jewel and kraken.
Signed-off-by: Piotr Dałek <piotr.dalek@corp.ovh.com>
2017-07-20 09:35:55 +02:00
Jason Dillaman
836ab7ad95
test: skip pool application metadata tests if OSDs not at min luminous
...
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-07-19 13:13:01 -04:00
Sage Weil
572a942f8f
mon: 'auth list' -> 'auth ls'
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-19 12:33:14 -04:00
Ilya Dryomov
682c5a42e1
qa/tasks/rbd_fio: dump fio options before starting
...
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-07-19 12:18:16 +02:00
Ilya Dryomov
03f69b3275
qa/tasks/rbd_fio: support libaio engine
...
Want to set iodepth and do direct AIO.
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-07-19 12:18:16 +02:00
Yan, Zheng
b49d6d8ead
qa/cephfs: lsof if umount fails
...
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2017-07-19 15:32:37 +08:00
Patrick Donnelly
f8e0571982
qa: fix MDS_CLIENT_RECALL copy error
...
Fixes: http://tracker.ceph.com/issues/20682
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-18 16:06:20 -07:00
Patrick Donnelly
5b1a229fca
Merge PR 16200 into master
...
* refs/remotes/upstream/pull/16200/head:
qa: thrash max_mds and deactivate ranks
Reviewed-by: Zheng Yan <zyan@redhat.com>
2017-07-17 22:33:34 -07:00
Patrick Donnelly
b94d1dc385
Merge PR 16288 into master
...
* refs/remotes/upstream/pull/16288/head:
qa/cephfs: don't use int() to convert string of float point number
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-17 22:31:05 -07:00
Sage Weil
dfc9c36606
fix ceph.py
2017-07-17 16:27:13 -04:00
Sage Weil
c66da972df
qa/tasks/ceph.py: create osds in order
...
We aren't passing id to legacy 'osd create', which means we have to go
in order!
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-17 15:22:47 -04:00
Ilya Dryomov
0f75d79c34
qa/tasks/rbd_fio: use teuthology.packaging for handling packages
...
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2017-07-17 15:32:51 +02:00
Jason Dillaman
4fa1918717
qa/tasks: rbd-mirror daemon not properly run in foreground mode
...
Fixes: http://tracker.ceph.com/issues/20630
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-07-14 10:32:28 -04:00
Sage Weil
56e2965502
qa/tasks/ceph_manager: wait longer for pg stats to flush
...
An ill-timed mgr restart could blow the current 15s wait.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-13 12:13:45 -04:00
Yan, Zheng
e4844706b0
qa/cephfs: don't use int() to convert string of float point number
...
Fixes: http://tracker.ceph.com/issues/20582
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2017-07-13 15:55:22 +08:00
Sage Weil
fd39700d55
qa/tasks/ceph: wait for osds to come up before creating pool
...
Avoid health warnings.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
25717f7e84
qa/tasks/ceph_test_case.py: update health check helpers
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
93de19adcf
qa: whitelist health warnings
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
63f97ddcf6
qa/suites/rados: whitelist health warnings
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
b25ddafa16
qa/tasks/mon_clock_skew_check: vastly simplify
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
83aaa75991
qa/tasks/ceph: stop logging health on shutdown
...
Don't log health during actual teardown or we'll see
various scary messages unrelated to our test run.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00