Commit Graph

242 Commits

Author SHA1 Message Date
Sage Weil
e7006d06fb qa/tasks/ceph: explicitly add osds to crush map for upgrades
Before kraken, ceph-osd didn't add itself to crush... ceph-osd-prestart.sh
did it.  And ceph.py doesn't use that.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-27 12:52:35 -04:00
Sage Weil
2c8904506a qa/tasks/ceph: create osds before starting
This ensure they are in the OSDMap before the CRUSH
map, which avoids the OSD_ORPHAN health warning.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-22 13:45:27 -04:00
Sage Weil
0eed85515a qa/tasks/ceph: do not pregenerate osdmap
This is useless, and further leads to a situation
where we have a populated CRUSH map before any OSDs,
triggering the OSD_ORPHAN error.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-22 13:45:26 -04:00
Sage Weil
04969eff23 qa/tasks/resolve_stuck_peering: start osd at end
Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-19 14:28:28 -04:00
Sage Weil
cc902a1f6b qa/tasks/ceph: osd_scrub_pgs: reissue scrub requests in loop
The scrub commands are not reliable: if the OSD doesn't happen to
be connected at the time the command is issued it may not get
delivered.  Re-request scrubs for each PG that has not yet been
scrubbed so that we don't wait forever when the original request
is dropped.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-19 12:00:12 -04:00
Sage Weil
32361a798f qa/tasks/ceph: osd_scrub_pgs: tolerate down osd at initial scrub time
Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-19 12:00:12 -04:00
Sage Weil
bdf40c546d Merge pull request #15717 from liewegas/wip-20326
qa/tasks/ceph.py: tolerate active+clean+something

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-06-16 16:12:20 -05:00
Sage Weil
1565b86dc0 qa/tasks/ceph.py: tolerate active+clean+something
where something is, say, snaptrim.  or maybe scrubbing.
or whatever.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-15 22:29:28 -04:00
Sage Weil
f870cc5f28 qa/tasks/thrashosds: wait before wait_for_recovery
Make sure OSDs are up *and* they have flushed their PG stats before
waiting for recovery to ensure that we do not see a stale 'clean' state.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-15 12:14:24 -04:00
Sage Weil
200abcee6d qa/tasks/ceph: raise exception if scrubs time out
Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-15 11:23:18 -04:00
Sage Weil
0d80c88667 qa/tasks/ceph: raise an exception if pgs are not clean
If this happens the preceding test should have cleaned
up (e.g., ceph.healthy:).

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-15 11:23:18 -04:00
Sage Weil
6fa9d32407 qa/tasks/ceph: osd_scrub_pgs: try a bit longer
I just saw a test fail that was still waiting for
scrubs to complete.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-15 11:23:18 -04:00
John Spray
18fbf24c7a Merge pull request #15308 from jcsp/wip-19706
mon: don't kill MDSs unless some beacons are getting through

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-06-15 10:50:44 -04:00
John Spray
4a1fe14bc6 Merge pull request #15411 from jcsp/wip-fs-suite
qa: misc cephfs test improvements

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2017-06-15 10:50:07 -04:00
Sage Weil
554cf8394a Merge pull request #15073 from liewegas/wip-mgr-stats
mon,mgr: extricate PGmap from monitor

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2017-06-04 13:36:01 -05:00
Kefu Chai
e8b23d6852 qa/tasks: add a blacklist for flush_pg_stats()
so we don't wait for marked out osds.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-06-02 13:06:50 -04:00
Sage Weil
ab1b78ae00 qa/tasks: use new reliable flush_pg_stats helper
The helper gets a sequence number from the osd (or osds), and then
polls the mon until that seq is reflected there.

This is overkill in some cases, since many tests only require that the
stats be reflected on the mgr (not the mon), but waiting for it to also
reach the mon is sufficient!

Signed-off-by: Sage Weil <sage@redhat.com>
2017-06-02 13:02:45 -04:00
Yehuda Sadeh
ea911b7f48 Merge pull request #14351 from yehudasa/wip-rgw-mdsearch
rgw: metadata search part 2

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Abhishek Lekshmanan <abhishek@suse.com>
2017-06-02 09:16:07 -07:00
Yehuda Sadeh
6594d972f2 qa/tasks/rgw_multisite.py: adjust zone init
zone is now a ZoneConn object. Also, change import to make it relative
so that qa task can locate it.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
2017-06-01 13:32:00 -07:00
John Spray
7e1be30b9a qa: clean up test_exports.py
Mainly just using the setfattr helper
instead of run_shell.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-06-01 07:18:03 -04:00
John Spray
6ef30d1ed3 qa: explicitly set up standby replay in test_journal_migration
Previously this relied on being run in a special cluster configuration
that set up standby replay daemons.  This change will allow it
to live alongside all the 'normal' functional tests.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-06-01 07:18:03 -04:00
John Spray
01c46bf832 Merge pull request #15205 from batrick/i20039
mds: check export pin during replay

Reviewed-by: Yan, Zheng <zyan@redhat.com>
2017-06-01 11:23:02 +01:00
John Spray
3326321858 qa: fix daemon restart between tests
Previously, calling mds_stop without mds_fail meant
that if the filesystem creation was not quick, then
we would see those daemons go laggy.  This starts
to trigger failures now that we have cluster log
messages that fire when a daemon gets failed out
due to being laggy.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-05-31 18:00:43 -04:00
Yehuda Sadeh
760c5e4f86 Merge pull request #15184 from cbodley/wip-qa-rgw-cleanup
qa/rgw: remove apache/fastcgi and radosgw-agent tests

Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
2017-05-30 13:09:31 -07:00
Patrick Donnelly
76335b0e0f
qa: improve debug message for subtree wait
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-05-30 09:08:27 -07:00
Sage Weil
8554158574 Merge pull request #15325 from liewegas/wip-redirect
osd,librados: add manifest, redirect

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-05-29 14:48:33 -05:00
Sage Weil
a9a728fe4d Merge pull request #15296 from liewegas/wip-fix-at-end
qa/tasks/repair_test: unset flags we set

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-05-27 22:11:31 -05:00
Kefu Chai
8abc6e1bea qa/tasks/rebuild_mondb: update to address ceph-mgr changes
- revive ceph-mgr after updating the keyring cap
- grant "mgr:allow *" to client.admin
- minor refactors

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-05-28 09:59:50 +08:00
Sage Weil
a4247dd594 Merge branch 'wip-extensible_tier-redirect' of git://github.com/myoungwon/ceph into wip-redirect 2017-05-26 22:50:14 -04:00
Sage Weil
d292b5419f qa/tasks/repair_test: unset flags we set
In particular, noscrub and nodeepscrub leave a health
warning, which prevents shutdown with at-end.yaml.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-05-25 18:05:42 -04:00
John Spray
f80e0973f5 Merge pull request #15062 from ukernel/wip-19912
qa/tasks/cephfs: use getattr to guarantee inode is in client cache

Reviewed-by: John Spray <john.spray@redhat.com>
2017-05-25 18:44:54 +01:00
Sage Weil
5d80c74e63 Merge pull request #15252 from liewegas/wip-cleanup-tell
qa/tasks/ceph_manager: 'ceph $service tell ...' is obsolete

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-05-24 17:05:32 -05:00
John Spray
ef9d555916 Merge pull request #15105 from ukernel/wip-19892
qa/cephfs: disable mds_bal_frag for TestStrays.test_purge_queue_op_rate

Reviewed-by: John Spray <john.spray@redhat.com>
2017-05-24 16:41:45 +01:00
John Spray
ee75318807 Merge pull request #15122 from batrick/test-fragment-error
qa: fix float parse error in test_fragment

Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-05-24 16:40:50 +01:00
Sage Weil
5ab996ab3c qa/tasks/ceph_manager: 'ceph $service tell ...' is obsolete
This died forever ago; no need for the fallback here.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-05-23 22:53:53 -04:00
John Spray
3913ed0ba6 qa: refine assert_session_count (don't count killing)
Signed-off-by: John Spray <john.spray@redhat.com>
2017-05-23 05:22:18 -04:00
John Spray
ee2683c804 qa: update TestVolumeClient for new blacklisting
Blacklisted clients will now proactively fail
outstanding operations, rather than blocking.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-05-23 05:22:18 -04:00
John Spray
ab8e328c80 qa: clean up whitespace in test_misc.py
Signed-off-by: John Spray <john.spray@redhat.com>
2017-05-23 05:22:18 -04:00
John Spray
c91ccac6f6 qa: remove outdated TODO in TestVolumeClient
Signed-off-by: John Spray <john.spray@redhat.com>
2017-05-23 05:22:17 -04:00
John Spray
47a9c9ba67 qa: add test_filelock_eviction
To check that eviction is releasing flocks.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-05-23 05:22:17 -04:00
Casey Bodley
8c74c8a639 qa/rgw: remove apache/fastcgi
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-05-19 16:05:36 -04:00
Casey Bodley
0fb3e76eae qa/rgw: more cleanup in rgw.py
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-05-19 15:53:37 -04:00
Casey Bodley
c8d8b9cae1 qa/rgw: remove unused helpers in util/rgw.py
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-05-19 15:53:37 -04:00
Casey Bodley
a05b3bb409 qa/rgw: remove radosgw_agent task
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-05-19 15:53:37 -04:00
Casey Bodley
762e15fbb3 qa/rgw: remove radosgw-agent config from s3tests task
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-05-19 15:53:37 -04:00
Casey Bodley
9d82486d0e qa/rgw: remove radosgw-agent tests from radosgw_admin task
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-05-19 15:53:37 -04:00
Casey Bodley
898ab4bb0f qa/rgw: remove multisite configuration from rgw task
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-05-19 15:53:36 -04:00
Casey Bodley
cff53b246f Merge pull request #14688 from cbodley/wip-rgw-multi-suite
qa/rgw: add multisite suite to configure and run multisite tests

Reviewed-by: Orit Wasserman <owasserm@redhat.com>
2017-05-19 14:30:57 -04:00
Sage Weil
590fd5362a Merge pull request #15071 from cbodley/wip-qa-dnsmasq
qa: add task for dnsmasq configuration

Reviewed-by: Vasu Kulkarni <vasu@redhat.com>
Reviewed-by: Abhishek Lekshmanan <abhishek.lekshmanan@gmail.com>
2017-05-19 13:25:12 -05:00
Casey Bodley
de836ee684 qa/rgw: add test config to rgw_multisite_tests task
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-05-18 13:38:44 -04:00