Commit Graph

469 Commits

Author SHA1 Message Date
John Spray
7ad8ac63fc tasks/cephfs: clean up core on deliberate crash
To avoid internal.coredump task synthesizing a failure
during teardown from the core we left behind.

Fixes: #10949
Signed-off-by: John Spray <john.spray@redhat.com>
2015-03-04 19:13:43 +00:00
Andrew Schoen
d2b52d977b Control the nfs service correctly on rpm-based systems
This fixes issue #11020

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
(cherry picked from commit 2e4d884fda)
2015-03-04 10:10:01 -06:00
Andrew Schoen
7402d5f1a9 Fixes #11013, use time.sleep instead of manager.sleep which isn't there.
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
(cherry picked from commit 811c02e392)
2015-03-04 10:09:21 -06:00
Andrew Schoen
2e4d884fda Control the nfs service correctly on rpm-based systems
This fixes issue #11020

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
2015-03-04 08:57:03 -06:00
Andrew Schoen
811c02e392 Fixes #11013, use time.sleep instead of manager.sleep which isn't there.
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
2015-03-03 16:42:59 -06:00
John Spray
1647453d66 Merge pull request #356 from ceph/wip-10948
tasks/cephfs: don't run iptables in parallel

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2015-03-03 20:25:09 +00:00
John Spray
386b6feaaf Merge pull request #358 from ceph/wip-10712
tasks: fix intermittent failure in TestFlush

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2015-03-03 11:42:09 +00:00
Andrew Schoen
cdaa11d427 Use an NFS mount instead of virtio-9p in the qemu task
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
(cherry picked from commit 93f2bea353)
2015-03-02 15:45:10 -08:00
Greg Farnum
0c9fb0f7be workunit: include /usr/sbin in the PATH for all commands
Fixes: #10600

Backport: hammer, giant, firefly, dumpling

Signed-off-by: Greg Farnum <gfarnum@redhat.com>
(cherry picked from commit 0e53f5f38b)
2015-03-02 11:29:24 -08:00
Alfredo Deza
d25aa7bdc1 rename test to tests, follows best practices for tests dirs
Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit 7b2062602f)
2015-03-02 11:20:35 -08:00
Alfredo Deza
b0ac3516e3 create a new get_acl helper to ensure proper xml is compared
Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit 9b6ff63873)
2015-03-02 11:20:23 -08:00
Alfredo Deza
20a1ae72ef tests for the new acl helper in radosgw_admin
Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit 3801ac9357)
2015-03-02 11:20:11 -08:00
John Spray
d20d9238cd tasks: fix intermittent failure in TestFlush
This was an overly strict success condition: the
flush operation doesn't promise to leave you an empty
journal, it promises that anything in the journal
before the flush will be flushed.

Fixes: #10712
Signed-off-by: John Spray <john.spray@redhat.com>
2015-03-02 12:53:01 +00:00
John Spray
5b2fd3d614 tasks/cephfs: don't run iptables in parallel
Where multiple MDSs were on the same node, trying
to concurrently update their firewall state was
causing an exception because the iptables command
errors out if another instance is already running.

Fixes: #10948
Signed-off-by: John Spray <john.spray@redhat.com>
2015-03-02 12:41:13 +00:00
John Spray
fc4e90930b tasks/cephfs: fix/improve fuse force umount (again)
teuthology helpfully escapes things for us so
the \; didn't need the backslash.  The logic
was still falling over in some cases too.

Additionally, make the FUSE /sys/ abort operation
more surgical by working out the connection name
of our own mount during mount().

Signed-off-by: John Spray <john.spray@redhat.com>
2015-02-26 23:08:33 +00:00
Alfredo Deza
a8a29760f0 remove unused import from repair_test
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 15:04:02 -05:00
Alfredo Deza
9157622ef7 remove unused variable assignment
Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit 9df27f7a10)
2015-02-26 15:02:07 -05:00
Alfredo Deza
3f3ce53fde remove unused import from fuse
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 14:55:02 -05:00
Alfredo Deza
35224e3dda remove uneeded variable assignment for call
Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit 0a65e90db2)
2015-02-26 14:54:14 -05:00
Alfredo Deza
976773a5d0 remove unused import in calamari_setup
Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit d22cd6917b)
2015-02-26 14:53:35 -05:00
Alfredo Deza
e8e1e7ee5e remove unused imports from populate_rbd
Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit 959c491462)
2015-02-26 14:52:58 -05:00
Alfredo Deza
28d3075266 remove unused imports from peering_speed
Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit 6d45352782)
2015-02-26 14:52:40 -05:00
Alfredo Deza
5018e91c1e remove redefined StringIO import
Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit 3a18cb29a8)
2015-02-26 14:48:53 -05:00
Alfredo Deza
4ed442e44c stdin is no longer a kwarg
Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit 49a61dc2d2)
2015-02-26 14:48:40 -05:00
Alfredo Deza
33f7982480 add the log object to ceph_manager
Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit f7c1ca4a1e)
2015-02-26 14:48:30 -05:00
Alfredo Deza
9cdd278d08 add missing mon object to osd_failsafe
Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit 1673be45c0)
2015-02-26 14:47:35 -05:00
Alfredo Deza
744bc3802e fix undefined var in divergent_priors
Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit 26c5a82af9cf4efd3601b6a32aa5444f61fa9f81)
2015-02-26 14:47:08 -05:00
Alfredo Deza
0a65e90db2 remove uneeded variable assignment for call
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 11:35:31 -05:00
Alfredo Deza
3a18cb29a8 remove redefined StringIO import
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 11:35:07 -05:00
Alfredo Deza
49a61dc2d2 stdin is no longer a kwarg
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 11:34:21 -05:00
Alfredo Deza
f7c1ca4a1e add the log object to ceph_manager
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 11:33:47 -05:00
Alfredo Deza
920552a266 remove unused import from repair test
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 11:32:09 -05:00
Alfredo Deza
1673be45c0 add missing mon object to osd_failsafe
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 11:31:31 -05:00
Alfredo Deza
959c491462 remove unused imports from populate_rbd
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 11:27:11 -05:00
Alfredo Deza
6d45352782 remove unused imports from peering_speed
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 11:26:42 -05:00
Alfredo Deza
9df27f7a10 remove unused variable assignment
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 11:26:15 -05:00
Alfredo Deza
d22cd6917b remove unused import in calamari_setup
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 11:25:12 -05:00
Alfredo Deza
26c5a82af9 fix undefined var in divergent_priors
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 11:24:41 -05:00
Alfredo Deza
e4ba81700b fix lint errors on ceph_fuse
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-26 11:20:19 -05:00
Sage Weil
55c11a3b41 ceph: ugh fix syntax
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 1922c61bbf1165e23650e74767ac4d5f63becdbc)
2015-02-25 11:38:12 -08:00
Sage Weil
1922c61bbf ceph: ugh fix syntax
Signed-off-by: Sage Weil <sage@redhat.com>
2015-02-25 11:37:44 -08:00
Yuri Weinstein
fea2e22774 Fixes #10869, added {role} to the dir name.
Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>

Fixed typo

Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>
(cherry picked from commit 9942440ddf9f6847e76de3b49f1ad0d98c62a3d0)
2015-02-25 11:30:59 -08:00
Sage Weil
5be8b8ef17 ceph: fix ps axuf lsof line
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 18307be0ca97f7221d6202884f8a68a4b26c1dbf)
2015-02-25 11:05:42 -08:00
Sage Weil
18307be0ca ceph: fix ps axuf lsof line
Signed-off-by: Sage Weil <sage@redhat.com>
2015-02-25 11:05:13 -08:00
Kefu Chai
64de3cd574 Thrasher: log backtrace of thrown exception
* add a wrapper to log uncaught exception to self.logger, greenlet also
  prints the backtrace and exception to stderr, but teuthology.log does
  not capture stderr. so we need to catch them by ourselves to reveal
  more info to root-cause this issue.
* log uncaught exception thrown by Thrasher.do_thrash() to self.log.

See: #10630
Signed-off-by: Kefu Chai <kchai@redhat.com>
2015-02-25 16:10:52 +08:00
Andrew Schoen
93f2bea353 Use an NFS mount instead of virtio-9p in the qemu task
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
2015-02-24 12:47:58 -06:00
Sage Weil
1ee621683a ceph: ps axf too before lsof
Specifically, I want to know *who* is running the ceph-osd that is
holding the files open.

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit a68281e1476e6af38237e1d1031dd7bd0980ef9f)
2015-02-24 06:59:55 -08:00
Sage Weil
a68281e147 ceph: ps axf too before lsof
Specifically, I want to know *who* is running the ceph-osd that is
holding the files open.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-02-24 06:59:16 -08:00
Greg Farnum
0e53f5f38b workunit: include /usr/sbin in the PATH for all commands
Fixes: #10600

Backport: hammer, giant, firefly, dumpling

Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2015-02-23 21:05:04 -08:00
Sage Weil
508270be76 Merge pull request #313 from armaseg/master
teuthology: some suites still use http://ceph.newdream.net
2015-02-23 21:01:45 -08:00
Sage Weil
0365f8b68d ceph: lsof if umount fails
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 077e917af1394e5999293031afcc476b2bdce1e4)
2015-02-23 17:01:00 -08:00
Sage Weil
97fb9d8ed6 ceph: archive logs at very end
This ensures that we still gather the logs even if the other nested tasks
throw an exception in the finally block.

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit ca09683f5fc1a6067c524c4034c27ab4a26e11f3)
2015-02-23 17:00:57 -08:00
Sage Weil
077e917af1 ceph: lsof if umount fails
Signed-off-by: Sage Weil <sage@redhat.com>
2015-02-23 13:52:48 -08:00
Sage Weil
ca09683f5f ceph: archive logs at very end
This ensures that we still gather the logs even if the other nested tasks
throw an exception in the finally block.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-02-23 13:52:48 -08:00
Yuri Weinstein
2df632e54d Fixes #10869, added {role} to the dir name.
Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>

Fixed typo

Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>
2015-02-13 09:09:36 -08:00
David Zafman
ca04dc65c1 Thrasher: Fix log message
Signed-off-by: David Zafman <dzafman@redhat.com>
2015-02-12 11:54:59 -08:00
Alfredo Deza
7b2062602f rename test to tests, follows best practices for tests dirs
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-11 11:25:45 -05:00
Alfredo Deza
9b6ff63873 create a new get_acl helper to ensure proper xml is compared
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-11 11:24:44 -05:00
Alfredo Deza
3801ac9357 tests for the new acl helper in radosgw_admin
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-02-11 11:14:59 -05:00
John Spray
03b0e1064c tasks/cephfs: fix fuse force unmount
This was broken in the case of multiple
mounts, and in the case of stuck mounts.

Fixes: #10702
Signed-off-by: John Spray <john.spray@redhat.com>
2015-02-10 14:58:23 +00:00
Dan Mick
4b6fa9dbb4 tasks/calamari_setup: ice_setup 0.2.0 has added another prompt
Add an extra '\n' to the interaction string to answer it

Signed-off-by: Dan Mick <dan.mick@redhat.com>
2015-02-06 17:33:53 -08:00
Zack Cerza
51f75079e2 Merge pull request #322 from ceph/wip-10753
Add labels to a couple common task failures
2015-02-04 15:02:10 -07:00
Andrew Schoen
a965d6ca14 Add labels to a couple common task failures
The labels will help us quickly determine what the command was trying to
accomplish in log output.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
2015-02-04 15:42:40 -06:00
John Spray
8ed4f120d6 Merge pull request #316 from ceph/wip-active-asok-10361
mds_scrub_checks: only direct ops at the active MDS

Reviewed-by: John Spray <john.spray@redhat.com>
2015-01-28 13:59:48 +00:00
Sage Weil
a76f5ce01b Merge pull request #296 from majianpeng/master
rados.py: Add write_fadvise_dontneed field
2015-01-27 12:56:31 -08:00
Sage Weil
4c22dac311 Merge pull request #302 from ceph/wip-crush-tunables
ceph: set crush tunables
2015-01-27 11:26:10 -08:00
Greg Farnum
8d2715eb45 mds_scrub_checks: only direct ops at the active MDS
Change the config option from mds_id to mds_rank to reflect the
fact that it's the rank we want to make use of (and will continue
to want when we're doing stuff like force exporting from one rank
to another).

Fixes: #10361

Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2015-01-27 11:17:04 -08:00
Josh Durgin
333b4445df Merge remote-tracking branch 'origin/next' 2015-01-27 09:37:19 -08:00
Sage Weil
9e9d7c91eb ceph: set crush tunables
Signed-off-by: Sage Weil <sage@redhat.com>
2015-01-26 13:42:56 -08:00
Zack Cerza
695f6c22f0 Merge pull request #314 from ceph/wip-wait-for-failure
ceph: test pg removal interruption; add ceph daemon failur einjection support
2015-01-26 12:26:14 -07:00
Gregory Farnum
7e5619dc2c Merge pull request #303 from ceph/wip-10388
tasks/mds_flush: use new stray/purge perfcounters

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2015-01-23 17:10:06 -08:00
Greg Farnum
41a99f58cc mds_flush: fix the flush test output expectations
Fixes: #10620

Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2015-01-23 15:24:02 -08:00
Sage Weil
cb74060cd4 ceph: no need to stop() before restart()
restart() will stop if the daemon is running.  This will get rid of the
spurious error

2015-01-23 15:19:36,828.828 ERROR:tasks.ceph.osd.0:tried to stop a non-running daemon

when the daemon isn't already running.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-01-23 15:21:31 -08:00
Sage Weil
83dd3d58a1 ceph: add wait_for_failure command
Expect and then ignore a daemon failure.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-01-23 15:20:50 -08:00
Armando Segnini
9ace238e87 teuthology: some suites still use http://ceph.newdream.net
This probably redirects to http://ceph.com but ceph.newdream.net still appears in some places

http://tracker.ceph.com/issues/9922 Fixes: #9922

Signed-off-by: Armando Segnini  <armando.segnini@telecom-bretagne.eu>
2015-01-23 11:13:11 +01:00
David Zafman
e5876126a9 Formatting changes 2015-01-21 09:08:44 -08:00
David Zafman
14e27650a6 Fix wrapped line 2015-01-21 09:08:44 -08:00
David Zafman
bff2358373 Fix objectstore name changed to keyvaluestore
Signed-off-by: David Zafman <dzafman@redhat.com>
2015-01-21 09:08:44 -08:00
David Zafman
b239dadc02 ceph_manager: Fix Thrasher kill_osd() to ignore pool going away
Signed-off-by: David Zafman <dzafman@redhat.com>
2015-01-21 09:08:44 -08:00
David Zafman
4eda2967a4 ceph_manager: Thrasher fixes to run() calls to set check_status = False
Signed-off-by: David Zafman <dzafman@redhat.com>
2015-01-21 09:08:44 -08:00
David Zafman
e0b0d41e4c ceph_manager: Re-enable Thrasher ceph-objectstore-tool testing
Signed-off-by: David Zafman <dzafman@redhat.com>
2015-01-21 09:08:44 -08:00
David Zafman
4a195c78c1 ceph_manager: In Thrasher randomly remove past intervals
Signed-off-by: David Zafman <dzafman@redhat.com>
2015-01-21 09:08:44 -08:00
David Zafman
f9e088fcbc ceph_manager: In Thrasher disable ceph_objectstore_tool tests if old release missing command
Require ceph-objectstore-tool to be available on all OSD nodes
Log a message when tool is not available

Signed-off-by: David Zafman <dzafman@redhat.com>
2015-01-21 09:08:44 -08:00
John Spray
a6b8e823f4 tasks/mds_flush: use new stray/purge perfcounters
Where previously we only tracked RADOS-level delete
ops during deletion, now also verify that they
correspond to the right number of MDS-level purge
operations.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-19 22:00:53 +00:00
John Spray
2c1cef6b5d tasks/mds_journal_repair: add test_table_tool
Explicit test of the tool's various modes, to
complement the places it's used in other tests.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-19 19:36:34 +00:00
John Spray
bf13a4e062 tasks/mds_journal_repair: create new test
This tests the new #9883 repair functionality
where we selectively scrape dentries out of
the journal while the MDS is offline.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-19 19:36:34 +00:00
Alfredo Deza
a6bca74722 Merge pull request #298 from ceph/wip-bi-sharding-3
Wip bi sharding 3
2015-01-19 12:17:27 -05:00
Andrew Schoen
b1d9f602c4 Merge pull request #288 from dmick/master
Tweaks to calamari-setup
2015-01-15 16:01:30 -06:00
Yehuda Sadeh
3d420ff1dd radosgw_admin: don't use boto for testing 301 responses
boto follows up on the redirects. Switched to using httplib2 for sending
this request.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
2015-01-14 15:10:23 -08:00
Loic Dachary
74bef7f488 Merge pull request #291 from dachary/wip-10018-scrub
add erasure code repair / scrub tests (take 2)

Reviewed-by: David Zafman <dzafman@redhat.com>
2015-01-14 09:29:19 +01:00
Jianpeng Ma
717dffe7c2 rados.py: Add write_fadvise_dontneed field
Signed-off-by: Jianpeng Ma <jianpeng.ma@intel.com>
2015-01-14 11:36:25 +08:00
Loic Dachary
e580eca3e3 repair_test: hinfo_key error injection
Add a function dedicated to erasure coded pools tests, similar to
repair_test_1. Add a corrupter that removes the hinfo_key from the object.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 12:31:15 -08:00
Loic Dachary
ae68658339 ceph_manager: add ceph-objectstore-tool wrapper
Add the CephManager.objectstore_tool method to encapsulate a call to
ceph-objectstore-tool. The wrapper can convert an object name into the
PG id and figure out the primary OSD. The designated OSD is stopped
before running the command and restarted afterwards.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 12:31:15 -08:00
Loic Dachary
0465c0c53f ceph_manager: add object helpers
Add CephManager.methods to get information regarding the pools and PGs
used to store a given object.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 12:31:15 -08:00
Loic Dachary
f3bd336dbe repair_test: rework indentation to make it pep8 clean
Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 12:31:15 -08:00
Loic Dachary
f08d95a5ea repair_test: rework to remove levels of indirection
The commit is large but does not introduce any semantic change and
consists primarily in code moving around, re-indented and removed.

Replace functions generating functions by functions and sequentially
iterating over a list of functions with a sequential call to the
functions.

Replace the setup/teardown with an equivalent using a with
statement and the ceph_manager.pool method.

Replace inline code with a call to ceph_manager.wait_for_all_up

It makes it easier to modify the tests, for instance to create erasure
coded pools and tests specific to them.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 12:31:15 -08:00
Loic Dachary
bd4926c4ac ceph_manager: rework indentation to make it pep8 clean
Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 20:22:33 +01:00
Loic Dachary
50e7e6df2b ceph_manager: fix docstring typo
Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 20:18:25 +01:00
Loic Dachary
7234481bae ceph_manager: define pool context
To create a pool before running a code bloc and remove it after.

   with manager.pool("mypool"):
        mytest..

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 20:18:25 +01:00
David Zafman
7029dba08a Revert "add erasure code repair / scrub tests" 2015-01-13 11:12:25 -08:00
Warren Usui
f36b171816 Merge pull request #286 from ceph/wip-10261-wusui
Calamari_setup now allows > 1 mon.
2015-01-13 10:57:31 -08:00
Loic Dachary
01174518b8 repair_test: hinfo_key error injection
Add a function dedicated to erasure coded pools tests, similar to
repair_test_1. Add a corrupter that removes the hinfo_key from the object.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 15:07:47 +01:00
Loic Dachary
236049b27d ceph_manager: add ceph-objectstore-tool wrapper
Add the CephManager.objectstore_tool method to encapsulate a call to
ceph-objectstore-tool. The wrapper can convert an object name into the
PG id and figure out the primary OSD. The designated OSD is stopped
before running the command and restarted afterwards.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 15:07:43 +01:00
Loic Dachary
372665d1ec ceph_manager: add object helpers
Add CephManager.methods to get information regarding the pools and PGs
used to store a given object.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 14:42:19 +01:00
Loic Dachary
f3a91d2ecd repair_test: rework indentation to make it pep8 clean
Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 14:42:15 +01:00
Loic Dachary
9203259ec3 repair_test: rework to remove levels of indirection
The commit is large but does not introduce any semantic change and
consists primarily in code moving around, re-indented and removed.

Replace functions generating functions by functions and sequentially
iterating over a list of functions with a sequential call to the
functions.

Replace the setup/teardown with an equivalent using a with
statement and the ceph_manager.pool method.

Replace inline code with a call to ceph_manager.wait_for_all_up

It makes it easier to modify the tests, for instance to create erasure
coded pools and tests specific to them.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 14:37:35 +01:00
Loic Dachary
764f6159ba ceph_manager: rework indentation to make it pep8 clean
Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 14:37:35 +01:00
Loic Dachary
1f14626b04 ceph_manager: fix docstring typo
Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 14:37:35 +01:00
Loic Dachary
666aecc121 ceph_manager: define pool context
To create a pool before running a code bloc and remove it after.

   with manager.pool("mypool"):
        mytest..

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-01-13 14:37:35 +01:00
David Zafman
e2494c5050 Fix objectstore name changed to keyvaluestore
Signed-off-by: David Zafman <dzafman@redhat.com>
2015-01-13 14:37:35 +01:00
David Zafman
7d4bb5529c ceph_manager: Fix Thrasher kill_osd() to ignore pool going away
Signed-off-by: David Zafman <dzafman@redhat.com>
2015-01-13 14:37:35 +01:00
David Zafman
cf8bc063d5 ceph_manager: Thrasher fixes to run() calls to set check_status = False
Signed-off-by: David Zafman <dzafman@redhat.com>
2015-01-13 14:37:35 +01:00
David Zafman
1903b7bee0 ceph_manager: Re-enable Thrasher ceph-objectstore-tool testing
Signed-off-by: David Zafman <dzafman@redhat.com>
2015-01-13 14:37:35 +01:00
David Zafman
b6f7b17b5c ceph_manager: In Thrasher randomly remove past intervals
Signed-off-by: David Zafman <dzafman@redhat.com>
2015-01-13 14:37:35 +01:00
David Zafman
0df5e84b69 ceph_manager: In Thrasher disable ceph_objectstore_tool tests if old release missing command
Require ceph-objectstore-tool to be available on all OSD nodes
Log a message when tool is not available

Signed-off-by: David Zafman <dzafman@redhat.com>
2015-01-13 14:37:35 +01:00
Dan Mick
4c27048b12 tasks/calamari_setup: build ice-tools virtualenv if building iceball
ice-tools needs a virtualenv populated to properly run to build
an iceball; add the commands to do that.  Also remove the built
iceball when the task exits.

Fixes: #10523
Signed-off-by: Dan Mick <dan.mick@redhat.com>
2015-01-12 19:56:58 -08:00
Dan Mick
bbe22d2fde tasks/calamari_setup: remove HTTP-downloaded iceball, if any
Fixes: #10522
Signed-off-by: Dan Mick <dan.mick@redhat.com>
2015-01-12 19:56:58 -08:00
Dan Mick
4377f00bac tasks/calamari_setup: use ice_version to completely specify iceball
Previously, the task would search for the lexicographically-greatest
filename matching ICE-*.tar.gz; now it builds a specific name
ICE-{ice_version}-{ice_distro}.tar.gz

Fixes: #10521
Signed-off-by: Dan Mick <dan.mick@redhat.com>
2015-01-12 19:56:58 -08:00
Warren Usui
1f557c566c Calamari_setup now allows > 1 mon.
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Fixes: #10261
2015-01-09 16:10:11 -08:00
John Spray
b1e35a4237 tasks/mds_auto_repair: remove unneeded log settings
The small segments and small segment limit
were used when doing a hacky flush by doing
IO and waiting: now that we have the explicit
'flush journal' asok in use, we can just use
a normal journal configuration.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-09 10:09:53 +00:00
John Spray
997c0c3d62 tasks/cephfs: remove config from Filesystem cons
This was only used in get_first_mon, which doesn't actually
need the parameter itself.  Makes it easier to casually
use Filesystem from any place with a ctx to hand.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-09 10:09:53 +00:00
John Spray
4ff4819573 tasks/cephfs: umount unused clients before running
When unused clients were mounted during an fs new,
they would end up in a state where they stalled
on subsequent attempts to umount them (ceph-fuse
stalls on exit if it can't terminate its mds_session)

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-09 10:09:52 +00:00
John Spray
8630c4ab3b tasks/cephfs: move ROOT_INO into filesystem module
...so that other tests can get at it.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-09 10:09:52 +00:00
John Spray
5e3f5766d0 tasks/cephfs: be more precise in set_clients_block
Instead of blocking the whole port range (which
might make OSDs running on that node collateral
damage), read the MDS's port out of the MDS map
and just block that.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-09 10:09:52 +00:00
John Spray
951d6c297d tasks/cephfs: use set_conf() from test case class
...because this is the one that will store up
changes to roll back during teardown.

Doing this makes it easy to run lots of test cases
togeher in a single teuthology run, raher than
setting up/tearing down the ceph cluster for each
on.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-09 10:09:52 +00:00
John Spray
cfbc277c1f tasks/cephfs: move common setUp/tearDown to parent
Now that we have more of these cases, there was lots
of duplication in setup and teardown.  For some tests
the "reset everything" setup/teardown is overkill,
but it's harmless.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-09 10:09:51 +00:00
John Spray
6d79e3918b tasks/cephfs: move wait_until helpers to testcase
...so that other test cases can use them

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-09 10:08:09 +00:00
John Spray
213641f315 tasks/ceph: update mds auth caps to 'allow *'
Since the new 'tell' for the MDS was introduced,
caps have to have the '*' to permit running remote
administrative commands.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-09 10:07:34 +00:00
Andrew Schoen
5644a320a5 Use qemu-kvm in the qemu task for rpm based systems.
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
2015-01-08 14:31:26 -06:00
Gregory Farnum
574be5ba55 Merge pull request #280 from ceph/wip-10387-cleanup
tasks/mds_flush: remove #10387 workaround

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2015-01-07 20:05:07 -08:00
Zack Cerza
31ff241f5a Inherit teuthology's log level
Signed-off-by: Zack Cerza <zack@redhat.com>
2015-01-07 09:53:38 -07:00
John Spray
f6bcbe3701 tasks/mds_flush: remove #10387 workaround
Now that #10387 is fixed in master, we can tighten
up this test to ensure that the expected deletions
are happening.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-07 14:31:47 +00:00
Sage Weil
3960530b7d Revert "Move output in task/s3readwrite"
This reverts commit 26a33c3a5a.

This is tryign to create the archive dir on the remote host:

2014-12-29T12:15:30.213 INFO:teuthology.orchestra.run.plana31:Running: 'mkdir -p /var/lib/teuthworker/archive/sage-2014-12-29_11:40:52-rgw-next---basic-multi/683052'
2014-12-29T12:15:30.231 ERROR:teuthology.contextutil:Saw exception from nested tasks
Traceback (most recent call last):
  File "/home/teuthworker/src/teuthology_master/teuthology/contextutil.py", line 28, in nested
    vars.append(enter())
  File "/usr/lib/python2.7/contextlib.py", line 17, in __enter__
    return self.gen.next()
  File "/var/lib/teuthworker/src/ceph-qa-suite_next/tasks/s3readwrite.py", line 241, in run_tests
    ctx.cluster.only(client).run(args=['mkdir', '-p', archive_dir])
  File "/home/teuthworker/src/teuthology_master/teuthology/orchestra/cluster.py", line 64, in run
    return [remote.run(**kwargs) for remote in remotes]
  File "/home/teuthworker/src/teuthology_master/teuthology/orchestra/remote.py", line 128, in run
    r = self._runner(client=self.ssh, name=self.shortname, **kwargs)
  File "/home/teuthworker/src/teuthology_master/teuthology/orchestra/run.py", line 368, in run
    r.wait()
  File "/home/teuthworker/src/teuthology_master/teuthology/orchestra/run.py", line 106, in wait
    exitstatus=status, node=self.hostname)
CommandFailedError: Command failed on plana31 with status 1: 'mkdir -p /var/lib/teuthworker/archive/sage-2014-12-29_11:40:52-rgw-next---basic-multi/683052'

...but it should only be on the local host.
2014-12-29 12:39:26 -08:00
Sage Weil
0220cbb717 Revert "tasks/ceph_manager: dump raw cluster command at debug level"
This reverts commit b0ebeba79c.
2014-12-23 12:57:18 -08:00
Sage Weil
b0ebeba79c tasks/ceph_manager: dump raw cluster command at debug level
Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-23 10:35:55 -08:00
John Spray
7696afd3fa tasks: create mds_flush task
This tests:
 * The new 'flush journal' asok command
 * That the resulting on disk structures are as expected
 * That cephfs-journal-tool is happy with the result

Fixes: #9881

Signed-off-by: John Spray <john.spray@redhat.com>
2014-12-19 13:40:53 +00:00
David Zafman
8a2c32875c Merge pull request #270 from ceph/wip-cot-rename
ceph_objectstore_tool: Changes for rename of binary ceph-objectstore-too...

Reviewed-by: Loic Dachary <ldachary@redhat.com>
2014-12-18 16:39:21 -08:00
Dan Mick
efa6c36f70 Add calamari_nosetests task.
Includes all accumulated review commentary and final 'yum clean all' fix.

Signed-off-by: Dan Mick <dan.mick@redhat.com>
2014-12-18 15:47:36 -08:00
Dan Mick
1bba98bda2 Remove obsolete tasks/calamari
Signed-off-by: Dan Mick <dan.mick@redhat.com>
2014-12-18 15:47:36 -08:00
Loic Dachary
75287f0868 ceph_objectstore_tool: fix pep8 errors
Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-12-19 00:43:48 +01:00
Loic Dachary
3c2aaa9c35 ceph_objectstore_tool: --op list now prints [pg,object]
The format of the output of --op list was changed to include the PG to
which the object belong. It simplifies the loop in
ceph_objectstore_tool.py.

http://tracker.ceph.com/issues/10376 Fixes: #10376

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-12-19 00:43:35 +01:00
John Spray
849e259b96 tasks: add mds_full task
This is for exercising the CephFS behaviour
when the cluster becomes full.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-12-18 12:45:31 +00:00
John Spray
f9609c4688 tasks/cephfs: let get_config use non-mon services
Previously was always using the default values of things
so querying mon instead of the appropriate service
worked fine.  However, for things we might want to
update on a per-test basis we need to go ask the
correct service what the setting really is.

Needed for osd_mon_report_interval_max in the ENOSPC
testing.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-12-18 12:45:30 +00:00
John Spray
8fa6b154e3 tasks/cephfs: enable osd epoch get from mounts
Using new hooks for #7317

Signed-off-by: John Spray <john.spray@redhat.com>
2014-12-18 12:45:30 +00:00
John Spray
f94ec69fe9 tasks/mds_client_limits: fix initial rm -rf
This was previously a no-op because it was trying
to delete "*" instead of *

Signed-off-by: John Spray <john.spray@redhat.com>
2014-12-18 12:45:30 +00:00
John Spray
9c55b81cc2 tasks/kclient: add 'debug' option
Signed-off-by: John Spray <john.spray@redhat.com>
2014-12-18 12:45:30 +00:00
John Spray
dedcc6483a tasks/cephfs: enable kclient for mds_* tasks
Signed-off-by: John Spray <john.spray@redhat.com>
2014-12-18 12:45:30 +00:00
David Zafman
eec179c687 ceph_objectstore_tool: Changes for rename of binary ceph-objectstore-tool
Signed-off-by: David Zafman <dzafman@redhat.com>
2014-12-17 17:10:32 -08:00
John Spray
b4214f95f4 Merge pull request #257 from ceph/wip-fs-creation
Fix PG counts and refactor FS creation during setup

Reviewed-by: Loic Dachary <ldachary@redhat.com>
2014-12-17 11:37:09 +00:00
Yehuda Sadeh
47d56d779e radosgw-admin: wait before trimming usage
Fixes: #9892

Need to wait through the usage interval before trimming usage, otherwise we might not
remove all pending usage info.

Backport: dumpling, firefly, giant

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
(cherry picked from commit dd09ecbfab8a659f3faaf879a52849caab5e8e8e)
2014-12-16 16:15:43 -05:00
Gregory Farnum
d562886f31 Merge pull request #260 from ceph/wip-mds-readonly
tasks: add test case for readonly MDS

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2014-12-10 20:39:52 -08:00
Sage Weil
0f9192f25d tasks/watch_notify_same_primary: fix test for new watch/notify
It now checks for 'notify1' and 'notify2' strings, allowing it to work
on both old and new versions of rados watch command.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-10 08:36:18 -08:00
Yan, Zheng
09b82e2bd9 tasks: add test case for readonly MDS
Signed-off-by: Yan, Zheng <zyan@redhat.com>
2014-12-10 09:48:02 +08:00
Josh Durgin
f3b3d75542 radosgw_admin: test full sync using existing tests
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2014-12-05 21:04:08 -08:00
Zack Cerza
a45663c543 Merge pull request #212 from ceph/wip-9117-wusui
Move output in task/s3readwrite
2014-12-03 14:17:26 -07:00
John Spray
8fc3550002 tasks/ceph: move FS creation inside Filesystem
Leave the legacy handling out in cephfs_setup, move
the filesystem creation stuff into Filesystem.  I
anticipate this being the right place for it if/when
we have tests that want to do 'fs rm' 'fs new' type
cycles within themselves.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-12-02 13:31:57 +00:00
John Spray
89beed1b0a tasks/ceph: conservative PG count in FS pools
This was tripping over the recent commit 42c85e80
in Ceph master, which tightens the limits on
acceptable PG counts per OSD, and was making
teuthology runs fail due to never going clean.

Rather than put in a new hardcoded count, infer
it from config.  Move some code around so that
the ceph task can get at a Filesystem object
to use in FS setup (this already has conf-getting
methods).

Signed-off-by: John Spray <john.spray@redhat.com>
2014-12-02 13:30:06 +00:00
Yan, Zheng
d761831133 tasks: add mds_auto_repair
New CephFS tests for MDS's auto repair functions. (So far the only
test case is verify/repair backtrace on fetch dirfrag)

Signed-off-by: Yan, Zheng <zyan@redhat.com>
2014-12-02 16:26:51 +08:00
Warren Usui
26a33c3a5a Move output in task/s3readwrite
The s3readwrite.py task formerly wrote too much output while excuting.
It now saves the data on the local machine in either the archive
directory or in /tmp if no archive directory is specified.
The new file contains a client name and timestamp in its name.
Once all processing has completed, that file is saved locally.

Fixes: 9117
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-12-01 11:01:26 -08:00
David Zafman
c59af44460 Merge pull request #244 from dachary/wip-objectstore-tool
add erasure coded pool tests to objectstore tool

Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: Loic Dachary <ldachary@redhat.com>
2014-11-20 15:40:54 -08:00
David Zafman
6923e68dac ceph_objectstore_tool: minor fixes
Remove development comment
nullfd isn't used anymore
s/log.errors/log.error/

Signed-off-by: David Zafman <dzafman@redhat.com>
2014-11-20 22:38:26 +01:00
David Zafman
a09d3a389d ceph_objectstore_tool: fix flake8 issues
Signed-off-by: David Zafman <dzafman@redhat.com>
2014-11-20 22:38:26 +01:00
David Zafman
f10edda4bc ceph_objectstore_tool: add assert so that teuthology knows when this test fails
Signed-off-by: David Zafman <dzafman@redhat.com>
2014-11-20 22:38:25 +01:00
Loic Dachary
9a089ef369 ceph_objectstore_tool: skip unused osds
When the number of objects is low, some OSDs may not be used at all.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-20 22:38:25 +01:00
Loic Dachary
be39bd5d84 ceph_objectstore_tool: run tests on erasure coded pools
Create an erasure coded pool and run tests on it. The list of PGs is
adapted to contain the shard id.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-20 22:38:25 +01:00
Loic Dachary
fbb33f3577 ceph_objectstore_tool: only run get/set bytes for replicated pools
It needs to be adapted to run with erasure coded pools.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-20 22:38:25 +01:00
Loic Dachary
2b57f4c59c ceph_objectstore_tool: use CephManager::get_pool_dump
Instead of custom made get_pool_id function.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-20 22:38:25 +01:00
Loic Dachary
d0caf5cdff ceph_objectstore_tool: add pgnum option
Instead of hardcoding 12 use a configuration option that defaults to
12. It is handy during development to lower the number to 4 and speed up
the test cycle.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-20 22:38:25 +01:00
Loic Dachary
c9e6edebd9 ceph_objectstore_tool: encapsulate into a function
Move code out of the task into function. Also remove the "REP" specifics
from helper functions that could also be used for erasure coded pools.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-20 22:38:25 +01:00
Loic Dachary
d591226373 ceph_objectstore_tool: add hinfo_key tests for erasure coded objects
When an hinfo_key attribute is found, assume an erasure coded object and
verify set-attr/get-attr works as expected by removing its content and
restoring it.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-20 22:38:25 +01:00
Loic Dachary
9ef8887a3b ceph_objectstore_tool: keep all json object representation
For erasure coded pools to be tested, the json object representation
must be preserved for all PG because they are all different. The
internal representation is changed from

   db[name]["pgid"] = pg
   db[name]["json"] = objjson

to a per pg map:

   db[name].setdefault("pg2json", {})[pg] = objjson

and the rest of the code is modified to adapt accordingly.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-20 22:38:25 +01:00
Loic Dachary
b925be13bc ceph_manager: add pool type constants
The osd dump command displays pool types using numerics instead of
symbolic names. Create constants in the CephManager class to use instead
of numbers.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-20 22:38:25 +01:00
Loic Dachary
fd1ec39d38 ceph_manager: fix typo in get_pgid docstring
Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-20 22:38:25 +01:00
Loic Dachary
2abb9f9087 ceph_manager: factorize with get_pool_dump
Replace code that duplicates get_osd_dump_json with a call to the
function.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-20 22:38:25 +01:00
Loic Dachary
6dac43fa46 ceph_manager: factorize with get_osd_dump_json
Replace code that duplicates get_osd_dump_json with a call to the
function.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-20 22:38:17 +01:00
Loic Dachary
5fc42b98ab ceph_manager: add get_pool_dump to return the json pool dump
Get the part of ceph osd dump specific to the pool given in argument.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-19 23:19:13 +01:00
Loic Dachary
f353c150a8 ceph_manager: add get_osd_dump_json to get the full output
Converts ceph osd dump_json into a python object.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-11-19 23:19:13 +01:00
Warren Usui
852fe69af4 Further changes for calamari_setup.py
Made suggestions from code reviews.
Added no_epel option.
Merged Dan Mick's changes that add the ability to get
iceballs from http URL.
Remove duplicate assignment and added some log.debugs

Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-11-18 19:06:24 -08:00
Warren Usui
357fd22f04 Add calamari_setup
Calamari_setup can be used to set up a calamari gui for manual testing,
or be run in a suite to test the calamari setup and calamari ceph
installation code.

Fixes: 9759
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-11-18 19:06:24 -08:00
Loic Dachary
e2d6ce7e9d Merge pull request #234 from ceph/wip-dzaddscrub
Add scrub_test and repair_test to rados basic suite

Reviewed-by: Loic Dachary <ldachary@redhat.com>
2014-11-11 03:42:33 +01:00
Sage Weil
36055e143c Merge remote-tracking branch 'gh/next' 2014-11-09 20:48:45 -08:00
Sage Weil
5f19ef7116 tasks/radosbench: no log to stderr
Signed-off-by: Sage Weil <sage@redhat.com>
2014-11-09 20:48:35 -08:00
Gregory Farnum
e8cd3f10d6 Merge pull request #221 from ceph/wip-forward-scrub
Wip forward scrub

Reviewed-by: John Spray <john.spray@redhat.com>
2014-11-07 16:16:38 -08:00
Greg Farnum
6c26c073de mds_scrub_checks: Run scrub and flush commands against the MDS.
We mostly do a variety of successful ones, but we also corrupt the store
using the rados tool and make sure we get the expected error codes. Includes
a yaml fragment so the task gets run as part of the fs/basic suite.

Signed-off-by: Greg Farnum <greg@inktank.com>
2014-11-07 13:06:13 -08:00
David Zafman
74e776139b repair_test: Wait for OSDs to come up before proceeding with test
Signed-off-by: David Zafman <dzafman@redhat.com>
2014-11-06 22:23:34 -08:00
Yan, Zheng
edb780a3c5 tasks/cephfs/mount: use seperate for testing flock and posix lock
Old version libfuse treats both flock and posix lock requests as posix
lock request. This is a workaround for the bug.

Fixes: #9995
Signed-off-by: Yan, Zheng <zyan@redhat.com>
2014-11-07 09:07:27 +08:00
Sage Weil
8a18d8baaf Merge remote-tracking branch 'gh/giant' into m
Conflicts:
	tasks/ceph_manager.py
2014-10-29 14:31:26 -07:00
Yehuda Sadeh
dd6194f637 Merge branch 'wip-apache-worker' 2014-10-23 16:05:44 -07:00
Yehuda Sadeh
c3b53c3265 apache: switch to use the apache worker mpm
Fixes: #9169

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
2014-10-23 16:05:03 -07:00
Yehuda Sadeh
35c9cae84c apache: change template to load mpm worker module
in apache 2.4

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
2014-10-23 16:04:58 -07:00
David Zafman
4ddadf0698 Thrasher: Disable ceph_objectstore_tool tests if old release missing command
Leaving disabled until merge of import/export fixes

Fixes: #9805

Signed-off-by: David Zafman <dzafman@redhat.com>
2014-10-22 23:30:29 -07:00
David Zafman
523cb63b5f ceph_manager: ceph_objectstore_tool testing off by default
Signed-off-by: David Zafman <dzafman@redhat.com>
2014-10-22 10:32:46 -07:00
Zack Cerza
1b8d31986a Smarter s3tests branch selection
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-10-22 09:12:43 -06:00
Sage Weil
ecfcb2e04c Merge pull request #189 from ceph/wip-apache-max-requests
apache: set MaxRequestsPerChild to 0
2014-10-21 10:57:29 -07:00
Yehuda Sadeh
1fd89f4e43 apache: switch to use the apache worker mpm
Fixes: #9169

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
(cherry picked from commit c3b53c3265)
2014-10-23 16:08:22 -07:00
Yehuda Sadeh
14b5a9afdd apache: change template to load mpm worker module
in apache 2.4

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
(cherry picked from commit 35c9cae84c)
2014-10-23 16:08:14 -07:00
David Zafman
a295c18a80 Thrasher: Disable ceph_objectstore_tool tests if old release missing command
Don't need to explicitly turn off the test during some upgrades
Leaving disabled until merge of import/export fixes

Fixes: #9805

Signed-off-by: David Zafman <dzafman@redhat.com>
2014-10-22 19:21:04 -07:00
Sage Weil
7e41c93ed8 tasks/thrashosds: support overrides
e.g.,

overrides:
  thrashosds:
    thrash_primary_affinity: false
...
tasks:
- install:
- ceph:
- thrashosds:
- workunit:
...

Needed for #9865

Signed-off-by: Sage Weil <sage@redhat.com>
2014-10-22 11:19:01 -07:00
David Zafman
bdbcf760d9 ceph_manager: ceph_objectstore_tool testing off by default
Signed-off-by: David Zafman <dzafman@redhat.com>
2014-10-22 10:34:26 -07:00
Zack Cerza
01b556afc1 Smarter s3tests branch selection
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
(cherry picked from commit 1b8d31986a)
2014-10-22 09:13:11 -06:00
Loic Dachary
e48a0a3924 erasure-code: unfound test needs a non empty file
Other rados put will fail as follows

$ touch /tmp/bar
$ ./rados -p rbd put existing_3 /tmp/bar
$ ./rados -p rbd put existing_3 /tmp/bar
WARNING: could not create object: existing_3
error putting rbd/existing_3: (17) File exists

it should be considered a bug in the rados command line but needs to be
addressed separately.

http://tracker.ceph.com/issues/9387 Fixes: #9387

Signed-off-by: Loic Dachary loic-201408@dachary.org
2014-10-20 14:41:10 -07:00
Gregory Farnum
28761d8bfd Merge pull request #181 from ceph/wip-client-flock
tasks/mds_client_recovery: file lock test

Reviewed-by: Greg Farnum <greg@inktank.com>
2014-10-16 06:57:54 -07:00
Yehuda Sadeh
f4432e6386 apache: set MaxRequestsPerChild to 0
Otherwise the default is 10k.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
2014-10-14 14:17:41 -07:00
Yehuda Sadeh
8a87a08477 tasks/s3tests: add slow backend configurable
Adding this so that we can modify the clients' conf file as needed with slow backend.
This can be achieved by:

overrides:
  s3tests:
    slow_backend: true

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
(cherry picked from commit 61409179df)
2014-10-14 11:36:24 -07:00
Yehuda Sadeh
61409179df tasks/s3tests: add slow backend configurable
Adding this so that we can modify the clients' conf file as needed with slow backend.
This can be achieved by:

overrides:
  s3tests:
    slow_backend: true

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
2014-10-13 15:07:06 -07:00
Greg Farnum
4db95170e6 document 'command' requirements on admin_socket method
Signed-off-by: Greg Farnum <greg@inktank.com>
2014-10-13 12:37:52 -07:00
Yan, Zheng
88133719b7 tasks/mds_client_recovery: file lock test
check that file lock doesn't get lost after an MDS restart

Signed-off-by: Yan, Zheng <zyan@redhat.com>
2014-10-10 16:54:29 +08:00
John Spray
6ac9efef9c tasks/cephfs: say which test failed in exception
Example:
Was: 'Test failure'
Now: Test failure: test_full_caps (tasks.mds_full.TestClusterFull)

Signed-off-by: John Spray <john.spray@redhat.com>
2014-10-08 16:27:44 +01:00
Sage Weil
7ba50e0c89 tasks/ceph_manager: enable log for ceph_objectstore_tool
Signed-off-by: Sage Weil <sage@redhat.com>
2014-10-07 08:37:53 -07:00
Loic Dachary
c598b8e962 erasure-code: unfound test needs a non empty file
Other rados put will fail as follows

$ touch /tmp/bar
$ ./rados -p rbd put existing_3 /tmp/bar
$ ./rados -p rbd put existing_3 /tmp/bar
WARNING: could not create object: existing_3
error putting rbd/existing_3: (17) File exists

it should be considered a bug in the rados command line but needs to be
addressed separately.

http://tracker.ceph.com/issues/9387 Fixes: #9387

Signed-off-by: Loic Dachary loic-201408@dachary.org
2014-10-02 08:06:42 +02:00
John Spray
48a0b75928 Merge remote-tracking branch 'origin/giant' into wip-merge
Conflicts:
	erasure-code/ec-rados-default.yaml
	tasks/mds_client_limits.py
	tasks/mds_client_recovery.py
	tasks/mds_journal_migration.py
2014-10-01 18:17:01 +01:00
Yan, Zheng
ff03b46509 tasks/mds_client_recovery: client trim its cache on reconnect
make sure CephFS client trim its cache before reconnect to the MDS.

Signed-off-by: Yan, Zheng <zyan@redhat.com>
2014-09-29 20:59:43 +01:00
John Spray
c2d298a43c tasks: wait for mds active before mounting clients
To make the logs clearer when trying to work out
if/when something went wrong, rather than always
having client logs start with some failures.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-29 15:04:33 +01:00
John Spray
0073e25d77 tasks: rename FuseMount.get_client_id to get_global_id
'client_id' was ambiguous because in other places it
meant the '0' in client.0, whereas here it means
the runtime-generated global ID of the client.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-29 15:04:25 +01:00
John Spray
b77b3bec72 tasks: add mds_client_limits
New CephFS tests for the behaviour of the system while
enforcing its resource limits.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-29 15:04:18 +01:00
John Spray
1fa15011a3 tasks: generalise CephFSTestCase
Some of this stuff could be even more general for embedding
unittest-style suites, but for the moment let's keep the cephfs
stuff in a walled garden.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-29 15:04:10 +01:00
John Spray
b6ccf0d414 tasks: generalize config writing for Filesystem
Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-29 15:03:17 +01:00
John Spray
8f49a7d86a tasks: wait for active after mds restart
May have been causing spurious failures on
trying to read session state after MDS restart (
session list isn't populated until recovery is
complete)

Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-25 11:28:53 +01:00
tamil
a5a1cce3c7 included an option to ceph_objectstore_tool, whenever we have keyvaluestore_backend as a configurable parameter
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2014-09-23 07:50:49 -07:00
John Spray
d9ec7f2f7a tasks: wait for mds active before mounting clients
To make the logs clearer when trying to work out
if/when something went wrong, rather than always
having client logs start with some failures.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-19 14:16:14 +01:00
John Spray
3e07bd1aaa tasks: rename FuseMount.get_client_id to get_global_id
'client_id' was ambiguous because in other places it
meant the '0' in client.0, whereas here it means
the runtime-generated global ID of the client.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-19 14:16:13 +01:00
John Spray
7274289542 tasks: add mds_client_limits
New CephFS tests for the behaviour of the system while
enforcing its resource limits.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-19 14:15:41 +01:00
John Spray
d777d7123b tasks: generalise CephFSTestCase
Some of this stuff could be even more general for embedding
unittest-style suites, but for the moment let's keep the cephfs
stuff in a walled garden.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-19 14:13:53 +01:00
John Spray
6f36269d24 tasks: generalize config writing for Filesystem
Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-19 14:13:51 +01:00
Gregory Farnum
278f4dc77a Merge pull request #143 from ceph/wip-migration-test
tasks: more substantial IO for journal migration

Reviewed-by: Greg Farnum <greg@inktank.com>
2014-09-18 15:03:35 -07:00
John Spray
65a4141e22 Merge remote-tracking branch 'origin/giant' 2014-09-17 13:50:55 +01:00
John Spray
7d086403d4 tasks: escaping '*' when deleting files
Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-17 13:37:08 +01:00
John Spray
366ee00554 tasks: more substantial IO for journal migration
...so that there will at least be multiple segments
in the log during the rewrite.

Also make the test stricter by checking that
cephfs-journal-tool can happily read the resulting
journal.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-16 15:14:54 +01:00
John Spray
1d9101cf31 tasks: fix race in test_stale_caps
Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-15 14:32:20 +01:00
John Spray
4daf2ddc39 tasks: typo in mds_client_recovery
Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-15 13:44:27 +01:00
John Spray
bc257677de tasks: handle failure cleanly in test_stale_caps
Previously would fail because the cap waiter
completed too soon, without noticing that the
reason it completed quickly was because it failed.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-15 13:44:27 +01:00
Samuel Just
f6582f8961 tasks: add watch_notify_same_primary
Reproduces: #9220
Signed-off-by: Samuel Just <sam.just@inktank.com>
2014-09-09 15:31:13 -07:00
Samuel Just
79989de8b0 Merge pull request #112 from ceph/wip-8231-forreview
Wip 8231 forreview

Reviewed-by: Samuel Just <sam.just@inktank.com>
2014-09-02 13:43:38 -07:00
John Spray
bbf569de74 tasks: fix mount race in mds_client_recovery
Signed-off-by: John Spray <john.spray@redhat.com>
2014-09-01 16:38:25 +01:00
David Zafman
05eee9fa79 ceph_manager: Add test code to use export/import to move a pg
Check for more than 1 osd down and randomize on chance_move_pg (100%)
For now only export from older down osd to newly down osd to avoid missing map

Signed-off-by: David Zafman <david.zafman@inktank.com>
2014-08-30 16:20:22 -07:00
David Zafman
0cdf6e813d ceph_manager: Implement export/import when thrasher kills an osd
Use list-pgs to avoid races by seeing actual pgs present

Signed-off-by: David Zafman <david.zafman@inktank.com>
2014-08-30 16:20:22 -07:00
David Zafman
9ade22dd34 ceph_objectstore_tool: Add task for testing of tool of the same name
Based on ceph/src/test/ceph_objectstore_tool.py but only does
replicated pool testing and doesn't test argument validation.

Signed-off-by: David Zafman <david.zafman@inktank.com>
2014-08-30 16:20:22 -07:00
tamil
b3dfe47589 Added dmcrypt option and ability to choose same or different disk for ceph journal
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2014-08-28 18:21:30 -07:00
John Spray
322e2498de Merge pull request #101 from ceph/wip-7810
Wip 7810
2014-08-27 22:22:13 +01:00
Zack Cerza
7baeb8043c Merge pull request #105 from ceph/wip-boto
tasks/s3tests: push boto config with idle_timeout setting
2014-08-26 09:58:39 -06:00
Sage Weil
4f8436bf5d Merge pull request #106 from ceph/wip-9091-wusui
mplement ceph.created_pool

Reviewed-by: Sage Weil <sage@redhat.com>
2014-08-26 06:34:45 -07:00
Sage Weil
12a391ea01 thrashosds: increase osd revive timeout (75s -> 150s)
This is needed when running valgrind.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-25 08:52:02 -07:00
Warren Usui
0ec5bd1a63 mplement ceph.created_pool
ceph.created_pool allows the user (via yaml lines) to add pools
that the ceph_manager knows about.

Fixes: 9091
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-08-22 17:39:38 -07:00
Sage Weil
9d466aa110 tasks/s3tests: push boto config with idle_timeout setting
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-22 15:28:33 -07:00
John Spray
1855e094e5 suites/fs: add client recovery
Signed-off-by: John Spray <john.spray@redhat.com>
2014-08-21 23:09:00 +01:00
John Spray
d001cc27bc tasks/mds_client_recovery: use existing clients
This will enable using .yaml changes to switch this
guy over to use kcephfs client once the teuthology
code around it supports all the same hooks as I've added
for fuse.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-08-21 23:09:00 +01:00
John Spray
bb52a9733a tasks/mds_client_recovery: network freeze test
This is about testing the CephFS client's handling
of losing connectivity to the MDS.

Fixes: #7810

Signed-off-by: John Spray <john.spray@redhat.com>
2014-08-21 23:09:00 +01:00
John Spray
8211d83dde tasks/ceph_fuse: enable umounting from config
This is for any test config that needs to run
some workunit with clients unmounted.  It allows
you to go toggle the mountedness of a client as
you go up and down the stack list this:

- ceph-fuse:
    client.0:
        mounted: true
- workunit:
    clients:
        client.0:
        - fs/misc/trivial_sync.sh
- ceph-fuse:
    client.0:
        mounted:
            false

The initial use case for this is running the
cephfs_journal_tool_smoke.sh workunit, which
tests administrative operations that are meant
to be run on an unmounted filesystem.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-08-21 23:09:00 +01:00
John Spray
1e7bfb842a tasks/workunit: fix log message
Signed-off-by: John Spray <john.spray@redhat.com>
2014-08-21 23:09:00 +01:00
John Spray
5c29ae6bd1 tasks/ceph: add ceph.stop task
So that we can explicitly stop daemons on demand.  Useful
for MDS tool tests that want the MDS daemons not to be running,
is this is more solid and explicit than doing e.g. "ceph mds
stop" from within workunits.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-08-21 23:09:00 +01:00