Commit Graph

1004 Commits

Author SHA1 Message Date
Sam Lang
3390cc30a6 Scripts to use pyflakes to check python syntax.
pyflakes runs a basic syntax checker against python code.
The added check-syntax.sh script and Makefile run pyflakes
on the python code within the teuthology directory reporting
any syntax errors that are found.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-01-31 07:56:56 -06:00
Joao Eduardo Luis
a63fac32f8 task: mon_clock_skew_check: use absolute value when comparing mon_skew
The monitors may report either positive or negative clock skews, and by
not using an absolute value we were constantly ignoring reported negative
clock skews.

Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
2013-01-30 20:52:39 +00:00
Joao Eduardo Luis
89e09fa90c task: mon_clock_skew_check: mark as ran once if an expected skew was found
... even if we didn't get a clean/finished result from the monitors

This ought to significantly cut the waiting time if something else (or
someone else) is leaving the leader hanging thus unable to finish a given
timecheck round.

Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
2013-01-30 20:52:03 +00:00
Sage Weil
19f4273190 peer: fix filtering out of scrub from pg state 2013-01-29 14:04:09 -08:00
Sage Weil
e805b7d62e admin_socket: don't bother remote executing if there is no test 2013-01-29 03:45:45 -08:00
Samuel Just
e33b425db7 osd_recovery: use --no-cleanup for rados bench
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-01-28 20:22:33 -08:00
Samuel Just
1c31194920 osd_recovery: inject a recovery delay
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-01-28 20:22:33 -08:00
Sage Weil
3b27c9ecbc osd_backfill: --no-cleanup for rados bench 2013-01-28 19:53:34 -08:00
Josh Durgin
826e5860a0 cram: fix for runs with coverage enabled
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-01-28 14:54:49 -08:00
Sage Weil
b5f81636a2 osdthrasher: inject pause on a live (on in) osd 2013-01-26 13:13:08 -08:00
Joao Eduardo Luis
aa85d914c4 task: mon_clock_skew_check: increase timeout and kick it off only on stop
We were kicking-off the timeout as soon as we started; it's better however
to kick if off only when we are told to stop (as long as 'at-least-once'
is true).

Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
2013-01-25 17:56:09 -08:00
Joao Eduardo Luis
673101c72f task: mon_clock_skew_check: distinguish between on-going and finished check
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
2013-01-25 17:56:05 -08:00
Samuel Just
3a5c70b89b ceph_manager: turn long stall injection off by default
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-01-24 17:31:38 -08:00
Sage Weil
006e70657d osd_recovery: fix up incomplete test
- stop rados bench from cleaning up
- flush pg stats
- fix sleep call

One or more of these helped fix this test, don't really care which.
2013-01-24 16:24:16 -08:00
Sage Weil
20af01f23b ceph_manager: fix get_num_active_recovered()
The states now have 'backfill' *or* 'recover' in them.
2013-01-24 16:23:33 -08:00
Sage Weil
b150e8e3f3 workunit: pass java path as env variable
The libcephfs-java test needs this.
2013-01-24 15:21:01 -08:00
Samuel Just
6a859bcd56 ceph_manager: use 80/70 as pause_long, pause_check_after defaults
OSD::op_tp suicides after 150.

Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-01-24 12:50:26 -08:00
Samuel Just
0f24dca2d7 ceph_manager: use do_rados for rmpool
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-01-24 10:08:44 -08:00
Sage Weil
9b56f3671a Merge remote-tracking branch 'gh/wip_heartbeat' 2013-01-23 18:43:02 -08:00
Samuel Just
ec5a14553f ceph_manager: default chance_down to 0.4
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-01-23 17:44:05 -08:00
Samuel Just
566ae5332e ceph_manager: add filestore and heartbeat stalls
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-01-23 17:40:40 -08:00
Sandon Van Ness
5d66c9ab01 Use ceph git repo instead of github.
This code change is so that instead of pulling the tarball of github
which can be unreliable at times it instead uses the ceph repo mirror
and serves as the same function. Now it is using git archive and no
longer uses wget. Because of this less tar-fu is needed to extract
the necessary files as it can be done directly through git archive.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
2013-01-23 17:22:31 -08:00
David Zafman
e714c77812 osd: Testing of deep-scrub omap changes
Fix scrub_test.py and add omap corruption test

Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Samuel Just <sam.just@inktank.com>
2013-01-22 15:48:45 -08:00
Joe Buck
b6e3edc6d8 test: create /tmp/cephtest/mnt.{id}
The workunit task assumes that a mount exists
at /tmp/cephtest/mnt.{id}
This patch creates the path if it doesn't
exist, enabling workunits to run in the absense
of kclient or ceph-fuse tasks.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by:  Sam Lang <sam.lang@inktank.com>
2013-01-22 13:09:46 -08:00
Joao Eduardo Luis
98cc1b835c task: mon_clock_skew_check: add option to run at least one timecheck
at-least-once          Runs at least once, even if we are told to stop.
                         (default: True)
  at-least-once-timeout  If we were told to stop but we are attempting to
                         run at least once, timeout after this many
                         seconds. (default: 300)

Fixes: #3854

Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
2013-01-21 21:10:45 -08:00
Sam Lang
53f22d9493 task/mds_thrasher: New task for thrashing the mds
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-18 15:48:52 -06:00
Alex Elder
dbc38eff62 rbd.py: update scratch and test image sizes
Test 167 was failing due to running out of space on the scratch
file system.  The test reserves 21MB in a file, and repeats 50
times.  It required just over 1GB, so I bumped the default size
for the testing device to 1200 MB.  I increased the test device
size as well.

This resolves http://tracker.newdream.net/issues/3864.

Signed-off-by: Alex Elder <elder@inktank.com>
2013-01-18 12:47:34 -06:00
Sage Weil
cd09be6ac8 ceph: pass ceph.conf to osdmaptool
This ensure it sees the chooseleaf option and generates the proper
CRUSH rules.
2013-01-17 12:27:17 -08:00
Loic Dachary
72db1a59cd When running teuthology with targets provisionned on OpenStack and kvm, the disks will show under /dev/vda, /dev/vdb etc. Add them to the list of devices to inspect and use for tests.
Signed-off-by: Loic Dachary <loic@dachary.org>
2013-01-16 20:48:15 -08:00
Josh Durgin
c8a9a9a888 Add cram task
This runs cram tests, which are an easy way to test output
stays consistent. We already use cram for basic cli tests with no cluster,
and now we can use it for whole system tests too.
2013-01-15 14:07:58 -08:00
Greg Farnum
71097b7b91 Revert "task/kclient: chmod root to 1777."
This reverts commit f17847e537. It had
a typo and we hopefully don't need it.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-01-14 16:14:08 -08:00
Sage Weil
92a9d9c229 ceph.conf: separate replicas across osds
ceph.git master now separates across crush hosts without this setting.
For teuthology clusters, we don't want that (unless the tests specifies
otherwise).
2013-01-13 22:52:00 -08:00
Sage Weil
9e4a3f0399 Merge remote-tracking branch 'gh/wip-3633' 2013-01-10 18:04:55 -08:00
Greg Farnum
f17847e537 task/kclient: chmod root to 1777.
Signed-off-by: Greg Farnum <greg@inktank.com>
2013-01-08 10:11:03 -08:00
Sam Lang
f8958463ae task/mpi: Allow working directory to be specified
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-08 10:07:42 -06:00
Sam Lang
1624812133 task: A task to setup mpi
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-08 09:34:03 -06:00
Sam Lang
e88c0fc81a task/ceph-fuse: chmod root to 1777
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-08 09:33:01 -06:00
Sam Lang
4ed20ae8fe task/pexec: Add barrier capability
This patch adds the ability to barrier between
parallel exec tasks so that all tasks will perform
the following step (after the barrier) at the same
time.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-08 09:32:08 -06:00
Sam Lang
3532008310 task/pexec: More fixes for all case, exec on hosts
We don't want to do an exec per role, but per-host.  We
were already doing an exec per host, but the names were confusing.
This fixes the names up and removes the role parameters.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-08 09:31:48 -06:00
Sam Lang
081a80f89c task/pexec: Fix when 'all' is used
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-08 09:29:34 -06:00
caleb miles
d44fb14762 radosgw-admin.py: Increase test coverage to current admin feature set.
Signed-off-by: caleb miles <caleb.miles@inktank.com>
2013-01-08 10:25:13 -05:00
Joao Eduardo Luis
620dd5511b task: mon_clock_skew_check.py: Check for clock skews on the monitors
Will run for as long as teuthology runs. By default, fails if any clock
skews higher than 0.05 seconds are detected, but will only fail when the
teuthology run finishes and after reporting a list of all the detected
skews.

Accepted options:

 interval     amount of seconds to wait in-between checks. (default: 30.0)
 max-skew     maximum skew, in seconds, that is considered tolerable
              before issuing a warning. (default: 0.05)
 expect-skew  'true' or 'false', to indicate whether to expect a skew
              during the run or not. If 'true', the test will fail if no
              skew is found, and succeed if a skew is indeed found; if
              'false', it's the other way around. (default: false)
 never-fail   Don't fail the run if a skew is detected and we weren't
              expecting it, or if no skew is detected and we were
              expecting it. (default: False)

Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
2013-01-04 18:16:58 +00:00
Joao Eduardo Luis
e88b909a1d task: ceph_manager: add 'get_mon_health' function
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
2013-01-04 17:03:55 +00:00
Joe Buck
5ce47c2a99 ssh_keys.py: pull the keys out of targets entry
rather than the hosts known hosts file.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
2013-01-03 15:04:55 -08:00
Sage Weil
9ca69e738a ceph: malloc check =3 means we hear on stderr too 2013-01-02 20:44:33 -08:00
Sage Weil
06310994df ceph: enable malloc debugging for ceph-osd 2013-01-02 12:31:54 -08:00
Joao Eduardo Luis
ed586c1bb0 task: ceph: don't wait for 'healthy' if 'wait-for-healthy' is false.
This new config option obviously defaults to 'true' in order to not only
maintain compatibility, but because it makes sense.

Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
2012-12-31 16:11:50 +00:00
Sage Weil
bb4a2c558b rgw: enable logging in ceph.conf 2012-12-29 08:28:44 -08:00
Yehuda Sadeh
c02d34dce1 task/swift: change upstream repository url
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2012-12-21 10:20:02 -08:00
Samuel Just
f2dbe5edd7 CephManager: add ability to test split
Signed-off-by: Samuel Just <sam.just@inktank.com>
2012-12-11 15:11:06 -08:00