Commit Graph

832 Commits

Author SHA1 Message Date
Josh Durgin
721280b7f3 task_util: move rados command here
Six copies are replaced with one, with an added option to check status
automatically. This should probably be used in a few places where the
return code is ignored.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-24 09:59:50 -07:00
Josh Durgin
6d2434b48f rgw: move common rgwadmin function to a new utility file
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-24 09:59:50 -07:00
Joe Buck
88cab4712f teuth: reworked rgw to support regions / zones
Signed-off-by: Joe Buck <jbbuck@gmail.com>
2013-07-24 09:59:50 -07:00
Joe Buck
afd0d87eda teuthology: updating RGW task to support regions
Extending the rgw.py task to allow for regions and zones
to be specified in the YAML file.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
2013-07-24 09:59:50 -07:00
Sage Weil
c1e0812b6a ceph: add wait_for_mon_quorum command
tasks:
...
- ceph.wait_for_mon_quorum: [a, b]
...

will block until the mon quorum consists of exactly [a, b].  This is
compared directly to the relevant field from 'ceph quorum_status'
which has the alphanumeric names only.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-23 15:38:29 -07:00
Sage Weil
68888862a1 sequential, parallel: allow entries to be references to top-level config
Often we want to build a test collection that substitutes different
sequences of tasks into a parallel/sequential construction.  However, the
yaml combination that happens when generating jobs is not smart enough to
substitute some fragment into a deeply-nested piece of yaml.

Instead, make these sequences top-level entries in the config dict, and
reference them.  For example:

tasks:
- install:
- ceph:
- parallel:
  - workload
  - upgrade-sequence
workload:
  workunit:
    - something
upgrade-sequence:
  install.restart: [osd.0, osd.1]

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-23 15:38:29 -07:00
Samuel Just
77cae4bf35 thrashosds: add delay option after recovery
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-07-22 16:30:57 -07:00
Joao Eduardo Luis
e84c54addd task: mon_clock_skew_check: grab max-skew value from ceph-mon's config
Instead of relying on hardcoded values, obtain the max-skew default from
'ceph-mon --show-config-value mon_clock_drift_allowed' to match the mon's
expectation.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
2013-07-22 10:02:22 -07:00
Sage Weil
222b296019 admin_socket: loop until the socket command succeeds
Sometimes the thing we're talking to is slow to start, or to register the
command we are running.  Loop in that case, at least for a while.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-22 10:02:14 -07:00
Yehuda Sadeh
37a662442d apache.conf: load modules conditionally
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2013-07-20 11:30:24 -07:00
Sage Weil
494accb8e2 ceph-deploy: do not test mon destroy by default 2013-07-19 22:19:09 -07:00
Joao Eduardo Luis
06ad2d2cc9 task: mon_clock_skew_check: by default, use max skew from global config
If not defined, defaults to 0.05;  if 'max-skew' however is defined, it
must override whatever is on the config.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-07-19 21:54:34 -07:00
Joao Eduardo Luis
7ca59dfae1 task: mon_clock_skew_check: missing 'str'.format() key crashed the test
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-07-19 21:54:28 -07:00
Sage Weil
51c2963e53 mon_thrash: tolerate scrub command failure
We can get EBUSY if there is an election going on.
2013-07-19 21:53:33 -07:00
Josh Durgin
633a1881e4 Merge branch 'wip-centos-rgw' 2013-07-19 14:44:51 -07:00
Josh Durgin
f0123db7d9 rgw: turn off continue print on centos
Also note that a ceph.conf setting is needed for that too

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-19 14:42:38 -07:00
Josh Durgin
29e7db26eb install: remove ceph-release rpm file after it's used
No need for it anymore, it's just taking up space

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-19 14:36:18 -07:00
Sage Weil
d874b57ef2 sleep: new task 2013-07-19 11:26:58 -07:00
Sage Weil
4db5b930af ceph: do not ignore osd leaks
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-18 18:05:25 -07:00
Sage Weil
2173d33d89 ceph-deploy: support overrides
Something like

overrides:
  ceph-deploy:
    foo: bar

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-18 11:21:20 -07:00
Yehuda Sadeh
c395687fd9 radosgw-admin: adapt task to recent changes
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2013-07-17 14:05:26 -07:00
Sage Weil
b15513fc7c workunit: set CEPH_CLI_TEST_DUP_COMMAND
This will make the CLI do every mon command twice and make sure they both
succeed.  This catches problems with mon command idempotency faster than
waiting for random failures trigger.
2013-07-16 17:16:08 -07:00
tamil
51352677ab added conf section to ceph-deploy task
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-07-16 17:14:33 -07:00
Warren Usui
e3d9084cd9 Created tasktest to test sequential and parallel tasks.
Added sequential task and parallel task.
Changed _run_one_task to run_one_task (now called by new tasks too).

Fix #4969
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-07-15 19:08:23 -07:00
tamil
47696d2b92 calling mon destroy command after mds create
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-07-15 17:04:21 -07:00
Sage Weil
5d596a4679 ceph_manager: drop -t arg prefix for pg dump_stuck
This is no longer needed, and ugly to support.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-13 14:11:26 -07:00
Josh Durgin
c0a7808164 rgw: use different daemon name and apache module paths
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-10 18:00:25 -07:00
Josh Durgin
42b9ea95da misc: move system type checking to a generic location
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-09 18:52:00 -07:00
Josh Durgin
7845848c0f s3tests: fix client configurations that aren't dictionaries
They're always used as dictionaries later on.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-09 18:51:03 -07:00
tamil
3a1c316aac Add mon create and destroy with an optional argument mon_initial_members
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-07-09 11:12:29 -07:00
Sage Weil
a0457492cc mon_thrasher: add pause/unpause of mons to thrashing
This adds an additional element of laggyness to the cluster which should
cause mons to call new elections.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-08 10:39:39 -07:00
Sage Weil
fdebf40f34 daemon-helper: send arbitrary signals via stdin
Each byte written to stdin will be interpreted as a signal.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-08 10:39:26 -07:00
Sage Weil
121b1b9f38 mon_thrash: optionally scrub after each iteration (default true)
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-08 10:39:24 -07:00
Sage Weil
9b22d38883 mon_thrash: fix more naming
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-05 14:30:55 -07:00
Sage Weil
389f65e7a6 mon_thrash: use _ instead of - consistently
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-05 14:30:55 -07:00
Sandon Van Ness
d54932cbc8 Fix VM issues.
Fix of #5494 although bad description. Instead of adding a wait
the code used to detect if the guest was back up is fixed. The
previous code appeared to assume only one machine and broke
when it was waiting for multiple machines if the guests did not
come up within 10 seconds of each other

Make nuke not do the normal stuff if the machine is a VPS as we
just destroy them when they get unlocked.

Instead of getting downburst options from ~/.teuthology.yaml get
it from the yaml given to teuthology for the test/task instead.

Fixed an error that would make all the default downburst values
not take effect if any of them were set via a yaml.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-07-03 19:07:35 -07:00
Sage Weil
813d96ffdd ceph: don't check leaks on client.* (i.e., radosgw)
...until we fix them.  This way we can see other valgrind issues.
2013-07-03 09:59:21 -07:00
Sage Weil
93ae152384 radosgw-admin: add missing quote 2013-07-01 14:21:55 -07:00
Sage Weil
c0bf24d770 radosgw-admin: test 'bucket list' command (all buckets)
Verifies fix for #5455
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-01 14:21:48 -07:00
Sage Weil
140e63b413 ceph: disable logrotate
This screwed up the log archival step at the end, and generally makes a
mess of automated runs.

Fixe: #5451
2013-06-26 17:48:03 -07:00
Sage Weil
ee03afaa60 dump_stuck: fix test
The mon-osd-report-timeout setting shouldn't be there!  We will set the
other item explicitly, and remove both from the suite yaml.

Fixes: #5440
2013-06-25 12:49:07 -07:00
Sage Weil
a6b65fe81b Merge pull request #15 from ceph/wip-ulimits
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-06-24 16:18:36 -07:00
Sage Weil
3142252129 Merge pull request #16 from ceph/wip-5431
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-06-24 11:01:48 -07:00
Sage Weil
947eba1a60 rados: fix multiclient tests
Each client (not run) gets its own pool!

Signed-off-by: Sage Weil <sage@inktank.com>
2013-06-23 20:44:38 -07:00
Sage Weil
e6e1df69ea dump_stuck: fix race with osd start
Occasionally we don't wait long enough for the osd to start and
mark itself up.  Keep trying until flush succeeds.

Fixes: #5431
Signed-off-by: Sage Weil <sage@inktank.com>
2013-06-23 16:21:59 -07:00
Sage Weil
13dbe9d1a5 enable-coredump -> adjust-ulimits
and set max_files to be big, too!
2013-06-23 09:15:28 -07:00
Sandon Van Ness
d39d648ead Merge remote-tracking branch 'remotes/origin/wip-sandon-cephdeploy' 2013-06-21 15:53:53 -07:00
Sandon Van Ness
f6270a7fb0 Wipe out existing id_rsa.pub and id_rsa before pushing ssh keys
A very simple change. Just touch a file first (to create it if it
doesn't yet exist so the delete doesn't error out) and then delete
it before pushing the keys to the file. This should avoid the
id_rsa.pub and id_rsa files from getting messed up due to previous
runs which were interrupted or failed (or if those files exist for
some reason). This appears to be what was causing breaking in the
ceph-deploy nightlies.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-06-21 14:45:36 -07:00
Chris Lee
70b5467122 Clean up nested-if logic 2013-06-20 13:42:33 -07:00
Samuel Just
3d9fa2253b task/peering_speed_test.py: add test which summarizes pg peering speed
Running this regularly may warn us about slow peering.

Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-06-19 15:16:28 -07:00