Commit Graph

879 Commits

Author SHA1 Message Date
Joe Buck
3eb62d2a36 rgy.py: make log_(data|metadata) configurable
The log_data and log_metadata are made configurable
via the YAML file and default to false
(meaning neither data nor metadata operations are
logged).

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 19:11:58 -07:00
Josh Durgin
ef68b98822 radosgw-agent: rename task to match tool name
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 17:33:36 -07:00
Josh Durgin
189e04fa88 rgw_sync_agent: fix a bunch of issues
- Read ceph.conf from stored copy that includes overrides
- Get system users and keys from cluster instead of reading other
  tasks' yaml, which may not be complete.
- Put zone info extraction from the cluster into utility functions,
  since it'll be useful for other tests later.
- Work with more than one agent on a single host
- Accept more than one client to run, like almost every other task
- Rename target to dest for consistency with radosgw-agent
- Don't make everything one large function

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 17:33:36 -07:00
Josh Durgin
ad6a9e537f rgw: ignore exit status when removing base apache dir
subdirs have already been removed, and this only needs to succeed once
per host.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 17:33:35 -07:00
Greg Farnum
f930f39b18 rgw_sync_agent: add new rgw_sync_agent task
This pulls access data out of the rgw task and off disk,
and then downloads, sets up, and runs an rgw sync agent
in test mode.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-08-01 17:33:35 -07:00
Joe Buck
1552a4b97a rgw.py: check for empty client config
Fixes a bug where an rgw client without
a system user specified would cause teuthology
to error out.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
2013-08-01 10:28:26 -07:00
Joe Buck
7f76c1ced1 rgw.py: refactoring to separate user creation
By separating out the user creation from
generating the region/zone info, we can generate
users for RGW tests that run against the default
pools.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-31 16:02:42 -07:00
Joe Buck
8c8e9a1403 rgw.py: change --secret-key to --secret
A 'user create' call was being passed to radosgw-admin
with '--secret-key' instead of the valid '--secret'
which was causing a random secret to be generated,
which was causing subsequent tests to fail.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
2013-07-31 14:18:20 -07:00
Josh Durgin
1f7127b243 s3/swift tests: call radosgw-admin as the right client
This allows the right region and zone info to be read from ceph.conf

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-31 13:32:58 -07:00
Josh Durgin
2f2108b3c0 rgw: fix dir creation and keyring
fastcgi_sock dir needs to exist before radosgw starts, and apache-execed radosgw needs an explicit keyring argument.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-31 12:47:48 -07:00
Sandon Van Ness
74b344e980 Reconnect after running chef task.
Just a simple change to reconnect to SSH after running
ceph-qa-chef to get around things like ulimit changes.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-30 17:26:59 -07:00
Sandon Van Ness
00c2ac5b2c Fix RHEL/centos ceph-deploy installs.
Use a hiphen as a diliminator instead of a dot.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-30 12:36:12 -07:00
Samuel Just
e1cd49bd45 ceph_manager: wait 20s before wait_for_clean in test_map_discontinuity
Fixes: #5806
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-07-30 10:47:22 -07:00
Warren Usui
bcf467c803 Remove now unnecessary try/except/raise that came about as a result of
the previous check-in.

Fixes: #5803
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-07-30 10:41:14 -07:00
Warren Usui
48a977e0f2 Always raise exception if yum install fails. This avoids later confusion when
packages are missing (the old code skipped 'Nothing to do' messages, but these
cases are still errors).

Fixes #5803
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed by: Sandon Van Ness
2013-07-30 10:40:47 -07:00
Samuel Just
8284e19226 ceph_manager: wait for all_up in test_map_discontinuity
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-07-29 13:45:24 -07:00
Samuel Just
e703942ce4 ceph_manager: allow-experimental-feature now causes an EINVAL
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-07-29 12:12:24 -07:00
Yehuda Sadeh
9dac3fe809 s3tests: clone correct branch
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2013-07-29 09:09:51 -07:00
Sage Weil
0158f0db9d adjust-ulimits: 16k open files
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-28 15:09:43 -07:00
Sage Weil
496c6773ac ceph_manager, dump_stuck: fix injectargs args
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-27 17:56:08 -07:00
Josh Durgin
73adf33346 radosgw-admin: fix typo in import
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-27 12:30:52 -07:00
Sage Weil
45ce9b0b54 valgrind: suppress curl, gnutlss, libfcgi leaks
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-27 07:49:11 -07:00
Sage Weil
a039d4ada7 ceph_manager: don't mark out an out osd on kill_osd(..., mark_out=True)
Fixes

2013-07-27T02:02:23.879 INFO:teuthology.task.thrashosds.thrasher:Recovered, killing an osd
2013-07-27T02:02:23.879 INFO:teuthology.task.thrashosds.thrasher:Killing osd 2, live_osds are [3, 4, 0, 1, 5, 2]
2013-07-27T02:02:24.547 INFO:teuthology.task.thrashosds.thrasher:Removing osd 2, in_osds are: [4, 0, 1, 5]
...
ValueError: list.remove(x): x not in list

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-27 07:48:55 -07:00
Sage Weil
404f6d245b rest-api: rename
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-26 18:21:21 -07:00
Warren Usui
3f0340f45f Add rest_api, a new task that starts up /usr/bin/ceph-rest-api running
as a daemon.

Fixes: 5613
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-07-26 18:20:16 -07:00
Josh Durgin
f2cddda154 rgw: correct socket option name
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-26 14:18:24 -07:00
Josh Durgin
699d0a3b27 rgw: add rgw log socket to daemonized radosgw too
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-26 14:15:28 -07:00
Josh Durgin
65172a0938 rgw: pass socket path directly to radosgw
Only radosgw needs this option, and each one will be different, so
remove it from the ceph.conf template.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-26 14:13:19 -07:00
Samuel Just
a355d9f570 ceph_manager: add test_map_discontinuity to thrasher
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-07-26 10:40:58 -07:00
Greg Farnum
a0b51b1f31 samba: run lsof and fuser after shutdown
The clients are pretty regularly reporting busy on unmount when
samba runs above them. This will hopefully give us some info about why.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-07-26 09:39:26 -07:00
Sandon Van Ness
f19bcfc9df Merge branch 'master' of github.com:ceph/teuthology 2013-07-25 19:50:39 -07:00
Sandon Van Ness
41cafb1a10 Merge remote-tracking branch 'origin/wip-sandon-vm'
Conflicts:
	teuthology/lock.py
	teuthology/misc.py
	teuthology/task/install.py
2013-07-25 19:50:02 -07:00
tamil
aeb3586bfb added test_mon_destroy option to ceph-deploy task
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-07-25 17:12:52 -07:00
Sandon Van Ness
57933b6f92 Fix priority so it only is added once.
Needed for baremetal so the sed doesn't keep adding
the priority line in the repos file.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-25 14:24:43 -07:00
Sage Weil
77ee5a0714 ceph_manager: try both new and old tell mon.* syntax
So we can test old code with new teuthology (e.g., upgrade testing)

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-25 13:54:20 -07:00
Sage Weil
1fcf8e3fbb ceph: make restart take both a list of daemons and other arguments
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Tamil Muthamizhan <tamil@inktank.com>
2013-07-24 13:26:24 -07:00
Joe Buck
3c9382a7fa teuth: rgw task example config update
Due to bug #5716, pools need to start with a '.' at present.
Updating the examples to follow this convention.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-24 13:00:18 -07:00
Joe Buck
4fb5781afb teuth: fix issue in cleanup code
The post-yield code in create_dirs needed to
be tweaked to correctly delete the {tdir}/apache
directory (if it exists) on each client.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-24 13:00:10 -07:00
Sage Weil
75136a8322 ceph: wait-for-osds-up option for restart
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-24 10:25:10 -07:00
Sage Weil
13aca3b7ad ceph: wait_for_osds_up
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-24 10:25:10 -07:00
Josh Durgin
8ad065dc9c rgw: add multi-region and zone support
Take client<->zone/region and the associated pools from ceph.conf, so
we don't have to invent a new format to specify it.

General region info is added to a new configuration section in the rgw
task. Each client is assumed to be a different zone, and a system user
is created with the key specified in the yaml, so it can be passed to
later task configuration as well. This isn't strictly necessary, but
avoids having to lookup this info in later tasks through something
like radosgw-admin.

Ports are allocated automatically because there's no obvious mapping
from host to client in the task configuration. Later tests can get the
endpoints desired by reading the region map.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-24 09:59:50 -07:00
Josh Durgin
721280b7f3 task_util: move rados command here
Six copies are replaced with one, with an added option to check status
automatically. This should probably be used in a few places where the
return code is ignored.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-24 09:59:50 -07:00
Josh Durgin
6d2434b48f rgw: move common rgwadmin function to a new utility file
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-24 09:59:50 -07:00
Joe Buck
88cab4712f teuth: reworked rgw to support regions / zones
Signed-off-by: Joe Buck <jbbuck@gmail.com>
2013-07-24 09:59:50 -07:00
Joe Buck
afd0d87eda teuthology: updating RGW task to support regions
Extending the rgw.py task to allow for regions and zones
to be specified in the YAML file.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
2013-07-24 09:59:50 -07:00
Sandon Van Ness
55cc15fb24 Wait a little longer before recreating VMs
Just to allow for the create to still work incase the os
volume is fairly large (takes a while to resize) and in
case the host machine is bogged down due to disk I/O.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-23 15:52:32 -07:00
Sage Weil
c1e0812b6a ceph: add wait_for_mon_quorum command
tasks:
...
- ceph.wait_for_mon_quorum: [a, b]
...

will block until the mon quorum consists of exactly [a, b].  This is
compared directly to the relevant field from 'ceph quorum_status'
which has the alphanumeric names only.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-23 15:38:29 -07:00
Sage Weil
68888862a1 sequential, parallel: allow entries to be references to top-level config
Often we want to build a test collection that substitutes different
sequences of tasks into a parallel/sequential construction.  However, the
yaml combination that happens when generating jobs is not smart enough to
substitute some fragment into a deeply-nested piece of yaml.

Instead, make these sequences top-level entries in the config dict, and
reference them.  For example:

tasks:
- install:
- ceph:
- parallel:
  - workload
  - upgrade-sequence
workload:
  workunit:
    - something
upgrade-sequence:
  install.restart: [osd.0, osd.1]

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-23 15:38:29 -07:00
Samuel Just
77cae4bf35 thrashosds: add delay option after recovery
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-07-22 16:30:57 -07:00
Joao Eduardo Luis
e84c54addd task: mon_clock_skew_check: grab max-skew value from ceph-mon's config
Instead of relying on hardcoded values, obtain the max-skew default from
'ceph-mon --show-config-value mon_clock_drift_allowed' to match the mon's
expectation.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
2013-07-22 10:02:22 -07:00