Commit Graph

997 Commits

Author SHA1 Message Date
Zack Cerza
cabb4142d1 Make this module pep8-clean. 2013-08-12 11:40:04 -05:00
Sage Weil
5aee4a8b6c Merge pull request #22 from ceph/rpm-upgrades
Fix rpm upgrades for nested branch configs
2013-08-10 11:35:05 -07:00
Zack Cerza
7c377cb77c Make _get_baseurlinfo_and_dist() provide the correct uri 2013-08-10 12:45:21 -05:00
Zack Cerza
251cbf7fd7 Merge pull request #21 from ceph/rpm-upgrades
Make install.upgrade work on RPM-based systems.
2013-08-09 16:26:39 -07:00
Zack Cerza
b78a6c779b Make install.upgrade work with CentOS/RHEL/Fedora. 2013-08-09 17:01:36 -05:00
Sage Weil
a0e74a9614 admin_socket: split command into multiple arguments
We were passing the command as a single string, but that does not work the
same way it used to.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-08-09 09:04:25 -07:00
Alfredo Deza
6acee41c07 add a key for ceph-deploy-branch detection
Signed-off-by: Alfredo Deza <alfredo@deza.pe>
2013-08-06 15:20:56 -07:00
Joe Buck
70f4eeb970 s3readwrite.py: enable overrides
Enable s3readwrite task to have the branch to
download specified and for overrides to be
incorporated into the config at run-time.
Code based on the s3tests.py task.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-06 12:15:37 -07:00
Sage Weil
18580c1d9c Merge remote-tracking branch 'gh/next' 2013-08-04 14:54:26 -07:00
Sage Weil
89347539ad samba: wait for smbd process to exit
I saw

2013-08-03T12:56:26.641 DEBUG:teuthology.orchestra.run:Running [10.214.131.28]: 'sudo killall -9 smbd'
2013-08-03T12:56:26.727 DEBUG:teuthology.orchestra.run:Running [10.214.131.28]: 'sudo lsof /home/ubuntu/cephtest/93695/mnt.0'
2013-08-03T12:56:26.830 INFO:teuthology.orchestra.run.out:[10.214.131.28]: COMMAND   PID USER   FD   TYPE DEVICE SIZE/OFF NODE NAME
2013-08-03T12:56:26.830 INFO:teuthology.orchestra.run.out:[10.214.131.28]: smbd    12381 root  cwd    DIR    0,0        0    1 /home/ubuntu/cephtest/93695/mnt.0

which makes me think we just need to wait a moment before
attempting the umount?

Signed-off-by: Sage Weil <sage@inktank.com>
2013-08-04 14:54:20 -07:00
Joe Buck
4bd5b591d4 rgw.py: add None object check when parsing info
The rgw task was failing to check for a None object
when parsing user info in the case where there were
config options set for the client that did not include
user info (e.g. valgrind: ).
2013-08-03 17:16:52 -07:00
Sage Weil
98f35a5e49 rgw: do not ignore leaks
Signed-off-by: Sage Weil <sage@inktank.com>
2013-08-02 12:00:18 -07:00
Joe Buck
3accf1c68a rgw.py: adding a safety check
Correcting a bug where specifying
a rgw server for a client but not specifying
a system user would throw an exception.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 21:05:15 -07:00
Josh Durgin
9110c7d8ca rgw: add log settings to example config
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 19:12:05 -07:00
Josh Durgin
1787a72ef0 radosgw-agent: get verbose logs from the agent
This makes it a lot easier to debug.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 19:12:03 -07:00
Josh Durgin
1ec9c1b4b8 radosgw-agent: move zone extraction to helper
This will be useful for other tasks.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 19:12:02 -07:00
Josh Durgin
fdde365858 rgw: don't make domain_root pool the same as zone root
These are supposed to be separate pools.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 19:12:01 -07:00
Josh Durgin
d79a2db578 rgw: create system users with the --system flag
without this they have no special permissions

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 19:11:59 -07:00
Joe Buck
3eb62d2a36 rgy.py: make log_(data|metadata) configurable
The log_data and log_metadata are made configurable
via the YAML file and default to false
(meaning neither data nor metadata operations are
logged).

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 19:11:58 -07:00
Josh Durgin
ef68b98822 radosgw-agent: rename task to match tool name
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 17:33:36 -07:00
Josh Durgin
189e04fa88 rgw_sync_agent: fix a bunch of issues
- Read ceph.conf from stored copy that includes overrides
- Get system users and keys from cluster instead of reading other
  tasks' yaml, which may not be complete.
- Put zone info extraction from the cluster into utility functions,
  since it'll be useful for other tests later.
- Work with more than one agent on a single host
- Accept more than one client to run, like almost every other task
- Rename target to dest for consistency with radosgw-agent
- Don't make everything one large function

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 17:33:36 -07:00
Josh Durgin
ad6a9e537f rgw: ignore exit status when removing base apache dir
subdirs have already been removed, and this only needs to succeed once
per host.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-08-01 17:33:35 -07:00
Greg Farnum
f930f39b18 rgw_sync_agent: add new rgw_sync_agent task
This pulls access data out of the rgw task and off disk,
and then downloads, sets up, and runs an rgw sync agent
in test mode.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-08-01 17:33:35 -07:00
Joe Buck
1552a4b97a rgw.py: check for empty client config
Fixes a bug where an rgw client without
a system user specified would cause teuthology
to error out.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
2013-08-01 10:28:26 -07:00
Joe Buck
7f76c1ced1 rgw.py: refactoring to separate user creation
By separating out the user creation from
generating the region/zone info, we can generate
users for RGW tests that run against the default
pools.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-31 16:02:42 -07:00
Joe Buck
8c8e9a1403 rgw.py: change --secret-key to --secret
A 'user create' call was being passed to radosgw-admin
with '--secret-key' instead of the valid '--secret'
which was causing a random secret to be generated,
which was causing subsequent tests to fail.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
2013-07-31 14:18:20 -07:00
Josh Durgin
1f7127b243 s3/swift tests: call radosgw-admin as the right client
This allows the right region and zone info to be read from ceph.conf

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-31 13:32:58 -07:00
Josh Durgin
2f2108b3c0 rgw: fix dir creation and keyring
fastcgi_sock dir needs to exist before radosgw starts, and apache-execed radosgw needs an explicit keyring argument.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-31 12:47:48 -07:00
Sandon Van Ness
74b344e980 Reconnect after running chef task.
Just a simple change to reconnect to SSH after running
ceph-qa-chef to get around things like ulimit changes.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-30 17:26:59 -07:00
Sandon Van Ness
00c2ac5b2c Fix RHEL/centos ceph-deploy installs.
Use a hiphen as a diliminator instead of a dot.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-30 12:36:12 -07:00
Samuel Just
e1cd49bd45 ceph_manager: wait 20s before wait_for_clean in test_map_discontinuity
Fixes: #5806
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-07-30 10:47:22 -07:00
Warren Usui
bcf467c803 Remove now unnecessary try/except/raise that came about as a result of
the previous check-in.

Fixes: #5803
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-07-30 10:41:14 -07:00
Warren Usui
48a977e0f2 Always raise exception if yum install fails. This avoids later confusion when
packages are missing (the old code skipped 'Nothing to do' messages, but these
cases are still errors).

Fixes #5803
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed by: Sandon Van Ness
2013-07-30 10:40:47 -07:00
Samuel Just
8284e19226 ceph_manager: wait for all_up in test_map_discontinuity
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-07-29 13:45:24 -07:00
Samuel Just
e703942ce4 ceph_manager: allow-experimental-feature now causes an EINVAL
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-07-29 12:12:24 -07:00
Yehuda Sadeh
9dac3fe809 s3tests: clone correct branch
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2013-07-29 09:09:51 -07:00
Sage Weil
0158f0db9d adjust-ulimits: 16k open files
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-28 15:09:43 -07:00
Sage Weil
496c6773ac ceph_manager, dump_stuck: fix injectargs args
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-27 17:56:08 -07:00
Josh Durgin
73adf33346 radosgw-admin: fix typo in import
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-27 12:30:52 -07:00
Sage Weil
45ce9b0b54 valgrind: suppress curl, gnutlss, libfcgi leaks
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-27 07:49:11 -07:00
Sage Weil
a039d4ada7 ceph_manager: don't mark out an out osd on kill_osd(..., mark_out=True)
Fixes

2013-07-27T02:02:23.879 INFO:teuthology.task.thrashosds.thrasher:Recovered, killing an osd
2013-07-27T02:02:23.879 INFO:teuthology.task.thrashosds.thrasher:Killing osd 2, live_osds are [3, 4, 0, 1, 5, 2]
2013-07-27T02:02:24.547 INFO:teuthology.task.thrashosds.thrasher:Removing osd 2, in_osds are: [4, 0, 1, 5]
...
ValueError: list.remove(x): x not in list

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-27 07:48:55 -07:00
Sage Weil
404f6d245b rest-api: rename
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-26 18:21:21 -07:00
Warren Usui
3f0340f45f Add rest_api, a new task that starts up /usr/bin/ceph-rest-api running
as a daemon.

Fixes: 5613
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-07-26 18:20:16 -07:00
Josh Durgin
f2cddda154 rgw: correct socket option name
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-26 14:18:24 -07:00
Josh Durgin
699d0a3b27 rgw: add rgw log socket to daemonized radosgw too
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-26 14:15:28 -07:00
Josh Durgin
65172a0938 rgw: pass socket path directly to radosgw
Only radosgw needs this option, and each one will be different, so
remove it from the ceph.conf template.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-26 14:13:19 -07:00
Samuel Just
a355d9f570 ceph_manager: add test_map_discontinuity to thrasher
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-07-26 10:40:58 -07:00
Greg Farnum
a0b51b1f31 samba: run lsof and fuser after shutdown
The clients are pretty regularly reporting busy on unmount when
samba runs above them. This will hopefully give us some info about why.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-07-26 09:39:26 -07:00
Sandon Van Ness
f19bcfc9df Merge branch 'master' of github.com:ceph/teuthology 2013-07-25 19:50:39 -07:00
Sandon Van Ness
41cafb1a10 Merge remote-tracking branch 'origin/wip-sandon-vm'
Conflicts:
	teuthology/lock.py
	teuthology/misc.py
	teuthology/task/install.py
2013-07-25 19:50:02 -07:00
tamil
aeb3586bfb added test_mon_destroy option to ceph-deploy task
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-07-25 17:12:52 -07:00
Sandon Van Ness
57933b6f92 Fix priority so it only is added once.
Needed for baremetal so the sed doesn't keep adding
the priority line in the repos file.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-25 14:24:43 -07:00
Sage Weil
77ee5a0714 ceph_manager: try both new and old tell mon.* syntax
So we can test old code with new teuthology (e.g., upgrade testing)

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-25 13:54:20 -07:00
Sage Weil
1fcf8e3fbb ceph: make restart take both a list of daemons and other arguments
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Tamil Muthamizhan <tamil@inktank.com>
2013-07-24 13:26:24 -07:00
Joe Buck
3c9382a7fa teuth: rgw task example config update
Due to bug #5716, pools need to start with a '.' at present.
Updating the examples to follow this convention.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-24 13:00:18 -07:00
Joe Buck
4fb5781afb teuth: fix issue in cleanup code
The post-yield code in create_dirs needed to
be tweaked to correctly delete the {tdir}/apache
directory (if it exists) on each client.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-24 13:00:10 -07:00
Sage Weil
75136a8322 ceph: wait-for-osds-up option for restart
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-24 10:25:10 -07:00
Sage Weil
13aca3b7ad ceph: wait_for_osds_up
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-24 10:25:10 -07:00
Josh Durgin
8ad065dc9c rgw: add multi-region and zone support
Take client<->zone/region and the associated pools from ceph.conf, so
we don't have to invent a new format to specify it.

General region info is added to a new configuration section in the rgw
task. Each client is assumed to be a different zone, and a system user
is created with the key specified in the yaml, so it can be passed to
later task configuration as well. This isn't strictly necessary, but
avoids having to lookup this info in later tasks through something
like radosgw-admin.

Ports are allocated automatically because there's no obvious mapping
from host to client in the task configuration. Later tests can get the
endpoints desired by reading the region map.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-24 09:59:50 -07:00
Josh Durgin
721280b7f3 task_util: move rados command here
Six copies are replaced with one, with an added option to check status
automatically. This should probably be used in a few places where the
return code is ignored.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-24 09:59:50 -07:00
Josh Durgin
6d2434b48f rgw: move common rgwadmin function to a new utility file
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-24 09:59:50 -07:00
Joe Buck
88cab4712f teuth: reworked rgw to support regions / zones
Signed-off-by: Joe Buck <jbbuck@gmail.com>
2013-07-24 09:59:50 -07:00
Joe Buck
afd0d87eda teuthology: updating RGW task to support regions
Extending the rgw.py task to allow for regions and zones
to be specified in the YAML file.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
2013-07-24 09:59:50 -07:00
Sandon Van Ness
55cc15fb24 Wait a little longer before recreating VMs
Just to allow for the create to still work incase the os
volume is fairly large (takes a while to resize) and in
case the host machine is bogged down due to disk I/O.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-23 15:52:32 -07:00
Sage Weil
c1e0812b6a ceph: add wait_for_mon_quorum command
tasks:
...
- ceph.wait_for_mon_quorum: [a, b]
...

will block until the mon quorum consists of exactly [a, b].  This is
compared directly to the relevant field from 'ceph quorum_status'
which has the alphanumeric names only.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-23 15:38:29 -07:00
Sage Weil
68888862a1 sequential, parallel: allow entries to be references to top-level config
Often we want to build a test collection that substitutes different
sequences of tasks into a parallel/sequential construction.  However, the
yaml combination that happens when generating jobs is not smart enough to
substitute some fragment into a deeply-nested piece of yaml.

Instead, make these sequences top-level entries in the config dict, and
reference them.  For example:

tasks:
- install:
- ceph:
- parallel:
  - workload
  - upgrade-sequence
workload:
  workunit:
    - something
upgrade-sequence:
  install.restart: [osd.0, osd.1]

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-23 15:38:29 -07:00
Samuel Just
77cae4bf35 thrashosds: add delay option after recovery
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-07-22 16:30:57 -07:00
Joao Eduardo Luis
e84c54addd task: mon_clock_skew_check: grab max-skew value from ceph-mon's config
Instead of relying on hardcoded values, obtain the max-skew default from
'ceph-mon --show-config-value mon_clock_drift_allowed' to match the mon's
expectation.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
2013-07-22 10:02:22 -07:00
Sage Weil
222b296019 admin_socket: loop until the socket command succeeds
Sometimes the thing we're talking to is slow to start, or to register the
command we are running.  Loop in that case, at least for a while.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-22 10:02:14 -07:00
Yehuda Sadeh
37a662442d apache.conf: load modules conditionally
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2013-07-20 11:30:24 -07:00
Sage Weil
494accb8e2 ceph-deploy: do not test mon destroy by default 2013-07-19 22:19:09 -07:00
Joao Eduardo Luis
06ad2d2cc9 task: mon_clock_skew_check: by default, use max skew from global config
If not defined, defaults to 0.05;  if 'max-skew' however is defined, it
must override whatever is on the config.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-07-19 21:54:34 -07:00
Joao Eduardo Luis
7ca59dfae1 task: mon_clock_skew_check: missing 'str'.format() key crashed the test
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-07-19 21:54:28 -07:00
Sage Weil
51c2963e53 mon_thrash: tolerate scrub command failure
We can get EBUSY if there is an election going on.
2013-07-19 21:53:33 -07:00
Josh Durgin
633a1881e4 Merge branch 'wip-centos-rgw' 2013-07-19 14:44:51 -07:00
Josh Durgin
f0123db7d9 rgw: turn off continue print on centos
Also note that a ceph.conf setting is needed for that too

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-19 14:42:38 -07:00
Josh Durgin
29e7db26eb install: remove ceph-release rpm file after it's used
No need for it anymore, it's just taking up space

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-19 14:36:18 -07:00
Sage Weil
d874b57ef2 sleep: new task 2013-07-19 11:26:58 -07:00
Sage Weil
4db5b930af ceph: do not ignore osd leaks
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-18 18:05:25 -07:00
Sage Weil
2173d33d89 ceph-deploy: support overrides
Something like

overrides:
  ceph-deploy:
    foo: bar

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-18 11:21:20 -07:00
Yehuda Sadeh
c395687fd9 radosgw-admin: adapt task to recent changes
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2013-07-17 14:05:26 -07:00
Sage Weil
b15513fc7c workunit: set CEPH_CLI_TEST_DUP_COMMAND
This will make the CLI do every mon command twice and make sure they both
succeed.  This catches problems with mon command idempotency faster than
waiting for random failures trigger.
2013-07-16 17:16:08 -07:00
tamil
51352677ab added conf section to ceph-deploy task
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-07-16 17:14:33 -07:00
Warren Usui
e3d9084cd9 Created tasktest to test sequential and parallel tasks.
Added sequential task and parallel task.
Changed _run_one_task to run_one_task (now called by new tasks too).

Fix #4969
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-07-15 19:08:23 -07:00
tamil
47696d2b92 calling mon destroy command after mds create
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-07-15 17:04:21 -07:00
Sage Weil
5d596a4679 ceph_manager: drop -t arg prefix for pg dump_stuck
This is no longer needed, and ugly to support.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-13 14:11:26 -07:00
Sandon Van Ness
b9b9dd130e Remove btrfs-tools and xfsprogs install step.
We already install btrfs-tools and xfsprogs with ceph-qa-chef
Doing it here was just causing problems on non-ubuntu
distros and I really see no point for it to have it now.
2013-07-12 20:56:07 -07:00
Sandon Van Ness
db4ad507de Allow Overrides for ceph-deploy task
This is needed so we can set the ceph branch for ceph-deploy
to use via the main yaml which is created via the suite
scheduler.


Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:54:23 -07:00
Sandon Van Ness
bd9cf10ca0 Workaround repopriority of yum local repo.
Because of issues with package name differences vps are
setup to use repopriority and our local repo (which has
some ceph/librados stuff in it) gets high priority so
the ceph.repo that is created on the machine from
ceph-release basically gets ignored. This change makes
it so ceph.repo is the same priority level as our local
repo.


Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:24:19 -07:00
Sandon Van Ness
8b05c8cc7b Re-create guest if it doesn't come up right.
In some rare cases (mainly centos/rhel after creating the
guest with downburst it does not come up right. It
gets a kernel panic at boot. Usually just turning it off
and then back on again is enough but to be on the safe
side I figured it should be re-created instead. This
insures you don't get hung jobs from a guest that didn't
come up correctly.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:19:51 -07:00
Sandon Van Ness
f16ecb99c5 Use ceph.com mirror instea of github for ceph-qa-chef on VPS.
Github.com is unreliable so use the ceph mirror instead.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:13:47 -07:00
Josh Durgin
c0a7808164 rgw: use different daemon name and apache module paths
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-10 18:00:25 -07:00
Josh Durgin
42b9ea95da misc: move system type checking to a generic location
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-09 18:52:00 -07:00
Josh Durgin
7845848c0f s3tests: fix client configurations that aren't dictionaries
They're always used as dictionaries later on.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-09 18:51:03 -07:00
tamil
3a1c316aac Add mon create and destroy with an optional argument mon_initial_members
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-07-09 11:12:29 -07:00
Sage Weil
a0457492cc mon_thrasher: add pause/unpause of mons to thrashing
This adds an additional element of laggyness to the cluster which should
cause mons to call new elections.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-08 10:39:39 -07:00
Sage Weil
fdebf40f34 daemon-helper: send arbitrary signals via stdin
Each byte written to stdin will be interpreted as a signal.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-08 10:39:26 -07:00
Sage Weil
121b1b9f38 mon_thrash: optionally scrub after each iteration (default true)
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-08 10:39:24 -07:00
Sage Weil
9b22d38883 mon_thrash: fix more naming
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-05 14:30:55 -07:00
Sage Weil
389f65e7a6 mon_thrash: use _ instead of - consistently
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-05 14:30:55 -07:00
Sandon Van Ness
d54932cbc8 Fix VM issues.
Fix of #5494 although bad description. Instead of adding a wait
the code used to detect if the guest was back up is fixed. The
previous code appeared to assume only one machine and broke
when it was waiting for multiple machines if the guests did not
come up within 10 seconds of each other

Make nuke not do the normal stuff if the machine is a VPS as we
just destroy them when they get unlocked.

Instead of getting downburst options from ~/.teuthology.yaml get
it from the yaml given to teuthology for the test/task instead.

Fixed an error that would make all the default downburst values
not take effect if any of them were set via a yaml.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-07-03 19:07:35 -07:00
Sage Weil
813d96ffdd ceph: don't check leaks on client.* (i.e., radosgw)
...until we fix them.  This way we can see other valgrind issues.
2013-07-03 09:59:21 -07:00
Sage Weil
93ae152384 radosgw-admin: add missing quote 2013-07-01 14:21:55 -07:00
Sage Weil
c0bf24d770 radosgw-admin: test 'bucket list' command (all buckets)
Verifies fix for #5455
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-01 14:21:48 -07:00
Sage Weil
140e63b413 ceph: disable logrotate
This screwed up the log archival step at the end, and generally makes a
mess of automated runs.

Fixe: #5451
2013-06-26 17:48:03 -07:00
Sage Weil
ee03afaa60 dump_stuck: fix test
The mon-osd-report-timeout setting shouldn't be there!  We will set the
other item explicitly, and remove both from the suite yaml.

Fixes: #5440
2013-06-25 12:49:07 -07:00
Sage Weil
a6b65fe81b Merge pull request #15 from ceph/wip-ulimits
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-06-24 16:18:36 -07:00
Sage Weil
3142252129 Merge pull request #16 from ceph/wip-5431
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-06-24 11:01:48 -07:00
Sage Weil
947eba1a60 rados: fix multiclient tests
Each client (not run) gets its own pool!

Signed-off-by: Sage Weil <sage@inktank.com>
2013-06-23 20:44:38 -07:00
Sage Weil
e6e1df69ea dump_stuck: fix race with osd start
Occasionally we don't wait long enough for the osd to start and
mark itself up.  Keep trying until flush succeeds.

Fixes: #5431
Signed-off-by: Sage Weil <sage@inktank.com>
2013-06-23 16:21:59 -07:00
Sage Weil
13dbe9d1a5 enable-coredump -> adjust-ulimits
and set max_files to be big, too!
2013-06-23 09:15:28 -07:00
Sandon Van Ness
d39d648ead Merge remote-tracking branch 'remotes/origin/wip-sandon-cephdeploy' 2013-06-21 15:53:53 -07:00
Sandon Van Ness
f6270a7fb0 Wipe out existing id_rsa.pub and id_rsa before pushing ssh keys
A very simple change. Just touch a file first (to create it if it
doesn't yet exist so the delete doesn't error out) and then delete
it before pushing the keys to the file. This should avoid the
id_rsa.pub and id_rsa files from getting messed up due to previous
runs which were interrupted or failed (or if those files exist for
some reason). This appears to be what was causing breaking in the
ceph-deploy nightlies.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-06-21 14:45:36 -07:00
Chris Lee
70b5467122 Clean up nested-if logic 2013-06-20 13:42:33 -07:00
Samuel Just
3d9fa2253b task/peering_speed_test.py: add test which summarizes pg peering speed
Running this regularly may warn us about slow peering.

Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-06-19 15:16:28 -07:00
Samuel Just
bb2cd9e375 task/: add args.py
The usage doc string for a task is tedious to write and
hard to keep reconciled with the code as defaults are changed.
args.py includes a helper to put it all in one place.

Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-06-19 15:16:28 -07:00
Warren Usui
778d930d08 Fix to ignore ssh-key checking if running on virtual machines or
if a line that reads 'sshkey: ignore' is in the yaml file.

Fix #5364
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-06-19 11:12:09 -07:00
Warren Usui
1441707222 Make reset of ssh key code conditional on being a virtual machine.
Add and use is_vm to determine if we are running on a virtual machine.

Fix #5364
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-06-19 11:12:05 -07:00
Sage Weil
3649e27770 admin_socket: fetch test from correct branch 2013-06-19 10:36:49 -07:00
Sage Weil
4455f75d17 valgrind: give up and ignore all leveldb leaks
Hopefully if it is our fault we will have our own struct wrapping the
leveldb resource that we leak.
2013-06-19 09:08:17 -07:00
Sandon Van Ness
32c8234725 Use authorized_keys2 instead of authorized_keys
Instead of going through the trouble of adding/removing lines
from authorized_keys which has all our normal keys in it, instead
push keys to the unused authorized_keys2 file which makes the key
management significantly simpler as that file can just be wiped
out each time instead of worrying about preserving contents.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-06-18 09:35:55 -07:00
Sandon Van Ness
0ff2e42e7c Use authorized_keys2 instead of authorized_keys
Instead of going through the trouble of adding/removing lines
from authorized_keys which has all our normal keys in it, instead
push keys to the unused authorized_keys2 file which makes the key
management significantly simpler as that file can just be wiped
out each time instead of worrying about preserving contents.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-06-17 16:24:37 -07:00
Sage Weil
ebd003bc82 valgrind: another leveldb leak 2013-06-17 13:53:10 -07:00
Sage Weil
7177d2e8fc ceph_manager: fix ceph tell mon.*
Need -- to make cli stop parsing (or quote the options).
Otherwise, the options will be parsed/applied to the cli's
librados instance.
2013-06-16 13:11:50 -07:00
Sage Weil
1ec8ba50fc no need for ceph --concise argument 2013-06-16 09:10:38 -07:00
Samuel Just
9cdc60af3f ceph_manager: use new ceph tell mon.* syntax
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
2013-06-14 10:32:43 -07:00
Sage Weil
8730db1907 rados: fix up for parallel work
- use a separate pool for each client
- create pool at start, destroy pool at end
- use all clients, if not explicitly specified

Signed-off-by: Sage Weil <sage@inktank.com>
2013-06-13 22:48:16 -07:00
tamil
0e30386502 adding a newline to auth key data
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-06-13 17:13:09 -07:00
tamil
662fd9ecd8 Merge branch 'master' of github.com:ceph/teuthology 2013-06-13 16:27:05 -07:00
tamil
e1daa1693e modified ceph-deploy to throw appropriate exceptions
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-06-13 16:26:42 -07:00
Sage Weil
f005204fd9 stop stripping leading \n from osd commands
leaving them in for mon command, but not for any good reason.
2013-06-13 14:51:21 -07:00
Warren Usui
18f2fcd24f Merge branch 'wip-RhelFix-wusui' 2013-06-12 17:05:51 -07:00
Sage Weil
d396096703 valgrind: make leveldb thread suppression more general
The thread can get created from a range of callers; ignore them all.
2013-06-11 19:34:33 -07:00
Warren Usui
8f29b3f9f1 Use install -d for /var/log/ceph.
Additional fix needed for #4946

Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-06-11 16:50:09 -07:00
Warren Usui
ae74abda26 Fix capitalization of CentOS
Fixes: #5313
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-06-11 14:14:07 -07:00
tamil
335185d309 added support for rhel
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-06-10 15:41:48 -07:00
Sage Weil
0911e5493b ceph: ignore ceph-osd leaks for now :( 2013-06-10 10:45:10 -07:00
Warren Usui
d034e17a11 Merge branch 'wip-teuthVm-wusui' 2013-06-10 09:46:42 -07:00
Sage Weil
c41e6ff5ba valgrind: glibc/boost_thread leak suppressions 2013-06-08 22:26:31 -07:00
Sage Weil
22532cc816 ceph_manager: drop -- before --format=json arg
Signed-off-by: Sage Weil <sage@inktank.com>
2013-06-07 21:58:41 -07:00
Sage Weil
028f1c709b valgrind: more leveldb whitelisting 2013-06-07 21:20:08 -07:00
Warren Usui
a4994e3bde Support added for running scheduled tasks on virtual machines.
This included:
    A). changes made so that full path names on some files were used
        (scheduled tasks started in different home directories).
    B.) Changes to insure tasks come up on the beanstalkc queue properly,
    C.) Finding and inserting the libvirt eqivalent code for vm machines
        in order to simulate ipmi actions,
    D.) Fix host key code, report valgrind issue more clearly.
    E.) Some message and downburst call changes.

    Fix #4988
    Fix #5122
    Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-06-07 19:32:15 -07:00
tamil
4eb8f8984a merged system_value for rpms
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-06-07 17:40:39 -07:00
tamil
754301bbec support install task for fedora
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-06-07 17:28:21 -07:00
Warren Usui
ed9af6cd97 Merge branch 'wip-RhelInstall-wusui' 2013-06-07 15:00:39 -07:00
Warren Usui
9d6141dd8e Add RHEL support to teuthology
Fix #4946
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-06-07 14:58:28 -07:00
Dan Mick
c95698045d task/install.py: extraneous subscript in upgrade() for only some remotes
Signed-off-by: Dan Mick <dan.mick@inktank.com>
2013-06-06 15:42:27 -07:00
Dan Mick
188ae8904d ceph_manager: don't say you have no arguments and then list them
Calling ceph pg dump --format=json works better without -- before pg
(how did this work before?...)

Signed-off-by: Dan Mick <dan.mick@inktank.com>
2013-06-04 16:11:19 -07:00
Sage Weil
3ec763c420 ceph: fix valgrind grep output parsing
When you pass a single file to zgrep you don't get the filename prefix,
which confuses the split line a few lines down.
2013-06-04 09:07:53 -07:00
Sage Weil
bd7292b529 ceph: debug valgrind error
File "/var/lib/teuthworker/teuthology-master/teuthology/task/ceph.py", line 215, in valgrind_post
    (file, kind) = line.split(':')
ValueError: need more than 1 value to unpack
2013-06-03 09:57:17 -07:00