Commit Graph

1320 Commits

Author SHA1 Message Date
Sage Weil
99c40128e4 lock: filter machine type for --list, --list-targets
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-17 20:54:16 -07:00
Sage Weil
608d8a201a lock: make --summary list all machines by default
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-17 20:54:15 -07:00
Sage Weil
1d16a9ba7f lock: drop machine-type default, but require for lock-many
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-17 20:54:08 -07:00
Samuel Just
9e91395a2e ceph.conf.template: enable osd debug verify stray on activate
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-07-17 18:14:58 -07:00
Yehuda Sadeh
c395687fd9 radosgw-admin: adapt task to recent changes
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2013-07-17 14:05:26 -07:00
tamil
053ba3a242 Merge branch 'master' of github.com:ceph/teuthology 2013-07-16 17:41:57 -07:00
tamil
a24aaa41af added overrides for ceph-deploy
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-07-16 17:41:32 -07:00
Sage Weil
b15513fc7c workunit: set CEPH_CLI_TEST_DUP_COMMAND
This will make the CLI do every mon command twice and make sure they both
succeed.  This catches problems with mon command idempotency faster than
waiting for random failures trigger.
2013-07-16 17:16:08 -07:00
tamil
51352677ab added conf section to ceph-deploy task
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-07-16 17:14:33 -07:00
Warren Usui
e3d9084cd9 Created tasktest to test sequential and parallel tasks.
Added sequential task and parallel task.
Changed _run_one_task to run_one_task (now called by new tasks too).

Fix #4969
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-07-15 19:08:23 -07:00
tamil
47696d2b92 calling mon destroy command after mds create
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-07-15 17:04:21 -07:00
Sage Weil
5d596a4679 ceph_manager: drop -t arg prefix for pg dump_stuck
This is no longer needed, and ugly to support.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-13 14:11:26 -07:00
Sandon Van Ness
b9b9dd130e Remove btrfs-tools and xfsprogs install step.
We already install btrfs-tools and xfsprogs with ceph-qa-chef
Doing it here was just causing problems on non-ubuntu
distros and I really see no point for it to have it now.
2013-07-12 20:56:07 -07:00
Sandon Van Ness
db4ad507de Allow Overrides for ceph-deploy task
This is needed so we can set the ceph branch for ceph-deploy
to use via the main yaml which is created via the suite
scheduler.


Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:54:23 -07:00
Sandon Van Ness
bd9cf10ca0 Workaround repopriority of yum local repo.
Because of issues with package name differences vps are
setup to use repopriority and our local repo (which has
some ceph/librados stuff in it) gets high priority so
the ceph.repo that is created on the machine from
ceph-release basically gets ignored. This change makes
it so ceph.repo is the same priority level as our local
repo.


Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:24:19 -07:00
Sandon Van Ness
e3a0742156 Don't install kernels on non-ubuntu VPS
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:20:45 -07:00
Sandon Van Ness
8b05c8cc7b Re-create guest if it doesn't come up right.
In some rare cases (mainly centos/rhel after creating the
guest with downburst it does not come up right. It
gets a kernel panic at boot. Usually just turning it off
and then back on again is enough but to be on the safe
side I figured it should be re-created instead. This
insures you don't get hung jobs from a guest that didn't
come up correctly.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:19:51 -07:00
Sandon Van Ness
4f478ccf2c Add description option to lock.lock()
For some reason lock_many() has a description but lock()
does not. This was useful in my testing of unlocking and
re-locking VPS machines to destroy.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:16:28 -07:00
Sandon Van Ness
f16ecb99c5 Use ceph.com mirror instea of github for ceph-qa-chef on VPS.
Github.com is unreliable so use the ceph mirror instead.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:13:47 -07:00
Sandon Van Ness
4dbef128d7 List IP address in orchestra.run output.
Figuring out which machines output is coming from when things
are being executed on multiple machines can be a huge pain.
This prints the IP in the logs so you can easily see where one
machine stops and another begins.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:10:36 -07:00
Sandon Van Ness
d7a20ba293 Fix Missed parenthesis.
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:08:06 -07:00
Sandon Van Ness
da7483a741 VM: Use mac addresses from DB instead of randomizing.
In order to make IP addresses less likely to change and to allow
a smaller DHCP pool to be used I generated static MAC addresses
for all the vpm entries in the DB. I also put the correct entries
for all the other types of machines as well for their primary
(eth0) mac address as well in order to keep things standardized
and so there is another location where we have this information.

Without this fix going through a few tests would exhaust the DHCP
pool which at the time was around 460 IP addresses for virtual
machines and has since been upped to ~690 IP addresses.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-07-12 20:05:39 -07:00
Sage Weil
0ee700830d ceph.conf: enable old message assert
If this triggers, the RECONNECT_SEQ feature is broken (and
maybe we've caught #5517).
2013-07-12 15:18:50 -07:00
Josh Durgin
c0a7808164 rgw: use different daemon name and apache module paths
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-10 18:00:25 -07:00
Josh Durgin
42b9ea95da misc: move system type checking to a generic location
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-09 18:52:00 -07:00
Josh Durgin
7845848c0f s3tests: fix client configurations that aren't dictionaries
They're always used as dictionaries later on.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-09 18:51:03 -07:00
tamil
3a1c316aac Add mon create and destroy with an optional argument mon_initial_members
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-07-09 11:12:29 -07:00
Sage Weil
b325d17030 lock: fix typo 2013-07-08 22:22:22 -07:00
Sandon Van Ness
d18fe740a2 VM: Use mac addresses from DB instead of randomizing.
In order to make IP addresses less likely to change and to allow
a smaller DHCP pool to be used I generated static MAC addresses
for all the vpm entries in the DB. I also put the correct entries
for all the other types of machines as well for their primary
(eth0) mac address as well in order to keep things standardized
and so there is another location where we have this information.

Without this fix going through a few tests would exhaust the DHCP
pool which at the time was around 460 IP addresses for virtual
machines and has since been upped to ~690 IP addresses.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-07-08 16:54:22 -07:00
Sage Weil
497c71e399 Merge pull request #17 from ceph/wip-mon-thrash
mon thrash improvements

Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-07-08 10:40:27 -07:00
Sage Weil
a0457492cc mon_thrasher: add pause/unpause of mons to thrashing
This adds an additional element of laggyness to the cluster which should
cause mons to call new elections.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-08 10:39:39 -07:00
Sage Weil
fdebf40f34 daemon-helper: send arbitrary signals via stdin
Each byte written to stdin will be interpreted as a signal.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-08 10:39:26 -07:00
Sage Weil
121b1b9f38 mon_thrash: optionally scrub after each iteration (default true)
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-08 10:39:24 -07:00
Sage Weil
9b22d38883 mon_thrash: fix more naming
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-05 14:30:55 -07:00
Sage Weil
389f65e7a6 mon_thrash: use _ instead of - consistently
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-05 14:30:55 -07:00
Sandon Van Ness
d54932cbc8 Fix VM issues.
Fix of #5494 although bad description. Instead of adding a wait
the code used to detect if the guest was back up is fixed. The
previous code appeared to assume only one machine and broke
when it was waiting for multiple machines if the guests did not
come up within 10 seconds of each other

Make nuke not do the normal stuff if the machine is a VPS as we
just destroy them when they get unlocked.

Instead of getting downburst options from ~/.teuthology.yaml get
it from the yaml given to teuthology for the test/task instead.

Fixed an error that would make all the default downburst values
not take effect if any of them were set via a yaml.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-07-03 19:07:35 -07:00
Sage Weil
813d96ffdd ceph: don't check leaks on client.* (i.e., radosgw)
...until we fix them.  This way we can see other valgrind issues.
2013-07-03 09:59:21 -07:00
Sage Weil
93ae152384 radosgw-admin: add missing quote 2013-07-01 14:21:55 -07:00
Sage Weil
c0bf24d770 radosgw-admin: test 'bucket list' command (all buckets)
Verifies fix for #5455
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-01 14:21:48 -07:00
Sandon Van Ness
c22b941ed9 Update keys if they have changed before locking
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-06-27 14:08:09 -07:00
Sage Weil
140e63b413 ceph: disable logrotate
This screwed up the log archival step at the end, and generally makes a
mess of automated runs.

Fixe: #5451
2013-06-26 17:48:03 -07:00
Sage Weil
ee03afaa60 dump_stuck: fix test
The mon-osd-report-timeout setting shouldn't be there!  We will set the
other item explicitly, and remove both from the suite yaml.

Fixes: #5440
2013-06-25 12:49:07 -07:00
Sage Weil
a6b65fe81b Merge pull request #15 from ceph/wip-ulimits
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-06-24 16:18:36 -07:00
Sage Weil
3142252129 Merge pull request #16 from ceph/wip-5431
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-06-24 11:01:48 -07:00
Sage Weil
947eba1a60 rados: fix multiclient tests
Each client (not run) gets its own pool!

Signed-off-by: Sage Weil <sage@inktank.com>
2013-06-23 20:44:38 -07:00
Sage Weil
e6e1df69ea dump_stuck: fix race with osd start
Occasionally we don't wait long enough for the osd to start and
mark itself up.  Keep trying until flush succeeds.

Fixes: #5431
Signed-off-by: Sage Weil <sage@inktank.com>
2013-06-23 16:21:59 -07:00
Sage Weil
13dbe9d1a5 enable-coredump -> adjust-ulimits
and set max_files to be big, too!
2013-06-23 09:15:28 -07:00
Sandon Van Ness
d39d648ead Merge remote-tracking branch 'remotes/origin/wip-sandon-cephdeploy' 2013-06-21 15:53:53 -07:00
Sage Weil
a6142a64e0 Merge pull request #14 from clee/cleanup
Clean up nested-if logic

Reviewed-by: Sage Weil <sage@inktank.com>
2013-06-21 15:27:49 -07:00
Sandon Van Ness
f6270a7fb0 Wipe out existing id_rsa.pub and id_rsa before pushing ssh keys
A very simple change. Just touch a file first (to create it if it
doesn't yet exist so the delete doesn't error out) and then delete
it before pushing the keys to the file. This should avoid the
id_rsa.pub and id_rsa files from getting messed up due to previous
runs which were interrupted or failed (or if those files exist for
some reason). This appears to be what was causing breaking in the
ceph-deploy nightlies.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-06-21 14:45:36 -07:00