Commit Graph

1191 Commits

Author SHA1 Message Date
Sandon Van Ness
8b05c8cc7b Re-create guest if it doesn't come up right.
In some rare cases (mainly centos/rhel after creating the
guest with downburst it does not come up right. It
gets a kernel panic at boot. Usually just turning it off
and then back on again is enough but to be on the safe
side I figured it should be re-created instead. This
insures you don't get hung jobs from a guest that didn't
come up correctly.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:19:51 -07:00
Sandon Van Ness
4f478ccf2c Add description option to lock.lock()
For some reason lock_many() has a description but lock()
does not. This was useful in my testing of unlocking and
re-locking VPS machines to destroy.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:16:28 -07:00
Sandon Van Ness
f16ecb99c5 Use ceph.com mirror instea of github for ceph-qa-chef on VPS.
Github.com is unreliable so use the ceph mirror instead.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:13:47 -07:00
Sandon Van Ness
4dbef128d7 List IP address in orchestra.run output.
Figuring out which machines output is coming from when things
are being executed on multiple machines can be a huge pain.
This prints the IP in the logs so you can easily see where one
machine stops and another begins.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:10:36 -07:00
Sandon Van Ness
d7a20ba293 Fix Missed parenthesis.
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-07-12 20:08:06 -07:00
Sandon Van Ness
da7483a741 VM: Use mac addresses from DB instead of randomizing.
In order to make IP addresses less likely to change and to allow
a smaller DHCP pool to be used I generated static MAC addresses
for all the vpm entries in the DB. I also put the correct entries
for all the other types of machines as well for their primary
(eth0) mac address as well in order to keep things standardized
and so there is another location where we have this information.

Without this fix going through a few tests would exhaust the DHCP
pool which at the time was around 460 IP addresses for virtual
machines and has since been upped to ~690 IP addresses.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-07-12 20:05:39 -07:00
Sandon Van Ness
d54932cbc8 Fix VM issues.
Fix of #5494 although bad description. Instead of adding a wait
the code used to detect if the guest was back up is fixed. The
previous code appeared to assume only one machine and broke
when it was waiting for multiple machines if the guests did not
come up within 10 seconds of each other

Make nuke not do the normal stuff if the machine is a VPS as we
just destroy them when they get unlocked.

Instead of getting downburst options from ~/.teuthology.yaml get
it from the yaml given to teuthology for the test/task instead.

Fixed an error that would make all the default downburst values
not take effect if any of them were set via a yaml.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-07-03 19:07:35 -07:00
Sage Weil
813d96ffdd ceph: don't check leaks on client.* (i.e., radosgw)
...until we fix them.  This way we can see other valgrind issues.
2013-07-03 09:59:21 -07:00
Sage Weil
93ae152384 radosgw-admin: add missing quote 2013-07-01 14:21:55 -07:00
Sage Weil
c0bf24d770 radosgw-admin: test 'bucket list' command (all buckets)
Verifies fix for #5455
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-01 14:21:48 -07:00
Sandon Van Ness
c22b941ed9 Update keys if they have changed before locking
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-06-27 14:08:09 -07:00
Sage Weil
140e63b413 ceph: disable logrotate
This screwed up the log archival step at the end, and generally makes a
mess of automated runs.

Fixe: #5451
2013-06-26 17:48:03 -07:00
Sage Weil
ee03afaa60 dump_stuck: fix test
The mon-osd-report-timeout setting shouldn't be there!  We will set the
other item explicitly, and remove both from the suite yaml.

Fixes: #5440
2013-06-25 12:49:07 -07:00
Sage Weil
a6b65fe81b Merge pull request #15 from ceph/wip-ulimits
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-06-24 16:18:36 -07:00
Sage Weil
3142252129 Merge pull request #16 from ceph/wip-5431
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-06-24 11:01:48 -07:00
Sage Weil
947eba1a60 rados: fix multiclient tests
Each client (not run) gets its own pool!

Signed-off-by: Sage Weil <sage@inktank.com>
2013-06-23 20:44:38 -07:00
Sage Weil
e6e1df69ea dump_stuck: fix race with osd start
Occasionally we don't wait long enough for the osd to start and
mark itself up.  Keep trying until flush succeeds.

Fixes: #5431
Signed-off-by: Sage Weil <sage@inktank.com>
2013-06-23 16:21:59 -07:00
Sage Weil
13dbe9d1a5 enable-coredump -> adjust-ulimits
and set max_files to be big, too!
2013-06-23 09:15:28 -07:00
Sandon Van Ness
d39d648ead Merge remote-tracking branch 'remotes/origin/wip-sandon-cephdeploy' 2013-06-21 15:53:53 -07:00
Sage Weil
a6142a64e0 Merge pull request #14 from clee/cleanup
Clean up nested-if logic

Reviewed-by: Sage Weil <sage@inktank.com>
2013-06-21 15:27:49 -07:00
Sandon Van Ness
f6270a7fb0 Wipe out existing id_rsa.pub and id_rsa before pushing ssh keys
A very simple change. Just touch a file first (to create it if it
doesn't yet exist so the delete doesn't error out) and then delete
it before pushing the keys to the file. This should avoid the
id_rsa.pub and id_rsa files from getting messed up due to previous
runs which were interrupted or failed (or if those files exist for
some reason). This appears to be what was causing breaking in the
ceph-deploy nightlies.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-06-21 14:45:36 -07:00
Chris Lee
70b5467122 Clean up nested-if logic 2013-06-20 13:42:33 -07:00
Samuel Just
3d9fa2253b task/peering_speed_test.py: add test which summarizes pg peering speed
Running this regularly may warn us about slow peering.

Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-06-19 15:16:28 -07:00
Samuel Just
bb2cd9e375 task/: add args.py
The usage doc string for a task is tedious to write and
hard to keep reconciled with the code as defaults are changed.
args.py includes a helper to put it all in one place.

Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-06-19 15:16:28 -07:00
Sage Weil
02aa7d99e9 schedule_suite.sh: specify admin_socket branch in overrides yaml
Signed-off-by: Sage Weil <sage@inktank.com>
2013-06-19 13:35:50 -07:00
Warren Usui
a93b4679e8 Include MySQLdb
Fixes: #5120
2013-06-19 11:29:38 -07:00
Warren Usui
778d930d08 Fix to ignore ssh-key checking if running on virtual machines or
if a line that reads 'sshkey: ignore' is in the yaml file.

Fix #5364
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-06-19 11:12:09 -07:00
Warren Usui
1441707222 Make reset of ssh key code conditional on being a virtual machine.
Add and use is_vm to determine if we are running on a virtual machine.

Fix #5364
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-06-19 11:12:05 -07:00
Sage Weil
3649e27770 admin_socket: fetch test from correct branch 2013-06-19 10:36:49 -07:00
Sage Weil
4455f75d17 valgrind: give up and ignore all leveldb leaks
Hopefully if it is our fault we will have our own struct wrapping the
leveldb resource that we leak.
2013-06-19 09:08:17 -07:00
Sandon Van Ness
32c8234725 Use authorized_keys2 instead of authorized_keys
Instead of going through the trouble of adding/removing lines
from authorized_keys which has all our normal keys in it, instead
push keys to the unused authorized_keys2 file which makes the key
management significantly simpler as that file can just be wiped
out each time instead of worrying about preserving contents.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-06-18 09:35:55 -07:00
Sandon Van Ness
0ff2e42e7c Use authorized_keys2 instead of authorized_keys
Instead of going through the trouble of adding/removing lines
from authorized_keys which has all our normal keys in it, instead
push keys to the unused authorized_keys2 file which makes the key
management significantly simpler as that file can just be wiped
out each time instead of worrying about preserving contents.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-06-17 16:24:37 -07:00
Sage Weil
ebd003bc82 valgrind: another leveldb leak 2013-06-17 13:53:10 -07:00
Sage Weil
7887f84f9d misc: let clients use any pool
rados.py, for example, creates new pools for each instance.
2013-06-16 14:53:49 -07:00
Sage Weil
7177d2e8fc ceph_manager: fix ceph tell mon.*
Need -- to make cli stop parsing (or quote the options).
Otherwise, the options will be parsed/applied to the cli's
librados instance.
2013-06-16 13:11:50 -07:00
Sage Weil
1ec8ba50fc no need for ceph --concise argument 2013-06-16 09:10:38 -07:00
Samuel Just
9cdc60af3f ceph_manager: use new ceph tell mon.* syntax
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
2013-06-14 10:32:43 -07:00
Sage Weil
8730db1907 rados: fix up for parallel work
- use a separate pool for each client
- create pool at start, destroy pool at end
- use all clients, if not explicitly specified

Signed-off-by: Sage Weil <sage@inktank.com>
2013-06-13 22:48:16 -07:00
tamil
0e30386502 adding a newline to auth key data
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-06-13 17:13:09 -07:00
tamil
662fd9ecd8 Merge branch 'master' of github.com:ceph/teuthology 2013-06-13 16:27:05 -07:00
tamil
e1daa1693e modified ceph-deploy to throw appropriate exceptions
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-06-13 16:26:42 -07:00
Sage Weil
f005204fd9 stop stripping leading \n from osd commands
leaving them in for mon command, but not for any good reason.
2013-06-13 14:51:21 -07:00
Warren Usui
18f2fcd24f Merge branch 'wip-RhelFix-wusui' 2013-06-12 17:05:51 -07:00
Sage Weil
d396096703 valgrind: make leveldb thread suppression more general
The thread can get created from a range of callers; ignore them all.
2013-06-11 19:34:33 -07:00
Warren Usui
8f29b3f9f1 Use install -d for /var/log/ceph.
Additional fix needed for #4946

Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-06-11 16:50:09 -07:00
Warren Usui
ae74abda26 Fix capitalization of CentOS
Fixes: #5313
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-06-11 14:14:07 -07:00
tamil
335185d309 added support for rhel
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-06-10 15:41:48 -07:00
Dan Mick
a3c1122658 teuthology-lock --summary: allow --machine-type=all
Somehow this got lost; putting it back

Signed-off-by: Dan Mick <dan.mick@inktank.com>
(cherry picked from commit e4eb4aa23b)
2013-06-10 14:10:52 -07:00
Sage Weil
0911e5493b ceph: ignore ceph-osd leaks for now :( 2013-06-10 10:45:10 -07:00
Warren Usui
d034e17a11 Merge branch 'wip-teuthVm-wusui' 2013-06-10 09:46:42 -07:00