Commit Graph

2144 Commits

Author SHA1 Message Date
Zack Cerza
427bf93c88 Fix PEP-8 linter complaints
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-28 09:54:54 -06:00
Sage Weil
4c98d67639 Merge pull request #175 from ceph/wip-tcmalloc
valgrind: ignore tcmalloc uninitialized memory
2014-01-27 21:14:17 -08:00
Zack Cerza
e424d78c6b Be more verbose about log file locations
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-27 12:28:53 -06:00
Zack Cerza
50722a7d9c Symlink worker logs into job archive dir
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-24 10:19:43 -06:00
Alfredo Deza
60da6e0c6f Merge pull request #177 from ceph/wip-7213
use create-initial for deploying monitors
2014-01-24 05:54:37 -08:00
Alfredo Deza
cf250a8f4a use create-initial for deploying monitors
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
2014-01-23 16:41:49 -05:00
Zack Cerza
e8bb1654b2 call wait() on the teuthology-results Popen object
This ought to fix the issue where zombie teuthology-results processes
stick around.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-17 10:05:21 -06:00
Alfredo Deza
1ddc3fdb6a Merge pull request #176 from ceph/wip-7164
lock.py: request only rsa keys from ssh-keyscan
2014-01-16 13:00:37 -08:00
Dan Mick
381d4aa873 lock.py: request only rsa keys from ssh-keyscan
New versions of ssh-keyscan return two keytypes if possible; this
breaks the comparison of "number of lines of output from keyscan"
to "number of hosts we request keys from".  Fix by asking for only
one type of key (as older ssh-keyscans did).

Fixes: #7164
Signed-off-by: Dan Mick <dan.mick@inktank.com>
2014-01-16 12:51:39 -08:00
Samuel Just
66312f7ea1 ceph_manager: in test_map_discontinuity, delay killing osd for 20s
This should give enough time for the replica to get the message
marking them clean.

Signed-off-by: Samuel Just <sam.just@inktank.com>
2014-01-16 09:57:44 -08:00
Zack Cerza
53fc2d93dd Log a warning when killing long-running jobs.
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-16 10:53:53 -06:00
Zack Cerza
769ef8a960 Kill jobs that run for over 3 days (configurable)
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-16 10:38:39 -06:00
Sage Weil
69d4962759 Merge pull request #174 from ceph/wip-7171
thrashosds: change min_in from 2 -> 3
2014-01-10 11:08:31 -08:00
Sage Weil
9a9ee610bd valgrind: ignore tcmalloc uninitialized memory
This is the main source of noise when running valgrind +
tcmalloc.  Apparently there are other issues, so I think we
still need the notcmalloc gitbuilder, but this gets us part of
the way.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-01-10 11:02:16 -08:00
Sage Weil
495f2163a8 thrashosds: change min_in from 2 -> 3
See #7171. In rare cases CRUSH can't handle it when only 2/6 of
the OSDs are marked in.  Avoid those situations for now.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-01-10 11:00:55 -08:00
Zack Cerza
0c339fe427 Merge pull request #169 from ceph/wip-utsrelease
kernel: use utsrelease string for need_to_install() purposes
2014-01-10 09:22:40 -08:00
Ilya Dryomov
bf481b9134 kernel: use utsrelease string for need_to_install() purposes
Currently, to see if a node has rebooted into the right kernel,
need_to_install() compares a given 40-char commit hash with a 7-char
commit hash abbreviation it pulls from the output of 'uname -r'.

gitbuilders can now export UTS_RELEASE kernel version string through
.../$SHA1/version file.  Use this string instead of the 40-char commit
hash and compare it with the output of 'uname -r' directly.  This saves
us the parsing exercise and, more importantly, makes it possible to
install clean tagged kernels using 'tag:' element, which wasn't
possible before because version string of such kernels doesn't have
a commit hash in it.

If version file is unavailable, fallback to the existing way of doing
things.

Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
2014-01-10 12:26:09 +02:00
Alfredo Deza
4c049b37e6 Merge pull request #173 from ceph/wip-ceph-deploy
fix ceph-dpeloy tests to use 2x replication
2014-01-09 05:43:04 -08:00
Sage Weil
1b4368b3cf schedule_suite: 2x replication for ceph-deploy
This avoids having to update all of the tests.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-01-08 17:35:10 -08:00
Zack Cerza
c4b9dae4f6 Merge pull request #172 from ceph/wip-teuthvmcon-wusui
Fix a bug where ctx.config['targets'] was looped through again
2014-01-07 15:09:39 -08:00
Warren Usui
f32adf055a Fix a bug where ctx.config['targets'] was looped through again
in connect().  The bug caused vm behavior to happen for a
target if any of the machines in the cluster was a vm.  The code
was also changed to set the key to none only if rsa or dsa keys
were used on a vm.

Fixes: 7113
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-01-07 14:22:57 -08:00
Alfredo Deza
0c9129bf43 error after 15 minutes of waiting for gatherkeys
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
(cherry picked from commit ad18717505c407f2c8bb995af8fd516b61957661)
2014-01-06 15:25:33 -05:00
Zack Cerza
5c0496633c Enable reporting of entire runs as dead 2014-01-06 11:08:12 -06:00
Zack Cerza
3cffea4917 Re-raise exceptions caught in the watchdog 2014-01-03 15:45:18 -06:00
Zack Cerza
027929e0cb Use response.text if response.json is None
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-03 15:08:45 -06:00
Zack Cerza
f92174ff31 Strip stdout lines
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-03 15:01:31 -06:00
Zack Cerza
68b259fd00 Catch and log unhandled exceptions in the watchdog
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-03 14:56:46 -06:00
Zack Cerza
c6a9de0445 Add 'emperor' to list of branches with reporting
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-03 14:45:25 -06:00
Zack Cerza
eaa3e1240e Work around a change in pip 1.5 regarding wheels
The error message was "pip's wheel support requires setuptools >= 0.8
for dist-info support."

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-03 12:41:11 -06:00
Zack Cerza
d3afebe19c Be safer when calling ./bootstrap
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-03 11:55:13 -06:00
Sandon Van Ness
131b218465 Use CentOS Gitbuilder sha1 instead of Fedora for non-ubuntu.
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2014-01-02 18:30:08 -08:00
Alfredo Deza
d88ec89a67 break out of the while loop after 15 minutes
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
(cherry picked from commit bef6eb74dcaa37b70b1eab4d28bfa10abb0049d0)
Signed-off-by: Zack Cerza <zack@cerza.org>
2014-01-02 14:48:49 -06:00
Zack Cerza
b4f524ebe4 Sleep once outside of the watchdog loop
Hopefully this will prevent the double-posting of jobs.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-31 14:25:05 -06:00
Alfredo Deza
fc815f75fc Merge pull request #168 from ktdreyer/readme-formatting
format bullets in README
2013-12-31 05:53:51 -08:00
Ken Dreyer
3d895a0f25 format bullets in README 2013-12-30 20:37:53 -07:00
Zack Cerza
56c70e1145 Set the content-type in report_job() 2013-12-30 17:30:14 -06:00
Zack Cerza
ba8f99991a Split out ResultsSerializer.job_info()
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-30 17:30:14 -06:00
Zack Cerza
e985ca2aab Port from httplib2 to requests module
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-30 17:30:14 -06:00
Sage Weil
6b87147590 valgrind.supp: ignore libnss3 leaks
These just started popping up when I updated the notcmalloc gitbuilder, probably
because of an updated libnss version.   Whitelist it!

Signed-off-by: Sage Weil <sage@inktank.com>
2013-12-30 10:12:37 -06:00
Ilya Dryomov
d781348fd5 rbd: bump the default scratch size for xfstests to 10G
autobuild-ceph.git commit 53db7a34aba5 had silently changed the default
elevator from cfq to deadline, which made xfstests 167 very unhappy.
It looks like with deadline and noop elevators it requires a ~6G
scratch partition.  Bump the default scratch image size to 10G.

Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
2013-12-23 19:54:11 +02:00
Sage Weil
c70e086b35 Revert "valgrind.supp: ignore libnss3 leaks"
This reverts commit 572dc88a7c.

This didn't occur on next; I think there may be a real leak on the ceph
side.
2013-12-22 09:50:12 -08:00
Sage Weil
572dc88a7c valgrind.supp: ignore libnss3 leaks
These just started popping up.  Probably because I gave the
gitbuilders a kick?

Signed-off-by: Sage Weil <sage@inktank.com>
2013-12-21 22:21:49 -08:00
SandonV
e13e4562ef Merge pull request #166 from ceph/wip-lockspell-wusui
Fix spelling error in comment.
2013-12-20 12:48:42 -08:00
Warren Usui
1810781441 Fix spelling error in teuthology/task/locktest.py comment 2013-12-20 12:39:30 -08:00
Zack Cerza
f8ce69cb9b Add ability to mark jobs as 'dead'
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-20 10:01:03 -06:00
Zack Cerza
e312048ade Allow passing multiple job_ids
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-19 16:43:11 -06:00
Zack Cerza
220779c8c4 Implement single-job killing
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-19 16:32:41 -06:00
Zack Cerza
eeeb6267f9 For teuthology-kill, s/suite/run/
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-19 16:32:41 -06:00
SandonV
228358ff17 Merge pull request #165 from ceph/wip-7042-fix-wusui
Do not run local handling fix if local parameter is not found.
2013-12-19 14:27:16 -08:00
Warren Usui
37815b76d3 Do not run local handling fix if local parameter is not found.
Fixes: 7042
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-12-19 14:20:12 -08:00