Commit Graph

1923 Commits

Author SHA1 Message Date
Alfredo Deza
0c9129bf43 error after 15 minutes of waiting for gatherkeys
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
(cherry picked from commit ad18717505c407f2c8bb995af8fd516b61957661)
2014-01-06 15:25:33 -05:00
Zack Cerza
5c0496633c Enable reporting of entire runs as dead 2014-01-06 11:08:12 -06:00
Zack Cerza
3cffea4917 Re-raise exceptions caught in the watchdog 2014-01-03 15:45:18 -06:00
Zack Cerza
027929e0cb Use response.text if response.json is None
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-03 15:08:45 -06:00
Zack Cerza
f92174ff31 Strip stdout lines
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-03 15:01:31 -06:00
Zack Cerza
68b259fd00 Catch and log unhandled exceptions in the watchdog
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-03 14:56:46 -06:00
Zack Cerza
c6a9de0445 Add 'emperor' to list of branches with reporting
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-03 14:45:25 -06:00
Zack Cerza
eaa3e1240e Work around a change in pip 1.5 regarding wheels
The error message was "pip's wheel support requires setuptools >= 0.8
for dist-info support."

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-03 12:41:11 -06:00
Zack Cerza
d3afebe19c Be safer when calling ./bootstrap
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-03 11:55:13 -06:00
Sandon Van Ness
131b218465 Use CentOS Gitbuilder sha1 instead of Fedora for non-ubuntu.
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2014-01-02 18:30:08 -08:00
Alfredo Deza
d88ec89a67 break out of the while loop after 15 minutes
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
(cherry picked from commit bef6eb74dcaa37b70b1eab4d28bfa10abb0049d0)
Signed-off-by: Zack Cerza <zack@cerza.org>
2014-01-02 14:48:49 -06:00
Zack Cerza
b4f524ebe4 Sleep once outside of the watchdog loop
Hopefully this will prevent the double-posting of jobs.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-31 14:25:05 -06:00
Alfredo Deza
fc815f75fc Merge pull request #168 from ktdreyer/readme-formatting
format bullets in README
2013-12-31 05:53:51 -08:00
Ken Dreyer
3d895a0f25 format bullets in README 2013-12-30 20:37:53 -07:00
Zack Cerza
56c70e1145 Set the content-type in report_job() 2013-12-30 17:30:14 -06:00
Zack Cerza
ba8f99991a Split out ResultsSerializer.job_info()
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-30 17:30:14 -06:00
Zack Cerza
e985ca2aab Port from httplib2 to requests module
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-30 17:30:14 -06:00
Sage Weil
6b87147590 valgrind.supp: ignore libnss3 leaks
These just started popping up when I updated the notcmalloc gitbuilder, probably
because of an updated libnss version.   Whitelist it!

Signed-off-by: Sage Weil <sage@inktank.com>
2013-12-30 10:12:37 -06:00
Ilya Dryomov
d781348fd5 rbd: bump the default scratch size for xfstests to 10G
autobuild-ceph.git commit 53db7a34aba5 had silently changed the default
elevator from cfq to deadline, which made xfstests 167 very unhappy.
It looks like with deadline and noop elevators it requires a ~6G
scratch partition.  Bump the default scratch image size to 10G.

Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
2013-12-23 19:54:11 +02:00
Sage Weil
c70e086b35 Revert "valgrind.supp: ignore libnss3 leaks"
This reverts commit 572dc88a7c.

This didn't occur on next; I think there may be a real leak on the ceph
side.
2013-12-22 09:50:12 -08:00
Sage Weil
572dc88a7c valgrind.supp: ignore libnss3 leaks
These just started popping up.  Probably because I gave the
gitbuilders a kick?

Signed-off-by: Sage Weil <sage@inktank.com>
2013-12-21 22:21:49 -08:00
SandonV
e13e4562ef Merge pull request #166 from ceph/wip-lockspell-wusui
Fix spelling error in comment.
2013-12-20 12:48:42 -08:00
Warren Usui
1810781441 Fix spelling error in teuthology/task/locktest.py comment 2013-12-20 12:39:30 -08:00
Zack Cerza
f8ce69cb9b Add ability to mark jobs as 'dead'
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-20 10:01:03 -06:00
Zack Cerza
e312048ade Allow passing multiple job_ids
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-19 16:43:11 -06:00
Zack Cerza
220779c8c4 Implement single-job killing
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-19 16:32:41 -06:00
Zack Cerza
eeeb6267f9 For teuthology-kill, s/suite/run/
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-19 16:32:41 -06:00
SandonV
228358ff17 Merge pull request #165 from ceph/wip-7042-fix-wusui
Do not run local handling fix if local parameter is not found.
2013-12-19 14:27:16 -08:00
Warren Usui
37815b76d3 Do not run local handling fix if local parameter is not found.
Fixes: 7042
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-12-19 14:20:12 -08:00
Zack Cerza
dbcef31ba0 Merge pull request #156 from ceph/teuthology-doc-hadoop-wusui
Added docstrings.  Cleaned up code (broke up long lines, removed unused
2013-12-19 09:27:14 -08:00
Zack Cerza
006c031182 Merge pull request #164 from ceph/wip-rados
rados: add in more (optional) op types
2013-12-19 09:24:21 -08:00
Zack Cerza
d70d1ad76f Merge pull request #160 from ceph/wip-fix-5149-wusui
Added handling of a 'local' option inside install.py which specifies
2013-12-19 09:23:36 -08:00
Zack Cerza
9a29c3ef71 Log calls to teuthology-report more verbosely
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-19 10:29:30 -06:00
Zack Cerza
b014c71829 Catch every exception here, for now.
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-19 10:29:30 -06:00
Sandon Van Ness
031be56813 Use saucy gitbuilder for arm package checking.
Some-how missed it checks both sha1 and package version file
and package version was still the quantal gitbuilder which wont
work as the hardware is down.

This was causing scheduling failures.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-12-18 12:38:50 -08:00
Sage Weil
5320db57ce rados: add in more (optional) op types
Signed-off-by: Sage Weil <sage@inktank.com>
2013-12-18 11:41:58 -08:00
Zack Cerza
a0eb1a8e8c Use shell=True to call teuthology-report
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-16 14:22:22 -06:00
Zack Cerza
c22ee528b7 Catch OSError if script isn't in $PATH
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-16 13:34:37 -06:00
Zack Cerza
420fff6207 Revert "Use path when calling teuthology-report. …"
This reverts commit e4b5ab811e.
2013-12-16 11:43:06 -06:00
Sandon Van Ness
e4b5ab811e Use path when calling teuthology-report. …
The 'teuthology-report' command is probably not going to exist
in $PATH so get the location of the running command and assume its
in the same path.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-12-14 07:14:51 -08:00
Zack Cerza
7cb815f792 Merge pull request #162 from jcsp/fsid-conf
Fix FSID not being set in ceph.conf
2013-12-13 09:25:30 -08:00
Zack Cerza
02e0a1e913 Merge pull request #161 from jcsp/ssh-config
Respect .ssh/config when opening SSH connections
2013-12-13 09:24:23 -08:00
Zack Cerza
2e2b8feba2 Skip the 'dead' report on old branches
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-13 10:48:52 -06:00
Sandon Van Ness
36c0344f98 Use saucy gitbuilder when grabbing sha1 for arm.
Old quantal gitbuilders are gone until hardware comes back. Use
the new saucy gitbuilders instead.
2013-12-12 16:04:38 -08:00
Zack Cerza
966dad544b Make sure to report all results.
If a just-finished job was using a teuthology branch not known to
contain the reporting feature, then report the job via the
teuthology-report script. Note that in some cases this will result in
double reporting but the extra load should be negligible.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-12 17:33:53 -06:00
Zack Cerza
7f135ec94a Enable reporting of single jobs
(also switch to docopt)

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-12 17:00:43 -06:00
Zack Cerza
3d23b9b205 Remove the child's stderr completely
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-12 15:45:58 -06:00
John Spray
9ff4d4a4e7 Fix FSID not being set in ceph.conf
Symptom was that 'ceph --admin-daemon... config get fsid'
returned zeros, while correct fsid was present in cluster maps.
Fix it by populating FSID in ceph.conf, after extracting it from
monmap.
2013-12-12 13:34:52 -08:00
Zack Cerza
625f479b68 When starting a job, tell paddles it's running
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2013-12-12 11:47:45 -06:00
Sandon Van Ness
a7f87f3a3a Longer timeout after sync/reboot.
With only a 5 second sleep via ssh and python it looks like a
race-condition was sometimes hitting where it would think
the machine is back up before the reboot command had completed.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
2013-12-11 18:07:43 -08:00