Commit Graph

1323 Commits

Author SHA1 Message Date
Warren Usui
fd750da993 Add changes to make teuthology suites work on vms.
Fixes: #4719
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed by: Dan Mick <dan.mick@inktank.com>
2013-04-23 18:10:29 -07:00
Warren Usui
97c7798512 Merge branch 'wip-teuthologyfix4693-wusui' 2013-04-23 16:30:56 -07:00
Warren Usui
b7aaa1982c Check downburst paths. Display an appropriate error message if an
executable downburst cannot be found.

Fixes: #4693
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed by: Dan Mick <dan.mick@inktank.com>
2013-04-23 16:28:38 -07:00
Sandon Van Ness
1435cb5442 Merge branch 'next' of github.com:ceph/teuthology into next 2013-04-23 11:23:36 -07:00
Sandon Van Ness
0b50cb5e84 Increase IPMI attempts to try to get around Flakey IPMI.
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
2013-04-23 11:22:52 -07:00
Sage Weil
7fbe467f2f ceph.conf: enable full debugging on the mon 2013-04-23 11:02:27 -07:00
Sandon Van Ness
98cc648ca2 Increase IPMI attempts to try to get around Flakey IPMI.
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
2013-04-23 10:35:37 -07:00
Sage Weil
86ad464f3c Merge branch 'next' 2013-04-22 13:01:11 -07:00
Sage Weil
48d89c616a ceph-deploy: fix stop command
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 13:01:02 -07:00
tamil
861ac497e5 added ceph.client.admin.keyring on the client to run rbd and rados tests
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-04-19 18:23:54 -07:00
tamil
2bbac6e482 added extra packages required by ceph-deploy for rbd and rados tests
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-04-19 18:13:01 -07:00
Sage Weil
decdeadfff Merge branch 'next' 2013-04-18 11:11:38 -07:00
Sage Weil
4efed08415 ceph-deploy: stop daemons, archive, then purge[data]
Purge removes logs, and we want to archive those, so explicitly shut down
all daemons before doing the archiving step.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-18 10:15:44 -07:00
Sage Weil
a3c48351a4 ceph.conf: lower mon disk avail warning threshold
Only wanr when we hit 90% instead of default 70%

Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit cf4bf09b2c)
2013-04-18 10:15:44 -07:00
Sam Lang
a629e9eec4 Merge branch 'next'
Conflicts:
	teuthology/lock.py
	teuthology/lockstatus.py
	teuthology/misc.py
	teuthology/task/install.py
2013-04-17 18:09:39 -05:00
Sam Lang
77cf9f4b68 misc: Fix for case status['description'] == None
Skip the machine that has a description, but the
value is None.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-04-17 17:43:14 -05:00
caleb miles
2bcbf1846a radosgw-admin-rest: Add task for RESTful admin api.
Signed-off-by caleb miles <caleb.miles@inktank.com>
2013-04-17 08:49:26 -07:00
caleb miles
8db7b08ee6 radosgw-admin-rest: Add task for RESTful admin api.
Signed-off-by caleb miles <caleb.miles@inktank.com>
2013-04-17 08:48:47 -07:00
Sam Lang
3f78cb0d3a misc: Check for 'None' string from yaml
The description attribute from the machines yaml returned by the
locker might be the string 'None'.  Need to explicitly check for
that to avoid using a test dir of /tmp/cephtest/None.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-17 10:36:42 -05:00
Sam Lang
438410a97a lock: Fix import cycle breakage
fa2049f caused an import cycle between lock.py and misc.py.  Move the
needed functions from lock.py to lockstatus.py so that we can avoid the
import cycle.

Signed-off-by: Sam Lang <sam.lang@inktank.com>

Conflicts:
	teuthology/lock.py
2013-04-17 10:36:42 -05:00
Dan Mick
dc457093b7 Revert "Revert "Install.py: Prevent prompts from breaking apt""
This reverts commit 67a616a979.

Sigh.  As it turns out, /etc/default/grub being hacked also
causes the same problem.  I think there's a way to fix that cleanly
as well, but until then, replacing the "accept installed version"
hack here so jobs can run.
2013-04-17 10:34:20 -05:00
Dan Mick
1b65b8fe1a Revert "Install.py: Prevent prompts from breaking apt"
This reverts commit 5995ae7e78.

With the changes to ceph-qa-chef and the teuthology kernel task,
we're no longer touching packaged file /etc/grub.d/10_linux, which
was the reason for this apt forcing.  Remove so that we find other
package problems that might be masked by this; we can always
put it back if there are such problems until we can fix those as well.

Signed-off-by: Dan Mick <dan.mick@inktank.com>
(cherry picked from commit c2b0828b19)
2013-04-17 10:34:20 -05:00
Sandon Van Ness
df3d70fd62 Install.py: Prevent prompts from breaking apt
Change apt commands to prevent prompts from coming up (forcing
non-interactive mode) so things like grub or other stuff doesn't
break teuthology runs.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-04-17 10:34:20 -05:00
Sam Lang
750c69b08c misc: Check for 'None' string from yaml
The description attribute from the machines yaml returned by the
locker might be the string 'None'.  Need to explicitly check for
that to avoid using a test dir of /tmp/cephtest/None.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-17 10:30:57 -05:00
Sam Lang
1727d9b356 misc: Use pythonic 'is not None' for jobid case
The conditional 'if global_jobid:' evaluates to true
in some cases even when global_jobid is None.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-17 10:30:20 -05:00
Sam Lang
c1d47a2c63 misc: Fix name parsing
Use last two digits of year.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-17 10:30:02 -05:00
Sam Lang
b37f43db1b lock: Fix import cycle breakage
fa2049f caused an import cycle between lock.py and misc.py.  Move the
needed functions from lock.py to lockstatus.py so that we can avoid the
import cycle.

Signed-off-by: Sam Lang <sam.lang@inktank.com>

Conflicts:
	teuthology/lock.py
2013-04-17 10:28:55 -05:00
Sam Lang
72cbf1157a misc: Use job id and make short path for testdir
Nightlies run on teuthology currently use a testdir of
/home/ubuntu/cephtest, but this causes stale job errors occasionally
from the previous tests not getting properly cleaned up, which prevents
the nightlies from running successfully.

The misc.py get_testdir() function can specify a testdir that is
specific to the job, but previously the path was too long and would
cause separate job failures.

This patch does two things to resolve that.  First, it uses the job id
from the teuthology run if one exists.  This should be a relatively
short number that will identify the job run effectively.  Second,
if the job id isn't available, it creates a shortened form of the
job's name, for example the job name:

teuthology-2013-04-09_23:51:49-rgw-next-testing-basic

becomes:

te1304092351rntb

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-17 10:24:16 -05:00
Sage Weil
e8aa0d8bb8 ceph-deploy: purge before archiving
Purge will uninstall and (in so doing) stop the daemons. This avoids trying
to tar up the mon data or logs while they are being written to, which
avoids errors like

2013-04-16T20:21:47.103 INFO:teuthology.task.ceph-deploy:Archiving mon data...
2013-04-16T20:21:47.545 INFO:teuthology.orchestra.run.err:tar: ./ceph-mira089/store.db/000009.log: file changed as we read it

Also drop the unnecessary uninstall (it is implied by purge).

Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 4befae4fbe)
2013-04-16 20:51:31 -07:00
Sage Weil
4befae4fbe ceph-deploy: purge before archiving
Purge will uninstall and (in so doing) stop the daemons. This avoids trying
to tar up the mon data or logs while they are being written to, which
avoids errors like

2013-04-16T20:21:47.103 INFO:teuthology.task.ceph-deploy:Archiving mon data...
2013-04-16T20:21:47.545 INFO:teuthology.orchestra.run.err:tar: ./ceph-mira089/store.db/000009.log: file changed as we read it

Also drop the unnecessary uninstall (it is implied by purge).

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-16 20:51:21 -07:00
Sage Weil
33a6693f45 scheduled_suite.sh: check clock skew at start and end of run
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 5c80201ec4)
2013-04-16 19:58:38 -07:00
Sam Lang
90e2a2ef02 misc: Fix close() call to pass in fd
fd is an int, we need to use os.close().

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-15 16:26:22 -05:00
Sam Lang
cde1429e13 misc: Fix bug in calling function remote_mktemp()
Changed the name of the function to get a remote temporary filename,
need to update all the locations where it gets called.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-15 16:16:34 -05:00
Sam Lang
fe513353c6 Merge pull request #11 from ceph/wip-4717
misc: Use tempfile.mkstemp() instead of tempnam
2013-04-15 11:44:59 -07:00
Sam Lang
3b0d91533e misc: Use tempfile.mkstemp() instead of tempnam
tempnam() is considered an unsafe security risk because the filename
generated is easy to guess and can be symlinked in advance.  Use
mkstemp() instead.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
Reviewed-by: Joe Buck <jbbuck@gmail.com>
2013-04-15 13:43:20 -05:00
Dan Mick
f69ddafde7 Revert "Revert "Install.py: Prevent prompts from breaking apt""
This reverts commit 67a616a979.

Sigh.  As it turns out, /etc/default/grub being hacked also
causes the same problem.  I think there's a way to fix that cleanly
as well, but until then, replacing the "accept installed version"
hack here so jobs can run.
2013-04-15 11:24:31 -07:00
Sam Lang
50aaece463 misc: Use pythonic 'is not None' for jobid case
The conditional 'if global_jobid:' evaluates to true
in some cases even when global_jobid is None.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-13 10:12:45 -05:00
Sam Lang
715245e100 misc: Fix name parsing
Use last two digits of year.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-12 19:04:19 -05:00
Dan Mick
67a616a979 Revert "Install.py: Prevent prompts from breaking apt"
This reverts commit 5995ae7e78.

With the changes to ceph-qa-chef and the teuthology kernel task,
we're no longer touching packaged file /etc/grub.d/10_linux, which
was the reason for this apt forcing.  Remove so that we find other
package problems that might be masked by this; we can always
put it back if there are such problems until we can fix those as well.

Signed-off-by: Dan Mick <dan.mick@inktank.com>
(cherry picked from commit c2b0828b19)
2013-04-12 15:49:24 -07:00
Dan Mick
52cdaae683 kernel.py: put submenu name in 01_ceph_kernel if necessary
We had been writing 01_ceph_kernel with the kernel title, and
relying on the fact that grub.cfg would never have submenus in it
(implemented by a hack to /etc/grub.d/10_linux which neutered its
submenu creation).  However, that hack was modifying a package file,
and got in the way of later apt commands.  Rather than doing it
that way, this divines the title of the submenu and sets the
default variable to "submenu>kernel", which works to select the
desired kernel.

It depends on there being only one level of submenu, and on the
format of the menuentry and submenu commands, dictated by grub2.
None of this is likely to work at all outside Ubuntu.

Fixes: #4496
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
(cherry picked from commit 52aec32a7d)
2013-04-12 15:49:15 -07:00
Samuel Just
2c7b1f3d62 peer.py: we can't assume pg query state will match mon pg state
The pg state could easily have changed in the mean time,
for example, from recovery_wait to recovering.

Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-12 15:02:16 -07:00
Sam Lang
35e6db72a1 Merge pull request #10 from ceph/wip-fix-importcycle
lock: Fix import cycle breakage

Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-04-12 13:06:55 -07:00
Dan Mick
c2b0828b19 Revert "Install.py: Prevent prompts from breaking apt"
This reverts commit 5995ae7e78.

With the changes to ceph-qa-chef and the teuthology kernel task,
we're no longer touching packaged file /etc/grub.d/10_linux, which
was the reason for this apt forcing.  Remove so that we find other
package problems that might be masked by this; we can always
put it back if there are such problems until we can fix those as well.

Signed-off-by: Dan Mick <dan.mick@inktank.com>
2013-04-12 10:57:41 -07:00
Sam Lang
a6b84a5613 lock: Fix import cycle breakage
fa2049f caused an import cycle between lock.py and misc.py.  Move the
needed functions from lock.py to lockstatus.py so that we can avoid the
import cycle.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-12 12:57:33 -05:00
wusui
a246f1e8ad Merge pull request #9 from ceph/wip-short-testdir
misc: Use job id and make short path for testdir

Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-04-12 10:22:15 -07:00
Dan Mick
52aec32a7d kernel.py: put submenu name in 01_ceph_kernel if necessary
We had been writing 01_ceph_kernel with the kernel title, and
relying on the fact that grub.cfg would never have submenus in it
(implemented by a hack to /etc/grub.d/10_linux which neutered its
submenu creation).  However, that hack was modifying a package file,
and got in the way of later apt commands.  Rather than doing it
that way, this divines the title of the submenu and sets the
default variable to "submenu>kernel", which works to select the
desired kernel.

It depends on there being only one level of submenu, and on the
format of the menuentry and submenu commands, dictated by grub2.
None of this is likely to work at all outside Ubuntu.

Fixes: #4496
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
2013-04-11 13:41:45 -07:00
Sam Lang
fa2049ffbb misc: Use job id and make short path for testdir
Nightlies run on teuthology currently use a testdir of
/home/ubuntu/cephtest, but this causes stale job errors occasionally
from the previous tests not getting properly cleaned up, which prevents
the nightlies from running successfully.

The misc.py get_testdir() function can specify a testdir that is
specific to the job, but previously the path was too long and would
cause separate job failures.

This patch does two things to resolve that.  First, it uses the job id
from the teuthology run if one exists.  This should be a relatively
short number that will identify the job run effectively.  Second,
if the job id isn't available, it creates a shortened form of the
job's name, for example the job name:

teuthology-2013-04-09_23:51:49-rgw-next-testing-basic

becomes:

te1304092351rntb

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-11 13:53:38 -05:00
Sandon Van Ness
e2938f4026 Fix for kdb: doesn't work on mira nodes
This is a fix for issue #4677 which was caused by kdb output being
hard-coded to ttyS1 which is fine for all our hardware except mira
machines. This change just checks to see if mira is in the host's
name and uses ttyS2 instead (simple fix).
2013-04-09 13:12:58 -07:00
Sandon Van Ness
9c9baef680 Fix: kdb: doesn't work on mira nodes
Change kernel.py to use ttyS2 for kdb output instead of ttyS1 when
the node is a mira machine. This is a fix for issue #4677
2013-04-09 13:09:39 -07:00
Joe Buck
826ad0d816 teuthology: fix for ssh-keys-task
Resolves an issue where we
were not properly escaping the generated
public key when doing matches against it.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewd-by: Sam Lang <sam.lang@inktank.com>
2013-04-08 11:54:55 -07:00