Commit Graph

1971 Commits

Author SHA1 Message Date
Warren Usui
567961b727 Readjust the indentation of mon_clock_skew_check.py and mon_thrash.py.
Added docstrings.

Fixes: 6537
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-02-13 20:36:08 -08:00
Zack Cerza
2b9a6bc035 Merge pull request #182 from ceph/wip-imports
Make libvirt import optional (OS X)
2014-02-12 10:46:35 -06:00
Alfredo Deza
d728b7ab6a Merge pull request #189 from ceph/wip-6536-wusui
Add docstrings to s3 related tasks.
2014-02-12 10:12:35 -05:00
Warren Usui
cee713ff6a Add docstrings to s3 related tasks.
Fixes: 6536
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-02-11 20:21:06 -08:00
Zack Cerza
51957433db Merge pull request #184 from ceph/wip-pg-scrub-from-ceph-wusui
Scrub osds when ceph task finishes
2014-02-10 18:25:14 -06:00
Warren Usui
ceffee40d4 Make sure osd_scrub_pgs gets called if healthy call fails.
Fixes: 7198
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-02-10 09:28:39 -08:00
Warren Usui
69f29d17f0 Move osd_scrub_pgs call to finally block of ceph main task.
Fixes: 7198
Signed-off-bye: Warren Usui <warren.usui@inktank.com>
2014-02-07 14:54:23 -08:00
Sage Weil
330612dc87 schedule_suite: be slightly more verbose
Signed-off-by: Sage Weil <sage@inktank.com>
2014-02-05 15:20:15 -08:00
Alfredo Deza
0d2cbe64f4 Merge pull request #185 from ceph/wip-7342
Enable killing runs that haven't even started yet
2014-02-05 14:46:14 -05:00
Zack Cerza
7759473e3e Enable killing runs that haven't even started yet
This requires passing --machine-type.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-02-05 13:43:34 -06:00
Sage Weil
c2bd8464d8 valgrind: fix msync suppression
Signed-off-by: Sage Weil <sage@inktank.com>
2014-02-05 11:26:41 -08:00
Warren Usui
2bcc60061c Implement the rest of the checks for automatic pg scrubbing.
osd_scrub_pgs now:
1.) Insures that all pgs are clean and active.
2.) Performs the scrub of all OSDs.
3.) Waits until all pgs have been scrubbed or no progress in scrubbing has
    been made for two minutes.

These changes have been made on top of a previous check-in that was incomplete

Fixes: 7198
Signed-off-bye: Warren Usui <warren.usui@inktank.com>
2014-02-03 22:18:38 -08:00
Sage Weil
be3ea71cdb valgrind: another tcmalloc suppression
With this I can run a simple job with a normal build (not
notcmalloc) and valgrind does not complain.  Fingers crossed!

Signed-off-by: Sage Weil <sage@inktank.com>
2014-02-01 22:05:14 -08:00
Warren Usui
e8717163a6 Scrub osds when ceph task finishes
Fixes: 7198
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-01-31 15:29:02 -08:00
Zack Cerza
c1bc7fcc7f Merge pull request #180 from ceph/wip-7278
install.py: look for 'local' in config directly, don't warn if not found
2014-01-31 13:41:56 -08:00
Zack Cerza
8213ebbdca Merge pull request #183 from ceph/wip-enhance
Wip enhance
2014-01-31 13:41:21 -08:00
Dan Mick
a321b6e79c README.rst: describe new pp object and unlock_on_failure option.
Signed-off-by: Dan Mick <dan.mick@inktank.com>
2014-01-31 13:30:08 -08:00
Zack Cerza
915cb5d4f1 Merge pull request #181 from ceph/wip-thrash-exception
Fix MDS wait when get_mds_status returns None
2014-01-31 13:12:43 -08:00
Dan Mick
1845a5ab68 interactive.py: add handy "pp" for prettyprinting at the prompt
Signed-off-by: Dan Mick <dan.mick@inktank.com>
2014-01-31 13:07:51 -08:00
Dan Mick
77ec431105 internal.py: add global config key 'unlock_on_failure'
When testing new tasks, it's a drag to have to keep manually unlocking
the targets.  Default behavior is to keep them locked for any failure;
this allows overriding that choice for the case where the failure is
in debugging the task, where the state of the targets isn't important.

Signed-off-by: Dan Mick <dan.mick@inktank.com>
2014-01-31 13:07:45 -08:00
John Spray
b80e510257 Make libvirt import optional (OS X) 2014-01-31 20:53:45 +00:00
John Spray
aaf9b51c60 Fix MDS wait when get_mds_status returns None 2014-01-31 20:49:13 +00:00
Dan Mick
c489e4408b install.py: look for 'local' in config directly, don't warn if not found
Fixes: #7278
Signed-off-by: Dan Mick <dan.mick@inktank.com>
2014-01-30 23:47:03 -08:00
Zack Cerza
de771ddd7a Update README.rst to reflect changes in libvirt
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-30 14:54:23 -06:00
Alfredo Deza
cc3956adf4 fix indentation level for task
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
(cherry picked from commit 1397d29963b2135998ccaaa9a89c83e0bb9710f4)
2014-01-28 16:04:46 -05:00
Alfredo Deza
bdafbc0c56 make sure we are using a dict
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
(cherry picked from commit fe4c97dc4f5f893df526f8694d4b9fafb064f9d3)
2014-01-28 16:04:25 -05:00
Zack Cerza
81d1405a15 Attempt to fix #7241
This involves moving everything in build_ceph_cluster() inside the try:
block, so if an exception fails the cleanup in the finally: block will
actually be executed.
2014-01-28 10:05:03 -06:00
Zack Cerza
427bf93c88 Fix PEP-8 linter complaints
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-28 09:54:54 -06:00
Sage Weil
4c98d67639 Merge pull request #175 from ceph/wip-tcmalloc
valgrind: ignore tcmalloc uninitialized memory
2014-01-27 21:14:17 -08:00
Zack Cerza
e424d78c6b Be more verbose about log file locations
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-27 12:28:53 -06:00
Zack Cerza
50722a7d9c Symlink worker logs into job archive dir
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-24 10:19:43 -06:00
Alfredo Deza
60da6e0c6f Merge pull request #177 from ceph/wip-7213
use create-initial for deploying monitors
2014-01-24 05:54:37 -08:00
Alfredo Deza
cf250a8f4a use create-initial for deploying monitors
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
2014-01-23 16:41:49 -05:00
Zack Cerza
e8bb1654b2 call wait() on the teuthology-results Popen object
This ought to fix the issue where zombie teuthology-results processes
stick around.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-17 10:05:21 -06:00
Alfredo Deza
1ddc3fdb6a Merge pull request #176 from ceph/wip-7164
lock.py: request only rsa keys from ssh-keyscan
2014-01-16 13:00:37 -08:00
Dan Mick
381d4aa873 lock.py: request only rsa keys from ssh-keyscan
New versions of ssh-keyscan return two keytypes if possible; this
breaks the comparison of "number of lines of output from keyscan"
to "number of hosts we request keys from".  Fix by asking for only
one type of key (as older ssh-keyscans did).

Fixes: #7164
Signed-off-by: Dan Mick <dan.mick@inktank.com>
2014-01-16 12:51:39 -08:00
Samuel Just
66312f7ea1 ceph_manager: in test_map_discontinuity, delay killing osd for 20s
This should give enough time for the replica to get the message
marking them clean.

Signed-off-by: Samuel Just <sam.just@inktank.com>
2014-01-16 09:57:44 -08:00
Zack Cerza
53fc2d93dd Log a warning when killing long-running jobs.
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-16 10:53:53 -06:00
Zack Cerza
769ef8a960 Kill jobs that run for over 3 days (configurable)
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-01-16 10:38:39 -06:00
Sage Weil
69d4962759 Merge pull request #174 from ceph/wip-7171
thrashosds: change min_in from 2 -> 3
2014-01-10 11:08:31 -08:00
Sage Weil
9a9ee610bd valgrind: ignore tcmalloc uninitialized memory
This is the main source of noise when running valgrind +
tcmalloc.  Apparently there are other issues, so I think we
still need the notcmalloc gitbuilder, but this gets us part of
the way.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-01-10 11:02:16 -08:00
Sage Weil
495f2163a8 thrashosds: change min_in from 2 -> 3
See #7171. In rare cases CRUSH can't handle it when only 2/6 of
the OSDs are marked in.  Avoid those situations for now.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-01-10 11:00:55 -08:00
Zack Cerza
0c339fe427 Merge pull request #169 from ceph/wip-utsrelease
kernel: use utsrelease string for need_to_install() purposes
2014-01-10 09:22:40 -08:00
Ilya Dryomov
bf481b9134 kernel: use utsrelease string for need_to_install() purposes
Currently, to see if a node has rebooted into the right kernel,
need_to_install() compares a given 40-char commit hash with a 7-char
commit hash abbreviation it pulls from the output of 'uname -r'.

gitbuilders can now export UTS_RELEASE kernel version string through
.../$SHA1/version file.  Use this string instead of the 40-char commit
hash and compare it with the output of 'uname -r' directly.  This saves
us the parsing exercise and, more importantly, makes it possible to
install clean tagged kernels using 'tag:' element, which wasn't
possible before because version string of such kernels doesn't have
a commit hash in it.

If version file is unavailable, fallback to the existing way of doing
things.

Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
2014-01-10 12:26:09 +02:00
Alfredo Deza
4c049b37e6 Merge pull request #173 from ceph/wip-ceph-deploy
fix ceph-dpeloy tests to use 2x replication
2014-01-09 05:43:04 -08:00
Sage Weil
1b4368b3cf schedule_suite: 2x replication for ceph-deploy
This avoids having to update all of the tests.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-01-08 17:35:10 -08:00
Zack Cerza
c4b9dae4f6 Merge pull request #172 from ceph/wip-teuthvmcon-wusui
Fix a bug where ctx.config['targets'] was looped through again
2014-01-07 15:09:39 -08:00
Warren Usui
f32adf055a Fix a bug where ctx.config['targets'] was looped through again
in connect().  The bug caused vm behavior to happen for a
target if any of the machines in the cluster was a vm.  The code
was also changed to set the key to none only if rsa or dsa keys
were used on a vm.

Fixes: 7113
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-01-07 14:22:57 -08:00
Alfredo Deza
0c9129bf43 error after 15 minutes of waiting for gatherkeys
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
(cherry picked from commit ad18717505c407f2c8bb995af8fd516b61957661)
2014-01-06 15:25:33 -05:00
Zack Cerza
5c0496633c Enable reporting of entire runs as dead 2014-01-06 11:08:12 -06:00