Commit Graph

1003 Commits

Author SHA1 Message Date
Sage Weil
7620323559 Merge pull request #1196 from ceph/wip-dump-stuck
tasks/dump_stuck: fix failure

/home/teuthworker/archive/sage-dump-stuck/457624
2016-10-06 11:11:26 -05:00
Sage Weil
f0dffe567d tasks/dump_suck: mark down osds one at a time
This forces them to be unclean, *then* stale.  This ensures
that after they are both down they are both *always* unclean,
whereas previously it would be possible for them to be only
stale and not unclean.

Signed-off-by: Sage Weil <sage@redhat.com>
2016-10-05 15:30:23 -04:00
Sage Weil
5ef07c32f8 tasks/dump_stuck: fix unclean count
Signed-off-by: Sage Weil <sage@redhat.com>
2016-10-05 15:24:11 -04:00
Sage Weil
50f2022412 tasks/dump_stuck: more verbose
Signed-off-by: Sage Weil <sage@redhat.com>
2016-10-05 15:24:10 -04:00
David Zafman
c7bcb2bd54 multibench: Fixed so I can use this standalone
Signed-off-by: David Zafman <dzafman@redhat.com>
2016-10-04 21:30:31 -07:00
David Zafman
60cdb05380 ceph_manager: do_pg_scrub() don't resubmit a request for 2 minutes
Add 6 minute fatal timeout
Warn when repair is being resubmitted because it causes races

Signed-off-by: David Zafman <dzafman@redhat.com>
2016-10-04 21:30:31 -07:00
Patrick Donnelly
84dd1a6283
Merge remote-tracking branch 'upstream/pull/1176/head'
* upstream/pull/1176/head:
  tasks/cephfs: add default paths to vstart_runner

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2016-09-28 12:53:41 -04:00
John Spray
4d6953a46e tasks/cephfs: add default paths to vstart_runner
So that for folks with sources in typical locations
(or typical on my workstation at least!) invoking
vstart_runner is less of a mouthful.

Signed-off-by: John Spray <john.spray@redhat.com>
2016-09-28 16:38:30 +01:00
Kefu Chai
176811f8e1 Revert "ceph_manager: lower mon_reweight_min_bytes_per_osd to 10" 2016-09-26 23:15:46 +08:00
Gregory Farnum
652d9b65da Merge pull request #1185 from jcsp/wip-dirfrags-2
tasks/ceph: enable dirfrags in cephfs
2016-09-23 14:12:39 -07:00
John Spray
cc8198d8eb tasks/ceph: enable dirfrags in cephfs
Otherwise places we set mds_bal_frag have
no effect.

Signed-off-by: John Spray <john.spray@redhat.com>
2016-09-23 11:04:54 +01:00
Sage Weil
dccdb2eb0f tasks/ceph_manager: add bdev_inject_crash_probability support
Only do the failure injection 50% of the time; otherwise, just
kill as usual.

Signed-off-by: Sage Weil <sage@redhat.com>

# Conflicts:
#	tasks/ceph_manager.py
2016-09-21 17:07:28 -04:00
Sage Weil
f4de41c24f inject bdev failures when killing osd on bluestore
Signed-off-by: Sage Weil <sage@redhat.com>
2016-09-21 17:01:33 -04:00
Sage Weil
2ae53848cf tasks/ceph_manager: respect bdev_inject_crash in kill_osd()
Note that this will only work with bluestore osds.

Signed-off-by: Sage Weil <sage@redhat.com>
2016-09-21 17:01:29 -04:00
Sage Weil
4936d40438 tasks/thrashosds: fix powercycle, bdev_inject_crash
The CephManager.kill_osd expects these in its config, not
Thrasher's.

Signed-off-by: Sage Weil <sage@redhat.com>
2016-09-21 16:28:00 -04:00
Kefu Chai
affccb1a6c Merge pull request #1169 from ceph/wip-kefu-17179
tasks: add rebuild_mondb

Reviewed-by: Samuel Just <sjust@redhat.com>
2016-09-17 21:29:54 +08:00
Casey Bodley
f51a65523a Merge pull request #1136 from ceph/wip-16129
rgw: add sleep to let the sync agent init

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2016-09-16 10:43:52 -04:00
Kefu Chai
525d9899ce Merge pull request #1118 from ceph/wip-scrub
tasks/scrub_test: use fuse path (do not assume filestore!)

Reviewed-by: Kefu Chai <kchai@redhat.com>
2016-09-13 14:41:06 +08:00
Kefu Chai
3f0ab86d26 Merge pull request #1124 from ceph/wip-offline-split
ceph_manager: test offline split via ceph-objectstore-tool

Reviewed-by: Kefu Chai <kchai@redhat.com>
2016-09-11 11:33:03 +08:00
Sage Weil
a61dc2a1c1 tasks/scrub_test: use fuse path (do not assume filestore!)
Signed-off-by: Sage Weil <sage@redhat.com>
2016-09-08 10:13:28 -04:00
Kefu Chai
a192c786b3 tasks: add rebuild_mondb
* tasks/rebuild_mondb.py: this task
  1. removes all store.db on all monitors
  2. rebuild the store.db for the first mon
  3. start the first mon
  4. run mkfs on other mon
  5. and revive them
* suites/rados/singleton/all/rebuild-mon-db.yaml
  1. run rados/test.sh
  2. run rebuild_mondb task

Fixes: http://tracker.ceph.com/issues/17179
Signed-off-by: Kefu Chai <kchai@redhat.com>
2016-09-08 21:35:01 +08:00
Zack Cerza
eb3abfcd9d Merge pull request #1153 from ceph/wip-manager-construct
tasks/ceph: construct CephManager earlier
2016-09-06 14:35:27 -06:00
Kefu Chai
a053ce091e tasks/ceph_manager: allow ObjectStoreTool to not revive OSD after done with it
rebuild_mondb task is not able to offer OSD with any monitor alive. so
self.manager.revive_osd() will always timeout after calling cot.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2016-09-06 17:56:10 +08:00
Nathan Cutler
67de197ce5 buildpackages/make-rpm.sh: adjust Source0 spec file line on SUSE
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2016-09-05 21:42:53 +02:00
Nathan Cutler
29671063f9 buildpackages: use unadulterated make-dist
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2016-09-05 21:33:21 +02:00
Nathan Cutler
44d9acb677 buildpackages/make-rpm.sh: use /etc/os-release
The lsb_release binary is deprecated and requires installation of packages.

The /etc/os-release file is guaranteed to be present.

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2016-09-05 21:33:21 +02:00
vasukulkarni
cda292c9d6 Merge pull request #1158 from ceph/wip-fio-unmap-devices
tasks/rbd_fio: unmap rbd devices on cleanup
2016-09-01 10:00:23 -07:00
Ilya Dryomov
15be2d29be tasks/rbd_fio: unmap rbd devices on cleanup
Not doing so leads to issues and can interfere with subsequent jobs.
One example is the invocation of vgs(8) during the inital test setup:
it will issue a read to the left-behind rbd device(s) whose backing
cluster is long gone, locking up the job.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2016-09-01 18:13:55 +02:00
Ilya Dryomov
ec97445740 tasks/rbd_fio: don't use sudo unnecessarily
Creating and cloning images doesn't require sudo.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2016-09-01 15:51:07 +02:00
Loic Dachary
2525fedc3d Merge pull request #1155 from SUSE/wip-suse-ceph-test
buildpackages: fix make-rpm.sh for SUSE

Reviewed-by: Loic Dachary <ldachary@redhat.com>
2016-09-01 12:04:27 +02:00
Nathan Cutler
0b10284caa buildpackages: force ceph-test build on SUSE
The ceph-test package is required for teuthology. It is disabled to speed up
the build in OBS, but here we need it enabled unconditionally.

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2016-09-01 10:33:17 +02:00
Ricardo Dias
494b623602 buildpackages: fix RPM generation script to support opensuse spec file
Signed-off-by: Ricardo Dias <rdias@suse.com>
(cherry picked from commit 4692f995ba6967796d5dd2f5fd85fde8705a1ccc)
2016-09-01 10:32:10 +02:00
Gregory Farnum
c2efbd39c1 Merge pull request #1112 from jcsp/wip-16909
tasks/cephfs: test standby replay teardown on rank stop

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2016-08-31 14:03:24 -07:00
John Spray
c444db12d4 tasks/ceph: construct CephManager earlier
Previously, if errors occurred during healthy(), then
the finally block would invoke osd_scrub_pgs, which relies
on CephManager being constructed, and it would die, hiding
the original exception.

Signed-off-by: John Spray <john.spray@redhat.com>
2016-08-31 15:32:33 +01:00
Loic Dachary
b4bb54e5df Merge pull request #1141 from rjfd/wip-test-buildpackages
buildpackages: make make-{deb,rpm}.sh aware of cmake

Reviewed-by: Loic Dachary <ldachary@redhat.com>
Reviewed-by: Nathan Cutler <ncutler@suse.cz>
2016-08-26 09:23:14 +02:00
Josh Durgin
f28f881bda ceph_manager: test offline split via ceph-objectstore-tool
When killing an osd, split all pools with a low threshold.
This will slow down tests, but should not impact correctness.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-08-26 00:28:32 +00:00
Loic Dachary
b9813e490e workunit: allow parallel git clone
When multiple client run in parallel on the same machine and they
try to get workunits from a repository that is not github, they must
git clone in a directory that is suffixed as srcdir. Otherwise they
will conflict with each other.

Fixes: http://tracker.ceph.com/issues/17116

Signed-off-by: Loic Dachary <loic@dachary.org>
2016-08-25 11:34:46 +02:00
Zack Cerza
bf797f0b1f Update console checking, and DRY
Signed-off-by: Zack Cerza <zack@redhat.com>
2016-08-24 11:13:32 -06:00
Zack Cerza
821cf2caa6 Drop unnecessary console handling code
Remote.console is provided by teuthology now
http://tracker.ceph.com/issues/17124
Fixes: 17124

Signed-off-by: Zack Cerza <zack@redhat.com>
2016-08-24 11:11:25 -06:00
Nathan Cutler
65590aa039
buildpackages: make make-{deb,rpm}.sh aware of cmake
Fixes: http://tracker.ceph.com/issues/16819
Signed-off-by: Ricardo Dias <rdias@suse.com>
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2016-08-24 14:38:17 +01:00
Orit Wasserman
ad5a2a2f19 rgw: add debug info when comparing bucket metadata
Signed-off-by: Orit Wasserman <owasserm@redhat.com>
2016-08-24 13:22:34 +02:00
Samuel Just
6394a9e52f ceph_manager: do not use StringIO instance as default argument
Otherwise, all invokations share the *same* StringIO instance
leading to some very silly bugs.

Signed-off-by: Samuel Just <sjust@redhat.com>
2016-08-23 09:53:39 -07:00
Kefu Chai
5c4aeb10c4 ceph_manager: lower mon_reweight_min_bytes_per_osd to 10
otherwise monitor could reject the command:
```
Refusing to reweight: we only used 588084 kb used across all osds!
```
if the average used space is smaller than
`mon_reweight_min_bytes_per_osd`.

Fixes: http://tracker.ceph.com/issues/16805
Signed-off-by: Kefu Chai <kchai@redhat.com>
2016-08-23 02:18:35 +08:00
Samuel Just
3052452e64 Merge pull request #1122 from ceph/wip-fixbiglogs
Fix big logs in rados thrash runs

Reviewed-by: Samuel Just <sjust@redhat.com>
2016-08-19 07:52:53 -07:00
David Zafman
9d52d10898 revive_osd: Suppress output used to wait for OSD to be fully started
Signed-off-by: David Zafman <dzafman@redhat.com>
2016-08-18 21:10:54 -07:00
David Zafman
d762fa563e Thrashosds: Turn off output when doing dump_ops testing
Needed to add stdout option to pass down to run()

Signed-off-by: David Zafman <dzafman@redhat.com>
2016-08-18 09:02:14 -07:00
Casey Bodley
171eadd6a3 Merge pull request #1119 from ceph/wip-rgw-17025
rgw: data_extra_pool is unique per zone

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2016-08-15 09:38:10 -04:00
Orit Wasserman
c6cdd0905f rgw: data_extra_pool is unique per zone
Fixes: http://tracker.ceph.com/issues/17025
Signed-off-by: Orit Wasserman <owasserm@redhat.com>
2016-08-15 14:52:56 +02:00
John Spray
f23e15c9fa tasks/cephfs: test standby replay teardown on rank stop
Fixes: http://tracker.ceph.com/issues/16909
Signed-off-by: John Spray <john.spray@redhat.com>
2016-08-09 14:29:13 +01:00
Douglas Fuller
e01ff2e3c1 cephfs/vstart_runner: add -ww option to ps
vstart_runner can't find arguments to ceph daemons to identify them with
ps -x because commands are cut off at terminal width. Add -ww for wide
output.

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
2016-08-09 06:19:14 -07:00