Commit Graph

656 Commits

Author SHA1 Message Date
Loic Dachary
8f9de175e0 ceph: log which ceph.conf file is written
Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-10-21 15:49:48 +02:00
John Spray
4a83c56604 tasks/cephfs: include an empty dir in journal repair
Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-19 17:57:53 +01:00
Zack Cerza
da22f74ad1 Merge pull request #632 from ceph/wip-rbd-dir-fix
use cephtest as base dir during testing
2015-10-14 17:28:14 -06:00
Vasu Kulkarni
bdf79881df Use cephtest as base dir for testing and let nuke cleanup in case of abrupt failures
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2015-10-14 17:27:00 -04:00
John Spray
176e9ef267 tasks/cephfs: add test_failover
A quick check that clients refuse to mount
when daemons are laggy, and while we're at it,
that the basics of failover work.  It's a trivial
test, but it's nice to have this kind of thing
so that we don't have to wait for weird thrasher
failures if something breaks.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-14 02:20:00 +01:00
John Spray
043049f6c3 tasks/cephfs: add a wait_for_daemons
...to avoid trying to mount a client
before the daemons are ready

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-14 02:16:13 +01:00
John Spray
267ef8bad7 Merge pull request #624 from ceph/wip-13437
Wip 13437 (tasks/cephfs: fix TestClientLimits.test_client_oldest_tid …)

Reviewed-by: John Spray <john.spray@redhat.com>
2015-10-13 22:50:52 +01:00
Loic Dachary
baa8ea327e buildpackages: implement notcmalloc flavor
http://tracker.ceph.com/issues/13454 Fixes: #13454

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-10-13 14:56:10 +02:00
Kefu Chai
8d1c15e9a5 Merge pull request #621 from dachary/wip-13449-admin-socket
tasks/admin_socket.py: wait 120 seconds instead of 60

Reviewed-by: Kefu Chai <kchai@redhat.com>
2015-10-13 17:04:55 +08:00
Loic Dachary
ccd1bc2c03 Merge pull request #619 from dachary/wip-buildpackages-submodules
buildpackages: get ceph submodules
2015-10-13 10:55:55 +02:00
Yan, Zheng
b03ac07980 tasks/cephfs: fix TestClientLimits.test_client_oldest_tid
To get the health warning, first we need to make sure requests are
added to session's completed request list. Then we need to send an
extra request to MDS to trigger the code that generates the warning.

Fixes: #13437
Signed-off-by: Yan, Zheng <zyan@redhat.com>
2015-10-13 16:53:39 +08:00
Loic Dachary
c0828cae19 tasks/admin_socket.py: wait 120 seconds instead of 60
When running on virtual machines, it may take more than one minute for a
daemon to create the admin socket.

http://tracker.ceph.com/issues/13449 Fixes: #13449

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-10-13 10:25:18 +02:00
Sage Weil
2545265f17 Merge remote-tracking branch 'gh/infernalis' 2015-10-12 13:04:19 -04:00
Loic Dachary
81f5c3ad03 buildpackages: get ceph submodules
Prior to v0.80.9, autogen.sh did not get submodules. Copy/paste the
submodule initialization from newer autogen.sh in common.sh so that
v0.80.8 and below can be rebuilt from sources. It does not hurt to
update the submodules twice.

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-10-12 18:46:38 +02:00
Yehuda Sadeh
d1ada2e8dd radosgw_admin: skip log objects that start with obj_delete_at_hint
when testing radosgw-admin log show --object, don't use these objects

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
2015-10-12 09:13:25 -07:00
Loic Dachary
56a6e0c356 buildpackages: use _get_version instead of os_version
os_version is from the remote and will be 7.1.23 for CentOS 7
instead of the expected 7.0 for all 7.* CentOS.

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-10-10 12:51:15 +02:00
Samuel Just
174a74105f radosbench: use pool config even if create_pool is false
Signed-off-by: Samuel Just <sjust@redhat.com>
2015-10-09 14:06:04 -07:00
Samuel Just
c0b0ec2831 divergent_priors2: give divergent time to come up
Fixes: #13322
Signed-off-by: Samuel Just <sjust@redhat.com>
2015-10-09 14:05:42 -07:00
Loic Dachary
0446645de8 Merge pull request #609 from dachary/wip-13376-buildpackages-upgrade
buildpackages: honour install priorities tags, branch, sha1
2015-10-09 16:37:40 +02:00
Loic Dachary
0045dfc8a5 buildpackages: enforce ceph-object-corpus canonical URL
Signed-off-by: Loic Dachary <loic@dachary.org>
2015-10-09 11:57:40 +02:00
Loic Dachary
4d89c9e4de buildpackages: walk the whole config tree to find sha1 to build
It is not enough to look for the first install task. In upgrade tests,
the install.upgrade task requires more packages to be built. In more
complicated tests using sequential and parallel tasks, the actual
install or install.upgrade task may be deeper in the config tree.

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-10-09 10:05:58 +02:00
Loic Dachary
c103579a55 buildpackages: honour install priorities tags, branch, sha1
The install config may have contradicting tag/branch and sha1.  When
suite.py prepares the jobs, it always overrides the sha1 with whatever
default is provided on the command line with --distro and what is found
in the gitbuilder. If it turns out that the tag or the branch in the
install config task is about another sha1, it will override anyway.

Instead of obtaining the tag, branch and sha1 directly from the
packaging.GitbuilderProject object, compute them from the returned
uri_reference data member. The uri_reference is used by the install task
to fetch packages in the gitbuilders and this is what buildpackages
needs to build.

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-10-09 10:03:35 +02:00
Loic Dachary
810caaf55f buildpackages: branches are found in refs/remotes/origin
Note in refs/heads which is populated for local branches only and not
after a fresh clone.

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-10-09 10:01:34 +02:00
Loic Dachary
e54c916891 buildpackages: do not override the sha1
Signed-off-by: Loic Dachary <loic@dachary.org>
2015-10-09 10:01:05 +02:00
Loic Dachary
a630da7220 buildpackages: allow for concurrent build if != sha1
Signed-off-by: Loic Dachary <loic@dachary.org>
2015-10-08 12:16:30 +02:00
Loic Dachary
9db757b811 buildpackages: prefer GitbuilderProject.os_{type,version} over config.
The config['os_type'] and config['os_version'] are not always set for a given
job (for instance, in the rbd suite). When a suite runs, it relies on
default values, depending on the target Operating System and internal,
hard coded values associating ubuntu to 14.04 etc.

Instead of using config['os_{type,version}'] use the GitbuilderProject
equivalent which is set with the appropriate defaults.

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-10-08 12:16:30 +02:00
Greg Farnum
94d54f7510 Merge remote-tracking branch 'origin/wip-vstart-runner' into infernalis
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2015-10-07 10:55:36 -07:00
Loic Dachary
a5f260ee92 Merge pull request #596 from ceph/wip-radosbench
tasks/radosbench: wait just a bit longer

Reviewed-by: Loic Dachary <ldachary@redhat.com>
2015-10-05 01:57:04 +02:00
vasukulkarni
5f201625b1 Merge pull request #549 from ceph/wip-krbd-fio
Add rbd fio test to exercise block IO on all format/features
2015-10-02 11:54:26 -07:00
Vasu Kulkarni
971c5fc29c rbd io tests using fio, various fio options can be specified in yaml
test rbd or krbd using fio, can also run io on rbd clones if option is specified in yaml
various options like image-size, rbd format/features, fio io size, readwrite options can be provided in yaml.
check the docstring for exact usage.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2015-10-02 13:41:42 -04:00
John Spray
62247f2009 tasks/cephfs: fix FuseMount bin path in vstart
FuseMount only uses the prefix for finding the 'ceph'
executable, which is in ./ for either cmake or
authtools, not ./src for cmake like other binaries.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:18 +01:00
John Spray
0584b9c8dc tasks/cephfs: fix test_journal_migration
It was trying to get the output file from
a different remote than the one used to
run the journal tool.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:18 +01:00
John Spray
9eb65dd1be tasks/cephfs: move mds_scrub_checks
...into a CephFSTestCase.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:18 +01:00
John Spray
03ea0971cc tasks/cephfs: move journal migration test
...into a CephFSTestCase.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:17 +01:00
John Spray
176123caea tasks/cephfs: extend vstart_runner's ctx&run
Sufficiently to enable using workunits.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:17 +01:00
John Spray
99e3a400e1 tasks/cephfs: add --interactive for vstart runner
Just like interactive-on-error in teuthology.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:17 +01:00
John Spray
904fd76813 tasks/cephfs: fix FuseMount._asok_path
Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:17 +01:00
John Spray
20e7d2af24 tasks/cephfs: remove a redundant sudo
Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:17 +01:00
John Spray
4f8adf155b tasks/cephfs: raise error on non-string stdins
Shouldn't be any from the fs tests that get run

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:17 +01:00
John Spray
80a9efb73d tasks/cephfs: warn if vstart_runner can't import mods
Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:17 +01:00
John Spray
06891416da tasks/cephfs: stop if needed binaries are absent
Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:17 +01:00
John Spray
23ae9815a5 tasks/cephfs: add instructions to vstart_runner
Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:17 +01:00
John Spray
cbb85720f3 tasks/cephfs: updates for cmake environ
Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:17 +01:00
John Spray
8b846b9f2c tasks/cephfs: add vstart runner script
This is to allow running CephFSTestCase tests
against a vstart cluster, for much faster turnaround
during development than running teuthology against
built ceph packages.

Not everything will be runnable this way, but for
certain things like filesystem repair scenarios we
have everything we need within a vstart environment.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:17 +01:00
John Spray
efb62ae1e7 tasks/cephfs: mark some tests as @needs_trimming
So that we can drop these tests when not running
client as root.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:17 +01:00
John Spray
f8a4b08702 tasks/cephfs: add needs_trimming decorator
For tests to advertise that they need the client
to be able to trim its cache (i.e. currently that
means requiring run as root)

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:17 +01:00
John Spray
c0e90aa5a7 tasks/cephfs: make FuseMount.teardown safer
(don't assume fuse_daemon exists)

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
1ed6272d93 tasks/cephfs: mark some tests as @long_running
Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
d4ecee82e4 tasks/cephfs: add @long_running decorator
A means for test cases to mark particular methods
as long running, so that the vstart runner can skip
them when running for developers.

This is not a scientific thing, anything that takes
more than about 2 minutes due to lots of iteration
or sleeps.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
9a1ca94edc tasks/cephfs: cluster_down before fs rm
In teuthology this isn't needed because we join the
mds child processes after killing them.  In vstart
we're killing them asynchronously, so be a bit more
careful to ensure they can't re-insert themselves
to the mdsmap between our calling fail and our calling
fs rm.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
684e0deb6f tasks/cephfs: split up TestClientRecovery
...into the part that requires a network-isolated
client and the part that doesn't.

This happens to also be the part that won't work with
vstart vs. the part that will.  teuthology yaml will
still pick up and run both parts.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
6f8a35bb42 tasks/cephfs: make memstore dependency declarative
...instead of checking for it procedurally during
TestClusterFull.setUp

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
98da1e0d84 tasks/cephfs: refine TestClientLimits.test_client_oldest_tid
* Instead of creating files in background, create
  them in foreground (simpler).
* Instead of creating max_request*2 files, just create
  max_requests plus a litle bit.
* Set max_requests to 1000 instead of 5000 to run a bit
  faster.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
37d55663f0 tasks/cephfs: fix race in TestStrays
We weren't waiting for export dir to complete (the asok
just starts the process).  This wasn't noticeable when running
remotely due to latency between the test runner and the MDS,
but it shows up when running against a local vstart cluster.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
be749f8fde tasks/cephfs: work around fuse weirdness
I am seeing a strange thing where it seems like sometimes
a ls of /sys/fs/fuse/connections is returning empty when
connections do exist.  It is pretty easy to make this
a non-issue by waiting for "more conns than we started with"
instead of "list of conns is different", so do that.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
e51cd6bee3 tasks/cephfs: prefix override in FuseMount
For asok and invokation of ceph CLI.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
7e6f8d4905 tasks/cephfs: use stdin/stdout in Filesystem._read_data_xattr
...so that it doesn't rely on a temp local file and using
run.Raw(">") in commands.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
48cf533f0a tasks/cephfs: handle endpoint not connected
Previously failure to stat mnt dir was interpreted
as being unmounted.  For "transport endpoint no connected"
error we do want to recognise that it is mounted, albeit
with no ceph-fuse process.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
120f9d882e tasks/cephfs: use Filesystem.rados instead of direct CLI
Because it's slicker and because I can wrap it for vstart.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
bab4c7aa71 tasks/cephfs: add Filesystem.exists
Use this during test setup to check whether
a filesystem is configured at all, before
trying to tear it down.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:16 +01:00
John Spray
e2f8b48d94 tasks/cephfs: add Filesystem._prefix
So that my vstart subclass can put ./ before
all the commands.

One could set $PATH, but I like to unambiguously point
it at the local built binaries in case someone also
has some systemwide packages.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:15 +01:00
John Spray
f58c119d90 tasks/cephfs: use raw_cluster_cmd instead of manual "ceph"
It's less repetitive, and I can override it for use with
vstart.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:15 +01:00
John Spray
99a8a123da tasks/cephfs: add Filesystem.delete_all
So that we can clear down filesysteems created by
someone other than ourselves (like vstart.sh)

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:15 +01:00
John Spray
ad562f4b79 tasks/cephfs: make Filesystem rados command overridable
Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:15 +01:00
John Spray
7884d5b555 tasks/cephfs: use raw_cluster_cmd in Filesystem
...instead of manually invoking 'ceph' -- so that
it's easier to override this for vstart.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-10-02 17:41:15 +01:00
Sage Weil
3e30d5dfdf tasks/radosbench: wait just a bit longer
A run failed due to thrashing.. missed by about 30s (the osd
eventually sent the last reply but we'd already timed out).

Signed-off-by: Sage Weil <sage@redhat.com>
2015-10-02 08:44:33 -04:00
Gregory Farnum
6add6ba2df Merge pull request #461 from ceph/wip-client-oldest-tid
cephfs/test_client_limits.py: update test_client_oldest_tid

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2015-10-01 22:37:15 -07:00
Loic Dachary
487ccb964a Merge pull request #592 from ceph/wip-13000-centos-activate
ceph-deploy: always zap disk before creating an osd
2015-10-01 08:49:20 +02:00
Josh Durgin
03b4a7957e Merge pull request #580 from dachary/wip-13250-ceph-deploy-branch
ceph-deploy: fix ceph-deploy-branch config options

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2015-09-30 19:45:30 -04:00
Sage Weil
f467a98a29 tasks/ceph_manager: %d -> %s
Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-30 14:58:32 -04:00
Loic Dachary
01d48a270a ceph-deploy: always zap disk before creating an osd
The existing logic is to ceph-deploy osd create --zap-disk which will
zap the data device before preparing it. However it will not zap the
journal device (see http://tracker.ceph.com/issues/13291).

If ceph-deploy osd create fails, a fall back will zap both the data
device and the journal and try prepare again. This could work if
the device preparation and activation was synchronous and catch all
errors that could be caused by an unclean journal device. However,
the activation is asynchronous and it is entirely possible for a device
to be prepared successfully and fail to activate in the background.

The data and journal device are always zapped before calling ceph-deploy
osd create. The logic is simpler and the overhead is low.

http://tracker.ceph.com/issues/13000 Fixes: #13000

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-09-30 20:57:24 +02:00
Sage Weil
a53a80b9f0 tasks/ceph_manager: fix logging on failed pool property
Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-30 09:24:38 -04:00
Samuel Just
4e9f1df514 rados: add test for 13234.yaml
Signed-off-by: Samuel Just <sjust@redhat.com>
2015-09-29 21:19:10 -07:00
Sage Weil
a4f9bdbe54 tasks/cephfs: mds allow
Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-28 12:38:04 -04:00
Loic Dachary
56e95a0ebc Merge pull request #583 from dachary/wip-buildpackages-flock
buildpackages: flock must --close
2015-09-28 02:03:54 +02:00
Loic Dachary
05228e1e5c buildpackages: user-data must be per os-type/os-version
CentOS 6.5 needs to install a package and reboot to grow the root file
system. Instead of assuming a common user-data.txt file can fit all
Operating Systems, make one user data per os-type/os-version combination.

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-09-27 23:42:58 +02:00
Loic Dachary
204e27fecd buildpackages: flock must --close
The process run by flock must not inherit the file descriptor because
this will cause the lock to be held forever should the command survive
the call to flock. This is for instance the case for the ssh-agent.

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-09-26 15:23:03 +02:00
Loic Dachary
8bc89b6517 Merge pull request #581 from dachary/wip-11883-git-url
buildpackages: get Ceph git url via get_ceph_git_url()
2015-09-26 13:44:56 +02:00
Loic Dachary
78b2a9aa74 buildpackages: get Ceph git url via get_ceph_git_url()
Instead of relying on git_base_url, use the get_ceph_git_url() to obtain
the URL of the Ceph repository to use with git clone. This allows the
user to override it via the git_ceph_url configuration option and the
--git-ceph-url command line option to teuthology-openstack.

http://tracker.ceph.com/issues/11883 Refs: #11883

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-09-26 13:11:36 +02:00
Loic Dachary
fdcfb79b3c ceph-deploy: fix ceph-deploy-branch config options
The config paramter of download_ceph_deploy does not have a ceph-deploy
item, therefore the ceph-deploy-branch parameter is always assumed to be
master.

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-09-26 02:16:46 +02:00
Sage Weil
ecc504fea0 tasks/divergent_priors2: wait for osd to start before tell
Otherwise we can get

2015-09-24T19:22:15.191 INFO:teuthology.orchestra.run.mira080.stderr:Error ENXIO: problem getting command descriptions from osd.1

Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-24 23:34:17 -04:00
Sage Weil
0e2814d81e tasks/ceph_manager: ignore failure getting pg_num
Otherwise, we may fail while racing with a workload that deletes a pool:

2015-09-23T15:01:52.855 INFO:tasks.workunit.client.1.vpm128.stdout:[ RUN      ] LibRadosTwoPoolsPP.PromoteSnapTrimRace
2015-09-23T15:01:53.892 INFO:teuthology.orchestra.run.vpm176:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage ceph osd pool get .rgw pg_num'
2015-09-23T15:01:54.206 INFO:teuthology.orchestra.run.vpm176:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage ceph osd pool get .rgw.gc pg_num'
2015-09-23T15:01:54.462 INFO:teuthology.orchestra.run.vpm176:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage ceph osd pool get .users.uid pg_num'
2015-09-23T15:01:54.696 INFO:teuthology.orchestra.run.vpm176:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage ceph osd pool get .users.email pg_num'
2015-09-23T15:01:55.006 INFO:teuthology.orchestra.run.vpm176:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage ceph osd pool get .users pg_num'
2015-09-23T15:01:55.296 INFO:teuthology.orchestra.run.vpm176:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage ceph osd pool get .rgw.buckets.index pg_num'
2015-09-23T15:01:55.523 INFO:teuthology.orchestra.run.vpm176:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage ceph osd pool get .log pg_num'
2015-09-23T15:01:55.752 INFO:teuthology.orchestra.run.vpm176:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage ceph osd pool get .usage pg_num'
2015-09-23T15:01:56.188 INFO:teuthology.orchestra.run.vpm176:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage ceph osd pool get .rgw.buckets.extra pg_num'
2015-09-23T15:01:56.625 INFO:teuthology.orchestra.run.vpm176:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage ceph osd pool get test-rados-api-vpm128-17360-6 pg_num'
2015-09-23T15:01:56.928 INFO:teuthology.orchestra.run.vpm176:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage ceph osd pool get test-rados-api-vpm128-17360-13 pg_num'
2015-09-23T15:01:57.193 INFO:teuthology.orchestra.run.vpm176.stderr:Error ENOENT: unrecognized pool 'test-rados-api-vpm128-17360-13'
2015-09-23T15:01:57.206 ERROR:teuthology.parallel:Exception in parallel execution
Traceback (most recent call last):
...

Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-24 12:19:07 -04:00
Loic Dachary
7e3a271a89 buildpackages: make rpm packages
Signed-off-by: Loic Dachary <loic@dachary.org>
2015-09-23 21:11:26 +02:00
Gregory Farnum
e3c99476b0 Merge pull request #453 from ceph/wip-11835
tasks/cephfs: time out on ceph-fuses that don't die

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2015-09-21 16:09:48 -07:00
John Spray
eab70197a6 tasks/ceph: wait for MDS to be active when creating a cluster
This is the correct implementation of 685d76a77c,
merged while broken in ff1655cb57 and
reverted in 4cccde634f.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-09-21 16:05:51 -07:00
Sage Weil
caa9235857 Merge pull request #566 from ceph/wip-s3-branch-fix
Use old branches as is without 'ceph-' for firefly, hammer etc
2015-09-18 10:10:10 -04:00
Greg Farnum
4cccde634f Revert "Merge pull request #567 from ceph/ceph_fuse-timeout"
This reverts commit ff1655cb57, reversing
changes made to 2b25080d4f.

Since we haven't actually started the MDS daemons yet, this code is broken.

Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2015-09-17 07:29:28 -07:00
Gregory Farnum
2f43e23fc6 Merge pull request #569 from dachary/wip-13031-packages
buildpackages: fix flake / pep8
2015-09-16 14:29:19 -07:00
Greg Farnum
f97fde6c26 tasks/cephfs: switch unmount timeout to 15 minutes
Our ffsb and fsync tests contain so many small writes at random offsets
that it can take >10 minutes to commit all of them to disk if we get
a slower OSD cluster. 15 minutes is still a plenty-fast timeout for
this stage compared to just hanging and losing the logs!

Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2015-09-16 14:27:14 -07:00
Loic Dachary
e38a6d69df buildpackages: build the repository and the packages
http://tracker.ceph.com/issues/13031 Fixes: #13031

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-09-16 23:19:34 +02:00
Greg Farnum
685d76a77c ceph: wait for CephFS to be healthy before proceeding
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2015-09-16 13:53:36 -07:00
Vasu Kulkarni
08bae3b9d2 Use old branches as is without 'ceph-' for firefly, hammer etc
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2015-09-16 13:21:51 -04:00
Loic Dachary
8c618e5645 buildpackages: re-use config from install task
So that the packages built match what the install task expects.

http://tracker.ceph.com/issues/13031 Fixes: #13031

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-09-15 17:37:05 +02:00
Loic Dachary
d40d8ff501 buildpackages: pep8 compliance
http://tracker.ceph.com/issues/13031 Fixes: #13031

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-09-15 16:16:34 +02:00
Loic Dachary
0eeb441bb0 buildpackages: build ceph packages and upload them
Import install from task.

http://tracker.ceph.com/issues/13031 Fixes: #13031

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-09-15 16:13:09 +02:00
Loic Dachary
2b25080d4f buildpackages: build ceph packages and upload them
Build Ceph packages from source for the required revision, os_type,
os_version and architecture and upload them to the gitbuilder
repository.

http://tracker.ceph.com/issues/13031 Fixes: #13031

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-09-15 12:01:05 +02:00
Sage Weil
5af28cf746 tasks/radosbench: allow cleanup: false
Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-11 16:52:41 -04:00
Zack Cerza
e9847570de Merge pull request #561 from ceph/wip-sudo
sudo ceph
2015-09-11 10:20:50 -06:00
Sage Weil
dad981d339 tasks: sudo ceph for cli
/var/run/ceph is 770.  This is mainly necessary for any
interaction with the daemon sockets, but it is what users do
and it may avoid log noise.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-11 12:15:01 -04:00
Sage Weil
9b3f36f91f ceph: add option to expect valgind errors and fail if there are none
See #10328
Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-11 11:53:37 -04:00
Gregory Farnum
6ac2774823 Merge pull request #550 from ceph/wip-12806
tasks/qemu: avoid adding duplicated entries to /etc/exports

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2015-09-09 14:53:45 +01:00
Sage Weil
a328e3e60d tasks/ceph_manager: dump pgs when recover times out
It is really hard to map a stuck recovery back to the pgs that
are stuck.  This will make it easy.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-08 08:59:49 -04:00
Sage Weil
394dbc1890 Merge remote-tracking branch 'gh/wip-ceph-deploy' 2015-09-06 13:36:07 -04:00
Kefu Chai
ff4e2ad3dc Merge pull request #552 from ceph/wip-12878
tasks/scrub_test: sudo ls ...

Reviewed-by: Kefu Chai <kchai@redhat.com>
2015-09-06 18:26:56 +08:00
Sage Weil
6064443ad1 tasks/divergent_prior[2]: fix rados get syntax
Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-05 09:25:24 -04:00
Sage Weil
7b3ea31efe tasks/reg11184.py: fix rados get syntax
no -o

Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-05 09:23:54 -04:00
Sage Weil
26eb35d611 tasks/s3tests: use ceph-$foo branches
We need to be able to merge things into s3-tests master that
break rgw.  Create ceph-foo branches (ceph-master,
ceph-infernalis, etc.) and use those instead.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-04 14:42:21 -04:00
Sage Weil
d46c3a53ab tasks/scrub_test: sudo ls ...
/var/lib/ceph/osd/* is owned by ceph now; ubuntu user can't read
it.

Fixes: #12878
Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-02 22:01:58 -04:00
Sage Weil
0553e50401 tasks/ceph_deploy: install ceph-test
Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-02 18:45:33 -04:00
John Spray
8e275049a0 tasks/cephfs: fix mount_timeout config loading
Previously were defaulting to a string, which
always compared greater than elapsed, so never
timed out.

Fixes: #12820
Signed-off-by: John Spray <john.spray@redhat.com>
2015-09-01 11:31:23 +01:00
Sage Weil
f5865d32a8 tasks/ceph_deploy: work with systemd
This is ugly, but I'm not going to rewrite this right now.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-08-31 14:21:29 -04:00
Loic Dachary
fbaedbf9fa Merge pull request #546 from dachary/wip-11881-multipath
ceph-disk workunit
2015-08-31 17:30:05 +02:00
Yan, Zheng
7de534c1da tasks/qemu: avoid adding duplicated entries to /etc/exports
try deleting leftover entries for previous job

Fixes: #12806
Signed-off-by: Yan, Zheng <zyan@redhat.com>
2015-08-31 22:27:38 +08:00
Sage Weil
33ad04c715 Merge remote-tracking branch 'gh/next' 2015-08-28 15:41:17 -04:00
John Spray
ffe79e9f31 tasks/cephfs: timeout on wait in test_network_death
So that we can get a failure instead of a hang in

Signed-off-by: John Spray <john.spray@redhat.com>
2015-08-28 13:57:23 +01:00
John Spray
a3f8a44d25 Merge remote-tracking branch 'origin/next' 2015-08-28 09:17:28 +01:00
Loic Dachary
d6c72c3a25 ceph-deploy: implement only_mon
If only_mon is true, do not attempt to deploy osds.

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-08-27 23:23:53 +02:00
Loic Dachary
b181783166 ceph-deploy: implement keep_running
When keep_running is true, do not shutdown the cluster, leave it as it
is for other workunits or tasks to use. This effectively allows the
ceph-deploy task to be used as a helper to deploy clusters.

The call to build_ceph_cluster is simplified by giving it the whole
configuration dictionary instead of re-building one with selected arguments.

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-08-27 23:20:25 +02:00
Loic Dachary
e5db657e86 ceph-deploy: ceph report on failure
When ceph-deploy fails, run ceph report to get more information about
the state of the cluster at the time of the failure.

Signed-off-by: Loic Dachary <loic@dachary.org>
2015-08-27 23:19:22 +02:00
John Spray
df8cecf479 tasks/cephfs: fix test_pool_perm teardown
Fixes: #12777
Signed-off-by: John Spray <john.spray@redhat.com>
2015-08-27 17:53:24 +01:00
Andrew Schoen
766da54dbc ceph_deploy: if a branch is given, use it when installing rgw
Fixes: http://tracker.ceph.com/issues/12758

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
2015-08-24 10:09:24 -05:00
Yan, Zheng
753f93dde8 Merge pull request #526 from ceph/wip-stashed-layout
tasks/cephfs: implement TestDataScan.test_stashed_layout
2015-08-20 19:03:59 +08:00
Travis Rhoden
dceae70ece Merge pull request #514 from ceph/wip-cd-add-branch-install
- Use branch from config for ceph-deploy testing

Reviewed-by: Travis Rhoden <trhoden@redhat.com>
2015-08-14 11:24:08 -07:00
John Spray
c1ca95cd16 tasks/cephfs: implement TestDataScan.test_stashed_layout
This is for testing how cephfs-data-scan handles
the new 'layout' xattr when found during recovery.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-08-13 14:29:35 +01:00
Ken Dreyer
0cf220ad4f calamari_setup: install "cli" utils on Calamari node
In our RHCS 1.3 ceph-deploy docs, we tell users to run "ceph-deploy
install --cli" on their calamari admin node, but our smoke test wasn't
actually doing this.

See https://bugzilla.redhat.com/1252929 , "[Ubuntu 1.3.0] - ceph-deploy
install --no-adjust-repos --cli `hostname` is failing with a Traceback
error"
2015-08-12 14:52:37 -06:00
Dan Mick
532a4e4a8b calamari_setup: change to use ceph-deploy repo command
Fixes: #12643
Signed-off-by: Dan Mick <dan.mick@redhat.com>
2015-08-11 16:14:38 -07:00
Ken Dreyer
63e6d87db4 calamari_setup: handle RHCS 1.2 (missing /mnt/MON)
In RHCS 1.2 we don't have a /mnt/MON directory. The intention of
35c6363a1e was to handle this condition,
but in 1.2, the non-zero return code makes Teuthology fail the whole
test.

We don't want *Teuthology* itself to act on the return code here; we
simply want to know what it was and structure the rest of the test
accordingly.
2015-08-06 07:58:35 -06:00
Alfredo Deza
af973259ef fix indentation error in docstring
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2015-08-06 08:41:17 -04:00
John Spray
9f530092e2 tasks: fix syntax error in ceph.py
From e195f9fa.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-08-06 10:25:03 +01:00
Andrew Schoen
fd39a7e0bf Merge pull request #520 from ceph/wip-logrotate-chcon
Set the SELinux context of the logrotate config
2015-08-05 16:30:57 -05:00
Zack Cerza
e195f9fa1c Set the SELinux context of the logrotate config
Signed-off-by: Zack Cerza <zack@redhat.com>
2015-08-05 10:08:35 -06:00
Sage Weil
c93fe1f1c6 tasks/ceph_manager: be silent about sending SIGHUPs
At the default interval this generates tons of log noise.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-08-04 13:08:04 -04:00
Zack Cerza
657dd9ecf0 Merge pull request #515 from ceph/wip-12476
thrashosds: send a random osd a signal.SIGHUP
2015-07-29 09:03:50 -06:00
Dan Mick
6726f4f1fa calamari_setup: only stop ceph on osd/mon machines
Fixes bogus failure at end of run on calamari server

Signed-off-by: Dan Mick <dan.mick@redhat.com>
2015-07-28 16:12:09 -07:00
Dan Mick
30dbe93125 calamari_setup: hack for 1.3.0 on centos: install lttng from EPEL
lttng is not yet part of any private repo; since 1.3.0/CentOS is not
a supported product, just grab it from EPEL for this test.

Signed-off-by: Dan Mick <dan.mick@redhat.com>
2015-07-28 16:12:09 -07:00
Dan Mick
ac8b0e1abf calamari_setup: pep8 fix
Signed-off-by: Dan Mick <dan.mick@redhat.com>
2015-07-28 16:12:09 -07:00
Dan Mick
17c3a94b5d calamari_setup: Add --no-adjust-repos to ceph-deploy install --<type>
Otherwise, ceph-deploy will install an apt source that points to
ceph.com, which will override the local ISO repos.

No --mon/--osd yet until 12147 is fixed

Signed-off-by: Dan Mick <dan.mick@redhat.com>
2015-07-28 16:12:09 -07:00
Josh Durgin
0200811fc0 Merge pull request #446 from ceph/wip-krbd-unmap-suite
unmap subsuite for krbd

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2015-07-28 15:49:46 -07:00
Andrew Schoen
a3c9a763b1 ceph_manager: don't add an osd to live_osds until it's been revived
also waits to remove it from dead_osds. this fixes an issue where
do_sighup tries to send a signal to an osd that has not been revived
yet.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
2015-07-28 17:05:31 -05:00
Andrew Schoen
b2187b79f3 thrashosds: adds a sighup_delay option, defaulted to 0.1
This will call Thrasher.do_sighup which picks a random osd and sends a
signal.SIGHUP to it, delaying for the value of sighup_delay between each
time it picks a new osd to signal.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
2015-07-28 14:46:35 -05:00
Andrew Schoen
84d24038aa ceph_manager: adds a do_sighup method
This method runs in a separate greenlet than do_thrash and will pick a
random live osd to send a signal.SIGHUP to. There is a config option,
sighup_delay, which controls how long to delay between sending the
signals.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
2015-07-28 14:46:12 -05:00
Andrew Schoen
ed73f67991 ceph_manager: adds a signal_osd method
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
2015-07-28 14:13:30 -05:00
Vasu Kulkarni
c1116a0af8 - Use branch from config for ceph-deploy testing instead of released branch
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2015-07-28 13:38:47 -04:00
Travis Rhoden
ec1916af6f Merge pull request #509 from ceph/wip-12225
Don't pass host to mon create-initial

Reviewed-by: Travis Rhoden <trhoden@redhat.com>
2015-07-28 10:26:54 -07:00
Gregory Farnum
961701e3ed Merge pull request #512 from ceph/wip-pool-perm-fixes
Fixes to test_pool_perm

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2015-07-28 14:45:44 +01:00
John Spray
a13064bf21 Merge pull request #489 from dachary/wip-12320-ceph-fuse
tasks/ceph_fuse.py: virtual machines need flexible mount timeout

Reviewed-by: John Spray <john.spray@redhat.com>
2015-07-28 11:20:36 +01:00
John Spray
e33065f3c9 tasks/cephfs: reset auth caps in setUp
Signed-off-by: John Spray <john.spray@redhat.com>
2015-07-28 10:34:58 +01:00
John Spray
6864f882bc tasks/cephfs: don't hardcode data pool name in test_pool_perm
Signed-off-by: John Spray <john.spray@redhat.com>
2015-07-28 10:34:58 +01:00
John Spray
e3434b6d2f tasks/cephfs: test_pool_perm: reinstate proper caps on teardown
Signed-off-by: John Spray <john.spray@redhat.com>
2015-07-28 10:34:58 +01:00
John Spray
a0ccd69041 tasks/cephfs: don't hardcode client ID in test_pool_perm
Our teuthology yamls all happen to use "client.0" but
it's not guaranteed.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-07-28 10:34:58 +01:00