Commit Graph

644 Commits

Author SHA1 Message Date
Sage Weil
015df934af mon/OSDMonitor: require force argument to split a cache pool
There are several perils when splitting a cache pool:

 - split invalidstes pg stats, which disables the agent
 - a scrub must be manually triggered post-split to rebuild stats
 - the pool may fill the OSDs during that period.
 - or, the pool may end up beyond the 'full' mark and once scrub does
   complete and the agent activate we may block IO for a long time while
   we catch up with flush/evict

Make it a bit harder for users to shoot themselves in the foot.

Fixes: #8043
Signed-off-by: Sage Weil <sage@inktank.com>
2014-04-15 13:57:21 -07:00
Joao Eduardo Luis
e213ee1573 qa: workunits: mon: auth_caps.sh: test 'auth' caps requirements
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2014-04-07 18:30:56 +01:00
Ilya Dryomov
d323634024 qa: test_alloc_hint: set ec ruleset-failure-domain to osd
Create a custom profile with ruleset-failure-domain=osd.  (The default
ruleset-failure-domain=host won't do because this script assumes and
works only if all osds are on the same host.)  While at it, set k and m
explicitly to avoid troubles in the future.

Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
2014-04-03 21:16:14 +04:00
Sage Weil
e44f85ce18 qa/workunits/cephtool/test.sh: test 'osd pg-temp ...'
Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-31 14:12:51 -07:00
Sage Weil
8c761c4797 mon: make 'ceph osd erasure-code-profile set ...' idempotent
Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-31 10:01:43 -07:00
Sage Weil
7d321d8aa6 qa/workunits/rados/test_alloc_hint: fix erasure syntax
This changed recently.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-31 09:14:36 -07:00
Sage Weil
4f9f7f8789 qa/workunits/fs/misc/layout_vxattrs: ceph.file.layout is not listed
As of 08a3d6bd42.  A similar change was made
in the kernel.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-29 14:23:21 -07:00
Sage Weil
7cb1d3a43d qa/workunits/mon/pool_ops.sh: fix test
The pool create command doesn't take k/v pairs any more.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-27 12:57:40 -07:00
Sage Weil
233801c622 qa/workunits/mon/pool_ops.sh: use expect_false
Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-27 12:56:44 -07:00
Warren Usui
2779e2ab03 Make sure s3_utilities are found.
Add FindBin code to rgw/s3 tests to include s3_utilites.pm properly.

Fixes: 7863
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-03-25 16:30:03 -07:00
David Zafman
01b99668ab qa: Add ceph_multi_stress_watch for rep and ec
Fixes: #7438

Signed-off-by: David Zafman <david.zafman@inktank.com>
2014-03-24 14:04:48 -07:00
Sage Weil
c9b4e7d9be Merge pull request #1465 from ceph/wip-s3pm-wusui
Remove code duplication from s3 tests.
2014-03-20 16:05:48 -07:00
Sage Weil
513e4ad8ec Merge remote-tracking branch 'gh/firefly' 2014-03-20 10:42:26 -07:00
Ilya Dryomov
a4849fb914 qa: test_alloc_hint: flush journal before prodding the FS
OSDs that for some reason get behind on processing their op queue break
expect_alloc_hint_eq(), as it pokes the FS and not the journal.  Fix it
by flushing the journal before proceeding with anything else.

Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
2014-03-19 10:59:27 +02:00
Loic Dachary
b273011874 osd,mon: use profile instead of properties
The qa and functional tests are adapted to the new command prototype
requiring a profile instead of a list of properties. When possible the
implicit ruleset creation is used to simplify the test setup.

Signed-off-by: Loic Dachary <loic@dachary.org>
2014-03-18 00:23:42 +01:00
Warren Usui
b2fcc6e195 Remove code duplication from s3 tests.
A new module, s3_utilities.pm has been created.  It contains
subroutines common to at least two of the workunits in this
directory. Code was moved here from the other pl files, and
some minor changes (paramers and scope changes) were needed.

Fixes: 7472
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-03-14 21:52:37 -07:00
Sage Weil
5873b0f03c Merge remote-tracking branch 'gh/wip-da-SCA-firefly-20140304'
Reviewed-by: Sage Weil <sage@inktank.com>
2014-03-14 11:42:30 -07:00
Sage Weil
5676d6e5be Merge remote-tracking branch 'gh/firefly' 2014-03-14 08:30:04 -07:00
Sage Weil
c5b557ee88 qa/workunits: misc -> fs/misc
Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-13 17:38:08 -07:00
Sage Weil
b70db7e735 Merge remote-tracking branch 'gh/firefly' 2014-03-11 19:20:27 -07:00
Sage Weil
d72c2bbccb Merge pull request #1400 from ceph/wip-hint-tests
wip-hint-tests

Reviewed-by: Sage Weil <sage@inktank.com>
2014-03-11 18:09:54 -07:00
Sage Weil
2fbd77206c qa/workunits/cephtool/test.sh: fix thrash (more)
If I have to touch this again I will remove it.  Ugh.  This time,

ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-03-11_02:30:01-rados-firefly-distro-basic-plana/125922

hit NXIO a few lines down because one of the OSDs was still down.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-11 17:03:23 -07:00
Warren Usui
1249b0bd76 Use pgrep radosgw to determine if rados gateway is running.
Fixes: 7528
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
Signed-off-by: Warren Usui <warren.usui@inktank.com>
(cherry picked from commit 5b88856cd2)
2014-03-10 20:40:52 -07:00
Warren Usui
745f72c5c2 Fixed get_status() to find client.radosgw fields inside of ps output.
Fixes: 7375
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
Signed-off-by: Warren Usui <warren.usui@inktank.com>
(cherry picked from commit 8020dcf779)
2014-03-10 20:40:41 -07:00
Warren Usui
880bc3a536 Fix get_status() to find client.rados text inside of ps command results.
Added port (fixed value for right now in teuthology) to hostname.
Fixes: 7374
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
Signed-off-by: Warren Usui <warren.usui@inktank.com>
(cherry picked from commit 8200b8a025)
2014-03-10 20:40:07 -07:00
Ilya Dryomov
55c23a1a82 qa: add script for testing rados allocation hint
Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
2014-03-09 19:28:22 +02:00
Sage Weil
db0c37829c Merge remote-tracking branch 'gh/wip-7210' into firefly
Reviewed-by: Sage Weil <sage@inktank.com>
2014-03-07 15:23:31 -08:00
Sage Weil
1c8c61897d qa/workunits/cephtool/test.sh: fix 'osd thrash' test
- fix the wait check for osds to come back up
- make sure they get marked back in, too

Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
2014-03-07 15:21:38 -08:00
Joao Eduardo Luis
38fd666ac6 qa: workunits/mon/rbd_snaps_ops.sh: ENOTSUP on snap rm from copied pool
'rados cppool' copies the contents but that doesn't make the destination
pool an unmanaged snaps pool.  Therefore, we must get an ENOTSUP when
we try to remove an unmanaged snap from a not-unmanaged pool.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2014-03-07 19:49:56 +00:00
Sage Weil
0fd636c488 Merge remote-tracking branch 'gh/firefly'
Conflicts:
	README
2014-03-07 08:19:20 -08:00
Sage Weil
b436930779 qa/workunits/rest/test.py: do not test 'osd thrash'
This wreaks havoc on our QA because it marks osds up and down and then
immediately after that we try to scrub and some osds are still down.

Adjust the CLI test to wait for all OSDs to come back up after thrashing.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-06 13:46:10 -08:00
Gregory Farnum
9d387d38de Merge pull request #1378 from ceph/wip-7487
mon: no crush buckets with type 0 (#7487)

Reviewed-by: Greg Farnum <greg@inktank.com>
2014-03-05 20:58:40 -08:00
Gregory Farnum
b642e3884c Merge pull request #1343 from ceph/wip-cache-warn-full
mon: warn when cache tier is full

Reviewed-by: Loic Dachary <loic@dachary.org>
Reviewed-by: Greg Farnum <greg@inktank.com>
2014-03-05 14:22:34 -08:00
Sage Weil
b093958a11 Merge pull request #1351 from ceph/wip-7248
osd: OSD: limit the value of 'size' and 'count' on 'osd bench'

Reviewed-by: Sage Weil <sage@inktank.com>
2014-03-05 14:18:28 -08:00
Sage Weil
772968e60b mon/OSDMonitor: disallow crush buckets of type 0
Prevent creation of buckets of type 0 ('osd', 'device', etc.), as they
will confusing the mapping algorithm.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-05 13:15:58 -08:00
Sage Weil
ca12e0d92e qa/workunits/mon/crush_ops: use expect_false
Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-05 12:52:08 -08:00
Sage Weil
0592368070 mon: warn when pool nears target max objects/bytes
The cache pools will throttle when they reach the target max size, so it
is important to make the administrator aware when they approach that point.
Unfortunately it is not particularly easy to efficiently keep track of
which PGs have hit their limit and use that for reporting.  However, it
is easy to raise a flag when we start to approach the target for the
entire pool, and that sort of early warning is arguably more useful
anyway.

Trigger the warning based on the target full ratio.  Not when we hit the
target, but when we are 2/3 between it and completely full.

Implements: #7442
Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-05 11:59:07 -08:00
Danny Al-Gaaf
3a786caddc test_o_trunc.c: fix indentation to kernel style (tabs, 8 spaces)
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
2014-03-04 15:43:18 +01:00
Danny Al-Gaaf
f83690949f test_o_trunc.c: reduce scope of some variables
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
2014-03-04 15:43:18 +01:00
Danny Al-Gaaf
c2f3f2b59e close file descriptor in error case
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
2014-03-04 15:43:18 +01:00
Sage Weil
c029c2fbf1 mon/OSDMonitor: add 'osd tier add-cache <pool> <size>' command
This is a friendlier interface for setting up a cache tier with some
reasonable defaults (defined via config options).  This will simplify
the user experience and documentation.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-03 21:16:24 -08:00
Sage Weil
08efb45889 OSDMonitor: do not add non-empty tier pool unless forced
In general, users should not use non-empty pools as new tiers or else
things can behave strangely:

 - the data sets are unrelated behavior will be... strange.
 - if the cache pool is not "new" and does not do the OMAP flag, the OSD
   will not know not to flush omap objects to an EC base tier
 - probably other random stuff I'm forgetting

Allow a user to shoot themselves in the foot with --force-nonempty.

Implements: #7457
Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-03 21:11:17 -08:00
Joao Eduardo Luis
aca6ac343e qa: workunits: cephtool: test 'osd bench' limits
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2014-03-03 15:28:04 +00:00
Sage Weil
f3976c1653 Merge pull request #1334 from kazhang/wip-get-hit_set-parameter
OSDMonitor: enable getting hit set parameters

Reviewed-by: Sage Weil <sage@inktank.com>
2014-03-01 01:21:37 -08:00
Kai Zhang
dc20c7d674 OSDMonitor: enable getting hit set parameters
We would like to get the hit set parameters: hit_set_type |
hit_set_period | hit_set_count | hit_set_fpp via OSDMonitor

Signed-off-by: Kai Zhang <zakir.exe@gmail.com>
2014-03-01 00:22:14 -08:00
Warren Usui
5b88856cd2 Use pgrep radosgw to determine if rados gateway is running.
Fixes: 7528
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-02-28 21:43:31 -08:00
Yehuda Sadeh
20641b6761 Merge pull request #1280 from ceph/wip-7374-wusui
Fix sub check() to not exit if rgw is running.

Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
2014-02-25 18:00:37 -08:00
Warren Usui
8020dcf779 Fixed get_status() to find client.radosgw fields inside of ps output.
Fixes: 7375
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-02-25 16:52:28 -08:00
Warren Usui
8200b8a025 Fix get_status() to find client.rados text inside of ps command results.
Added port (fixed value for right now in teuthology) to hostname.
Fixes: 7374
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2014-02-25 16:49:39 -08:00
Loic Dachary
1340b360b5 erasure-code: test that changing the pool size is not allowed
http://tracker.ceph.com/issues/7360 fixes #7360

Signed-off-by: Loic Dachary <loic@dachary.org>
2014-02-23 07:52:59 -08:00