Commit Graph

37746 Commits

Author SHA1 Message Date
Sage Weil
2667eed93c Merge pull request #3158 from athanatos/wip-10259
osd_types: op_queue_age_hist and fs_perf_stat should be in osd_stat_t::o...

Reviewed-by: Sage Weil <sage@redhat.com>
2014-12-11 13:16:13 -08:00
Samuel Just
dce6f288ad ReplicatedPG::scan_range: an object can disappear between the list and the attr get
The first item in the range is often last_backfill, upon which writes
can be occuring.  It's trimmed off on the primary side anyway.

Fixes: 10150
Backport: dumpling, firefly, giant
Signed-off-by: Samuel Just <sjust@redhat.com>
2014-12-11 13:05:54 -08:00
Samuel Just
449d7d47e4 Merge pull request #2579 from yuandong1222/cache-hboject-filestore
Cache hobject filestore_key to avoid massive _reverse_nibbles calls

Reviewed-by: Samuel Just <sjust@redhat.com>
2014-12-11 13:02:22 -08:00
Samuel Just
b03fc61466 Merge pull request #3018 from tchaikov/wip-fix-safe_read_file
safe_io: do not set ending \0 in safe_read_file()

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Loic Dachary <ldachary@redhat.com>
2014-12-11 13:00:45 -08:00
Samuel Just
81af475318 Merge pull request #3070 from ceph/wip-10080
SimpleMessenger: allow RESETSESSION whenever we forget an endpoint

Reviewed-by: Sage Weil <sage@redhat.com>
2014-12-11 12:59:36 -08:00
Noah Watkins
61102203b8 debian: enable libgoogle-perftools-dev on arm64
These binaries haven't landed in Ubuntu, but they are in
sid and jessie for arm64. On Saucy I've installed them by
hand from ports.ubuntu.com and things seems pretty swell.

Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
2014-12-11 12:03:30 -07:00
Gregory Farnum
a09a87ad14 Merge pull request #3150 from ceph/wip-10288
mon: fix `fs ls` on peons

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2014-12-11 10:47:11 -08:00
Sage Weil
2246dca05f common/blkdev: fix block device discard check
- fix base name calculation (do not assume sda)
- reverse sense of check (it was returning false when true before?)
- add a generic helper to get other properties, too

Fixes: #10296
Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-11 10:18:56 -08:00
Sage Weil
25e37837c3 common/blkdev: get_block_device_base
And a test.  Ideally the test would get run on a machine with weirdo
devices like /dev/cciss/c0d1.

This mirrors the logic in ceph-disk, which has been tested.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-11 10:18:56 -08:00
Loic Dachary
c64b3c5371 Merge pull request #3152 from dachary/wip-10296-journal-discard
os: do not attempt to detect discard by default

Reviewed-by: Sage Weil <sage@redhat.com>
2014-12-11 19:15:05 +01:00
Joao Eduardo Luis
beaa04e411 mon: MonitorDBStore: allow randomly injecting random delays on writes
Adds two new config options:

mon_inject_transaction_delay_probability : DOUBLE (0.0-1.0, default: 0.0)
mon_inject_transaction_delay_max : DOUBLE (seconds, default: 10.0)

If probability is set to a value greater than 0, just before applying
the transaction, the store will decide whether to inject a delay,
randomly choosing a value between 0 and the max.

Signed-off-by: Joao Eduardo Luis <joao@redhat.com>
2014-12-11 17:49:20 +00:00
Loic Dachary
2ad55cd601 tests: do not ignore ceph-disk.sh::test_activate_dev failure
The status of the test commands must be tested instead of being
silently ignored.

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-12-11 18:35:13 +01:00
Loic Dachary
9c5fd8c20a os: do not attempt to detect discard by default
If the journal_discard option is false, the journal should no try to
guess if the device has discard capabilities.

http://tracker.ceph.com/issues/10296 Refs: #10296

Signed-off-by: Loic Dachary <ldachary@redhat.com>
2014-12-11 18:35:06 +01:00
Sage Weil
279c199928 ceph_test_rados_api_watch_notify: fix a zillion warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-11 06:57:43 -08:00
Sage Weil
531e8bb722 ceph_test_rados_api_watch_notify: use GT macro
so we can see the value

Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-11 06:57:43 -08:00
Sage Weil
008d788f9b ceph_test_rados_api_watch_notify: print err to debug
Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-11 06:57:43 -08:00
Sage Weil
c1dd92bda7 osdc/Objecter: normalize watch error (ENOENT on delete)
Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-11 06:26:06 -08:00
John Spray
5559e6aea9 mon: fix fs ls on peons
This was incorrectly using pending_mdsmap instead
of mdsmap.  We didn't notice in test because of
single-mon configurations.

Fixes: #10288
Backport: giant

Signed-off-by: John Spray <john.spray@redhat.com>
2014-12-11 14:00:57 +00:00
Loic Dachary
44a4f9578f Merge pull request #3149 from ceph/wip-typo
mds: fix log typo

Reviewed-by: Loic Dachary <ldachary@redhat.com>
2014-12-11 14:25:50 +01:00
John Spray
b025f56e72 mds: fix log typo
s/trvserse/traverse/

Signed-off-by: John Spray <john.spray@redhat.com>
2014-12-11 12:40:08 +00:00
Sage Weil
1dc064a8d4 Merge pull request #3137 from thesues/fix-strtol
Fix libstriprados::remove, use strtoll insdead of strtol

Reviewed-by: Sage Weil <sage@redhat.com>
2014-12-10 21:10:57 -08:00
Sage Weil
a8f85dccbb mon: allow full flag to be manually cleared
This is like a temporary measure as the mon will try to set them again,
but we have run into cases where the mon was misbehaving (failing to clear
the flag) and we wanted to do it.  Note that the mon will likely set it
again on the next tick() anyway.

If we're going to clear it, we may as well be able to set it, too (again,
even if the mon is going to clear it soon).  If nothing else this is useful
for writing tests.

Fixes: #9323
Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-10 21:00:26 -08:00
Gregory Farnum
a1eb443ee5 Merge pull request #3038 from ceph/wip-mds-readonly
Wip mds readonly

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2014-12-10 20:39:37 -08:00
Sage Weil
69c1f4869d Merge pull request #3145 from XinzeChi/wip-finisher
Finisher: call cond signal if necessary in function queue

Reviewed-by: Sage Weil <sage@redhat.com>
2014-12-10 20:34:34 -08:00
Xinze Chi
2e5ee3080b Finisher: call signal if necessary in function queue
Signed-off-by: Xinze Chi <xmdxcxz@gmail.com>
2014-12-11 04:33:34 +00:00
Noah Watkins
d92e2cafee lttng: add int type definitions
The normal path through #include <lttng/tracepoint.h> doesn't
appear to include int defintions like uint64_t that are used
in Ceph so we add our definitions file.

Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
2014-12-10 18:48:43 -07:00
Noah Watkins
060cbaacef cycles: add reader for i386 and aarch64
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
2014-12-10 18:47:02 -07:00
Loic Dachary
d3f2ec34d2 Merge pull request #3119 from ceph/wip-8797
rados.py: remove Rados.__del__(); it just causes problems

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Loic Dachary <ldachary@redhat.com>
2014-12-11 00:21:46 +01:00
Dan Mick
b038e8fbf9 Call Rados.shutdown() explicitly before exit
This is mostly a demonstration of good behavior, as the resources will
be reclaimed on exit anyway.

Signed-off-by: Dan Mick <dan.mick@redhat.com>
2014-12-10 15:09:32 -08:00
Dan Mick
5ba9b8f21f rados.py: remove Rados.__del__(); it just causes problems
Recent versions of Python contain a change to thread shutdown that
causes ceph to hang on exit; see http://bugs.python.org/issue21963.
As it turns out, this is relatively easy to avoid by not spawning
threads on exit, as Rados.__del__() will certainly do by calling
shutdown(); I suspect, but haven't proven, that the problem is
that shutdown() tries to start() a threading.Thread() that never
makes it all the way back to signal start().

Also add a PendingReleaseNote and extra doc comments to clarify.

Fixes: #8797
Signed-off-by: Dan Mick <dan.mick@redhat.com>
2014-12-10 13:19:16 -08:00
Gregory Farnum
616c46e521 Merge pull request #3129 from ceph/wip-gfcomplete-dirty
gf-complete: update submodule to newest upstream

Reviewed-by: Loic Dachary <ldachary@redhat.com>
2014-12-10 13:14:05 -08:00
Gregory Farnum
24ca9f1c25 Merge pull request #3071 from ceph/wip-10164
mds: drop dirty dentries in deleted directory

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2014-12-10 11:04:04 -08:00
VRan Liu
8b195eccad check endpoints is not empty before use it.
Signed-off-by: VRan Liu <gliuwr@gmail.com>
(cherry picked from commit 9d5e4edc15)
2014-12-10 10:46:10 -08:00
Sage Weil
320c25607c osdc/Objecter: only issue one error per watch; do it through one path
Send all errors through C_DoWatchError for simplicity.

Only deliver one error per watch.  This simplifies callers logic and avoids
a very rare case that they need to deal with.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-10 10:05:57 -08:00
Sage Weil
1e32e3bd82 ceph_test_rados_api_watch_notify: wait longer for osd to time us out
If cluster is thrashing this can fail.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-10 10:01:40 -08:00
Sage Weil
78eed527e7 ceph_test_rados_api_watch_notify: weak assert cookie is valid (it's a ptr)
Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-10 10:01:40 -08:00
Sage Weil
6f43c6c53a osdc/Objecter: pass correct cookie value to error
Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-10 09:44:01 -08:00
Sage Weil
f02d2d721a Merge pull request #3135 from XinzeChi/wip-3
os/FileStore.cc: insert not empty list<Context*> to op_finisher/ondisk_finisher

Reviewed-by: Sage Weil <sage@redhat.com>
2014-12-10 08:39:32 -08:00
Xinze Chi
b34e54521c os/FileStore.cc: insert not empty list<Context*> to op_finisher/ondisk_finisher.
Signed-off-by: Xinze Chi <xmdxcxz@gmail.com>
2014-12-10 17:00:43 +00:00
Sage Weil
d56eccdc7f Merge pull request #3140 from leseb/sample-ceph.conf
ceph.conf: update sample

Reviewed-by: Sage Weil <sage@redhat.com>
2014-12-10 06:43:07 -08:00
Sébastien Han
7ab4a3931c ceph.conf: update sample
This commit adds:

* client admin socket
* change some default
* re-indent

Signed-off-by: Sébastien Han <sebastien.han@enovance.com>
2014-12-10 15:32:17 +01:00
Jason Dillaman
efd9d8df1e tests: Minor cleanup to librbd test
The tests can now be repeated to increase the chance of hitting
edge condition failures.  Also added additional logic to immediately
fail IO tests when an error is encountered.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2014-12-10 08:56:59 -05:00
Dongmao Zhang
78a15ee4c6 Fix libstriprados::remove, use strtoll insdead of strtol
Signed-off-by: Dongmao Zhang <deanraccoon@gmail.com>
2014-12-10 18:55:55 +08:00
Loic Dachary
a3912d2308 Merge pull request #3120 from ceph/wip-10010
ceph-osd: don't close stderr if not daemonizing

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Loic Dachary <ldachary@redhat.com>
2014-12-10 11:12:45 +01:00
Yan, Zheng
6b51a9f509 mds: set dirfrag version when fetching dirfrag is skipped
Signed-off-by: Yan, Zheng <zyan@redhat.com>
2014-12-10 11:32:43 +08:00
Dan Mick
17c72f591f ceph-osd: remove extra close of stderr
Otherwise, one loses log messages when running with -f or -d.  When
daemonizing, stderr is already closed in global_init_postfork_finish.

Fixes: #10010, #10113, #9810

Signed-off-by: Dan Mick <dan.mick@redhat.com>
2014-12-09 19:28:49 -08:00
Greg Farnum
606b004d94 gf-complete: update submodule to newest upstream
Keep the tree cleaner!

Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2014-12-09 15:28:47 -08:00
David Zafman
bf0d8d3aa9 osd: Remove dead code related to old pg removal mechanism
Signed-off-by: David Zafman <dzafman@redhat.com>
2014-12-09 11:17:42 -08:00
Loic Dachary
cdb110539f Merge pull request #3073 from cholcombe973/get_pool_values_docs
Updating docs for Get Pool Value 

Reviewed-by: Loic Dachary <ldachary@redhat.com>
2014-12-09 11:41:17 +01:00
Sage Weil
9902383c69 crush/CrushWrapper: fix create_or_move_item when name exists but item does not
We were using item_exists(), which simply checks if we have a name defined
for the item.  Instead, use _search_item_exists(), which looks for an
instance of the item somewhere in the hierarchy.  This matches what
get_item_weightf() is doing, which ensures we get a non-negative weight
that converts properly to floating point.

Backport: giant, firefly
Fixes: #9998
Reported-by: Pawel Sadowski <ceph@sadziu.pl>
Signed-off-by: Sage Weil <sage@redhat.com>
2014-12-08 16:21:24 -08:00