Commit Graph

35041 Commits

Author SHA1 Message Date
Gregory Farnum
d9cf299134 Merge pull request #2279 from ceph/wip-hadoop
fix and reorg hadoop workunits

Reviewed-by: Greg Farnum <greg@inktank.com>
2014-08-19 11:47:07 -07:00
Sage Weil
c36b72c1d1 Merge pull request #2199 from ceph/wip-reweight
mon: allow reweighting of osds by pg (isntead of bytes used)

Reviewed-by: Guang Yang <yguang@yahoo-inc.com>
2014-08-19 10:40:42 -07:00
Sage Weil
33048410c8 mon/OSDMonitor: respect CRUSH weights for reweight-by-pg
Do not assume that all OSDs are weighted equally for reweight-by-pg.

Note that reweight-by-utilization already reweights based on the size of
the OSD volume; we presume that this is already reflected by the CRUSH
weights.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-19 08:16:56 -07:00
Sage Weil
1ecf44eb57 mon/OSDMonitor: reweight-by-pg for pool(s)
Allow the reweight-by-pg to look at a specific set of pools.  If the list
is ommitted, use PGs from all pools.  This allows you to focus on a
specific pool (the one that will dominate data usage).  Otherwise things
may not be quite right because other pools may have PGs that contain
much less data.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-19 08:16:55 -07:00
Sage Weil
8b971e94d4 mon/OSDMonitor: adjust weights up, when possible
Note when OSDs are underloaded, as well.  If that is the case, adjust the
OSD reweight value if, if possible.  (It won't always be possible since
weights are capped at 1.)

Note that we set the underload threshold to the average, as we want to
aggressively adjust weights up (back to 1.0) whenever possible.  This gets
us a more efficient mapping calculation and reduces the amount of "noise"
in the weights.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-19 08:16:41 -07:00
Sage Weil
977f85279f qa/workunits/cephtool/test.sh: test reweight-by-pg
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-19 08:16:41 -07:00
Sage Weil
01cb40547c mon/OSDMonitor: reweight-by-pg
This is just like reweight-by-utilization, but looks purely at the PG to
OSD mapping, not at the number of bytes used on the target disks.  This
allows the reweighting to be done before any data is written into the
cluster, when no data will need to migrate as a result of the reweight.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-19 08:16:39 -07:00
Sage Weil
92b227e1c0 Merge remote-tracking branch 'gh/next' 2014-08-18 21:10:32 -07:00
John Wilkins
ab886c4a0b doc: Removed quick guide and wireshark from top-level IA.
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
2014-08-18 14:29:09 -07:00
John Wilkins
acee2e5833 doc: Move wireshark documentation to dev.
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
2014-08-18 14:28:38 -07:00
Sage Weil
ce6e9a916b doc/release-notes: v0.84
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-18 11:57:59 -07:00
Sage Weil
a59bc86594 Merge pull request #2280 from ceph/wip-fs-docs
doc: add notes on using "ceph fs new"

Reviewed-by: Sage Weil <sage@redhat.com>
2014-08-18 10:04:41 -07:00
john
b016f84682 doc: add notes on using "ceph fs new"
Signed-off-by: John Spray <john.spray@redhat.com>
2014-08-18 17:47:31 +01:00
Jenkins
8336f81c5c 0.84 2014-08-18 09:02:20 -07:00
Sage Weil
bda230186f qa/workunits/rbd/qemu-iotests: touch common.env
This seems to be necessary on trusty.

Backport: firefly, dumpling
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 055be68cf8)
2014-08-18 08:47:36 -07:00
Sage Weil
1dc1fb8a60 qa/workunits/hadoop: move all hadoop tests into a hadoop/ dir
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-18 08:39:14 -07:00
Sage Weil
3d3fcc98be qa/workunits/hadoop-wordcount: fix/use -rmr command
-rm -r -f ... doesn't seem to work; use -rmr instead.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-18 08:38:10 -07:00
Sage Weil
adaf5a6a88 qa/workunits/hadoop-wordcount: use -x
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-18 08:37:38 -07:00
Sage Weil
055be68cf8 qa/workunits/rbd/qemu-iotests: touch common.env
This seems to be necessary on trusty.

Backport: firefly, dumpling
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-17 20:54:28 -07:00
Sage Weil
313e60b360 Merge pull request #2010 from ceph/wip-misplaced
osd: track misplaced objects separately from degraded objects

Reviewed-by: Samuel Just <sam.just@inktank.com>
2014-08-17 20:49:05 -07:00
Sage Weil
5045c5cb4c qa/workunits/rest/test.py: use rbd instead of data pool for size tests
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 22:07:56 -07:00
Sage Weil
3279f3e737 qa/workunits/rest/test.py: do snap test on our data2/3 pool
This way it works when a 'data' pool doesn't already exist.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 22:07:56 -07:00
Sage Weil
6d7a229c14 qa/workunits/rest/test.py: fix rd_kb -> rd_bytes
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 22:07:56 -07:00
Sage Weil
284647f350 Merge pull request #2272 from ceph/wip-8621
Wip 8621

Reviewed-by: Sage Weil <sage@redhat.com>
2014-08-16 22:04:13 -07:00
Sage Weil
0e07f7f045 osd: fix theoretical use-after-free of OSDMap
In practice, the map will remain pinned for a while, but this
will make coverity happy.

*** CID 1231685:  Use after free  (USE_AFTER_FREE)
/osd/OSD.cc: 6223 in OSD::handle_osd_map(MOSDMap *)()
6217
6218           if (o->test_flag(CEPH_OSDMAP_FULL))
6219            last_marked_full = e;
6220           pinned_maps.push_back(add_map(o));
6221
6222           bufferlist fbl;
>>>     CID 1231685:  Use after free  (USE_AFTER_FREE)
>>>     Calling "encode" dereferences freed pointer "o".
6223           o->encode(fbl);
6224
6225           hobject_t fulloid = get_osdmap_pobject_name(e);
6226           t.write(coll_t::META_COLL, fulloid, 0, fbl.length(), fbl);
6227           pin_map_bl(e, fbl);
6228           continue;

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 14:51:31 -07:00
Sage Weil
44a0e3766a Merge pull request #2259 from ceph/wip-9039
Wip 9039

Reviewed-by: Sage Weil <sage@redhat.com>
2014-08-16 13:41:41 -07:00
Sage Weil
904a5f1c31 vstart.sh: make filestore fd cache size smaller
I hit the fd limit on a vstart cluster with the default 128; reduce this
to 16.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 13:19:46 -07:00
Sage Weil
932e478783 mon: track stuck undersized
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 13:19:46 -07:00
Sage Weil
190dc2f38f mon: track pgs that get stuck degraded
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 13:19:46 -07:00
Sage Weil
5168907fe2 osd: track last_fullsized in pg_stat_t
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 13:19:46 -07:00
Sage Weil
dbc3f65046 osd: track last_undegraded pg stat
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 13:19:46 -07:00
Sage Weil
1907066fee osd/osd_types: add last_undegraded, last_undersized to pg_stat_t
Keep track of the last time the PG was known to not be degraded or
undersized.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 13:19:45 -07:00
Sage Weil
6d6767d34c osd/PG: track PG_STATE_UNDERSIZED separately from DEGRADED
DEGRADED means there are objects without complete reduncancy; also check
for needs_recovery().

UNDERSIZED means acting set is too small.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 13:18:54 -07:00
Sage Weil
b037e47a36 osd: add PG_STATE_UNDERSIZED
This is a distinct concept from degraded.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 13:18:54 -07:00
Sage Weil
6c0a213436 osd/PG: account for misplaces separately than degraded
A degraded object does not have enough replicas or shards, while a
misplaced object is not stored in the correct place.  Account for them
separately.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 13:18:54 -07:00
Sage Weil
988d7419e7 librados: approximate legacy 'degraded' value
The librados API returns a degraded count and no misplaced count.  Sum them
to approximate the old behavior.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 13:18:53 -07:00
Sage Weil
d734d7fcac mon: warn about misplaced objects, just like degraded
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 13:18:53 -07:00
Sage Weil
a3149994e8 osd: num_objects_misplaced
Signed-off-by: Sage Weil <sage@inktank.com>
2014-08-16 13:18:53 -07:00
Sage Weil
34fe7a8214 Merge pull request #2217 from ceph/wip-problem-osds
mon: 'ceph osd blocked-by' for histogram of peers OSDs are waiting for

Reviewed-by: Samuel Just <sam.just@inktank.com>
2014-08-16 13:15:10 -07:00
Sage Weil
14614e013f qa/workunits/rest/test.py: fix 'df' test to use total_used_bytes
This changed back in ee2dbdb0f5

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 13:06:02 -07:00
Sage Weil
93c5b25be8 Revert "os/FileJournal: Update the journal header when closing journal"
This reverts commit 4eb18dd487.

This may be responsible for #9073.  Until that is resolved, revert.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-16 12:56:39 -07:00
Sage Weil
ee9e1eadab Merge pull request #2271 from ceph/wip-9053
paxos: fix problem with disjoint quorum members

Reviewed-by: Joao Eduardo Luis <joao.luis@inktank.com>
2014-08-16 09:18:19 -07:00
Yehuda Sadeh
c83c90ca07 rgw: update civetweb submodule
Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
2014-08-15 21:37:50 -07:00
Alfredo Deza
a14a700acc Merge pull request #2270 from ceph/wip-init-ceph
init-ceph: don't use bashism

Reviewed-by: Alfredo Deza <adeza@redhat.com>
2014-08-15 19:42:59 -04:00
Sage Weil
0d6d1aa7e0 init-ceph: don't use bashism
-z STRING
              the length of STRING is zero

Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-15 16:41:43 -07:00
Alfredo Deza
dc611e864b Merge pull request #2247 from ceph/wip-ceph-disk
ceph-disk: fix various dmcrypt bugs

Reviewed-by: Alfredo Deza <adeza@redhat.com>
2014-08-15 19:40:15 -04:00
Loic Dachary
082db05c81 Merge pull request #2269 from ceph/wip-osd-mon-feature
osd: fix mon feature requirement

Reviewed-by: Loic Dachary <loic@dachary.org>
2014-08-16 00:19:59 +02:00
Sage Weil
1d0c66ae3a Merge remote-tracking branch 'gh/next' 2014-08-15 15:01:23 -07:00
Boris Ranto
7df67a544f Fix -Wno-format and -Werror=format-security options clash
This causes build failure in latest fedora builds, ceph_test_librbd_fsx adds -Wno-format cflag but the default AM_CFLAGS already contain -Werror=format-security, in previous releases, this was tolerated but in the latest fedora rawhide it no longer is, ceph_test_librbd_fsx builds fine without -Wno-format on x86_64 so there is likely no need for the flag anymore

Signed-off-by: Boris Ranto <branto@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2014-08-15 15:00:59 -07:00
Sage Weil
ae0b9f1776 osd: fix feature requirement for mons
These features should be set on the client_messenger, not
cluster_messenger.

Backport: firefly
Signed-off-by: Sage Weil <sage@redhat.com>
2014-08-15 14:29:11 -07:00