Commit Graph

25849 Commits

Author SHA1 Message Date
Sage Weil
14777ec1b5 Merge remote-tracking branch 'gh/next'
Conflicts:
	ceph.spec.in
2013-04-24 08:51:25 -07:00
leseb
31399d1776 Fix typo of the keystone service-create command
Signed-off-by: leseb <sebastien.han@enovance.com>
2013-04-24 08:49:58 -07:00
Yehuda Sadeh
9abec309e8 rgw: list container only shows stats if needed
Fixes: #4759
Add a new request param 'stats' for the swift list containers
request. If set to 'false' it disables stats retrieval, which
makes it go faster. Also, don't dump stats if format is plain,
as they're not going to be dumped.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2013-04-24 08:49:28 -07:00
Sage Weil
c7a0477bad rbd: fix cli-integration tests for striping change
We don't set the striping feature when we are using backward-compatible
(default) striping now; fix the test accordingly.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-24 08:36:06 -07:00
Gary Lowell
446641aa34 95-ceph-osd-alt.rules: Fix missing parent parameter
Signed-off-by: Gary Lowell  <gary.lowell@inktank.com>
2013-04-24 08:22:04 -07:00
Samuel Just
1f7ff412ab ReplicatedPG: timeout watches based on last_became_active
This way a notify on an object with a single defunct watcher
won't necessarily have to wait the full timeout if the pg
has been active for a while.

Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-04-23 20:54:57 -07:00
Samuel Just
a40772bedc osd_types: add last_became_active to pg_stats
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-04-23 20:54:57 -07:00
Samuel Just
d44cfc524f Merge branch 'wip_4552' into next
Fixes: #4552
Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-23 20:51:37 -07:00
Samuel Just
d196b5badf OSD: don't report peers down if hbclient_messenger is backed up
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-04-23 18:27:28 -07:00
Samuel Just
49eeaeba3f Messenger: add interface to get oldest queued message arrival time
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-04-23 18:27:28 -07:00
Samuel Just
297c6714b3 DispatchQueue: track queued message arrival times and expose oldest
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-04-23 18:27:28 -07:00
Sage Weil
0cd86dfb64 Merge pull request #237 from ceph/wip-4794
init-ceph: fix (and simplify) pushing ceph.conf to remote unique name
2013-04-23 17:23:32 -07:00
Sage Weil
e09efda7c8 Merge pull request #241 from ceph/wip-4798
#4798

Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-23 17:17:02 -07:00
Sage Weil
48631c114a mon: revert part of PaxosService::is_readable() change
In 98e23980f4ab7ba289303f72da06721c84767293 is_readable() was changed to
call is_active(), but that has a check for is_bootstrapping(), so there is
a semantic change.

As a result, we may fail PaxosService::is_readable() (due to bootstrapping)
and then try to call Paxos::wait_for_readable().  That will assert that
Paxos::is_readable() is false, but it will be true and we will crash.

Revert that part of the change, since the semantic change was not
intentional.

Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-23 17:16:31 -07:00
Sage Weil
0093d704e6 librbd: fix i386 build
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-23 16:18:53 -07:00
Josh Durgin
5349ee3056 Merge pull request #240 from ceph/wip-4665
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-04-23 16:11:44 -07:00
Sage Weil
857c88e017 librbd: add read_iterate2 call with fixed argument type
The existing read_iterate takes a size_t for the length, which is only 4GB
on 32-bit machines.  Instead, take a uint64_t length for the new
read_iterate2().

Return 0 instead of the number of bytes read; this makes the user-facing
API a bit simpler.

Fixes: #4665
Signed-off-by: Sage Weil <sage@inktank.com>

keep bytes return from internal method
2013-04-23 15:57:26 -07:00
Sage Weil
6c798ed940 librbd: implement read not in terms of read_iterate
The read() method returns the bytes read, trimmed to the end of the image;
use the other read() variant to do this (which use aio_read()) instead of
read_iterate().

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-23 15:45:19 -07:00
Sage Weil
95ed73a73b mon: drop forwarded requests after an election
On each election, we resend routed requests to the new leader (or
requeue for ourselves).  Therefore, if we receive a forwarded request,
we should drop it on the floor if there is a new election.  Add a field
in the PaxosServiceMessage struct to track which election epoch we
received the request in, and drop it in PaxosService::dispatch() if
that is in the past.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-23 14:06:41 -07:00
Sage Weil
ab25707092 mon: requeue routed_requests for self if elected leader
If we have requests that we have forwarded, and are elected leader,
requeue those requests for ourself and queue them normally and clear out
the routed_requests map.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-23 13:45:59 -07:00
Sage Weil
4b07d6928c mon: track original Connection* for forwarded requests
Keep a reference to the source Connection* for forwarded requests.  This
makes the reply path slightly cleaner, and will help us later.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-23 13:40:27 -07:00
Gregory Farnum
426e3be64e Merge pull request #222 from ceph/wip-3495
Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-23 12:44:05 -07:00
Samuel Just
8402107c65 test_filejournal: adjust corrupt entry tests to force header write
The journal no longer assumes corruption if it finds a valid entry
after an inavlid entry.  Instead, these tests will exercise the
corruption detection via the header committed_up_to member.

Fixes: #4792
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-23 12:28:36 -07:00
Yehuda Sadeh
556bb6491d rgw: stream list buckets (containers) request
Fixes: #4760
Instead of retrieving the entire list of buckets in one
chunk, streamline it. This makes it so that if the request
takes too long, client isn't going to timeout before getting
any data.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2013-04-23 10:48:43 -07:00
Sage Weil
ccbc4dbc6e init-ceph: fix (and simplify) pushing ceph.conf to remote unique name
The old code would only do the push once per remote node (due to the
list in $pushed_to) but would reset $unique on each attempt.  This would
break if a remote host was processed twice.

Fix by just skipping the $pushed_to optimization entirely.

Fixes: #4794
Reported-by: Andreas Friedrich <andreas.friedrich@ts.fujitsu.com>
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-23 10:00:38 -07:00
Gary Lowell
7ad63d23d7 ceph-disk: OSD hotplug fixes for Centos
Two fixes for Centos 6.3 and other systems with udev versions
prior to 172.  The disk peristant name using the GPT UUID does
not exist, so use the by_path persistent name instead for the
journal symlink.

The gpt label fields are not available for use in udev rules. Add
ceph-disk-udev wrapper script that extracts the partition
type guid from the label and calls ceph-disk-activate if it is
a ceph guid type. (Bug #4632)

Signed-off-by: Gary Lowell  <gary.lowell@inktank.com>
2013-04-22 22:30:39 -07:00
John Wilkins
3dd9574bbf doc: Usage requires --num_osds.
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
2013-04-22 21:03:15 -07:00
John Wilkins
b71ec9c25a doc: Added some detail. Calculating PGs, maps; reorganized a bit.
fixes: #2968

Signed-off-by: John Wilkins <john.wilkins@inktank.com>
2013-04-22 21:02:45 -07:00
Joao Eduardo Luis
b73ef010bf mon: [MDS]Monitor: remove 'stop_cluster' and 'do_stop()'
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-23 00:18:28 +01:00
Joao Eduardo Luis
f42fc0e462 mon: MDSMonitor: tighter leash on cross-proposals to the osdmon
Fixes: #3495

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-23 00:18:28 +01:00
Gregory Farnum
2501980350 Merge pull request #234 from ceph/wip-4758
Fixes #4758.

Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-22 15:22:04 -07:00
Joao Eduardo Luis
fa77e1e732 mon: PaxosService: add request_proposal() to perform cross-proposals
Instead of allowing services to directly use 'propose_pending()' on
other services, we instead add two new functions:

  - request_proposal() to request 'this' service to propose its
    pending value; and
  - request_proposal(PaxosService *other) so that 'this' service
    can request a proposal to 'other'

These functions should allow us to enforce a greater set of
constraints at time of a cross-proposal, either by making sure a
service will (e.g.) hold-off his own proposals until said proposal
is performed, or even that the other service will enforce a tighter
set of constraints that wouldn't otherwise be enforced by using
'propose_pending()' directly.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 23:20:26 +01:00
Joao Eduardo Luis
a634bb17e4 mon: PaxosService: is_writeable() depends on being ready to be written to
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 23:20:21 +01:00
Joao Eduardo Luis
98e23980f4 mon: PaxosService: is_readable/writeable() depending on is_active()
Instead of depending on individual conditions.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 23:20:18 +01:00
Joao Eduardo Luis
b29a5b151f mon: PaxosService: consider is_recovering() on is_writeable()
A service is never writeable while it's recovering.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 23:20:14 +01:00
Sage Weil
b33fae4ea1 mon: commit LogSummary on every message
This moves our version pointer up so that we don't re-log (by re-consuming)
log messages to /var/log/ceph/ceph.log on ceph-mon restart.  OTOH, it means
we rewrite the summary of the last 50 messages, but we consider that to be
relatively cheap (and something we *always* did prior for bobtail and
earlier anyway).

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 15:12:22 -07:00
Sage Weil
59d6953ca0 mon: set threshold to periodically stash_full
Set an interval to periodically write a full copy of the map that is lower
than the trim point (which is generally a very large number of commits).

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 15:12:22 -07:00
Sage Weil
5792be81ad Merge pull request #230 from ceph/wip-mon-paxos-fixes
Wip mon paxos fixes

Reviewed-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
2013-04-22 15:11:46 -07:00
Gregory Farnum
c200cdb081 Merge pull request #225 from ceph/wip-4543
Fixes #4543

Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-22 15:05:14 -07:00
John Wilkins
660752a276 doc: Added users to Getting Started.
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
2013-04-22 15:03:11 -07:00
Joao Eduardo Luis
1164345a6e ceph-mon: Attempt to obtain monmap from several possible sources
In order of interest/priority:

  - our latest monmap version
  - a backup monmap version created during sync start, if the store
    appears to be in a post-aborted sync state
  - a mkfs monmap version

If none of these are found, we should go ahead and try to build a
monmap from ceph.conf to join an existing cluster.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 22:55:18 +01:00
Joao Eduardo Luis
9ba32404ae mon: Monitor: backup monmap prior to starting a store sync
If by fate we end up attempting a store sync after failing at
least one before, we might not have a monmap to read from the
store to backup.  Therefore, in that case, we shall backup the
current monmap being used by the monitor.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 22:53:14 +01:00
Yehuda Sadeh
de5d1da810 rgw: don't send tail to gc if copying object to itself
Fixes: #4776
Backport: bobtail
Need to make sure that when copying an object into itself we don't
send the tail to the garbage collection.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2013-04-22 13:44:19 -07:00
Josh Durgin
cec5282b2a Merge pull request #232 from ceph/wip-4710
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-04-22 13:36:38 -07:00
Sage Weil
70e1e47da0 Merge pull request #233 from ceph/wip-mon-idempotent
Wip mon idempotent

Reviewed-by: Dan Mick <dan.mick@inktank.com>
2013-04-22 12:58:08 -07:00
Sage Weil
85fd2ca20d mon: make 'osd pool rmsnap ...' idempotent
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 12:50:09 -07:00
Sage Weil
43d62c00c9 mon: make 'osd pool mksnap ...' idempotent
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 12:49:58 -07:00
Sage Weil
08e3ec116f mon: make 'osd blacklist rm ...' idempotent
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 12:48:49 -07:00
Sage Weil
5926ffa576 rbd: only set STRIPINGV2 feature when needed
Only set the STRIPINGV2 feature if the striping parameters are non-default.
Specifically, fix the case where the passed-in size and count are == 0.

Fixes: #4710
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 12:41:49 -07:00
Sage Weil
5446218f7b rbd: fix feature display for --info
Only include the feature if it is set!

Backport: bobtail
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 12:38:11 -07:00