Commit Graph

25553 Commits

Author SHA1 Message Date
Samuel Just
49eeaeba3f Messenger: add interface to get oldest queued message arrival time
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-04-23 18:27:28 -07:00
Samuel Just
297c6714b3 DispatchQueue: track queued message arrival times and expose oldest
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-04-23 18:27:28 -07:00
Sage Weil
0cd86dfb64 Merge pull request #237 from ceph/wip-4794
init-ceph: fix (and simplify) pushing ceph.conf to remote unique name
2013-04-23 17:23:32 -07:00
Sage Weil
e09efda7c8 Merge pull request #241 from ceph/wip-4798
#4798

Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-23 17:17:02 -07:00
Sage Weil
48631c114a mon: revert part of PaxosService::is_readable() change
In 98e23980f4 is_readable() was changed to
call is_active(), but that has a check for is_bootstrapping(), so there is
a semantic change.

As a result, we may fail PaxosService::is_readable() (due to bootstrapping)
and then try to call Paxos::wait_for_readable().  That will assert that
Paxos::is_readable() is false, but it will be true and we will crash.

Revert that part of the change, since the semantic change was not
intentional.

Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-23 17:16:31 -07:00
Sage Weil
0093d704e6 librbd: fix i386 build
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-23 16:18:53 -07:00
Josh Durgin
5349ee3056 Merge pull request #240 from ceph/wip-4665
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-04-23 16:11:44 -07:00
Sage Weil
857c88e017 librbd: add read_iterate2 call with fixed argument type
The existing read_iterate takes a size_t for the length, which is only 4GB
on 32-bit machines.  Instead, take a uint64_t length for the new
read_iterate2().

Return 0 instead of the number of bytes read; this makes the user-facing
API a bit simpler.

Fixes: #4665
Signed-off-by: Sage Weil <sage@inktank.com>

keep bytes return from internal method
2013-04-23 15:57:26 -07:00
Sage Weil
6c798ed940 librbd: implement read not in terms of read_iterate
The read() method returns the bytes read, trimmed to the end of the image;
use the other read() variant to do this (which use aio_read()) instead of
read_iterate().

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-23 15:45:19 -07:00
Sage Weil
95ed73a73b mon: drop forwarded requests after an election
On each election, we resend routed requests to the new leader (or
requeue for ourselves).  Therefore, if we receive a forwarded request,
we should drop it on the floor if there is a new election.  Add a field
in the PaxosServiceMessage struct to track which election epoch we
received the request in, and drop it in PaxosService::dispatch() if
that is in the past.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-23 14:06:41 -07:00
Sage Weil
ab25707092 mon: requeue routed_requests for self if elected leader
If we have requests that we have forwarded, and are elected leader,
requeue those requests for ourself and queue them normally and clear out
the routed_requests map.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-23 13:45:59 -07:00
Sage Weil
4b07d6928c mon: track original Connection* for forwarded requests
Keep a reference to the source Connection* for forwarded requests.  This
makes the reply path slightly cleaner, and will help us later.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-23 13:40:27 -07:00
Gregory Farnum
426e3be64e Merge pull request #222 from ceph/wip-3495
Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-23 12:44:05 -07:00
Samuel Just
8402107c65 test_filejournal: adjust corrupt entry tests to force header write
The journal no longer assumes corruption if it finds a valid entry
after an inavlid entry.  Instead, these tests will exercise the
corruption detection via the header committed_up_to member.

Fixes: #4792
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-23 12:28:36 -07:00
Sage Weil
ccbc4dbc6e init-ceph: fix (and simplify) pushing ceph.conf to remote unique name
The old code would only do the push once per remote node (due to the
list in $pushed_to) but would reset $unique on each attempt.  This would
break if a remote host was processed twice.

Fix by just skipping the $pushed_to optimization entirely.

Fixes: #4794
Reported-by: Andreas Friedrich <andreas.friedrich@ts.fujitsu.com>
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-23 10:00:38 -07:00
Gary Lowell
7ad63d23d7 ceph-disk: OSD hotplug fixes for Centos
Two fixes for Centos 6.3 and other systems with udev versions
prior to 172.  The disk peristant name using the GPT UUID does
not exist, so use the by_path persistent name instead for the
journal symlink.

The gpt label fields are not available for use in udev rules. Add
ceph-disk-udev wrapper script that extracts the partition
type guid from the label and calls ceph-disk-activate if it is
a ceph guid type. (Bug #4632)

Signed-off-by: Gary Lowell  <gary.lowell@inktank.com>
2013-04-22 22:30:39 -07:00
John Wilkins
3dd9574bbf doc: Usage requires --num_osds.
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
2013-04-22 21:03:15 -07:00
John Wilkins
b71ec9c25a doc: Added some detail. Calculating PGs, maps; reorganized a bit.
fixes: #2968

Signed-off-by: John Wilkins <john.wilkins@inktank.com>
2013-04-22 21:02:45 -07:00
Joao Eduardo Luis
b73ef010bf mon: [MDS]Monitor: remove 'stop_cluster' and 'do_stop()'
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-23 00:18:28 +01:00
Joao Eduardo Luis
f42fc0e462 mon: MDSMonitor: tighter leash on cross-proposals to the osdmon
Fixes: #3495

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-23 00:18:28 +01:00
Gregory Farnum
2501980350 Merge pull request #234 from ceph/wip-4758
Fixes #4758.

Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-22 15:22:04 -07:00
Joao Eduardo Luis
fa77e1e732 mon: PaxosService: add request_proposal() to perform cross-proposals
Instead of allowing services to directly use 'propose_pending()' on
other services, we instead add two new functions:

  - request_proposal() to request 'this' service to propose its
    pending value; and
  - request_proposal(PaxosService *other) so that 'this' service
    can request a proposal to 'other'

These functions should allow us to enforce a greater set of
constraints at time of a cross-proposal, either by making sure a
service will (e.g.) hold-off his own proposals until said proposal
is performed, or even that the other service will enforce a tighter
set of constraints that wouldn't otherwise be enforced by using
'propose_pending()' directly.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 23:20:26 +01:00
Joao Eduardo Luis
a634bb17e4 mon: PaxosService: is_writeable() depends on being ready to be written to
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 23:20:21 +01:00
Joao Eduardo Luis
98e23980f4 mon: PaxosService: is_readable/writeable() depending on is_active()
Instead of depending on individual conditions.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 23:20:18 +01:00
Joao Eduardo Luis
b29a5b151f mon: PaxosService: consider is_recovering() on is_writeable()
A service is never writeable while it's recovering.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 23:20:14 +01:00
Sage Weil
b33fae4ea1 mon: commit LogSummary on every message
This moves our version pointer up so that we don't re-log (by re-consuming)
log messages to /var/log/ceph/ceph.log on ceph-mon restart.  OTOH, it means
we rewrite the summary of the last 50 messages, but we consider that to be
relatively cheap (and something we *always* did prior for bobtail and
earlier anyway).

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 15:12:22 -07:00
Sage Weil
59d6953ca0 mon: set threshold to periodically stash_full
Set an interval to periodically write a full copy of the map that is lower
than the trim point (which is generally a very large number of commits).

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 15:12:22 -07:00
Sage Weil
5792be81ad Merge pull request #230 from ceph/wip-mon-paxos-fixes
Wip mon paxos fixes

Reviewed-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
2013-04-22 15:11:46 -07:00
Gregory Farnum
c200cdb081 Merge pull request #225 from ceph/wip-4543
Fixes #4543

Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-22 15:05:14 -07:00
Joao Eduardo Luis
1164345a6e ceph-mon: Attempt to obtain monmap from several possible sources
In order of interest/priority:

  - our latest monmap version
  - a backup monmap version created during sync start, if the store
    appears to be in a post-aborted sync state
  - a mkfs monmap version

If none of these are found, we should go ahead and try to build a
monmap from ceph.conf to join an existing cluster.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 22:55:18 +01:00
Joao Eduardo Luis
9ba32404ae mon: Monitor: backup monmap prior to starting a store sync
If by fate we end up attempting a store sync after failing at
least one before, we might not have a monmap to read from the
store to backup.  Therefore, in that case, we shall backup the
current monmap being used by the monitor.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 22:53:14 +01:00
Yehuda Sadeh
de5d1da810 rgw: don't send tail to gc if copying object to itself
Fixes: #4776
Backport: bobtail
Need to make sure that when copying an object into itself we don't
send the tail to the garbage collection.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2013-04-22 13:44:19 -07:00
Josh Durgin
cec5282b2a Merge pull request #232 from ceph/wip-4710
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-04-22 13:36:38 -07:00
Sage Weil
70e1e47da0 Merge pull request #233 from ceph/wip-mon-idempotent
Wip mon idempotent

Reviewed-by: Dan Mick <dan.mick@inktank.com>
2013-04-22 12:58:08 -07:00
Sage Weil
85fd2ca20d mon: make 'osd pool rmsnap ...' idempotent
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 12:50:09 -07:00
Sage Weil
43d62c00c9 mon: make 'osd pool mksnap ...' idempotent
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 12:49:58 -07:00
Sage Weil
08e3ec116f mon: make 'osd blacklist rm ...' idempotent
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 12:48:49 -07:00
Sage Weil
5926ffa576 rbd: only set STRIPINGV2 feature when needed
Only set the STRIPINGV2 feature if the striping parameters are non-default.
Specifically, fix the case where the passed-in size and count are == 0.

Fixes: #4710
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 12:41:49 -07:00
Sage Weil
5446218f7b rbd: fix feature display for --info
Only include the feature if it is set!

Backport: bobtail
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 12:38:11 -07:00
Sage Weil
568101fa72 rbd: avoid clobbering return value with udevadm settle
Fixes: #4707
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-22 11:41:27 -07:00
Samuel Just
8db9d0a274 FileJournal: a valid entry after invalid entry =/=> corrupt
Out of order journal entry writes using aio may cause entry
n+2 to be written prior to n.  This does not indicate
corruption.

Fixes: #4736
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-22 11:32:25 -07:00
Sylvain Munaut
9b953aa410 radosgw: Fix duplicate 'Content-Type' when using 'response-content-type'
Signed-off-by: Sylvain Munaut <s.munaut@whatever-company.com>
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
2013-04-22 09:14:10 -07:00
Joao Eduardo Luis
4b9a2a39c9 mon: MonmapMonitor: add function to obtain latest monmap
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 17:08:27 +01:00
Joao Eduardo Luis
41b874cb70 mon: PaxosService: add 'exists_key/version' helper functions
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-04-22 17:08:06 +01:00
Gary Lowell
1a8b30eff1 ceph-create-keys: Don't wait if permission denied
If get or create keys returns permssion denied, exit
gracefully instead of retrying.

Signed-off-by: Gary Lowell  <gary.lowell@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
2013-04-21 22:08:49 -07:00
John Wilkins
1fa719d59e doc: Aesthetic improvements. Removed unnecessary graphic and overrode margin for h3 tag.
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
2013-04-20 11:10:51 -07:00
John Wilkins
3749ffe652 doc: Added a scenario to PG troubleshooting.
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
2013-04-20 11:08:08 -07:00
John Wilkins
cf915941be doc: Changed usage to "bucket-name". Description was okay.
fixes: #4102

Signed-off-by: John Wilkins <john.wilkins@inktank.com>
2013-04-20 11:06:44 -07:00
David Zafman
c4f8adca83 Merge branch 'wip-4201' into next
Reviewed-by: Samuel Just <sam.just@inktank.com>
2013-04-19 18:14:28 -07:00
David Zafman
870f47c7cb tools/ceph-filestore-dump: Implement remove, export and import
Change local names to be clearer
Break real_log() into common function get_log()
Move infos_oid, biginfo_oid and log_oid to globals for general use

Feature: #4201 (osd: data loss: pg export/import/remove)

Signed-off-by: David Zafman <david.zafman@inktank.com>
2013-04-19 18:11:26 -07:00