Commit Graph

29422 Commits

Author SHA1 Message Date
Joao Eduardo Luis
42c4137cbf mon: OSDMonitor: fix some annoying whitespace
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-10-29 20:30:37 +00:00
Sage Weil
c2cd460950 Merge pull request #765 from ceph/wip-6635
mon: OSDMonitor: Make 'osd pool rename' idempotent

Reviewed-by: Sage Weil <sage@inktank.com>
Reviewed-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-10-25 17:53:30 -07:00
Sage Weil
8282e24dd6 mon/OSDMonitor: make racing dup pool rename behave
If we get dup pool rename requests that are racing, make sure the second
one comes back with 'success' if the rename entry already exists in the
pending_inc map.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-10-25 17:45:06 -07:00
Joao Eduardo Luis
c14c98d3f0 mon: OSDMonitor: Make 'osd pool rename' idempotent
'ceph osd pool rename' takes two arguments: source pool and dest pool.
If by chance 'source pool' does not exist and 'destination pool' does,
then, in order to assure it's idempotent, we want to assume that if
'source pool' no longer exists is because it was already renamed.

However, while we will return success in such case, we want to make sure
to let the user know that we made such assumption.  Mostly to warn the
user of such a thing in case of a mistake on the user's part (say, the
user didn't notice that the source pool didn't exist, while the dest did),
but also to make sure that the user is not surprised by the command
returning success if the user expected an ENOENT or EEXIST.

Fixes: #6635

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-10-26 01:28:10 +01:00
Gregory Farnum
0f1fed6fe7 Merge pull request #769 from ceph/wip-copy-get
With this branch we make copy-get significantly easier to extend by applying our standard encode/decode stuff to it, instead of doing an inline encode-onto-the-payload. We also add some infrastructure for dealing with completion of RepGathers.

Reviewed-by: Sage Weil <sage@inktank.com>
2013-10-25 13:57:21 -07:00
Greg Farnum
aea985c142 Objecter: expose the copy-get()'ed object's category
In the OSD, store the category in the CopyOp using this.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-10-25 13:52:57 -07:00
Greg Farnum
06b5bf675a osd: add category to object_copy_data_t
We don't bump the encoding version -- and stick it in the middle --
since it's still brand-new. For simplicity, we encode it unconditionally
rather than trying to embed it alongside the attrs or with its own
"complete" flag in the cursor.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-10-25 13:52:56 -07:00
Greg Farnum
61f2e5d994 OSD: add back CEPH_OSD_OP_COPY_GET, and use it in the Objecter
This one is encoded with version information. We are not doing anything
to control which op gets sent by the client, but after discussion with
Sam we think this op isn't accessible enough to clients (right now it's
only triggered by a client sending copy-from, which can only happen via
ceph-test-rados) to require compatibility versioning.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-10-25 13:52:56 -07:00
Greg Farnum
15c8267e34 OSD: rename CEPH_OSD_OP_COPY_GET -> CEPH_OSD_OP_COPY_GET_CLASSIC
In order to introduce versioning of copy-get, we need to make it a
different op that has the versioning infrastructure from the start.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-10-25 13:52:56 -07:00
Greg Farnum
b75b7ad679 ReplicatedPG: copy: move the COPY_GET implementation into its own function
It was getting long, isn't terribly dependent on access to do_osd_ops()
state, and will be easier to make generic as its own function.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-10-25 13:52:56 -07:00
Greg Farnum
80f36963b7 osd: Add a new object_copy_data_t, and use it in the OSD/Objecter
Right now this is very primitive, but we're about to extend it to
deal with request versioning appropriately, and adding in some
extra fields.
Sadly we are doing a little extra copying in the Objecter as a result, but
too bad -- being able to do updates will be worth it.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-10-25 13:52:56 -07:00
Greg Farnum
808fa9ad39 ReplicatedPG: cache: don't handle cache if the obc is blocked
Right now the only way that can happen is if we're in the middle of a
promote!

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-10-25 13:36:45 -07:00
Greg Farnum
91b589fb1f ReplicatedPG: copy: add a C_KickBlockedObject
As the name says, you give it an obc and it kicks the block list
when finish()ed.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-10-25 13:36:45 -07:00
Greg Farnum
ade8f19650 ReplicatedPG: add a Context *ondone to RepGathers
Make a few changes to make sure we trigger it when appropriate. We'll use
this shortly for object promotion, and perhaps for other things in future.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-10-25 13:36:45 -07:00
Greg Farnum
b403ca80d9 ReplicatedPG: copy: rename CopyOp::version -> user_version
This version is a user version, and since we're in the OSD we
should call it such. (In particular, we may want to keep track
of the internal version too when doing cache promotes.)

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-10-25 13:36:44 -07:00
Greg Farnum
4e139fc318 ReplicatedPG: copy: do not let start_copy() return error codes
There's no failure it can actually run into, and handling error
codes in some of its callers is going to be a pain.
While we're here, document the parameters.

Signed-off-by: Greg Farnum <greg@inktank.com>
2013-10-25 13:36:44 -07:00
Greg Farnum
178f9a2a45 ObjectStore: add a bufferlist-based getattrs() function
Signed-off-by: Greg Farnum <greg@inktank.com>
2013-10-25 13:36:44 -07:00
Sage Weil
4f7114a945 Merge branch 'wip-osd-fixes' into next
Reviewed-by: Samuel Just <sam.just@inktank.com>
2013-10-25 12:56:02 -07:00
Sage Weil
e17ff196a4 osd/osd_types: init SnapSet::seq in ctor
Signed-off-by: Sage Weil <sage@inktank.com>
2013-10-25 12:50:17 -07:00
Sage Weil
d2b661d0ef os/FileStore: fix getattr return value when using omap
The return value should be the length of the value, even when it was
stored in omap.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-10-25 12:49:57 -07:00
Sage Weil
3a469bb2ae os/ObjectStore: fix RMATTRS encoding
Apparently nobody uses this!

Signed-off-by: Sage Weil <sage@inktank.com>
2013-10-25 12:49:51 -07:00
Samuel Just
847ea60592 PGLog::read_log: don't add items past backfill line to missing
Fixes: #6574
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Reviewed-by: David Zafman <david.zafman@inktank.com>
2013-10-25 12:34:13 -07:00
Sage Weil
4be4abe932 Merge pull request #764 from ceph/wip-rbd-parent-info
rbd.py: increase parent name size limit

Reviewed-by: Sage Weil <sage@inktank.com>
2013-10-25 10:09:28 -07:00
Josh Durgin
3c0042cde5 rbd.py: increase parent name size limit
64 characters isn't all that long. 4096 ought to be enough for anyone.

Fixes: #6072
Backport: dumpling, cuttlefish
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-10-24 17:31:04 -07:00
Samuel Just
87d3f88742 PGMap::dirty_all should be asserting about osd_epochs, not in.osd_epochs
Fixes: #6627
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-10-24 16:44:04 -07:00
Adam Twardowski
0388b712b4 Update init-rbdmap
Add a chkconfig line for RHEL based distros to make chkconfig start rbdmap earlier on boot and stop later on shutdown.  This will help prevent shutdown/reboot from hanging your system forever in the event that some daemon has a file held open on an rbd mounted filesystem.

Signed-off-by: Adam Twardowski <adam.twardowski@gmail.com>(cherry picked from commit 80384a1a24)
2013-10-24 14:49:53 -07:00
Greg Farnum
0d326c3fa5 ceph: tolerate commands without any child args
Signed-off-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-10-24 12:35:47 -07:00
Josh Durgin
9fa9ed122e Merge branch 'wip-rgw-sync-next' into next
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
2013-10-24 11:45:06 -07:00
Josh Durgin
cfe845115b rgw: eliminate one unnecessary case statement
0x21 '!' is the first character that doesn't need encoding, so we can
expand the lower bound check.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-10-24 11:37:07 -07:00
Josh Durgin
f9a6d71904 radosgw-admin: remove unused function escape_str()
This was added before formatters were used for dumping logs.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-10-24 08:59:07 -07:00
Josh Durgin
ec45b3b88c rgw: escape bucket and object names in StreamReadRequests
This fixes copy operations for objects that contain unsafe characters,
like a newline, which would return a 403 otherwise, since the GET to
the source rgw would be unable to verify the signature on a partially
valid bucket name.

Fixes: #6604
Backport: dumpling
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-10-24 08:58:59 -07:00
Josh Durgin
dd308cd481 rgw: move url escaping to a common place
This is useful outside of the s3 interface. Rename url_escape()
url_encode() for consistency with the exsting common url_decode()
function. This is in preparation for the next commit, which needs
to escape url-unsafe characters in another place.

Backport: dumpling
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-10-24 08:58:48 -07:00
Josh Durgin
e0e8fb1b2b rgw: update metadata log list to match data log list
Send the last marker whether the log is truncated in the same format
as data log list, so clients don't have more needless complexity
handling the difference.  Keep bucket index logs the same, since they
contain the marker already, and are not used in exactly the same way
metadata and data logs are.

Backport: dumpling
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-10-24 08:56:48 -07:00
Josh Durgin
c275912509 rgw: include marker and truncated flag in data log list api
Consumers of this api need to know their position in the log. It's
readily available when fetching the log, so return it.  Without the
marker in this call, a client could not easily or efficiently figure
out its position in the log, since it would require getting the global
last marker in the log, and then reading all the log entries.

This would be slow for large logs, and would be subject to races that
would cause potentially very expensive duplicate work.

Returning this atomically while fetching the log entries simplifies
all of this.

Fixes: #6615
Backport: dumpling
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-10-24 08:56:45 -07:00
Josh Durgin
e74776f417 cls_log: always return final marker from log_list
There's no reason to restrict returning the marker to the case where
less than the whole log is returned, since there's already a truncated
flag to tell the client what happened.

Giving the client the last marker makes it easy to consume when the
log entries do not contain their own marker. If the last marker is not
returned, the client cannot get the last marker without racing with
updates to the log.

Backport: dumpling
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-10-24 08:56:07 -07:00
Josh Durgin
ea816c1c2f rgw: skip read_policy checks for system_users
A system user should still be able to examine suspended buckets, and
get -ENOENT instead of -EACCESS for a deleted object.

Fixes: #6616
Backport: dumpling
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-10-24 08:56:02 -07:00
Sage Weil
71d1a28806 Merge pull request #761 from ceph/wip-6620
mds: MDSMap: adjust buffer size for uint64 values with more than 5 chars

Backport: dumpling, cuttlefish
2013-10-23 16:24:23 -07:00
Joao Eduardo Luis
0e8182edd8 mds: MDSMap: adjust buffer size for uint64 values with more than 5 chars
Fixes: #6620

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-10-24 00:17:45 +01:00
João Eduardo Luís
c2cf8489bc Merge pull request #730 from ceph/wip-monc-ping
mon: MonClient: ping monitors without authenticating

* add support on the monitor to reply to MPing messages with the contents of
  'mon_status' and 'health', regardless of a client having authenticated beforehand.

* add support on the MonClient to send a MPing message to a randomly picked
  monitor (it was easier this way, '-m ip:port' allows for targeted ping) and block
  waiting for a reply.

* add support on librados, pybind/rados.py and the 'ceph' tool to send pings to
  monitors.

Resolves: #5984

Reviewed-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
2013-10-22 19:18:55 -07:00
Joao Eduardo Luis
7ba4bc4ab7 cli: ceph: add support to ping monitors
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-10-23 03:04:23 +01:00
Joao Eduardo Luis
400cb18bbc pybind: rados: ping a monitor via librados
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-10-23 02:54:40 +01:00
Joao Eduardo Luis
1a2e0ebaf1 pybind: rados: support ETIMEDOUT on make_ex()
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-10-23 02:54:34 +01:00
Joao Eduardo Luis
2d7ccab382 librados: support pinging a monitor without auth via RadosClient
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-10-23 02:52:01 +01:00
Joao Eduardo Luis
6a4b196a5b mon: MonClient: allow pinging a monitor without authenticating first
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-10-23 02:52:01 +01:00
Joao Eduardo Luis
c521ba78b1 mon: MonClient: adjust whitespaces
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-10-23 02:52:01 +01:00
Joao Eduardo Luis
5e4652eb8c mon: Monitor: reply to ping messages, letting them know we're alive
Fixes: #5984

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-10-23 02:52:01 +01:00
Joao Eduardo Luis
4ca1407fd9 mon: Monitor: do not flush formatter at end of _mon_status()
Delegate that to the caller so that we can combine the result of
_mon_status() with the result of other functions.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-10-23 02:52:01 +01:00
Sage Weil
dbbf9938e3 Merge remote-tracking branch 'gh/wip-6242-b' into next
Reviewed-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
2013-10-22 13:32:01 -07:00
Sage Weil
1821ad781b pybind/rados: create InterruptedOrTimeoutError exception
Signed-off-by: Sage Weil <sage@inktank.com>
2013-10-22 13:12:59 -07:00
Sage Weil
12308862f7 ceph: move timeout
Signed-off-by: Sage Weil <sage@inktank.com>
2013-10-22 13:02:22 -07:00