RepoMirrors/ceph

mirror of https://github.com/ceph/ceph synced 2024-12-18 01:16:55 +00:00

Author	SHA1	Message	Date
Greg Farnum	12122b11c7	OSDMap: add primary out params to _pg_to_osds and _raw_to_up_osds Switch to use pointers for the out parameters instead of references. These functions are still just pointing at the front of the generated lists for the "primary" params, but now that all their callers respect these outputs we can add programmatic leader assignment with just these two functions. Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:07 -08:00
Greg Farnum	0c3050932b	OSDMap: add primary out params to internal _pg_to_up_acting_osds function And use pointers instead of references for out params. Now pg_to_up_acting_osds and pg_to_acting_osds can plug in to this slightly more real implementation, instead of making up their own. (We are still just using the first member anyway, but we're about to plug it into the bottom layer of functions.) Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:07 -08:00
Greg Farnum	1c750c65f5	OSDMonitor: implement remove_down_primary_temp() Same as remove_down_pg_temp() Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:06 -08:00
Greg Farnum	412afea976	OSDMonitor: make remove_redundant_pg_temp clear primary, too So that this works with future CRUSH changes, we copy the map and clear out the primary_temp, then compare its output with the real map's output. If they match, remove the primary_temp from the real map. Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:06 -08:00
Greg Farnum	e2db4aea87	OSDMonitor: remove primary_temp entries when you remove their pool Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:06 -08:00
Greg Farnum	a246039553	OSDMap: expose the primary_temp in print() Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:06 -08:00
Greg Farnum	1f81fda58c	OSDMap: dedup the primary_temp Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:06 -08:00
Greg Farnum	e9e615cb12	OSDMap: add primary_temp to apply_incremental() Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:06 -08:00
Greg Farnum	74bdcb69e3	OSDMap: add [new_]primary_temp to the map and Incremental It's not used actively yet, but there it is. Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:06 -08:00
Greg Farnum	b55c45e85d	OSDMap: update Incremental encode/decode to match the full map's Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:06 -08:00
Greg Farnum	3d7c69fb09	OSDMap: add a CEPH_FEATURE_OSDMAP_ENC feature, and use new encoding Bring our OSDMap encoding into the modern Ceph world! :) This is fairly straightforward, but has a few rough edges: Previously we had a "struct_v" which went at the beginning of the OSDMap encoding, and then later on an ev "extended version" which was used to store the more-frequently-changed OSDMap pieces. There was no size information stored explicitly to let clients skip this, but osd maps were always encoded into their own bufferlist before being sent to clients, which had the same effect. We now use the modern ENCODE_START three times: 1) for the overall OSDMap encoding, 2) for the client-usable portion of the map, 3) for the "extended" portion of the map This will let us independently rev everything, which may come in useful if we want to (for instance) add a "monitor" portion to the map that the OSDs don't care about. It also makes adding new client information a lot easier since older clients will still be able to decode the map as a whole. We may want to merge this OSDMAP_ENC feature with one of the others we are creating during this cycle, since they're all very closely related. That will also let us protect more naturally against old clients getting a map they need to understand but can't (because we only need the new map features-to-come when used with erasure-encoded PGs, etc). Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:06 -08:00
Greg Farnum	2646d5edb1	OSDMap: add primary out param to pg_to_raw_up, and use pointers instead of refs The only user is in the OSDMonitor, and it's going to want that information anyway. Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:06 -08:00
Greg Farnum	045e1d75a7	OSDMap: add primary-specifying pg_to_acting_osds This works the same as pg_to_up_acting_osds Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:05 -08:00
Greg Farnum	93d481a5d2	mon, osdmaptool: switch to primary-specifying pg_to_up_acting_osds Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:05 -08:00
Greg Farnum	9749f30cdf	OSDMap: implement pg_to_up_acting_osds with primary interface Use our pointer calling conventions instead of a reference for the new version of the function. Right now we're just setting the primaries equal to the first member of up and acting (or -1 if none), but very shortly we'll modify our private OSDMap functions to export them based on the contents of temp_primary. While in general anybody querying for the mapping information will need to pay attention to whom the primary is as well, we have lots of callers who will need real code changes to do so. To serve them, we keep a version that does not export the primary, but asserts that the primary matches the first entry in its list. Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:05 -08:00
Greg Farnum	5b699782d1	OSDMap: switch pg_to_osds to have an explicit primary param Use pointers instead of references for the out params, too! Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:05 -08:00
Greg Farnum	5367d92e0b	OSDMap: rename _raw_to_temp_osds() -> _get_temp_osds() This function does not (and never has!) used the raw vector, so remove it and don't use a name which implies it is doing any sort of conversion. Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:05 -08:00
Greg Farnum	69a2ec27f0	OSDMap: unify the pg_to_acting_osds and pg_to_up_acting_osds implementations These were the same except for a call to _raw_to_up_osds(). Move the existing pg_to_up_acting_osds into a private function taking a pointer, only fill in the up vector if it's a non-NULL pointer, and call it via the obvious header implementations. Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:05 -08:00
Greg Farnum	c1a95f83ac	OSDMap: remove get_pg_primary() function This was used only by SyntheticClient, and that wants get_pg_acting_primary() anyway. Delete the easily-misused get_pg_primary() and switch. Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:05 -08:00
Greg Farnum	7a9c1712f4	OSDMap: doc the different pg->OSD mapping functions Some of these look like what you should use for mapping and they absolutely are not suitable for that. Make it clearer. Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:05 -08:00
Greg Farnum	268ae82ac3	osd: do not misuse calc_pg_role We've been using the role returned from this to determine if we're the primary or not. Don't. This is mostly about removing a few asserts; while in there I also redirected some calls to use static dereference instead of going through the osdmap lookup path. Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:05 -08:00
Greg Farnum	a09d4f171e	PG: do not use role == 0 as a determinant of primacy We already have an is_primary() function to use instead. Signed-off-by: Greg Farnum <greg@inktank.com>	2014-01-15 16:33:04 -08:00
Josh Durgin	c60ae09b38	Merge pull request #978 from ceph/wip-3454 Reviewed-by: Josh Durgin <josh.durgin@inktank.com>	2014-01-15 15:28:31 -08:00
Yehuda Sadeh	644afd67b9	radosgw-admin: add temp url params to usage Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>	2014-01-15 15:12:40 -08:00
athanatos	980ef0e8b3	Merge pull request #1089 from dachary/wip-mailmap mailmap: add athanatos <sam.just@inktank.com> Reviewed-by: Samuel Just <sam.just@inktank.com>	2014-01-15 10:25:28 -08:00
athanatos	73e469c966	Merge pull request #963 from dachary/wip-erasure-code-api erasure code interface helpers Reviewed-by: Samuel Just <sam.just@inktank.com>	2014-01-15 10:22:47 -08:00
John Wilkins	970f9387bd	doc: Updated paths for OSDs using the OS disk. fixes: #6682 Signed-off-by: John Wilkins <john.wilkins@inktank.com>	2014-01-15 10:08:28 -08:00
Loic Dachary	1ffe4226c2	mailmap: add athanatos <sam.just@inktank.com> Signed-off-by: Loic Dachary <loic@dachary.org>	2014-01-15 09:23:09 +01:00
Sage Weil	4050eae32c	Merge pull request #1084 from dachary/wip-cephtool-test qa: cleanup cephtool/test.sh tmp files Reviewed-by: Sage Weil <sage@inktank.com>	2014-01-14 21:57:48 -08:00
Gregory Farnum	f19adc919d	Merge pull request #1085 from dachary/ceph-master Reviewed-by: Greg Farnum <greg@inktank.com>	2014-01-14 15:12:34 -08:00
Loic Dachary	4b5f2570e9	common: fix bufferlist::append(istream) test bufferlist::append(istream) now filters out empty lines; reflect this in the test Signed-off-by: Loic Dachary <loic@dachary.org>	2014-01-15 00:08:41 +01:00
Sage Weil	e55a08964f	doc/release-notes: v0.75 Signed-off-by: Sage Weil <sage@inktank.com>	2014-01-14 09:37:52 -08:00
Loic Dachary	08c17b7c5c	qa: cleanup cephtool/test.sh tmp files When run in a shared environment ( as opposed as a machine created for the purpose of running this test only ), it is important to cleanup leftovers to avoid poluting the /tmp space. Create a common temporary directory for all tmp files. Signed-off-by: Loic Dachary <loic@dachary.org>	2014-01-14 17:31:04 +01:00
Ken Dreyer	03d7d97d5d	Merge branch 'next'	2014-01-14 16:16:41 +00:00
Loic Dachary	a520026eb4	Merge pull request #1076 from dachary/wip-vector-op erasure-code: use uintptr_t instead of long long Reviewed-by: Andreas Peters <andreas.joachim.peters@cern.ch>	2014-01-14 08:10:59 -08:00
Loic Dachary	dc4e212d6a	Merge pull request #1078 from ceph/wip-mon-pgmap mon: make 'pg getmap' not include a trailing newline Reviewed-by: Loic Dachary <loic@dachary.org>	2014-01-13 22:38:09 -08:00
Sage Weil	66a4f8a291	Merge pull request #1071 from ceph/wip-max-file-size allow mds max file size to be adjusted Reviewed-by: Yan, Zheng <zheng.z.yan@intel.com>	2014-01-13 17:43:49 -08:00
Sage Weil	c5cacf4e56	Merge pull request #1058 from ceph/wip-cache-snap snap/clone promotion, flush, and other goodies This is now passing the thrashing with both cache and snap ops: sage-2014-01-13_15:45:26-rados:thrash-wip-cache-snap-testing-basic-plana Reviewed-by: Samuel Just <sam.just@inktank.com>	2014-01-13 16:50:17 -08:00
Sage Weil	be8db8c338	osd/ReplicatedPG: use get_object_context in trim_object find_object_context() has all the logic to choose a particular clone given a logical snap. In the trim case, we want none of that: we just need to pull the obc for a specific clone instance. Note that this changes none of the failure cases (previous we asserted r == 0). Signed-off-by: Sage Weil <sage@inktank.com>	2014-01-13 16:19:50 -08:00
Sage Weil	b5ae76e8fe	ceph_test_rados: do not delete in-use snaps There are a bunch of ops that read from snaps. Do not delete a snap while they are in use. Signed-off-by: Sage Weil <sage@inktank.com>	2014-01-13 16:19:49 -08:00
Sage Weil	8b39719a10	osd/OSDMonitor: fix 'osd tier add ...' pool mangling Signed-off-by: Sage Weil <sage@inktank.com>	2014-01-13 16:19:49 -08:00
Sage Weil	d41a1d3d82	osd/ReplicatedPG: update ObjectContext's object_info_t for new hit_set objects We were fabricating an object_info_t correctly and writing it to disk, but it was not reflected by the in-memory ObjectContext. If something came along quickly (like backfill) and tried to use it, the info would be invalid. Fix this by fabricating it in the obc and copying it to the new_obs for the update. Fixes: #7122 Signed-off-by: Sage Weil <sage@inktank.com>	2014-01-13 16:19:49 -08:00
Sage Weil	10547e6713	osd/ReplicatedPG: always return ENOENT on deleted snap Previously, if a snap was deleted but the clone was there and we hadn't trimmed it yet, we would still return the data. Instead, return ENOENT unconditionally (even it's not removed yet). This makes the behavior from the client perspective more predictable and conistent. Signed-off-by: Sage Weil <sage@inktank.com>	2014-01-13 16:19:49 -08:00
Sage Weil	8cab9e7657	ceph_test_rados_api_tier: partial test for promote vs snap trim race This reliably returns ENODEV due to the test at the finish of flush. Not because we are actually racing with trim, though: the trimmer doesn't run at all. I believe it captures the important property, though. Namely: we should not write a promoted object that is "behind" the snap trimmer's progress. The fact that we are in front of it (the trimmer hasn't started yet) should not matter since the object is logically deleted anyway. We probably want to make the OSD return ENODEV on read in the normal case when you try to access a clone that is pending trimming. Signed-off-by: Sage Weil <sage@inktank.com>	2014-01-13 16:19:49 -08:00
Sage Weil	8221a2a54d	osd/ReplicatedPG: cleanly abort flush if the object no longer exists If the object no longer exists (for example, because the snap trimmer just killed it) clean up the flush state without trying to mark the object clean. Signed-off-by: Sage Weil <sage@inktank.com>	2014-01-13 16:19:49 -08:00
Sage Weil	f3ce2549c5	osd/Replicated: mark obc !exists on snap trim Signed-off-by: Sage Weil <sage@inktank.com>	2014-01-13 16:19:49 -08:00
Sage Weil	48306e47d0	mon: debug propagate_snaps_to_tiers Signed-off-by: Sage Weil <sage@inktank.com>	2014-01-13 16:19:48 -08:00
Sage Weil	6719d30288	osd: fix propagation of removed snaps to other tiers When we update removed_snaps we do not update snap_seq. Drop this broken optimization. Signed-off-by: Sage Weil <sage@inktank.com>	2014-01-13 16:19:48 -08:00
Sage Weil	7e80fa068e	osd/ReplicatedPG: handle promote that races with snap deletion If we are promoting a clone and realize that the object is no longer defined for any snaps, abort the copy and delete any temp object. If the defined snaps have changed, make sure they are updated in memory so that on promote completion the snapshot metadata is correct. Signed-off-by: Sage Weil <sage@inktank.com>	2014-01-13 16:19:48 -08:00
Sage Weil	cd42368e3c	osd/ReplicatedPG: simplify copy-from temp object handling Previously the caller was generating a temp object name and passing it down in severaly different ways. Instead, generate one when we realize that we need it, and store it in one place (CopyResults), where the completions can get at the information. Signed-off-by: Sage Weil <sage@inktank.com>	2014-01-13 16:19:48 -08:00

1 2 3 4 5 ...

30843 Commits