Commit Graph

23514 Commits

Author SHA1 Message Date
Sage Weil
83f12e4630 ceph_test_rados: submit a racing read with every COPY_FROM
Verify that the racing read completes after the COPY_FROM does (i.e., is
blocked by it).

Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-17 11:06:27 -07:00
Sage Weil
d050596a0d ceph_test_rados: use get_version64() throughout
Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-17 11:06:27 -07:00
Sage Weil
cdc637f0f1 osd: block requests on object during COPY_FROM
Block any request on an object (read or write) during the COPY_FROM
operation.

This could potentially be broken down into read vs write operations without
much difficulty, but blocking any op indescriminately is sufficient for
now, so let's keep it simple.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-17 11:06:27 -07:00
Sage Weil
f97277cc50 osd: add infrastructure to block io on an obc
Add an is_blocked() method for the obc, and add infrastructure to block
any operations if it returns true.  Clean up on_change(), and add a helper
to kick an obc when whatever condition leading to it being blocked is no
longer true.

For now, is_blocked() is always false...

Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-17 11:06:27 -07:00
Sage Weil
df7c36ac08 osd/ReplicatedPG: factor some bits into finish_copy
This is a bit cleaner and hopefully more usable for osd-driven cache
promotion.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-17 11:06:27 -07:00
Sage Weil
5921703eb5 ceph_test_rados: more readable output
Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-17 11:06:27 -07:00
Sage Weil
4e29e362e7 osd/ReplicatedPG: stage object chunks to replicas during COPY_FROM
As we get each chunk of data during the COPY_FROM operation, write it out
to a temporary object on the replicas.  When we get all the pieces, move
it into place.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-17 11:06:27 -07:00
Sage Weil
25a608cdcc osd/ReplicatedPG: allow RepGathers with no version
If the repop has no version set, skip the updates to last_update and
last_update_{applied,ondisk} and last_complete_ondisk.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-11 15:56:12 -07:00
Sage Weil
155cdd29a3 osd: flag new/old temp objects in MOSDSubOp
Allow us to mark when we start and stop using a temporary object in a
sub_op.  If we start to use it, make sure the collection exists on the
replica.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-11 15:56:12 -07:00
Sage Weil
c4260fad29 osd/ReplicatedPG: drop unused issue_repop() arguments
These are cruft from the old parallelexec mode and can be removed.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-11 15:19:18 -07:00
Sage Weil
27ca5d2e06 osd/ReplicatedPG: drop dead code from parallelexec issue_repop
This is unused and references otherwise unused variables we can remove!

Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-11 15:19:18 -07:00
Sage Weil
c9885e7487 osd/ReplicatedPG: generate one-off unique temp object names
Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-11 15:19:18 -07:00
Sage Weil
6cecd0db56 osd/osd_types: move coll_t::META_COLL definition
Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-11 15:19:18 -07:00
Sage Weil
17c5d765d7 os/FileStore: implement collection_move_rename
This is similar to a collection_add + collection_move sequence in that we
apply the same replay guards.  The difference is that we roll it up into
a single operation, change the filename, and make the omap content carry
over by calling DBObjectMap->clone (as there is no rename function or
collection awareness in the DBObjectMap).

Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-11 15:19:18 -07:00
Sage Weil
ef7cffc34f os/ObjectStore: add collection_move_rename
Add method to move an object between collections *and* change its name.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-11 15:09:14 -07:00
Sage Weil
661b377c28 os/FileStore: pass old + new object name to lfn_link
For now, the only caller leaves the name the same.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-09 16:55:21 -07:00
Loic Dachary
132e403938 autoconf: use $(LIBOSD) $(LIBCOMMON) instead of libosd.a libcommon.la
hardcoded library names in unit test dependencies break compilation

Signed-off-by: Loic Dachary <loic@dachary.org>
2013-09-10 00:30:57 +02:00
athanatos
07ff0215e2 Merge pull request #518 from dachary/wip-5878
erasure plugin mechanism and abstract API

Reviewed-by: Samuel Just <sam.just@inktank.com>
2013-09-09 14:54:39 -07:00
Loic Dachary
392179478e ErasureCodePlugin: plugin registry tests and example
libec_example.la is a fully functional plugin based on
ErasureCodeExample to test the ErasureCodePlugin abstract
interface. It is dynamically loaded to test the
ErasureCodePluginRegistry implementation.

Although the plugin is built in the test directory, it will be
installed. noinst_LTLIBRARIES won't build the shared library, only the
static version which is not suitable for testing.

http://tracker.ceph.com/issues/5878 refs #5878

Signed-off-by: Loic Dachary <loic@dachary.org>
2013-09-09 23:34:12 +02:00
Loic Dachary
916901f913 ErasureCodePlugin: plugin registry
A ErasureCodePluginRegistry singleton holds all erasure plugin objects
derived from ErasureCodePlugin and dlopen(2) handles for the lifetime
of the OSD and is cleaned up by the destructor.

The registry has a single entry point ( method factory ) and should
be used as follows:

  map<std::string,std::string> parameters;
  parameters["directory"] = "/usr/lib/ceph/erasure-code";
  ErasureCodeInterfaceRef erasure_code;
  ErasureCodePluginRegistry &instance = ErasureCodePluginRegistry::instance();
  instance.factory("jerasure", parameters, &erasure_code));

If the plugin requested ( "jerasure" in the example above ) is not
found in the *plugins* data member, the load method is called and will:

  * dlopen(parameters["erasure-code-directory"] + "jerasure")
  * f = dlsym("__erasure_code_init")
  * f("jerasure")
  * check that it registered "jerasure"

The plugin is expected to do something like

  instance.add(plugin_name, new ErasureCodePluginJerasure());

to register itself.

The factory method is protected with a Mutex to avoid race
conditions when using the same plugin from two threads.

The erasure_codelib_LTLIBRARIES variable is added to the Makefile
and the plugins are expected to add themselves and be installed
in the $(libdir)/erasure-code

http://tracker.ceph.com/issues/5878 refs #5878

Signed-off-by: Loic Dachary <loic@dachary.org>
2013-09-09 23:32:47 +02:00
Loic Dachary
b61369c803 ErasureCodePlugin: plugin interface
When dynamically loaded, a plugin is expected to define

    int __erasure_code_init(char *plugin_name);

When called, it is responsible for registering an ErasureCodePlugin
derived object that provides a factory method from which the concrete
implementation of the ErasureCodeInterface object can be generated:

  virtual int factory(const map<std::string,std::string> &parameters,
                      ErasureCodeInterfaceRef *erasure_code) {
    *erasure_code = ErasureCodeInterfaceRef(new ErasureCodeExample(parameters));
    return 0;
  }

The plugin instance contains the library data member which is used to
store the handle of the shared library. It is opaque to the plugin.

http://tracker.ceph.com/issues/5878 refs #5878

Signed-off-by: Loic Dachary <loic@dachary.org>
2013-09-09 23:28:37 +02:00
Loic Dachary
640f2f213f ErasureCode: example implementation : K=2 M=1
An erasure code implementation designed for tests. Although it is fully
functional and could be used on actual data, it is mainly provided for
testing purposes. It splits data in two, computes an XOR parity and
can sustain the loss of one chunk.

The constructor will usleep(3) for parameters["usleep"] microseconds
so that the caller can create race conditions.

http://tracker.ceph.com/issues/5878 refs #5878

Signed-off-by: Loic Dachary <loic@dachary.org>
2013-09-09 23:26:42 +02:00
David Zafman
95d61c1a86 Fix usage to include export and import --type options
Signed-off-by: David Zafman <david.zafman@inktank.com>
2013-09-09 14:10:13 -07:00
Loic Dachary
dde21bdfeb ErasureCode: abstract interface
The erasure coded pool relies on this abstract interface to encode and
decode the chunks stored in the OSD. It has been designed to be
generic enough to accomodate the libraries and algorithms that are
most likely to be used. It does not claim to be universal.

http://tracker.ceph.com/issues/5878 refs #5878

Signed-off-by: Loic Dachary <loic@dachary.org>
2013-09-09 21:23:17 +02:00
Sage Weil
583ffaf47a Merge pull request #577 from roaldvanloon/wip-automake
automake cleanup: implementing non-recursive make

Reviewed-by: Sage Weil <sage@inktank.com>
2013-09-09 09:02:59 -07:00
Sage Weil
726fe73d0d rados: fix typo
cct is a member of the ObjBencher parent that we are trying to
initialize; need to pass in cct_ here.

Broken in 5dd0a83f91.

Fixes: #6256
Signed-off-by: Sage Weil <sage@inktank.com>
2013-09-08 21:40:33 -07:00
Roald J. van Loon
6949d221ad automake cleanup: implementing non-recursive make
- Enabling subdir objects
- Created a Makefile-env.am with basic automake init
- Created .am files per subdir, included from src/Makefile.am

Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-08 00:11:09 +02:00
Sage Weil
b2caa799d6 Merge remote-tracking branch 'gh/next' 2013-09-07 14:11:00 -07:00
Sage Weil
bc552647d4 Merge pull request #576 from ceph/wip-6078-2
rgw: fix get cors, delete cors

Reviewed-by: Sage Weil <sage@inktank.com>
2013-09-07 14:09:23 -07:00
Roald J. van Loon
99c6be2fcf automake cleanup: renaming buildtest
The previous name could cause confusion (it's not only used to test
build libcommon anymore).

Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-07 22:44:12 +02:00
Roald J. van Loon
417f1f2a9d automake cleanup: moving rados tool to tools subdir
Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-07 22:41:10 +02:00
Roald J. van Loon
3daf6ea96d automake cleanup: moving misc tools to tools subdir
Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-07 22:41:10 +02:00
Roald J. van Loon
0b267b3038 automake cleanup: moving tests to test subdir
This are tests and should be in the src/test subdir.

Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-07 22:41:10 +02:00
Roald J. van Loon
bf6341fd88 automake cleanup: moving code away from include to common
Includes should only be includes. Moving C code to libcommon.

Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-07 22:41:10 +02:00
Roald J. van Loon
bb312b689a automake cleanup: moving hobject from os to common
This file is used by libcommon, so lets put it in src/common instead of
src/os.

Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-07 22:41:10 +02:00
Roald J. van Loon
09b42c033f automake cleanup: renamed inttypes.h
- In "includes", inttypes.h was cluttering the system's one. This caused
  random build errors on some systems/in some conditions. Renaming it.
- Add emergency defs of PRI*64 headers when int_types.h does not define
  them (which, unfortunately, can happen on some systems).

Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-07 22:41:10 +02:00
Roald J. van Loon
c17d134937 fix: build tests need libcrc.la
Build tests are failing because the need crc32c.cc's functions, so we
need to link them against libcrc.la

Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-07 19:25:17 +02:00
Sage Weil
13e42447f4 Merge pull request #570 from roaldvanloon/wip-cleanup-globals
cleanup: state_name in NamedState init list

Reviewed-by: Sage Weil <sage@inktank.com>
2013-09-07 08:45:02 -07:00
Danny Al-Gaaf
b5137baf65 init-radosgw*: fix status return value if radosgw isn't running
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
2013-09-07 11:30:15 +02:00
Roald J. van Loon
55a527142c cleanup: state_name in NamedState init list
Added const char *state_name to the NamedState initializer list. This
correctly enforces a state name.

Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-07 11:15:00 +02:00
Yehuda Sadeh
13872785ae rgw: fix get cors, delete cors
Remove a couple of variables that overrode class member. Not
really clear how it was working before, might have been a bad
merge / rebase.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2013-09-06 22:33:38 -07:00
Sage Weil
712b7a957d Merge pull request #568 from roaldvanloon/wip-cleanup-globals
Wip cleanup globals

Reviewed-by: Sage Weil <sage@inktank.com>
2013-09-06 14:02:03 -07:00
Sage Weil
be889fa956 Merge remote-tracking branch 'gh/next' 2013-09-06 09:50:41 -07:00
Sage Weil
39c89dcfed common/crc32c_intel_fast: avoid reading partial trailing word
The optimized intel code reads in word-sized chunks, knowing that the
allocator will only hand out memory in word-sized increments.   This makes
valgrind unhappy.  Whitelisting doesn't work because for some reason there
is no caller context (probably because of some interaction with yasm?).

Instead, just use the baseline code for the last few bytes.  This should
not be significant.

Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
2013-09-06 09:31:59 -07:00
Roald J. van Loon
c3529ed004 cleanup: passing context to NamedState for ceph_clock
This makes the constructor call on the subclasses explicit, and passes
the cct to the NamedState constructor. This cct is used by ceph_clock
to set enter_time.

Removes the last reference to g_ceph_context from libosd.

Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-06 10:36:35 +02:00
Roald J. van Loon
159693de00 cleanup: globals from libosd
There are some new globals introduced in libosd lately. This patch
cleans them.

Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-06 10:11:08 +02:00
Roald J. van Loon
8d0a1dbc79 cleanup: reduced number of pointer indirection in osd
Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-05 21:02:47 +02:00
Roald van Loon
5dd0a83f91 cleanup: removing globals from common/obj_bencher
This file is in common/ but cant be included in libcommon.la because of
this reference. Removing it, making the binary calling it to pass the
correct cephcontext (rados, rest-bench).

Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-05 21:02:47 +02:00
Roald van Loon
cf93c83858 cleanup: removing refs to globals from libosd
Removed almost all references to globals, apart from two;

- In OSD.cc: reference to g_lockdep
- In PG.h: NamedState uses ceph_clock_now(g_ceph_context)

Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-05 21:02:47 +02:00
Roald van Loon
9c4e62660a cleanup: removed last references to globals from client
- There were some refs in SyntheticClient to g_(ceph_context|conf), I
  replaced them with client->cct and client->cct->_conf.
- There were some refs in fuse_ll to g_conf, also replaced them with
  client->cct or cfuse->client->cct where applicable.

This makes everything in src/client completely independent from globals.

Signed-off-by: Roald J. van Loon <roaldvanloon@gmail.com>
2013-09-05 21:01:31 +02:00