Ceph is a distributed object, block, and file storage platform
Go to file
Matt Benjamin 610d66f531 Ceph Accelio/RDMA Transport (XioMessenger).
XioMessenger implements a Ceph Messenger provider for Accelio,
a high-performance messaging transport by Mellanox.  Current
Accelio is layered on ibverbs, and supports Infiniband, ROCE,
and other RDMA transports.  Future Accelio verions will support
alternative transports (including TCP), and flexible transport
selection.

config: cluster_rdma drives messenger creation
ceph_mds ceph_mon and ceph_osd use XioMessengers for cluster
	communication when cluster_rdma is set
Move XioMessenger to msg/xio.
	This matches the other new Messenger locations.
test: tests for tcp and xio messengers
	(Not tests only.)
buffer: add subclass for xio buffers
xio: convert to Connection::send_message interface
config: -x, --xio as aliases for client_rdma
ceph-fuse: create xio messenger if client_rdma
Find XioMessenger.h and QueueStrategy.h in msg/xio.
ceph-syn: create xio messenger if client_rdma
librados: create xio messenger if client_rdma
Find XioMessenger.h and QueueStrategy.h in msg/xio.
Restore non-abort from Xio Mon integration.
Fix xio_client send count, again.
xio: must signal cond under mutex lock
xio: dispatch strategies support ms_fast_dispatch
xio: config variable xio_port_shift
remove set_port_shift() from XioMessenger, and just use the value
	from the configuration
xio: don't depend on g_ceph_context for dout
XioMessenger now uses its own cct for all logging operations
	the accelio log function, however, still depends on a global
	CephContext. so we maintain an extra one, separate from g_ceph_context,
	in XioMessenger.cc that is initialized on first construction and a
	reference is held indefinitely
script: cephfsnew to automate pool and fs creation
Use new on_ow_msg_send_complete hook.
	Replaces on_msg_delivered for one-way message style.
Prototype new xio_discon behavior.
	On shutdown, XioPortal threads should not exit before Accelio
	finalizes all sessions.
Inline join_sessions, it needs sh_mtx held across wait loop.
Fix assert on Cond::Signal.  Adds Cond2.
Avoid deadlock, xio_disconnect can deliver a session teardown event.
Also Mutex2.
	(Note, Mutex2 and Cond2 are replaced by standard C++ downstream.)
Restore SimpleDispatcher Timings.
	The simple_client/simple_server timings are based on a ping/pong
	of messages between the client and server, unlike those of the
	xio_client/server programs, which are one-way (so their corresponding
	1-way bandwidth is appx. 2x what the test reports).  We assert
	that the results are in general comparable, because in both setups,
	a fixed number of messages (def. 50) is maintained in flight.
Wrap Accelio mempool in XioPool, add stats.
	To enable stat prints, set xio_trace_mempool.  Currently, prints
	to stdout at each 64K messages sent or received.
Restore _send_message(..).
Fix merge errors in simple_client, simple_dispatcher.
xio: fix for size in pool stats
Add in/outbound msg counters to XioPoolStats.
Pool stats are easier to read.
	Pool stats are easier to read, and if enabled, print on session
	teardown. This is a convenient time to view stats, and with a small
Make pool stats counters atomic.
Track requests using hook ctor/dtor.
	Lockless, portal thread provides atomicity.
Adapt to recent changes on Accelio for_next
	* Accelio options now of opaque type
	* on_msg_err with extra direction param
	* RDMA behavior now governed by 2 new options
    		XIO_OPTNAME_MAX_INLINE_DATA
   	 	XIO_OPTNAME_MAX_INLINE_HEADER
	* Separated send and recv queue depth
xio_messenger: Change xio optname queue depth msgs
	* Set 16k threshold to rdma buffers instead of send
	* Change xio optname for queue depth msgs
  		XIO_OPTNAME_SND/RCV_QUEUE_DEPTH_MSGS
xio_messenger: Protect Accelio queue depth.
	(Minimal send flow control.)
	The guard is per xio_connection, and considering batches.
	Increment happens only if xio_send_msg succeeded, decrement in
	on_ms_ow_send_complete and on_msg_error.  Note that we don't need
	atomics because counters are touched only in the correct portal
	thread.
Find XioMsg.h in msg/xio
Find XioMessenger.h and QueueStrategy.h in msg/xio (tests).
Adapt to 2 Accelio API changes.
	1. xio_context_stop loop takes only 1 argument
	2. xio_connect() now takes a structure argument, by reference
Set CMP0046 iif CMake version >= 3
Move XioMessenger to msg/xio
xio: fix for segfault on xio_connect()
No more Mutex2, Cond2.
xio: number of portal threads is configurable
xio: only create additional portals on bind()
xio: use QueueStrategy(1) as default
xio: Messenger factory accepts ms_type "xio"
xio: use ms_type instead of client,cluster_rdma
     removing the ability to configure the client and cluster networks
     separately in favor of a single global messenger type
     --xio is now a command-line alias for --ms_type xio
     all daemons now use the Messenger::create() factory function instead of
     conditionally creating XioMessengers
     the OSD and Monitor classes no longer need separate messengers to
     deal with both tcp/rdma clients
xio: portal binding honors ms_bind_port_min,max
xio: remove xio_port_shift
     port shifting is no longer necessary, because we won't create both tcp
     and xio messengers for the same service
     Use Accelio sglist helper macros.
     xio: make xio buffer unshareable
xio: Nuke special_handling.
Replace GENERIC with MON (requested by Sage).

Signed-off-by: Casey Bodley <casey@cohortfs.com>
Signed-off-by: Vu Pham <vu@mellanox.com>
Signed-off-by: Matt Benjamin <matt@cohortfs.com>
2015-01-14 16:43:30 -05:00
admin be nitpicky about missing references 2014-03-24 13:01:38 -04:00
ceph-erasure-code-corpus@f1f95a1de9 erasure-code: workunit to check for encoding regression 2014-11-10 14:56:43 +01:00
ceph-object-corpus@da43eb1195 ceph-object-corpus: drop compat with old ObjectStore::Transaction 2015-01-06 13:29:41 -08:00
cmake/modules Combined CMake Build for Hammer 2015-01-14 16:40:05 -05:00
debian Merge remote-tracking branch 'gh/next' 2015-01-14 08:57:33 -08:00
doc Merge pull request #2697 from ceph/wip-8900 2015-01-13 11:17:29 -08:00
examples rbd-replay: Convert prep-for-replay.py to rbd-replay-prep.cc 2014-08-21 10:57:33 -07:00
fusetrace remove superfluous second semicolons at end of lines 2014-05-08 10:09:25 +02:00
keys keys: renew autobuild.asc key 2013-02-07 22:31:40 -08:00
m4 autotools: ARM/AArch64 NEON detection similar to x86 2014-11-25 14:17:37 +01:00
man Combined CMake Build for Hammer 2015-01-14 16:40:05 -05:00
qa Merge pull request #3364 from ceph/wip-quota-test 2015-01-13 07:08:30 -08:00
rpm Move fedora patch file (used by ceph.spec.in) to rpm/ subdir 2014-08-27 12:56:43 -07:00
share ceph-post-file: single command to upload a file to cephdrop 2013-08-16 17:59:11 -07:00
src Ceph Accelio/RDMA Transport (XioMessenger). 2015-01-14 16:43:30 -05:00
systemd better systemd support 2014-12-01 14:45:31 +01:00
udev udev: /dev/disk/by-parttypeuuid/$type-$uuid 2013-06-17 09:49:53 -07:00
.gitignore examples: Add example for the librados C API 2014-08-21 10:57:28 -07:00
.gitmodule_mirrors build: add ceph-erasure-code-corpus to gitmodule_mirrors 2014-10-08 21:22:02 +02:00
.gitmodules .gitmodules: update ceph-object-corpus auth repo to github 2014-12-18 14:09:21 -08:00
.mailmap mailmap: Chris Holcombe affiliation 2014-12-27 01:11:31 +01:00
.organizationmap mailmap: Chris Holcombe affiliation 2014-12-27 01:11:31 +01:00
.peoplemap mailmap: Alfredo Deza affiliation : Fix Typo. 2014-12-27 01:11:31 +01:00
AUTHORS AUTHORS 2014-07-17 21:33:22 -07:00
autogen.sh autogen.sh: git submodule sync 2014-11-24 12:35:50 +01:00
ceph.spec.in packages: add python-virtualenv and xmlstarlet 2014-12-22 00:22:44 +01:00
ChangeLog
CMakeLists.txt Ceph Accelio/RDMA Transport (XioMessenger). 2015-01-14 16:43:30 -05:00
CodingStyle Clarify CodingStyle with respect to tab compression of space runs 2012-08-31 15:47:12 -07:00
configure.ac Merge remote-tracking branch 'gh/next' 2015-01-14 08:57:33 -08:00
CONTRIBUTING.rst doc: Add a contributing file at the root of repo 2014-09-06 00:23:24 +05:30
COPYING erasure-code: HTML display of benchmark results 2014-08-04 11:42:05 +02:00
COPYING-LGPL2.1 COPYING: note licenses for all files, not just the default 2012-01-12 10:03:27 -08:00
do_autogen.sh jemalloc-support: Added --with-jemalloc configure option 2014-12-15 11:22:29 +05:30
doc_deps.deb.txt doc: The dot package is included in graphviz package 2014-10-20 10:47:51 -07:00
Doxyfile
INSTALL Update INSTALL to mention the submodules/recursive 2014-05-31 11:22:41 +02:00
install-deps.sh install-deps.sh: do not require sudo when root 2014-12-21 17:52:18 +01:00
make-debs.sh debian: create a repository from sources 2015-01-02 12:14:39 +01:00
Makefile.am documentation: replace deps.{deb,rpm}.txt with scripts 2014-12-01 16:15:21 +01:00
NEWS
PendingReleaseNotes PendingReleaseNotes: make a note about librados flag changes 2015-01-13 12:23:37 -08:00
pom.xml update license for libcephfs 2014-08-26 15:52:37 -04:00
README README: symlink from README.md 2014-08-06 08:15:35 -07:00
README.cmake Combined CMake Build for Hammer 2015-01-14 16:40:05 -05:00
README.md documentation: replace deps.{deb,rpm}.txt with scripts 2014-12-01 16:15:21 +01:00
run-make-check.sh tests: run-make-check.sh arguments are passed to configure 2014-12-21 17:53:00 +01:00
SubmittingPatches documentation: simplify running make check 2014-12-13 15:01:40 +01:00

============================================ Ceph - a scalable distributed storage system

Please see http://ceph.com/ for current info.

Contributing Code

Most of Ceph is licensed under the LGPL version 2.1. Some miscellaneous code is under BSD-style license or is public domain. The documentation is licensed under Creative Commons Attribution-ShareAlike (CC BY-SA). There are a handful of headers included here that are licensed under the GPL. Please see the file COPYING for a full inventory of licenses by file.

Code contributions must include a valid "Signed-off-by" acknowledging the license for the modified or contributed file. Please see the file SubmittingPatches for details on what that means and on how to generate and submit patches.

We do not require assignment of copyright to contribute code; code is contributed under the terms of the applicable license.

Build Prerequisites

debian-based

The list of debian packages dependencies can be installed with:

./install-deps.sh

Note: libsnappy-dev and libleveldb-dev are not available upstream for natty, oneiric, and squeeze. Backports for Ceph can be found at ceph.com/debian-leveldb.

rpm-based

The list of RPM packages dependencies can be installed with:

     ./install-deps.sh

Building Ceph

Developers, please refer to the Developer Guide for more information, otherwise, you can build the server daemons, and FUSE client, by executing the following:

./autogen.sh
./configure
make

(Note that the FUSE client will only be built if libfuse is present.)

Dependencies

The configure script will complain about any missing dependencies as it goes. You can also refer to debian/control or ceph.spec.in for the package build dependencies on those platforms. In many cases, dependencies can be avoided with --with-foo or --without-bar switches. For example,

./configure --with-nss         # use libnss instead of libcrypto++
./configure --without-radosgw  # do not build radosgw
./configure --without-tcmalloc # avoid google-perftools dependency

Building packages

You can build packages for Debian or Debian-derived (e.g., Ubuntu) systems with

sudo apt-get install dpkg-dev
dpkg-checkbuilddeps        # make sure we have all dependencies
dpkg-buildpackage

For RPM-based systems (Red Hat, SUSE, etc.),

rpmbuild

Building the Documentation

Prerequisites

The list of package dependencies for building the documentation can be found in doc_deps.deb.txt:

sudo apt-get install `cat doc_deps.deb.txt`

Building the Documentation

To build the documentation, ensure that you are in the top-level `/ceph directory, and execute the build script. For example:

admin/build-doc