Ceph is a distributed object, block, and file storage platform
Go to file
Alexandre Oliva 617dc36d47 enable mds rejoin with active inodes' old parent xattrs
When the parent xattrs of active inodes that the mds attempts to open
during rejoin lack pool info (struct_v < 5), this field will be filled
in with -1, causing the mds to retry fetching a backtrace with a pool
number that matches the expected value, which fails and causes the
err==-ENOENT branch to be taken and retry pool 1, which succeeds, but
with pool -1, and so keeps on bouncing between the two retry cases
forever.

This patch arranges for the mds to go along with pool -1 instead of
insisting that it be refetched, enabling it to complete recovery
instead of eating cpu, network bandwidth and metadata osd's resources
like there's no tomorrow, in what AFAICT is an infinite and very busy
loop.

This is not a new problem: I've had it even before upgrading from
Cuttlefish to Dumpling, I'd just never managed to track it down, and
force-unmounting the filesystem and then restarting the mds was an
easier (if inconvenient) work-around, particularly because it always
hit when the filesystem was under active, heavy-ish use (or there
wouldn't be much reason for caps recovery ;-)

There are two issues not addressed in this patch, however.  One is
that nothing seems to proactively update the parent xattr when it is
found to be outdated, so it remains out of date forever.  Not even
renaming top-level directories causes the xattrs to be recursively
rewritten.  AFAICT that's a bug.

The other is that inodes that don't have a parent xattr (created by
even older versions of ceph) are reported as non-existing in the mds
rejoin message, because the absence of the parent xattr is signaled as
a missing inode (?failed to reconnect caps for missing inodes?).  I
suppose this may cause more serious recovery problems.

I suppose a global pass over the filesystem tree updating parent
xattrs that are out-of-date would be desirable, if we find any parent
xattrs still lacking current information; it might make sense to
activate it as a background thread from the backtrace decoding
function, when it finds a parent xattr that's too out-of-date, or as a
separate client (ceph-fsck?).

Backport: dumpling, cuttlefish
Signed-off-by: Alexandre Oliva <oliva@gnu.org>
Reviewed-by: Zheng, Yan <zheng.z.yan@intel.com>
2013-08-22 08:13:29 -07:00
admin
ceph-object-corpus@84a153afa7 ceph-object-corpus: added cuttlefish objects 2013-07-12 13:33:55 -07:00
debian ceph-post-file: single command to upload a file to cephdrop 2013-08-16 17:59:11 -07:00
doc Merge remote-tracking branch 'gh/next' 2013-08-19 12:41:54 -07:00
fusetrace fusetrace_ll.cc: handle return value of fuse_session_loop() 2013-06-03 15:22:58 +02:00
keys keys: renew autobuild.asc key 2013-02-07 22:31:40 -08:00
m4 ac_prog_javah.m4: Use AC_CANONICAL_TARGET instead of AC_CANONICAL_SYSTEM. 2013-01-14 14:11:54 -08:00
man ceph-post-file: single command to upload a file to cephdrop 2013-08-16 17:59:11 -07:00
qa cls/hello: hello, world rados class 2013-08-15 17:21:29 -07:00
share ceph-post-file: single command to upload a file to cephdrop 2013-08-16 17:59:11 -07:00
src enable mds rejoin with active inodes' old parent xattrs 2013-08-22 08:13:29 -07:00
udev udev: /dev/disk/by-parttypeuuid/$type-$uuid 2013-06-17 09:49:53 -07:00
wireshark Adding new Wireshark dissector. This is loosely based on the original 2013-07-04 17:00:55 +01:00
.gitignore .gitignore: ignore test-driver 2013-08-20 16:54:20 -07:00
.gitmodules remove leveldb from master branch 2013-02-27 14:22:48 +01:00
AUTHORS Relax Throttle::_reset_max conditions and associated unit tests 2013-02-05 20:06:04 +01:00
autogen.sh Build: Change build to always use system leveldb 2013-02-26 20:07:49 -08:00
ceph.spec.in ceph-post-file: single command to upload a file to cephdrop 2013-08-16 17:59:11 -07:00
ChangeLog
CodingStyle
configure.ac store: Add (experimental) ZFS parallel journal support 2013-08-15 09:48:22 -07:00
COPYING rbd.cc: relicense as LGPL2 2013-08-13 17:16:08 -07:00
COPYING-LGPL2.1
do_autogen.sh
Doxyfile
INSTALL
Makefile.am ceph-post-file: single command to upload a file to cephdrop 2013-08-16 17:59:11 -07:00
NEWS
PendingReleaseNotes librados: synchronous commands should return on commit instead of ack 2013-08-19 10:29:49 -07:00
README Add 'ceph-rest-api' 2013-07-10 20:58:51 -07:00
SubmittingPatches

============================================
Ceph - a scalable distributed storage system
============================================

Please see http://ceph.com/ for current info.

Contributing Code
=================

Most of Ceph is licensed under the LGPL version 2.1.  Some
miscellaneous code is under BSD-style license or is public domain.
The documentation is licensed under Creative Commons
Attribution-ShareAlike (CC BY-SA).  There are a handful of headers
included here that are licensed under the GPL.  Please see the file
COPYING for a full inventory of licenses by file.

Code contributions must include a valid "Signed-off-by" acknowledging
the license for the modified or contributed file.  Please see the file
SubmittingPatches for details on what that means and on how to
generate and submit patches.

We do not require assignment of copyright to contribute code; code is
contributed under the terms of the applicable license.


Building Ceph
=============

To prepare the source tree after it has been git cloned,

	$ git submodule update --init

To build the server daemons, and FUSE client, execute the following:

	$ ./autogen.sh
	$ ./configure
	$ make

(Note that the FUSE client will only be built if libfuse is present.)

Dependencies
------------

The configure script will complain about any missing dependencies as
it goes.  You can also refer to debian/control or ceph.spec.in for the
package build dependencies on those platforms.  In many cases,
dependencies can be avoided with --with-foo or --without-bar switches.
For example,

$ ./configure --with-nss         # use libnss instead of libcrypto++
$ ./configure --without-radosgw  # do not build radosgw and avoid libfcgi-dev
$ ./configure --without-tcmalloc # avoid google-perftools dependency


Building packages
-----------------

You can build packages for Debian or Debian-derived (e.g., Ubuntu)
systems with

$ sudo apt-get dpkg-dev
$ dpkg-checkbuilddeps        # make sure we have all dependencies
$ dpkg-buildpackage

For RPM-based systems (Redhat, Suse, etc.),

$ rpmbuild


Building the Documentation
==========================

Prerequisites
-------------
To build the documentation, you must install the following:

- python-dev
- python-pip
- python-virtualenv
- doxygen
- ditaa
- libxml2-dev
- libxslt-dev
- dot
- graphviz

For example:

	sudo apt-get install python-dev python-pip python-virtualenv doxygen ditaa libxml2-dev libxslt-dev dot graphviz

Building the Documentation
--------------------------

To build the documentation, ensure that you are in the top-level `/ceph directory, and execute the build script. For example:

	$ admin/build-doc


Build Prerequisites
-------------------
To build the source code, you must install the following:

- automake
- autoconf
- pkg-config
- gcc
- g++
- make
- libboost-dev
- libedit-dev
- libssl-dev
- libtool
- libfcgi
- libfcgi-dev
- libfuse-dev
- linux-kernel-headers
- libcrypto++-dev
- libaio-dev
- libgoogle-perftools-dev
- libkeyutils-dev
- uuid-dev
- libatomic-ops-dev
- libboost-program-options-dev
- libboost-thread-dev
- libexpat1-dev
- libleveldb-dev
- libsnappy-dev
- libcurl4-gnutls-dev
- python-argparse
- python-flask

For example:

	$ apt-get install automake autoconf pkg-config gcc g++ make libboost-dev libedit-dev libssl-dev libtool libfcgi libfcgi-dev libfuse-dev linux-kernel-headers libcrypto++-dev libaio-dev libgoogle-perftools-dev libkeyutils-dev uuid-dev libatomic-ops-dev libboost-program-options-dev libboost-thread-dev libexpat1-dev libleveldb-dev libsnappy-dev libcurl4-gnutls-dev python-argparse python-flask