This might happen if, e.g., the file_layout specifies an osd that later
is removed from the cluster entirely. Just ignore it instead of making
upper layers duplicate this check.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
This reverts commit 6fbab6da69.
This fails a unit test.
And I change my mind.. I think this is most cleanly handled inside crush, so
we don't duplicate the same check that is generating the error with an different
data structure.
It was not an index, and seems to contain recommendations
for system configuration. I have renamed it to confusing.txt
and will merge it in a future commit.
Signed-off-by: Mark Kampe <mark.kampe@dreamhost.com>
which was a copy of PlanningImplementation.txt
(and not html at all).
restored previous index.rst, which was overwritten with a copy
of PlanninImplementation.txt, but removed all of the recursively
included content from the document.
I will cherry-pick merge the new contents in a subsequent commit.
Signed-off-by: Mark Kampe <mark.kampe@dreamhost.com>
it was accidentally overwritten with a version of the product
had a somewhat different audience/focus and a few sphinx
formatting errors.
I will cherry-pick the corrections in a subsequent commit.
Signed-off-by: Mark Kampe <mark.kampe@dreamhost.com>
We haven't made explicit that the Hadoop Java code is under the Apache
License. Do so (with permission from the other contributors, thanks!).
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
We need to hold mylock before waiting on the cond or else we get
./common/Cond.h: In function 'int Cond::Wait(Mutex&)', in thread '7f37fe0c8700'
./common/Cond.h: 46: FAILED assert(mutex.is_locked())
ceph version 0.38-2-g73f99a1 (commit:73f99a189f491866da2be88adcfe0bd512282755)
1: (MDLog::_replay_thread()+0x2483) [0x6c4393]
2: (MDLog::ReplayThread::entry()+0xd) [0x4decbd]
3: (()+0x6d8c) [0x7f3803e8fd8c]
4: (clone()+0x6d) [0x7f38028d504d]
ceph version 0.38-2-g73f99a1 (commit:73f99a189f491866da2be88adcfe0bd512282755)
1: (MDLog::_replay_thread()+0x2483) [0x6c4393]
2: (MDLog::ReplayThread::entry()+0xd) [0x4decbd]
3: (()+0x6d8c) [0x7f3803e8fd8c]
4: (clone()+0x6d) [0x7f38028d504d]
*** Caught signal (Aborted) **
in thread 7f37fe0c8700
Signed-off-by: Sage Weil <sage@newdream.net>
This eliminates some flags and avoids annoying cases where the banner is
printed but we don't want to see it.
Signed-off-by: Sage Weil <sage@newdream.net>
Looks like this was missed when flocklock was added. Did a quick grep and
it doesn't look like it is missing anywhere else.
Signed-off-by: Sage Weil <sage@newdream.net>
Otherwise we don't actually journal this value, and we get confused when
we replay a start_truncate and try to restart it.
Fixes: #1756
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
These flags are probably relics from when the function got split;
they belong in send_op now.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
If an address isn't explicitly specified during mkfs, look for an unnamed
monitor in the (generated) monmap and see if any of those addresses is
configured on the local machine. If so, assume it's us, and name ourselves
in the seed monmap.
Signed-off-by: Sage Weil <sage@newdream.net>
Previously, the overlaps for snapdirs would not be included in
cstat causing the computed total to be incorrect.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Previously, we accounted for clone space usage inconsistently in
write_update_size_and_usage etc when walking through the operations.
make_writeable may change the most recent clone overlap, however, so we
can't handle it until then.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Shut down MonClient before messenger, to avoid race with MonClient::tick()
and MonClient::shutdown().
Fixes
#0 __lll_lock_wait () at ../nptl/sysdeps/unix/sysv/linux/x86_64/lowlevellock.S:136
#1 0x00007f44475e2849 in _L_lock_953 () from /lib/libpthread.so.0
#2 0x00007f44475e266b in __pthread_mutex_lock (mutex=0x14d8dc8) at pthread_mutex_lock.c:61
#3 0x00000000005ae090 in Mutex::Lock (this=0x14d8db8, no_lockdep=false) at ./common/Mutex.h:108
#4 0x000000000068440e in MonClient::shutdown (this=0x14d8c30) at mon/MonClient.cc:386
#5 0x00000000005b2653 in ceph_tool_common_shutdown (ctx=0x14d84c0) at tools/common.cc:661
#6 0x00000000005ada29 in main (argc=7, argv=0x7fff8a2394c8) at tools/ceph.cc:304
vs
#0 0x00007f44475e8a0b in raise (sig=<value optimized out>) at ../nptl/sysdeps/unix/sysv/linux/pt-raise.c:42
#1 0x00000000005eff6b in reraise_fatal (signum=11) at global/signal_handler.cc:59
#2 0x00000000005f0165 in handle_fatal_signal (signum=11) at global/signal_handler.cc:106
#3 <signal handler called>
#4 0x0000000000000000 in ?? ()
#5 0x000000000068661a in MonClient::tick (this=0x14d8c30) at mon/MonClient.cc:621
#6 0x0000000000689e3b in MonClient::C_Tick::finish(int) ()
#7 0x000000000061b3c5 in SafeTimer::timer_thread (this=0x14d8df8) at common/Timer.cc:102
#8 0x000000000061c6f0 in SafeTimerThread::entry() ()
#9 0x00000000005f1219 in Thread::_entry_func (arg=0x14e1a00) at common/Thread.cc:41
#10 0x00007f44475e0971 in start_thread (arg=<value optimized out>) at pthread_create.c:304
#11 0x00007f4445ead92d in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:112
#12 0x0000000000000000 in ?? ()
Signed-off-by: Sage Weil <sage@newdream.net>
Different sockaddr_* have the actual address (sin_addr, sin6_addr)
at different offsets, and sockaddr->sa_data just isn't enough.
inet_ntop conspires by taking a void*. I could figure out the right
offset with a switch (found->sa_family), but let's go for the
supposedly write-once-run-with-any-AF solution, getnameinfo.
Which, naturally, takes an extra length argument that is AF-specific,
and not provided anywhere nicely by getifaddrs. Huzzah!
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>