Commit Graph

3131 Commits

Author SHA1 Message Date
Thomas Schoebel-Theuer
8c0fc33630 main: replace old do_additional with oneshot 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer
eb938b488d main: remove old do_communicate 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer
6d7c752f2c main: remove old doing_additional 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer
ac7dd94c80 main: remove old additional_peers() 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer
1a96616efe main: new oneshot parameter at activate_peer() 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer
7a0c66393f main: split activate_peer() 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer
ca07239410 main: automatic deallocate of new floating peers 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer
2cb1d425c6 main: new oneshot peer threads 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer
0dcd357b6e all: release mars0.1astable113 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
e84d98e4b1 doc: checkout new mars-user-manual.pdf 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
1ebea116b9 doc: explain new "up" semantics after successful "primary" 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
a0df67c51d doc: document the new primitives 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
18319eed23 marsadm: fix parsing of backslash-terminated lines
Suggested-by: dhrmn <notifications@github.com>
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
70a4aae762 marsadm: primitives {is,todo,nr}-secondary 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
7427478957 marsadm: primitives wait-{is,todo}-{primary,secondary}-{on,off} 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
e03e89ab17 main: make my_uuid global 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
68596789f5 main: better error message 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
461ac8b4cd marsadm: new switch semantics on marsadm primary
Apparently, sysadmins often forget to execute "marsadm up mydata"
(or similar) after a failover.

Recall the failover command sequence:
"marsadm pause-fetch mydata; marsadm primary --force mydata"

Some months later, other sysadmins in the group are stumbling over
the very old "pause-fetch" after a regular planned handover via
"marsadm primary mydata". It works, but the former primary
(which is now secondary) does no longer fetch data, because of the
very old pause-fetch command which was never reverted.

Afterwards, /mars is filling up slowly over a long time.

Somewhen later (e.g. a few days), a monitoring alert "/mars too full"
is happening at midnight, leading to an unnecessary on-duty call.

A different type of monitoring could help, by not only
tracking the filling level of /mars, but also view-todo-fetch or
similar. However, some people dislike this, because there
exist operational use cases (like creation of backups) where pause-fetch
is executed _deliberately_ for a longer time.

Here is a workaround for a forgotten resume-fetch / up after
the first failover:

After the  _original_ "marsadm primary" or "primary --force" has
succeeded by appearance of /dev/mars/mydata, we simply execute
the equivalent of "marsadm up mydata".

This changes the semantics of the "primary" command. Hopefully
no scripts on this world will break.
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
08c41805ec marsadm: purge any left-over probe dirs 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
838b85c508 marsadm: global purge at cron 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
93ef671cf3 marsadm: global purge at link-purge-all 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
4d05bb3796 marsadm: split up link_purge_global 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
bd5412d4f5 marsadm: fix version detection for gone members 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
a11f8666a3 main: allow silent reporting 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
fb186be1f1 main: silence annoying error msg 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
f1a388827c main: push my IP to any preliminary peers 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
5b1ca6773a marsadm: safeguard missing old deletions 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
533b13b3df marsadm: fix initial join-resource on slow metadata communication 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
c3585565be marsadm: fix join-cluster on unknown peer 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
1dd31c1285 marsadm: only ask myself upon self wait-cluster 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
90947c1b14 marsadm: fix wait-cluster race abort 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
ddd025e5ca main: rename ambiguous lock 2020-11-07 08:25:46 +01:00
Thomas Schoebel-Theuer
8ac109ad22 server: activate peer on ip push 2020-11-07 08:25:46 +01:00
Thomas Schoebel-Theuer
9231052ce1 main: simplify activate_peer 2020-11-07 08:25:46 +01:00
Thomas Schoebel-Theuer
8fc98684bc main: allow push to non-activated peer 2020-11-07 08:25:44 +01:00
Thomas Schoebel-Theuer
38ce235e86 main: speedup peer search 2020-11-07 08:01:07 +01:00
Thomas Schoebel-Theuer
577fbac8ed main: factor out peer allocation 2020-11-07 08:01:07 +01:00
Thomas Schoebel-Theuer
5fd960b91c main: move make_peer_dir_list 2020-11-07 08:01:07 +01:00
Thomas Schoebel-Theuer
72cbf7b8be marsadm: skip unnecessary wait-cluster restart 2020-11-07 08:01:07 +01:00
Thomas Schoebel-Theuer
2cb3520e01 infra: rework net address parsing 2020-11-07 08:01:06 +01:00
Thomas Schoebel-Theuer
530a938a62 net: safeguard recv of bad string size 2020-11-07 07:58:53 +01:00
Thomas Schoebel-Theuer
26bab9b43d all: release mars0.1astable112 2020-10-28 07:02:32 +01:00
Thomas Schoebel-Theuer
b2cd7ddf23 marsadm: clear any local caches 2020-10-28 06:09:11 +01:00
Thomas Schoebel-Theuer
d3acf3f9c8 marsadm: fix join-cluster missing dirs 2020-10-28 06:09:11 +01:00
Thomas Schoebel-Theuer
c9b7fcf7f9 marsadm: safeguard join-resource endless loop 2020-10-28 06:09:11 +01:00
Thomas Schoebel-Theuer
e2ef6f4b04 main: do not write alivelinks before {create,join}-cluster 2020-10-28 06:09:11 +01:00
Thomas Schoebel-Theuer
8f510f5936 main: better error msg 2020-10-28 06:09:11 +01:00
Thomas Schoebel-Theuer
c77d656c01 infra: remove old method 2020-10-28 06:09:11 +01:00
Thomas Schoebel-Theuer
dd428841d0 infra: ignore more name prefixes 2020-10-28 06:09:11 +01:00
Thomas Schoebel-Theuer
f7f357669a main: skip annoying message 2020-10-28 06:09:11 +01:00