Commit Graph

3056 Commits

Author SHA1 Message Date
Thomas Schoebel-Theuer 4797219f7e main: smarter activation of new oneshot peers 2020-11-10 16:04:01 +01:00
Thomas Schoebel-Theuer bf4343e1bb main: testing of new ssh-free methods 2020-11-10 16:04:01 +01:00
Thomas Schoebel-Theuer a12717518b main: determine marsadm versions globally in the cluster 2020-11-10 16:04:01 +01:00
Thomas Schoebel-Theuer 54cb4605d0 all: bump versions 2020-11-10 16:04:01 +01:00
Thomas Schoebel-Theuer 3aa037d976 marsadm: push new replay links to primary 2020-11-10 16:04:01 +01:00
Thomas Schoebel-Theuer 69199818c8 marsadm: include probe_dir in transitive closure 2020-11-10 16:04:01 +01:00
Thomas Schoebel-Theuer 1c0e4cf9a9 marsadm: new ssh-less split-cluster method 2020-11-10 16:03:50 +01:00
Thomas Schoebel-Theuer 25cec3526e marsadm: new ssh-less merge-cluster method 2020-11-10 16:02:46 +01:00
Thomas Schoebel-Theuer 9cb9e81310 marsadm: new _push_link_foreign onto foreign IP 2020-11-10 16:02:05 +01:00
Thomas Schoebel-Theuer 9a72f86c60 marsadm: new option --no-ssh 2020-11-07 08:56:09 +01:00
Thomas Schoebel-Theuer 7511ebadcf marsadm: local peer and resource cache 2020-11-07 08:56:09 +01:00
Thomas Schoebel-Theuer 29b22a779f marsadm: check peer activations 2020-11-07 08:56:07 +01:00
Thomas Schoebel-Theuer ab6990593d marsadm: better _get_ip 2020-11-07 08:34:57 +01:00
Thomas Schoebel-Theuer 8b507da75e main: simplify new resolv method 2020-11-07 08:34:57 +01:00
Thomas Schoebel-Theuer 21afb79c32 infra: new push_link with foreign IP 2020-11-07 08:34:57 +01:00
Thomas Schoebel-Theuer c26fb3bef2 infra: invalidate user caches 2020-11-07 08:34:57 +01:00
Thomas Schoebel-Theuer 166fefb676 all: allow rebased fetch 2020-11-07 08:34:57 +01:00
Thomas Schoebel-Theuer 0d79408487 main: try to retrieve unknown peers 2020-11-07 08:34:57 +01:00
Thomas Schoebel-Theuer ad0051ba51 main: better report unknown peers 2020-11-07 08:34:57 +01:00
Thomas Schoebel-Theuer 77748d31ff main: allow fetch of initial uuid 2020-11-07 08:34:57 +01:00
Thomas Schoebel-Theuer bd22838db3 main: no_fetch when pushing to foreign peers 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer 06a9237723 main: push_info to foreign peers 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer d8e641a40e infra: new push_check operation 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer a4a2034be5 main: fetch exhausting metadata on full trigger 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer cc6fd0755b main: prepare smarter activation of new oneshot peers 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer 8c0fc33630 main: replace old do_additional with oneshot 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer eb938b488d main: remove old do_communicate 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer 6d7c752f2c main: remove old doing_additional 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer ac7dd94c80 main: remove old additional_peers() 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer 1a96616efe main: new oneshot parameter at activate_peer() 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer 7a0c66393f main: split activate_peer() 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer ca07239410 main: automatic deallocate of new floating peers 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer 2cb1d425c6 main: new oneshot peer threads 2020-11-07 08:34:56 +01:00
Thomas Schoebel-Theuer 0dcd357b6e all: release mars0.1astable113 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer e84d98e4b1 doc: checkout new mars-user-manual.pdf 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer 1ebea116b9 doc: explain new "up" semantics after successful "primary" 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer a0df67c51d doc: document the new primitives 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer 18319eed23 marsadm: fix parsing of backslash-terminated lines
Suggested-by: dhrmn <notifications@github.com>
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer 70a4aae762 marsadm: primitives {is,todo,nr}-secondary 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer 7427478957 marsadm: primitives wait-{is,todo}-{primary,secondary}-{on,off} 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer e03e89ab17 main: make my_uuid global 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer 68596789f5 main: better error message 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer 461ac8b4cd marsadm: new switch semantics on marsadm primary
Apparently, sysadmins often forget to execute "marsadm up mydata"
(or similar) after a failover.

Recall the failover command sequence:
"marsadm pause-fetch mydata; marsadm primary --force mydata"

Some months later, other sysadmins in the group are stumbling over
the very old "pause-fetch" after a regular planned handover via
"marsadm primary mydata". It works, but the former primary
(which is now secondary) does no longer fetch data, because of the
very old pause-fetch command which was never reverted.

Afterwards, /mars is filling up slowly over a long time.

Somewhen later (e.g. a few days), a monitoring alert "/mars too full"
is happening at midnight, leading to an unnecessary on-duty call.

A different type of monitoring could help, by not only
tracking the filling level of /mars, but also view-todo-fetch or
similar. However, some people dislike this, because there
exist operational use cases (like creation of backups) where pause-fetch
is executed _deliberately_ for a longer time.

Here is a workaround for a forgotten resume-fetch / up after
the first failover:

After the  _original_ "marsadm primary" or "primary --force" has
succeeded by appearance of /dev/mars/mydata, we simply execute
the equivalent of "marsadm up mydata".

This changes the semantics of the "primary" command. Hopefully
no scripts on this world will break.
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer 08c41805ec marsadm: purge any left-over probe dirs 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer 838b85c508 marsadm: global purge at cron 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer 93ef671cf3 marsadm: global purge at link-purge-all 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer 4d05bb3796 marsadm: split up link_purge_global 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer bd5412d4f5 marsadm: fix version detection for gone members 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer a11f8666a3 main: allow silent reporting 2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer fb186be1f1 main: silence annoying error msg 2020-11-07 08:25:47 +01:00