Thomas Schoebel-Theuer
3aa037d976
marsadm: push new replay links to primary
2020-11-10 16:04:01 +01:00
Thomas Schoebel-Theuer
69199818c8
marsadm: include probe_dir in transitive closure
2020-11-10 16:04:01 +01:00
Thomas Schoebel-Theuer
1c0e4cf9a9
marsadm: new ssh-less split-cluster method
2020-11-10 16:03:50 +01:00
Thomas Schoebel-Theuer
25cec3526e
marsadm: new ssh-less merge-cluster method
2020-11-10 16:02:46 +01:00
Thomas Schoebel-Theuer
9cb9e81310
marsadm: new _push_link_foreign onto foreign IP
2020-11-10 16:02:05 +01:00
Thomas Schoebel-Theuer
9a72f86c60
marsadm: new option --no-ssh
2020-11-07 08:56:09 +01:00
Thomas Schoebel-Theuer
7511ebadcf
marsadm: local peer and resource cache
2020-11-07 08:56:09 +01:00
Thomas Schoebel-Theuer
29b22a779f
marsadm: check peer activations
2020-11-07 08:56:07 +01:00
Thomas Schoebel-Theuer
ab6990593d
marsadm: better _get_ip
2020-11-07 08:34:57 +01:00
Thomas Schoebel-Theuer
18319eed23
marsadm: fix parsing of backslash-terminated lines
...
Suggested-by: dhrmn <notifications@github.com>
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
70a4aae762
marsadm: primitives {is,todo,nr}-secondary
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
7427478957
marsadm: primitives wait-{is,todo}-{primary,secondary}-{on,off}
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
461ac8b4cd
marsadm: new switch semantics on marsadm primary
...
Apparently, sysadmins often forget to execute "marsadm up mydata"
(or similar) after a failover.
Recall the failover command sequence:
"marsadm pause-fetch mydata; marsadm primary --force mydata"
Some months later, other sysadmins in the group are stumbling over
the very old "pause-fetch" after a regular planned handover via
"marsadm primary mydata". It works, but the former primary
(which is now secondary) does no longer fetch data, because of the
very old pause-fetch command which was never reverted.
Afterwards, /mars is filling up slowly over a long time.
Somewhen later (e.g. a few days), a monitoring alert "/mars too full"
is happening at midnight, leading to an unnecessary on-duty call.
A different type of monitoring could help, by not only
tracking the filling level of /mars, but also view-todo-fetch or
similar. However, some people dislike this, because there
exist operational use cases (like creation of backups) where pause-fetch
is executed _deliberately_ for a longer time.
Here is a workaround for a forgotten resume-fetch / up after
the first failover:
After the _original_ "marsadm primary" or "primary --force" has
succeeded by appearance of /dev/mars/mydata, we simply execute
the equivalent of "marsadm up mydata".
This changes the semantics of the "primary" command. Hopefully
no scripts on this world will break.
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
08c41805ec
marsadm: purge any left-over probe dirs
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
838b85c508
marsadm: global purge at cron
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
93ef671cf3
marsadm: global purge at link-purge-all
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
4d05bb3796
marsadm: split up link_purge_global
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
bd5412d4f5
marsadm: fix version detection for gone members
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
5b1ca6773a
marsadm: safeguard missing old deletions
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
533b13b3df
marsadm: fix initial join-resource on slow metadata communication
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
c3585565be
marsadm: fix join-cluster on unknown peer
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
1dd31c1285
marsadm: only ask myself upon self wait-cluster
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
90947c1b14
marsadm: fix wait-cluster race abort
2020-11-07 08:25:47 +01:00
Thomas Schoebel-Theuer
72cbf7b8be
marsadm: skip unnecessary wait-cluster restart
2020-11-07 08:01:07 +01:00
Thomas Schoebel-Theuer
b2cd7ddf23
marsadm: clear any local caches
2020-10-28 06:09:11 +01:00
Thomas Schoebel-Theuer
d3acf3f9c8
marsadm: fix join-cluster missing dirs
2020-10-28 06:09:11 +01:00
Thomas Schoebel-Theuer
c9b7fcf7f9
marsadm: safeguard join-resource endless loop
2020-10-28 06:09:11 +01:00
Thomas Schoebel-Theuer
e3ebc5762b
marsadm: view disk-error
2020-09-30 14:24:27 +02:00
Thomas Schoebel-Theuer
26b40474cb
marsadm: re-activate any forgotten fetch on handover
2020-09-21 14:40:48 +02:00
Thomas Schoebel-Theuer
ed95e24496
marsadm: allow leave-resource --force on empty resource
2020-09-19 17:42:34 +02:00
Thomas Schoebel-Theuer
ae2668b265
marsadm: hint admins on --ignore-sync
2020-09-18 17:45:57 +02:00
Thomas Schoebel-Theuer
23748272ca
marsadm: remove stray nonsense
2020-09-18 17:45:57 +02:00
Thomas Schoebel-Theuer
87064c1c5a
marsadm: fix primitive disk-present
2020-09-10 11:21:38 +02:00
Thomas Schoebel-Theuer
11792c250e
marsadm: remove annoying doubled error code
2020-09-05 23:08:30 +02:00
Thomas Schoebel-Theuer
60baf9c378
marsadm: fix old deletions max_nr detection
2020-09-05 23:06:38 +02:00
Thomas Schoebel-Theuer
24bb735d5a
marsadm: report summary on non-reachable non-member hosts
2020-09-03 16:29:55 +02:00
Thomas Schoebel-Theuer
2dbc0769d0
marsadm: old deletion method must ignore non-members
2020-09-03 16:29:55 +02:00
Thomas Schoebel-Theuer
3a727a04b7
marsadm: use ssh-free push at lowlevel-delete-host
2020-09-03 16:29:55 +02:00
Thomas Schoebel-Theuer
1e30e0c945
marsadm: use ssh-free push at lowlevel-set-host-ip
2020-09-03 16:29:55 +02:00
Thomas Schoebel-Theuer
f9044fc9bf
marsadm: workaround versionlink appearance race with log-rotate
2020-09-03 16:29:55 +02:00
Thomas Schoebel-Theuer
ac689b8640
marsadm: workaround race with primary logrotate
2020-09-03 16:29:55 +02:00
Thomas Schoebel-Theuer
80f18138d3
marsadm: now simplify get_alive_links()
2020-09-03 16:29:55 +02:00
Thomas Schoebel-Theuer
bcc1a63318
marsadm: new concept guest members
2020-09-03 16:29:55 +02:00
Thomas Schoebel-Theuer
2180337e85
marsadm: avoid old rsync method at join-resource
2020-09-03 16:29:55 +02:00
Thomas Schoebel-Theuer
aecccd547c
marsadm: unify naming of versionlink
2020-09-03 16:29:55 +02:00
Thomas Schoebel-Theuer
c7983a6fb6
marsadm: purge stray and/or transient guest links
2020-09-03 16:29:55 +02:00
Thomas Schoebel-Theuer
6d2091eb8e
marsadm: add --keep-backups for alivelink purge
2020-09-03 16:29:55 +02:00
Thomas Schoebel-Theuer
8cddbc1851
marsadm: do not delete versionlinks during ongoing join-resource
2020-09-01 19:35:10 +02:00
Thomas Schoebel-Theuer
6750a4fc63
marsadm: join-resource needs preliminary guest-like activation
2020-09-01 19:35:10 +02:00
Thomas Schoebel-Theuer
019b991cda
marsadm: earlier device check at {create,join}-resource
2020-09-01 19:35:10 +02:00