Commit de9eb41
test(dm): kill dm-masters sequentially in cleanup_process
In multi-master HA tests (3-node etcd cluster), sending SIGHUP to
all masters simultaneously causes etcd to lose quorum — each master
tries to transfer leadership but no peer can accept it. The leader
transfer blocks for 120s, failing the test.
Fix: kill dm-masters one at a time (SIGHUP + 30s wait per master),
so each graceful shutdown completes while quorum is maintained.
Escalate to SIGKILL after 30s for any stuck master. Workers and
syncers are still killed in parallel (no quorum dependency).
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 2caf0e5 commit de9eb41
1 file changed
+17
-5
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
21 | | - | |
22 | | - | |
23 | | - | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
24 | 36 | | |
25 | | - | |
| 37 | + | |
26 | 38 | | |
27 | 39 | | |
28 | 40 | | |
29 | | - | |
| 41 | + | |
30 | 42 | | |
31 | 43 | | |
32 | 44 | | |
| |||
0 commit comments