
keith at midnighthax
Nov 10, 2004, 9:18 AM
Post #3 of 4
(710 views)
Permalink
|
On Wed, 10 Nov 2004 08:49:46 -0700 Alan Robertson <alanr [at] unix> wrote: > If you pull all communications links out of bravo, then you've created a > split-brain. Putting them back should cause both systems to restart > heartbeat and reacquire resources. That makes sense, thanks. > First, having your ha.cf configuration, the version of heartbeat > (probably an older one) you're using and being more precise about what > cables you pulled and which ones you left in place (if any) would be > good. Setup: alpha and bravo are directors, dual NICs, "public" subnet 10.0.0.0/24. Real servers are charlie and delta, subnet 192.168.6.0/24. ha.cf, decommented: =========================== debugfile /var/log/ha-debug logfile /var/log/ha-log logfacility local0 keepalive 2 deadtime 30 warntime 10 initdead 120 nice_failback on udpport 694 bcast eth0 eth1 mcast eth0 225.0.0.1 694 1 0 ucast eth0 192.168.6.2 node alpha node bravo respawn hacluster /usr/lib/heartbeat/ipfail ping 192.168.6.4 192.168.6.5 =========================== Heartbeat: 1.0.4-1.woody. Cables pulled: both NICs on alpha while alpha is master. Cluster fails over, successfully, but fails back when cables replaced. > Upgrading to 1.2.x would also be good. 1.2.3 is the current version. We > like it. I was using the Debian package, but I can compile a later heartbeat by hand if that will help. Thanks for your help, it's appreciated. Keith _______________________________________________ Linux-HA mailing list Linux-HA [at] lists http://lists.linux-ha.org/mailman/listinfo/linux-ha
|