Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Linux-HA: Users

Failover Failure

 

 

Linux-HA users RSS feed   Index | Next | Previous | View Threaded


Yount.William at menloworldwide

Aug 2, 2012, 1:43 AM

Post #1 of 1 (87 views)
Permalink
Failover Failure

Attached is my cib.xml file.

I have a two node DRBD cluster setup in Active/Active. For whatever reason, it seems all my resources are attached to Node2. What I mean by that is that although the resources show that they are collocated, whenever I turn Node2 off or unplug a cable from Node2, then the cluster goes down. I wait to see if they come back up on the other node (although they should already be running as it is an Active/Active cluster) but they never do, even after 10 minutes. With Node2 off, I can't even ping the collocated IP address. However, if I turn off Node1 while Node2 is running, nothing goes down.

I am using the LCMC to give me a graphical overview of the setup and the screen seems to indicate that everything is okay. I believe it has to do with my fencing agent which is pacemaker. I know that even though it is set to turn a node off if there is an issue, the node never seems to shutdown. It complains that devices are busy and it can't reboot.

I am just hoping someone can take a look at my configuration and see if there is anything that stands out. If it is the fencing agent, is there a better fencing agent?


William
Attachments: cib.xml (6.92 KB)

Linux-HA users RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.