
krienke at uni-koblenz
Apr 3, 2012, 6:59 AM
Views: 664
Permalink
|
|
Re: Cluster node hanging upon access to ocfs2 fs when second cluster node dies ?
[In reply to]
|
|
Am 03.04.2012 15:51, schrieb Lars Marowsky-Bree: > On 2012-04-03T15:50:29, Rainer Krienke <krienke [at] uni-koblenz> wrote: > >> rzinstal4:~ # sbd -d /dev/disk/by-id/scsi-259316a7265713551-part1 dump >> ==Dumping header on disk /dev/disk/by-id/scsi-259316a7265713551-part1 >> Header version : 2 >> Number of slots : 255 >> Sector size : 512 >> Timeout (watchdog) : 90 >> Timeout (allocate) : 2 >> Timeout (loop) : 1 >> Timeout (msgwait) : 180 > > You have configured a msgwait of 180s - e.g., the message will be > considered delivered after 180s earliest; your stonith-timeout is set to > 30s, so this can *never* result in a successful fence. > > I suggest to increase stonith-timeout to 300s. > > > Regards, > Lars > Hi Lars, this was something I detected already. And I changed the timeout in the cluster configuration to 200sec. So the log I posted was the result of the configuration below (200sec). Is this still to small? $ crm configure show ... primitive stonith_sbd stonith:external/sbd \ op monitor interval="200" timeout="200" start-delay="200" \ params sbd_device="/dev/disk/by-id/scsi-259316a7265713551-part1" ... Rainer -- Rainer Krienke, Uni Koblenz, Rechenzentrum, A22, Universitaetsstrasse 1 56070 Koblenz, http://userpages.uni-koblenz.de/~krienke, Tel: +49261287 1312 PGP: http://userpages.uni-koblenz.de/~krienke/mypgp.html,Fax: +49261287 1001312 _______________________________________________ Linux-HA mailing list Linux-HA [at] lists http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
|