Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Linux-HA: Japanese

DRBD primary$B$K>:3J(B$B$7$J$$(B

 

 

Linux-HA japanese RSS feed   Index | Next | Previous | View Threaded


hayato at starsystems

Jan 11, 2012, 4:09 AM

Post #1 of 11 (551 views)
Permalink
DRBD primary$B$K>:3J(B$B$7$J$$(B

$B<D5\$H?=$7$^$9!#(B

$B$O$8$a$FEj9F$5$;$F$$$?$@$-$^$9!#(B
heartbeat v3 $B$*$h$S(B DRBD$B$K4X$7$F!"0J2<$NLdBj$G:$$C$F$$$^$9!#(B

$B!cLdBjE@!d(B
$B!!N>%N!<%I$,@5>o$K5/F0$7!"(Bactive$BB&$r%7%c%C%H%@%&%s$7$?$H$3$m(B
$B!!(Bstandby$BB&$N(BDRBD$B$,!"(Bslave$B>uBV$N$^$^%U%'%$%k%*!<%P!<$7$^$;$s!#(B
$B!!"(0l=V(Bprimary$B$K$J$k$N$G$9$,!"(Bslave$B>uBV$K$J$j$^$9!#(B

$B!c@5>o;~!d(B
============
Last updated: Wed Jan 11 20:58:18 2012
Stack: Heartbeat
Current DC: node1 (58598433-729f-4266-9c7f-2a02e306e090) - partition with
quorum
Version: 1.0.12-unknown
2 Nodes configured, unknown expected votes
1 Resources configured.
============

Online: [ node1 node2 ]

Master/Slave Set: ms_drbd
Masters: [ node1 ]
Slaves: [ node2 ]

$B!c0[>o;~!d(B
============
Last updated: Wed Jan 11 20:59:41 2012
Stack: Heartbeat
Current DC: node2 (58598433-729f-4266-9c7f-2a02e306e090) - partition
with quorum
Version: 1.0.12-unknown
2 Nodes configured, unknown expected votes
1 Resources configured.
============

Online: [ node2 ]
OFFLINE: [ node1 ]

Master/Slave Set: ms_drbd
Slaves: [ node2 ]
Stopped: [ drbd_hadoop:1 ]

Failed actions:
drbd_hadoop:0_promote_0 (node=node2, call=641, rc=1,
status=complete): unknown error

$B!c4D6-!d(B
$B!!(Bheartbeat-3.0.3-2.3.el5
$B!!(Bpacemaker-1.0.12-1.el5.centos
$B!!(Bdrbd83-8.3.8-1.el5.centos
$B!!(Bkernel 2.6.18-238.12.1.el5$B!!(B(OS:CentOS 5.7)
$B!!(B

$B!c(BPacemaker$B [at] _D!d(B
$B!!0J2<$N [at] _D$O!"0JA0%a!<%j%s%0%j%9%H$KEj9F$5$l$?FbMF$r85$K(B
$B!!@_Dj$7$F$$$^$9!#(B
$B!!;29M85!'(Bhttp://sourceforge.jp/projects/linux-ha/lists/archive/japan/2011-December/000996.html

$B!!(Bprimitive drbd_hadoop ocf:linbit:drbd \
params drbd_resource="r0" drbdconf="/etc/drbd.conf" \
op monitor interval="10s" \
op start interval="0s" timeout="240s" on-fail="restart" \
op monitor interval="10s" role="Master" timeout="20s" on-fail="restart" \
op monitor interval="20s" role="Slave" timeout="20s" on-fail="restart" \
op promote interval="0s" timeout="90s" on-fail="restart" \
op demote interval="0s" timeout="90s" on-fail="block" \
op stop interval="0s" timeout="100s" on-fail="block"
ms ms_drbd drbd_hadoop \
meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"
location l_hadoop ms_drbd \
rule $id="l_hadoop-rule" $role="master" 200: #uname eq node1 \
rule $id="l_hadoop-rule-0" $role="master" 100: #uname eq node2
property $id="cib-bootstrap-options" \
dc-version="1.0.12-unknown" \
cluster-infrastructure="Heartbeat" \
last-lrm-refresh="1326270814" \
stonith-enabled="false" \
no-quorum-policy="stop" \
default-action-timeout="240" \
default-resource-stickiness="0" \
symmetric-cluster="true" \
startup-fencing="true" \
stop-orphan-resources="true" \
remove-after-stop="false"

$BBgJQ$*<j?t$G$9$,!"$I$J$?$+$465<x$/$@$5$$$^$9$h$&!"$*4j$$CW$7$^$9!#(B

$B0J>e$G$9!#(B

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


hkuroki at 3ware

Jan 11, 2012, 4:01 PM

Post #2 of 11 (511 views)
Permalink
Re: DRBD primary$B$K>:3J(B$B$7$J$$(B [In reply to]

$B<D5\MM(B

$B9uLZ$H?=$7$^$9!#(B

$B$3$N$h$&$J>l9g$O(B/var/log/messages$B$rD4$Y$k$N$,(B
$B%U%!!<%9%H%9%F%C%W$K$J$j$^$9!#(Bmessages$B$K(BDRBD$B$,(BPrimary$B$K(B
$B$J$C$F!"(BSecandary$B$K$J$k2aDx$,5-O?$5$l$^$9!#(Bcrm_mon$B$N(B
$B7k2L$r8+$k8B$j(BDRBD$B$N%(%i!<$,H/@8$7$F$$$k$h$&$G$9$N$G!"(B
messages$B$+$i%(%i!<$NItJ,$rH4$-=P$7$F$_$F2<$5$$!#(B

$B<!$K(BDRBD$BC1BN$GF0:n3NG'$r9T$J$C$F$_$F2<$5$$!#(B
Heartbeat$B$r;_$a$F!"(Bdrbdadm$B%3%^%s%I@5>o$K(B
$B@Z$jBX$o$k$+$I$&$+3NG'$7$^$9!#(B

$B0J>e#2$D$N3NG'$G(BDRBD$B$N%(%i!<$r8+$D$1$i$l$k$H;W$$$^$9!#(B

On Wed, 11 Jan 2012 21:09:07 +0900
Hayato Shinomiya <hayato [at] starsystems> wrote:

> $B<D5\$H?=$7$^$9!#(B
>
> $B$O$8$a$FEj9F$5$;$F$$$?$@$-$^$9!#(B
> heartbeat v3 $B$*$h$S(B DRBD$B$K4X$7$F!"0J2<$NLdBj$G:$$C$F$$$^$9!#(B
>
> $B!cLdBjE@!d(B
> $B!!N>%N!<%I$,@5>o$K5/F0$7!"(Bactive$BB&$r%7%c%C%H%@%&%s$7$?$H$3$m(B
> $B!!(Bstandby$BB&$N(BDRBD$B$,!"(Bslave$B>uBV$N$^$^%U%'%$%k%*!<%P!<$7$^$;$s!#(B
> $B!!"(0l=V(Bprimary$B$K$J$k$N$G$9$,!"(Bslave$B>uBV$K$J$j$^$9!#(B
>
> $B!c@5>o;~!d(B
> ============
> Last updated: Wed Jan 11 20:58:18 2012
> Stack: Heartbeat
> Current DC: node1 (58598433-729f-4266-9c7f-2a02e306e090) - partition with
> quorum
> Version: 1.0.12-unknown
> 2 Nodes configured, unknown expected votes
> 1 Resources configured.
> ============
>
> Online: [ node1 node2 ]
>
> Master/Slave Set: ms_drbd
> Masters: [ node1 ]
> Slaves: [ node2 ]
>
> $B!c0[>o;~!d(B
> ============
> Last updated: Wed Jan 11 20:59:41 2012
> Stack: Heartbeat
> Current DC: node2 (58598433-729f-4266-9c7f-2a02e306e090) - partition
> with quorum
> Version: 1.0.12-unknown
> 2 Nodes configured, unknown expected votes
> 1 Resources configured.
> ============
>
> Online: [ node2 ]
> OFFLINE: [ node1 ]
>
> Master/Slave Set: ms_drbd
> Slaves: [ node2 ]
> Stopped: [ drbd_hadoop:1 ]
>
> Failed actions:
> drbd_hadoop:0_promote_0 (node=node2, call=641, rc=1,
> status=complete): unknown error
>
> $B!c4D6-!d(B
> $B!!(Bheartbeat-3.0.3-2.3.el5
> $B!!(Bpacemaker-1.0.12-1.el5.centos
> $B!!(Bdrbd83-8.3.8-1.el5.centos
> $B!!(Bkernel 2.6.18-238.12.1.el5$B!!(B(OS:CentOS 5.7)
> $B!!(B
>
> $B!c(BPacemaker$B [at] _D!d(B
> $B!!0J2<$N [at] _D$O!"0JA0%a!<%j%s%0%j%9%H$KEj9F$5$l$?FbMF$r85$K(B
> $B!!@_Dj$7$F$$$^$9!#(B
> $B!!;29M85!'(Bhttp://sourceforge.jp/projects/linux-ha/lists/archive/japan/2011-December/000996.html
>
> $B!!(Bprimitive drbd_hadoop ocf:linbit:drbd \
> params drbd_resource="r0" drbdconf="/etc/drbd.conf" \
> op monitor interval="10s" \
> op start interval="0s" timeout="240s" on-fail="restart" \
> op monitor interval="10s" role="Master" timeout="20s" on-fail="restart" \
> op monitor interval="20s" role="Slave" timeout="20s" on-fail="restart" \
> op promote interval="0s" timeout="90s" on-fail="restart" \
> op demote interval="0s" timeout="90s" on-fail="block" \
> op stop interval="0s" timeout="100s" on-fail="block"
> ms ms_drbd drbd_hadoop \
> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"
> location l_hadoop ms_drbd \
> rule $id="l_hadoop-rule" $role="master" 200: #uname eq node1 \
> rule $id="l_hadoop-rule-0" $role="master" 100: #uname eq node2
> property $id="cib-bootstrap-options" \
> dc-version="1.0.12-unknown" \
> cluster-infrastructure="Heartbeat" \
> last-lrm-refresh="1326270814" \
> stonith-enabled="false" \
> no-quorum-policy="stop" \
> default-action-timeout="240" \
> default-resource-stickiness="0" \
> symmetric-cluster="true" \
> startup-fencing="true" \
> stop-orphan-resources="true" \
> remove-after-stop="false"
>
> $BBgJQ$*<j?t$G$9$,!"$I$J$?$+$465<x$/$@$5$$$^$9$h$&!"$*4j$$CW$7$^$9!#(B
>
> $B0J>e$G$9!#(B
>
> _______________________________________________
> Linux-ha-japan mailing list
> Linux-ha-japan [at] lists
> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>


--
----------------------------------------------------------------------
$B9uLZ(B $BGn(B ($B3t(B)$B%5!<%I%&%'%"(B

Kuroki Hiroshi 135-0034 $BEl5~ET9>El6h1JBe(B2-31-13 $B%t%#%i%*!<%/%i(B2F
hkuroki [at] 3ware URL: http://www.3ware.co.jp/
Phone: 03-4530-8670 Fax: 03-5809-8260

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


hayato at starsystems

Jan 11, 2012, 4:51 PM

Post #3 of 11 (526 views)
Permalink
Re: DRBD primary$B$K>:3J(B$B$7$J$$(B [In reply to]

$B9uLZMM(B

$B<D5\$G$9!#(B


$BAaB.$N$4JV?.!"@?$KM-$jFq$&$4$6$$$^$9!#(B

$B0J2<!"$4;XE&D:$$$?2U=j$K$D$$$F!"%m%0Ey$NH4?h$r5-:\(B
$B$5$;$FD:$-$^$9!#(B
> $B$3$N$h$&$J>l9g$O(B/var/log/messages$B$rD4$Y$k$N$,(B
> $B%U%!!<%9%H%9%F%C%W$K$J$j$^$9!#(Bmessages$B$K(BDRBD$B$,(BPrimary$B$K(B
> $B$J$C$F!"(BSecandary$B$K$J$k2aDx$,5-O?$5$l$^$9!#(Bcrm_mon$B$N(B
> $B7k2L$r8+$k8B$j(BDRBD$B$N%(%i!<$,H/@8$7$F$$$k$h$&$G$9$N$G!"(B
> messages$B$+$i%(%i!<$NItJ,$rH4$-=P$7$F$_$F2<$5$$!#(B

$B!c%m%0H4?h!d(B
Jan 12 03:07:33 node1 lrmd: [3649]: info: rsc:res_drbd:1:19: stop
Jan 12 03:07:33 node1 crmd: [3652]: info: process_lrm_event: LRM operation res_drbd:1_monitor_20000 (call=17, status=1, cib-update=0, confirmed=true) Cancelled
Jan 12 03:07:33 node1 kernel: block drbd0: peer( Primary -> Unknown ) conn( Connected -> Disconnecting ) pdsk( UpToDate -> DUnknown )
Jan 12 03:07:33 node1 kernel: block drbd0: short read expecting header on sock: r=-512
Jan 12 03:07:33 node1 kernel: block drbd0: asender terminated
Jan 12 03:07:33 node1 kernel: block drbd0: Terminating asender thread
Jan 12 03:07:33 node1 kernel: block drbd0: Connection closed
Jan 12 03:07:33 node1 kernel: block drbd0: conn( Disconnecting -> StandAlone )
Jan 12 03:07:33 node1 kernel: block drbd0: receiver terminated
Jan 12 03:07:33 node1 kernel: block drbd0: Terminating receiver thread
Jan 12 03:07:33 node1 kernel: block drbd0: disk( UpToDate -> Diskless )
Jan 12 03:07:33 node1 kernel: block drbd0: drbd_bm_resize called with capacity == 0
Jan 12 03:07:33 node1 kernel: block drbd0: worker terminated
Jan 12 03:07:33 node1 kernel: block drbd0: Terminating worker thread
Jan 12 03:07:33 node1 lrmd: [3649]: info: RA output: (res_drbd:1:stop:stdout)
Jan 12 03:07:33 node1 kernel: block drbd0: State change failed: Disk state is lower than outdated
Jan 12 03:07:33 node1 kernel: block drbd0: state = { cs:StandAlone ro:Secondary/Unknown ds:Diskless/DUnknown r--- }
Jan 12 03:07:33 node1 kernel: block drbd0: wanted = { cs:StandAlone ro:Secondary/Unknown ds:Outdated/DUnknown r--- }
Jan 12 03:07:33 node1 lrmd: [3649]: info: RA output: (res_drbd:1:stop:stdout)


> $B<!$K(BDRBD$BC1BN$GF0:n3NG'$r9T$J$C$F$_$F2<$5$$!#(B
> Heartbeat$B$r;_$a$F!"(Bdrbdadm$B%3%^%s%I@5>o$K(B
> $B@Z$jBX$o$k$+$I$&$+3NG'$7$^$9!#(B
$B0J2<$N%3%^%s%I$r<B9T$7$F!"<jF0$K$F(BDRBD$B$,@Z$jBX$o$k(B
$B$3$H$r3NG'$7$^$7$?!#(B

$B!c<B;\FbMF!d(B
$B!!(Bprimary$BB&(B(node1)$B$K$F<B;\(B
$B!!!!(Bumount /drbd
$B!!!!(Bdrbdadm secondary r0

$B!!85(Bsecondary$BB&(B(node2)$B$K$F<B;\(B
$B!!!!(Bdrbdadm primary r0
$B!!!!(Bmount /dev/drbd0 /drbd

$B!c<B;\7k2L!d(B
$B!!!!(B/var/log/messages$B$K0J2<$,=PNO(B
$B!!!!!!(Bkernel: block drbd0: role( Primary -> Secondary )


$B0J>e$H$J$j$^$9!#(B
$B$465<x$NDx!"59$7$/$*4j$$CW$7$^$9!#(B


On Thu, 12 Jan 2012 09:01:06 +0900
Hiroshi Kuroki <hkuroki [at] 3ware> wrote:

> $B<D5\MM(B
>
> $B9uLZ$H?=$7$^$9!#(B
>
> $B$3$N$h$&$J>l9g$O(B/var/log/messages$B$rD4$Y$k$N$,(B
> $B%U%!!<%9%H%9%F%C%W$K$J$j$^$9!#(Bmessages$B$K(BDRBD$B$,(BPrimary$B$K(B
> $B$J$C$F!"(BSecandary$B$K$J$k2aDx$,5-O?$5$l$^$9!#(Bcrm_mon$B$N(B
> $B7k2L$r8+$k8B$j(BDRBD$B$N%(%i!<$,H/@8$7$F$$$k$h$&$G$9$N$G!"(B
> messages$B$+$i%(%i!<$NItJ,$rH4$-=P$7$F$_$F2<$5$$!#(B
>
> $B<!$K(BDRBD$BC1BN$GF0:n3NG'$r9T$J$C$F$_$F2<$5$$!#(B
> Heartbeat$B$r;_$a$F!"(Bdrbdadm$B%3%^%s%I@5>o$K(B
> $B@Z$jBX$o$k$+$I$&$+3NG'$7$^$9!#(B
>
> $B0J>e#2$D$N3NG'$G(BDRBD$B$N%(%i!<$r8+$D$1$i$l$k$H;W$$$^$9!#(B
>
> On Wed, 11 Jan 2012 21:09:07 +0900
> Hayato Shinomiya <hayato [at] starsystems> wrote:
>
> > $B<D5\$H?=$7$^$9!#(B
> >
> > $B$O$8$a$FEj9F$5$;$F$$$?$@$-$^$9!#(B
> > heartbeat v3 $B$*$h$S(B DRBD$B$K4X$7$F!"0J2<$NLdBj$G:$$C$F$$$^$9!#(B
> >
> > $B!cLdBjE@!d(B
> > $B!!N>%N!<%I$,@5>o$K5/F0$7!"(Bactive$BB&$r%7%c%C%H%@%&%s$7$?$H$3$m(B
> > $B!!(Bstandby$BB&$N(BDRBD$B$,!"(Bslave$B>uBV$N$^$^%U%'%$%k%*!<%P!<$7$^$;$s!#(B
> > $B!!"(0l=V(Bprimary$B$K$J$k$N$G$9$,!"(Bslave$B>uBV$K$J$j$^$9!#(B
> >
> > $B!c@5>o;~!d(B
> > ============
> > Last updated: Wed Jan 11 20:58:18 2012
> > Stack: Heartbeat
> > Current DC: node1 (58598433-729f-4266-9c7f-2a02e306e090) - partition with
> > quorum
> > Version: 1.0.12-unknown
> > 2 Nodes configured, unknown expected votes
> > 1 Resources configured.
> > ============
> >
> > Online: [ node1 node2 ]
> >
> > Master/Slave Set: ms_drbd
> > Masters: [ node1 ]
> > Slaves: [ node2 ]
> >
> > $B!c0[>o;~!d(B
> > ============
> > Last updated: Wed Jan 11 20:59:41 2012
> > Stack: Heartbeat
> > Current DC: node2 (58598433-729f-4266-9c7f-2a02e306e090) - partition
> > with quorum
> > Version: 1.0.12-unknown
> > 2 Nodes configured, unknown expected votes
> > 1 Resources configured.
> > ============
> >
> > Online: [ node2 ]
> > OFFLINE: [ node1 ]
> >
> > Master/Slave Set: ms_drbd
> > Slaves: [ node2 ]
> > Stopped: [ drbd_hadoop:1 ]
> >
> > Failed actions:
> > drbd_hadoop:0_promote_0 (node=node2, call=641, rc=1,
> > status=complete): unknown error
> >
> > $B!c4D6-!d(B
> > $B!!(Bheartbeat-3.0.3-2.3.el5
> > $B!!(Bpacemaker-1.0.12-1.el5.centos
> > $B!!(Bdrbd83-8.3.8-1.el5.centos
> > $B!!(Bkernel 2.6.18-238.12.1.el5$B!!(B(OS:CentOS 5.7)
> > $B!!(B
> >
> > $B!c(BPacemaker$B [at] _D!d(B
> > $B!!0J2<$N [at] _D$O!"0JA0%a!<%j%s%0%j%9%H$KEj9F$5$l$?FbMF$r85$K(B
> > $B!!@_Dj$7$F$$$^$9!#(B
> > $B!!;29M85!'(Bhttp://sourceforge.jp/projects/linux-ha/lists/archive/japan/2011-December/000996.html
> >
> > $B!!(Bprimitive drbd_hadoop ocf:linbit:drbd \
> > params drbd_resource="r0" drbdconf="/etc/drbd.conf" \
> > op monitor interval="10s" \
> > op start interval="0s" timeout="240s" on-fail="restart" \
> > op monitor interval="10s" role="Master" timeout="20s" on-fail="restart" \
> > op monitor interval="20s" role="Slave" timeout="20s" on-fail="restart" \
> > op promote interval="0s" timeout="90s" on-fail="restart" \
> > op demote interval="0s" timeout="90s" on-fail="block" \
> > op stop interval="0s" timeout="100s" on-fail="block"
> > ms ms_drbd drbd_hadoop \
> > meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"
> > location l_hadoop ms_drbd \
> > rule $id="l_hadoop-rule" $role="master" 200: #uname eq node1 \
> > rule $id="l_hadoop-rule-0" $role="master" 100: #uname eq node2
> > property $id="cib-bootstrap-options" \
> > dc-version="1.0.12-unknown" \
> > cluster-infrastructure="Heartbeat" \
> > last-lrm-refresh="1326270814" \
> > stonith-enabled="false" \
> > no-quorum-policy="stop" \
> > default-action-timeout="240" \
> > default-resource-stickiness="0" \
> > symmetric-cluster="true" \
> > startup-fencing="true" \
> > stop-orphan-resources="true" \
> > remove-after-stop="false"
> >
> > $BBgJQ$*<j?t$G$9$,!"$I$J$?$+$465<x$/$@$5$$$^$9$h$&!"$*4j$$CW$7$^$9!#(B
> >
> > $B0J>e$G$9!#(B
> >
> > _______________________________________________
> > Linux-ha-japan mailing list
> > Linux-ha-japan [at] lists
> > http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
> >
>
>
> --
> ----------------------------------------------------------------------
> $B9uLZ(B $BGn(B ($B3t(B)$B%5!<%I%&%'%"(B
>
> Kuroki Hiroshi 135-0034 $BEl5~ET9>El6h1JBe(B2-31-13 $B%t%#%i%*!<%/%i(B2F
> hkuroki [at] 3ware URL: http://www.3ware.co.jp/
> Phone: 03-4530-8670 Fax: 03-5809-8260

--

********************************
$B%9%?!<%7%9%F%`%:3t<02q<R(B
$BEl5~ET9A6hFn@D;3(B7-10-3
$BFn@D;3(BST$B%S%k(B5F
$B<D5\!!H;?M(B
TEL:03-5774-4086
FAX:03-3409-3135
E-Mail:hayato [at] starsystems
********************************

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


tsukishima.ha at gmail

Jan 11, 2012, 5:17 PM

Post #4 of 11 (518 views)
Permalink
Re: DRBD primary$B$K>:3J(B$B$7$J$$(B [In reply to]

$B<D5\MM(B

NTT$B%G!<%?@hC<5;=Q$NCSED$G$9!#(B

crm$B$N [at] _D$G$9$,!"2<5-(B3$BE@$rJQ99$7$F(B
$BF0:n$N3NG'$r$*4j$$$$$?$7$^$9!#(B

(1) 3$B9TL\$N(Bmonitor$B$r:o=|(B
5,6$B9TL\$G#M#a#s#t#e#r(B/Slave$B$N(Bmonitor$B$r [at] _D$7$F$$$k$N$G(B
3$B9TL\$OITMW$G$9!#(B

(2) property$B$N [at] _D$K$D$$$F(B
no-quorum-policy="ignore"$B$r [at] _D$7$F$/$@$5$$!#(B
$B$b$7!"(Bsymmetric-cluster, stop-orphan-resources, remove-after-stop$B$r(B
$BL@<(E*$K [at] _D$5$l$F$$$k>l9g$O!"$H$j$"$($:(B
$B$3$l$i$N [at] _D$b:o=|$7$FF0:n$r3NG'$7$F$/$@$5$$!#(B

(3) rsc_defaults$B$N [at] _D$K$D$$$F(B
resource-stickiness, migration-threshold$B$r(B
$B [at] _D$7$F$/$@$5$$!#(B

$B [at] _DjN(B

primitive drbd_hadoop ocf:linbit:drbd \
params drbd_resource="r0" drbdconf="/etc/drbd.conf" \
op start interval="0s" timeout="240s" on-fail="restart" \
op monitor interval="10s" role="Master" timeout="20s"
on-fail="restart" \
op monitor interval="20s" role="Slave" timeout="20s" on-fail="restart" \
op promote interval="0s" timeout="90s" on-fail="restart" \
op demote interval="0s" timeout="90s" on-fail="block" \
op stop interval="0s" timeout="100s" on-fail="block"
ms ms_drbd drbd_hadoop \
meta master-max="1" master-node-max="1" clone-max="2"
clone-node-max="1" notify="true"
location l_hadoop ms_drbd \
rule $id="l_hadoop-rule" $role="master" 200: #uname eq node1 \
rule $id="l_hadoop-rule-0" $role="master" 100: #uname eq node2
property \
no-quorum-policy="ignore" \
stonith-enabled="false" \
startup-fencing="false" \
crmd-transition-delay="2s"
rsc_defaults \
resource-stickiness="INFINITY" \
migration-threshold="1"


$B0J>e$h$m$7$/$*4j$$$$$?$7$^$9!#(B

$BCSED=_;R(B


2012$BG/(B1$B7n(B12$BF|(B9:51 Hayato Shinomiya <hayato [at] starsystems>:
> $B9uLZMM(B
>
> $B<D5\$G$9!#(B
>
>
> $BAaB.$N$4JV?.!"@?$KM-$jFq$&$4$6$$$^$9!#(B
>
> $B0J2<!"$4;XE&D:$$$?2U=j$K$D$$$F!"%m%0Ey$NH4?h$r5-:\(B
> $B$5$;$FD:$-$^$9!#(B
>> $B$3$N$h$&$J>l9g$O(B/var/log/messages$B$rD4$Y$k$N$,(B
>> $B%U%!!<%9%H%9%F%C%W$K$J$j$^$9!#(Bmessages$B$K(BDRBD$B$,(BPrimary$B$K(B
>> $B$J$C$F!"(BSecandary$B$K$J$k2aDx$,5-O?$5$l$^$9!#(Bcrm_mon$B$N(B
>> $B7k2L$r8+$k8B$j(BDRBD$B$N%(%i!<$,H/@8$7$F$$$k$h$&$G$9$N$G!"(B
>> messages$B$+$i%(%i!<$NItJ,$rH4$-=P$7$F$_$F2<$5$$!#(B
>
> $B!c%m%0H4?h!d(B
> Jan 12 03:07:33 node1 lrmd: [3649]: info: rsc:res_drbd:1:19: stop
> Jan 12 03:07:33 node1 crmd: [3652]: info: process_lrm_event: LRM operation res_drbd:1_monitor_20000 (call=17, status=1, cib-update=0, confirmed=true) Cancelled
> Jan 12 03:07:33 node1 kernel: block drbd0: peer( Primary -> Unknown ) conn( Connected -> Disconnecting ) pdsk( UpToDate -> DUnknown )
> Jan 12 03:07:33 node1 kernel: block drbd0: short read expecting header on sock: r=-512
> Jan 12 03:07:33 node1 kernel: block drbd0: asender terminated
> Jan 12 03:07:33 node1 kernel: block drbd0: Terminating asender thread
> Jan 12 03:07:33 node1 kernel: block drbd0: Connection closed
> Jan 12 03:07:33 node1 kernel: block drbd0: conn( Disconnecting -> StandAlone )
> Jan 12 03:07:33 node1 kernel: block drbd0: receiver terminated
> Jan 12 03:07:33 node1 kernel: block drbd0: Terminating receiver thread
> Jan 12 03:07:33 node1 kernel: block drbd0: disk( UpToDate -> Diskless )
> Jan 12 03:07:33 node1 kernel: block drbd0: drbd_bm_resize called with capacity == 0
> Jan 12 03:07:33 node1 kernel: block drbd0: worker terminated
> Jan 12 03:07:33 node1 kernel: block drbd0: Terminating worker thread
> Jan 12 03:07:33 node1 lrmd: [3649]: info: RA output: (res_drbd:1:stop:stdout)
> Jan 12 03:07:33 node1 kernel: block drbd0: State change failed: Disk state is lower than outdated
> Jan 12 03:07:33 node1 kernel: block drbd0: state = { cs:StandAlone ro:Secondary/Unknown ds:Diskless/DUnknown r--- }
> Jan 12 03:07:33 node1 kernel: block drbd0: wanted = { cs:StandAlone ro:Secondary/Unknown ds:Outdated/DUnknown r--- }
> Jan 12 03:07:33 node1 lrmd: [3649]: info: RA output: (res_drbd:1:stop:stdout)
>
>
>> $B<!$K(BDRBD$BC1BN$GF0:n3NG'$r9T$J$C$F$_$F2<$5$$!#(B
>> Heartbeat$B$r;_$a$F!"(Bdrbdadm$B%3%^%s%I@5>o$K(B
>> $B@Z$jBX$o$k$+$I$&$+3NG'$7$^$9!#(B
> $B0J2<$N%3%^%s%I$r<B9T$7$F!"<jF0$K$F(BDRBD$B$,@Z$jBX$o$k(B
> $B$3$H$r3NG'$7$^$7$?!#(B
>
> $B!c<B;\FbMF!d(B
> $B!!(Bprimary$BB&(B(node1)$B$K$F<B;\(B
> $B!!!!(Bumount /drbd
> $B!!!!(Bdrbdadm secondary r0
>
> $B!!85(Bsecondary$BB&(B(node2)$B$K$F<B;\(B
> $B!!!!(Bdrbdadm primary r0
> $B!!!!(Bmount /dev/drbd0 /drbd
>
> $B!c<B;\7k2L!d(B
> $B!!!!(B/var/log/messages$B$K0J2<$,=PNO(B
> $B!!!!!!(Bkernel: block drbd0: role( Primary -> Secondary )
>
>
> $B0J>e$H$J$j$^$9!#(B
> $B$465<x$NDx!"59$7$/$*4j$$CW$7$^$9!#(B
>
>
> On Thu, 12 Jan 2012 09:01:06 +0900
> Hiroshi Kuroki <hkuroki [at] 3ware> wrote:
>
>> $B<D5\MM(B
>>
>> $B9uLZ$H?=$7$^$9!#(B
>>
>> $B$3$N$h$&$J>l9g$O(B/var/log/messages$B$rD4$Y$k$N$,(B
>> $B%U%!!<%9%H%9%F%C%W$K$J$j$^$9!#(Bmessages$B$K(BDRBD$B$,(BPrimary$B$K(B
>> $B$J$C$F!"(BSecandary$B$K$J$k2aDx$,5-O?$5$l$^$9!#(Bcrm_mon$B$N(B
>> $B7k2L$r8+$k8B$j(BDRBD$B$N%(%i!<$,H/@8$7$F$$$k$h$&$G$9$N$G!"(B
>> messages$B$+$i%(%i!<$NItJ,$rH4$-=P$7$F$_$F2<$5$$!#(B
>>
>> $B<!$K(BDRBD$BC1BN$GF0:n3NG'$r9T$J$C$F$_$F2<$5$$!#(B
>> Heartbeat$B$r;_$a$F!"(Bdrbdadm$B%3%^%s%I@5>o$K(B
>> $B@Z$jBX$o$k$+$I$&$+3NG'$7$^$9!#(B
>>
>> $B0J>e#2$D$N3NG'$G(BDRBD$B$N%(%i!<$r8+$D$1$i$l$k$H;W$$$^$9!#(B
>>
>> On Wed, 11 Jan 2012 21:09:07 +0900
>> Hayato Shinomiya <hayato [at] starsystems> wrote:
>>
>> > $B<D5\$H?=$7$^$9!#(B
>> >
>> > $B$O$8$a$FEj9F$5$;$F$$$?$@$-$^$9!#(B
>> > heartbeat v3 $B$*$h$S(B DRBD$B$K4X$7$F!"0J2<$NLdBj$G:$$C$F$$$^$9!#(B
>> >
>> > $B!cLdBjE@!d(B
>> >$B!!(B $BN>%N!<%I$,@5>o$K5/F0$7!"(Bactive$BB&$r%7%c%C%H%@%&%s$7$?$H$3$m(B
>> >$B!!(B standby$BB&$N(BDRBD$B$,!"(Bslave$B>uBV$N$^$^%U%'%$%k%*!<%P!<$7$^$;$s!#(B
>> >$B!!(B $B"(0l=V(Bprimary$B$K$J$k$N$G$9$,!"(Bslave$B>uBV$K$J$j$^$9!#(B
>> >
>> > $B!c@5>o;~!d(B
>> > ============
>> > Last updated: Wed Jan 11 20:58:18 2012
>> > Stack: Heartbeat
>> > Current DC: node1 (58598433-729f-4266-9c7f-2a02e306e090) - partition with
>> > quorum
>> > Version: 1.0.12-unknown
>> > 2 Nodes configured, unknown expected votes
>> > 1 Resources configured.
>> > ============
>> >
>> > Online: [ node1 node2 ]
>> >
>> > Master/Slave Set: ms_drbd
>> > Masters: [ node1 ]
>> > Slaves: [ node2 ]
>> >
>> > $B!c0[>o;~!d(B
>> > ============
>> > Last updated: Wed Jan 11 20:59:41 2012
>> > Stack: Heartbeat
>> > Current DC: node2 (58598433-729f-4266-9c7f-2a02e306e090) - partition
>> > with quorum
>> > Version: 1.0.12-unknown
>> > 2 Nodes configured, unknown expected votes
>> > 1 Resources configured.
>> > ============
>> >
>> > Online: [ node2 ]
>> > OFFLINE: [ node1 ]
>> >
>> > Master/Slave Set: ms_drbd
>> > Slaves: [ node2 ]
>> > Stopped: [ drbd_hadoop:1 ]
>> >
>> > Failed actions:
>> > drbd_hadoop:0_promote_0 (node=node2, call=641, rc=1,
>> > status=complete): unknown error
>> >
>> > $B!c4D6-!d(B
>> >$B!!(B heartbeat-3.0.3-2.3.el5
>> >$B!!(B pacemaker-1.0.12-1.el5.centos
>> >$B!!(B drbd83-8.3.8-1.el5.centos
>> >$B!!(B kernel 2.6.18-238.12.1.el5$B!!(B(OS:CentOS 5.7)
>> >
>> >
>> > $B!c(BPacemaker$B [at] _D!d(B
>> >$B!!(B $B0J2<$N [at] _D$O!"0JA0%a!<%j%s%0%j%9%H$KEj9F$5$l$?FbMF$r85$K(B
>> >$B!!(B $B [at] _D$7$F$$$^$9!#(B
>> >$B!!(B $B;29M85!'(Bhttp://sourceforge.jp/projects/linux-ha/lists/archive/japan/2011-December/000996.html
>> >
>> >$B!!(B primitive drbd_hadoop ocf:linbit:drbd \
>> > params drbd_resource="r0" drbdconf="/etc/drbd.conf" \
>> > op monitor interval="10s" \
>> > op start interval="0s" timeout="240s" on-fail="restart" \
>> > op monitor interval="10s" role="Master" timeout="20s" on-fail="restart" \
>> > op monitor interval="20s" role="Slave" timeout="20s" on-fail="restart" \
>> > op promote interval="0s" timeout="90s" on-fail="restart" \
>> > op demote interval="0s" timeout="90s" on-fail="block" \
>> > op stop interval="0s" timeout="100s" on-fail="block"
>> > ms ms_drbd drbd_hadoop \
>> > meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"
>> > location l_hadoop ms_drbd \
>> > rule $id="l_hadoop-rule" $role="master" 200: #uname eq node1 \
>> > rule $id="l_hadoop-rule-0" $role="master" 100: #uname eq node2
>> > property $id="cib-bootstrap-options" \
>> > dc-version="1.0.12-unknown" \
>> > cluster-infrastructure="Heartbeat" \
>> > last-lrm-refresh="1326270814" \
>> > stonith-enabled="false" \
>> > no-quorum-policy="stop" \
>> > default-action-timeout="240" \
>> > default-resource-stickiness="0" \
>> > symmetric-cluster="true" \
>> > startup-fencing="true" \
>> > stop-orphan-resources="true" \
>> > remove-after-stop="false"
>> >
>> > $BBgJQ$*<j?t$G$9$,!"$I$J$?$+$465<x$/$@$5$$$^$9$h$&!"$*4j$$CW$7$^$9!#(B
>> >
>> > $B0J>e$G$9!#(B
>> >
>> > _______________________________________________
>> > Linux-ha-japan mailing list
>> > Linux-ha-japan [at] lists
>> > http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>> >
>>
>>
>> --
>> ----------------------------------------------------------------------
>> $B9uLZ(B $BGn(B ($B3t(B)$B%5!<%I%&%'%"(B
>>
>> Kuroki Hiroshi 135-0034 $BEl5~ET9>El6h1JBe(B2-31-13 $B%t%#%i%*!<%/%i(B2F
>> hkuroki [at] 3ware URL: http://www.3ware.co.jp/
>> Phone: 03-4530-8670 Fax: 03-5809-8260
>
> --
>
> ********************************
> $B%9%?!<%7%9%F%`%:3t<02q<R(B
> $BEl5~ET9A6hFn@D;3(B7-10-3
> $BFn@D;3(BST$B%S%k(B5F
> $B<D5\!!H;?M(B
> TEL:03-5774-4086
> FAX:03-3409-3135
> E-Mail:hayato [at] starsystems
> ********************************
>
> _______________________________________________
> Linux-ha-japan mailing list
> Linux-ha-japan [at] lists
> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


hkuroki at 3ware

Jan 11, 2012, 5:37 PM

Post #5 of 11 (519 views)
Permalink
Re: DRBD primary$B$K>:3J(B$B$7$J$$(B [In reply to]

$B<D5\MM(B

$B9uLZ$G$9!#(B

$B%m%0$N!V(BJan 12 03:07:33 node1 lrmd: [3649]:...$B!W$NA0$K(B
$B%(%i!<$d%o!<%K%s%0$,=P$F$*$i$:!"$3$N;~4V$+$i@Z$jBX$($,(B
$BH/@8$7$F$$$k$N$J$i$P!"(BDRBD$B$NLdBj$G$OL5$/!"CSEDMM$N;XE&$N(B
$B$h$&$K!"(BPacemaker$B$N [at] _D$K5/0x$7$?F0:n$K$h$j(Bdrbd$B%j%=!<%9$N(B
$BDd;_$,H/@8$7!"$=$N7k2L(BDRBD$B$N%(%i!<$,=P$F$$$k$H$$$&>uBV$@$H(B
$B;W$o$l$^$9!#(B

Pacemaker$B$N [at] _DjJQ9$7$F!"F0:n3NG'$r$*4j$$$7$^$9!#(B


On Thu, 12 Jan 2012 09:51:05 +0900
Hayato Shinomiya <hayato [at] starsystems> wrote:

> $B9uLZMM(B
>
> $B<D5\$G$9!#(B
>
>
> $BAaB.$N$4JV?.!"@?$KM-$jFq$&$4$6$$$^$9!#(B
>
> $B0J2<!"$4;XE&D:$$$?2U=j$K$D$$$F!"%m%0Ey$NH4?h$r5-:\(B
> $B$5$;$FD:$-$^$9!#(B
> > $B$3$N$h$&$J>l9g$O(B/var/log/messages$B$rD4$Y$k$N$,(B
> > $B%U%!!<%9%H%9%F%C%W$K$J$j$^$9!#(Bmessages$B$K(BDRBD$B$,(BPrimary$B$K(B
> > $B$J$C$F!"(BSecandary$B$K$J$k2aDx$,5-O?$5$l$^$9!#(Bcrm_mon$B$N(B
> > $B7k2L$r8+$k8B$j(BDRBD$B$N%(%i!<$,H/@8$7$F$$$k$h$&$G$9$N$G!"(B
> > messages$B$+$i%(%i!<$NItJ,$rH4$-=P$7$F$_$F2<$5$$!#(B
>
> $B!c%m%0H4?h!d(B
> Jan 12 03:07:33 node1 lrmd: [3649]: info: rsc:res_drbd:1:19: stop
> Jan 12 03:07:33 node1 crmd: [3652]: info: process_lrm_event: LRM operation res_drbd:1_monitor_20000 (call=17, status=1, cib-update=0, confirmed=true) Cancelled
> Jan 12 03:07:33 node1 kernel: block drbd0: peer( Primary -> Unknown ) conn( Connected -> Disconnecting ) pdsk( UpToDate -> DUnknown )
> Jan 12 03:07:33 node1 kernel: block drbd0: short read expecting header on sock: r=-512
> Jan 12 03:07:33 node1 kernel: block drbd0: asender terminated
> Jan 12 03:07:33 node1 kernel: block drbd0: Terminating asender thread
> Jan 12 03:07:33 node1 kernel: block drbd0: Connection closed
> Jan 12 03:07:33 node1 kernel: block drbd0: conn( Disconnecting -> StandAlone )
> Jan 12 03:07:33 node1 kernel: block drbd0: receiver terminated
> Jan 12 03:07:33 node1 kernel: block drbd0: Terminating receiver thread
> Jan 12 03:07:33 node1 kernel: block drbd0: disk( UpToDate -> Diskless )
> Jan 12 03:07:33 node1 kernel: block drbd0: drbd_bm_resize called with capacity == 0
> Jan 12 03:07:33 node1 kernel: block drbd0: worker terminated
> Jan 12 03:07:33 node1 kernel: block drbd0: Terminating worker thread
> Jan 12 03:07:33 node1 lrmd: [3649]: info: RA output: (res_drbd:1:stop:stdout)
> Jan 12 03:07:33 node1 kernel: block drbd0: State change failed: Disk state is lower than outdated
> Jan 12 03:07:33 node1 kernel: block drbd0: state = { cs:StandAlone ro:Secondary/Unknown ds:Diskless/DUnknown r--- }
> Jan 12 03:07:33 node1 kernel: block drbd0: wanted = { cs:StandAlone ro:Secondary/Unknown ds:Outdated/DUnknown r--- }
> Jan 12 03:07:33 node1 lrmd: [3649]: info: RA output: (res_drbd:1:stop:stdout)
>
>
> > $B<!$K(BDRBD$BC1BN$GF0:n3NG'$r9T$J$C$F$_$F2<$5$$!#(B
> > Heartbeat$B$r;_$a$F!"(Bdrbdadm$B%3%^%s%I@5>o$K(B
> > $B@Z$jBX$o$k$+$I$&$+3NG'$7$^$9!#(B
> $B0J2<$N%3%^%s%I$r<B9T$7$F!"<jF0$K$F(BDRBD$B$,@Z$jBX$o$k(B
> $B$3$H$r3NG'$7$^$7$?!#(B
>
> $B!c<B;\FbMF!d(B
> $B!!(Bprimary$BB&(B(node1)$B$K$F<B;\(B
> $B!!!!(Bumount /drbd
> $B!!!!(Bdrbdadm secondary r0
>
> $B!!85(Bsecondary$BB&(B(node2)$B$K$F<B;\(B
> $B!!!!(Bdrbdadm primary r0
> $B!!!!(Bmount /dev/drbd0 /drbd
>
> $B!c<B;\7k2L!d(B
> $B!!!!(B/var/log/messages$B$K0J2<$,=PNO(B
> $B!!!!!!(Bkernel: block drbd0: role( Primary -> Secondary )
>
>
> $B0J>e$H$J$j$^$9!#(B
> $B$465<x$NDx!"59$7$/$*4j$$CW$7$^$9!#(B
>
>
> On Thu, 12 Jan 2012 09:01:06 +0900
> Hiroshi Kuroki <hkuroki [at] 3ware> wrote:
>
> > $B<D5\MM(B
> >
> > $B9uLZ$H?=$7$^$9!#(B
> >
> > $B$3$N$h$&$J>l9g$O(B/var/log/messages$B$rD4$Y$k$N$,(B
> > $B%U%!!<%9%H%9%F%C%W$K$J$j$^$9!#(Bmessages$B$K(BDRBD$B$,(BPrimary$B$K(B
> > $B$J$C$F!"(BSecandary$B$K$J$k2aDx$,5-O?$5$l$^$9!#(Bcrm_mon$B$N(B
> > $B7k2L$r8+$k8B$j(BDRBD$B$N%(%i!<$,H/@8$7$F$$$k$h$&$G$9$N$G!"(B
> > messages$B$+$i%(%i!<$NItJ,$rH4$-=P$7$F$_$F2<$5$$!#(B
> >
> > $B<!$K(BDRBD$BC1BN$GF0:n3NG'$r9T$J$C$F$_$F2<$5$$!#(B
> > Heartbeat$B$r;_$a$F!"(Bdrbdadm$B%3%^%s%I@5>o$K(B
> > $B@Z$jBX$o$k$+$I$&$+3NG'$7$^$9!#(B
> >
> > $B0J>e#2$D$N3NG'$G(BDRBD$B$N%(%i!<$r8+$D$1$i$l$k$H;W$$$^$9!#(B
> >
> > On Wed, 11 Jan 2012 21:09:07 +0900
> > Hayato Shinomiya <hayato [at] starsystems> wrote:
> >
> > > $B<D5\$H?=$7$^$9!#(B
> > >
> > > $B$O$8$a$FEj9F$5$;$F$$$?$@$-$^$9!#(B
> > > heartbeat v3 $B$*$h$S(B DRBD$B$K4X$7$F!"0J2<$NLdBj$G:$$C$F$$$^$9!#(B
> > >
> > > $B!cLdBjE@!d(B
> > > $B!!N>%N!<%I$,@5>o$K5/F0$7!"(Bactive$BB&$r%7%c%C%H%@%&%s$7$?$H$3$m(B
> > > $B!!(Bstandby$BB&$N(BDRBD$B$,!"(Bslave$B>uBV$N$^$^%U%'%$%k%*!<%P!<$7$^$;$s!#(B
> > > $B!!"(0l=V(Bprimary$B$K$J$k$N$G$9$,!"(Bslave$B>uBV$K$J$j$^$9!#(B
> > >
> > > $B!c@5>o;~!d(B
> > > ============
> > > Last updated: Wed Jan 11 20:58:18 2012
> > > Stack: Heartbeat
> > > Current DC: node1 (58598433-729f-4266-9c7f-2a02e306e090) - partition with
> > > quorum
> > > Version: 1.0.12-unknown
> > > 2 Nodes configured, unknown expected votes
> > > 1 Resources configured.
> > > ============
> > >
> > > Online: [ node1 node2 ]
> > >
> > > Master/Slave Set: ms_drbd
> > > Masters: [ node1 ]
> > > Slaves: [ node2 ]
> > >
> > > $B!c0[>o;~!d(B
> > > ============
> > > Last updated: Wed Jan 11 20:59:41 2012
> > > Stack: Heartbeat
> > > Current DC: node2 (58598433-729f-4266-9c7f-2a02e306e090) - partition
> > > with quorum
> > > Version: 1.0.12-unknown
> > > 2 Nodes configured, unknown expected votes
> > > 1 Resources configured.
> > > ============
> > >
> > > Online: [ node2 ]
> > > OFFLINE: [ node1 ]
> > >
> > > Master/Slave Set: ms_drbd
> > > Slaves: [ node2 ]
> > > Stopped: [ drbd_hadoop:1 ]
> > >
> > > Failed actions:
> > > drbd_hadoop:0_promote_0 (node=node2, call=641, rc=1,
> > > status=complete): unknown error
> > >
> > > $B!c4D6-!d(B
> > > $B!!(Bheartbeat-3.0.3-2.3.el5
> > > $B!!(Bpacemaker-1.0.12-1.el5.centos
> > > $B!!(Bdrbd83-8.3.8-1.el5.centos
> > > $B!!(Bkernel 2.6.18-238.12.1.el5$B!!(B(OS:CentOS 5.7)
> > > $B!!(B
> > >
> > > $B!c(BPacemaker$B [at] _D!d(B
> > > $B!!0J2<$N [at] _D$O!"0JA0%a!<%j%s%0%j%9%H$KEj9F$5$l$?FbMF$r85$K(B
> > > $B!!@_Dj$7$F$$$^$9!#(B
> > > $B!!;29M85!'(Bhttp://sourceforge.jp/projects/linux-ha/lists/archive/japan/2011-December/000996.html
> > >
> > > $B!!(Bprimitive drbd_hadoop ocf:linbit:drbd \
> > > params drbd_resource="r0" drbdconf="/etc/drbd.conf" \
> > > op monitor interval="10s" \
> > > op start interval="0s" timeout="240s" on-fail="restart" \
> > > op monitor interval="10s" role="Master" timeout="20s" on-fail="restart" \
> > > op monitor interval="20s" role="Slave" timeout="20s" on-fail="restart" \
> > > op promote interval="0s" timeout="90s" on-fail="restart" \
> > > op demote interval="0s" timeout="90s" on-fail="block" \
> > > op stop interval="0s" timeout="100s" on-fail="block"
> > > ms ms_drbd drbd_hadoop \
> > > meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"
> > > location l_hadoop ms_drbd \
> > > rule $id="l_hadoop-rule" $role="master" 200: #uname eq node1 \
> > > rule $id="l_hadoop-rule-0" $role="master" 100: #uname eq node2
> > > property $id="cib-bootstrap-options" \
> > > dc-version="1.0.12-unknown" \
> > > cluster-infrastructure="Heartbeat" \
> > > last-lrm-refresh="1326270814" \
> > > stonith-enabled="false" \
> > > no-quorum-policy="stop" \
> > > default-action-timeout="240" \
> > > default-resource-stickiness="0" \
> > > symmetric-cluster="true" \
> > > startup-fencing="true" \
> > > stop-orphan-resources="true" \
> > > remove-after-stop="false"
> > >
> > > $BBgJQ$*<j?t$G$9$,!"$I$J$?$+$465<x$/$@$5$$$^$9$h$&!"$*4j$$CW$7$^$9!#(B
> > >
> > > $B0J>e$G$9!#(B
> > >
> > > _______________________________________________
> > > Linux-ha-japan mailing list
> > > Linux-ha-japan [at] lists
> > > http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
> > >
> >
> >
> > --
> > ----------------------------------------------------------------------
> > $B9uLZ(B $BGn(B ($B3t(B)$B%5!<%I%&%'%"(B
> >
> > Kuroki Hiroshi 135-0034 $BEl5~ET9>El6h1JBe(B2-31-13 $B%t%#%i%*!<%/%i(B2F
> > hkuroki [at] 3ware URL: http://www.3ware.co.jp/
> > Phone: 03-4530-8670 Fax: 03-5809-8260
>
> --
>
> ********************************
> $B%9%?!<%7%9%F%`%:3t<02q<R(B
> $BEl5~ET9A6hFn@D;3(B7-10-3
> $BFn@D;3(BST$B%S%k(B5F
> $B<D5\!!H;?M(B
> TEL:03-5774-4086
> FAX:03-3409-3135
> E-Mail:hayato [at] starsystems
> ********************************
>


--
----------------------------------------------------------------------
$B9uLZ(B $BGn(B ($B3t(B)$B%5!<%I%&%'%"(B

Kuroki Hiroshi 135-0034 $BEl5~ET9>El6h1JBe(B2-31-13 $B%t%#%i%*!<%/%i(B2F
hkuroki [at] 3ware URL: http://www.3ware.co.jp/
Phone: 03-4530-8670 Fax: 03-5809-8260

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


hayato at starsystems

Jan 11, 2012, 7:49 PM

Post #6 of 11 (509 views)
Permalink
Re: DRBD primary$B$K>:3J(B$B$7$J$$(B [In reply to]

$B9uLZMM(B
$BCSEDMM(B

$B$*@$OC$K$J$j$^$9!#(B
$B<D5\$G$9!#(B


$BCSEDMM$+$i$N$4Ds<($5$l$?@_Dj$rN.$79~$_(B
$B:FEY!"(Bnode1$B$N(BHeartbeat$B%W%m%;%9$rMn$H$9(B
$B;n83$r9T$$$^$7$?$,!">u67$KJQ2=$"$j$^$;(B
$B$s$G$7$?!#(B
$B"(0l=V(Bnode2$BB&$,(Bprimary$B$K$J$k$N$b0l=o$G$9(B

$B0J2<!"%m%0$rH4?h$5$;$FD:$-$^$9$N$G!"(B
$B$*<j?t$r$*3]$1CW$7$^$9$,!"$465<x$NDx!"(B
$B59$7$/$*4j$$CW$7$^$9!#(B
$B"(BgNL$N%m%0$G!"?=$7Lu$"$j$^$;$s!#(B

$B!c%m%0!d(B
Jan 12 11:31:25 node2 pengine: [3735]: notice: clone_print: Master/Slave Set: ms_drbd
Jan 12 11:31:25 node2 pengine: [3735]: notice: short_print: Masters: [ node1 ]
Jan 12 11:31:25 node2 pengine: [3735]: notice: short_print: Slaves: [ node2 ]
Jan 12 11:31:25 node2 pengine: [3735]: info: native_color: Resource drbd_hadoop:1 cannot run anywhere
Jan 12 11:31:25 node2 pengine: [3735]: info: master_color: Promoting drbd_hadoop:0 (Slave node2)
Jan 12 11:31:25 node2 pengine: [3735]: info: master_color: ms_drbd: Promoted 1 instances of a possible 1 to master
Jan 12 11:31:25 node2 pengine: [3735]: notice: RecurringOp: Start recurring monitor (10s) for drbd_hadoop:0 on node2
Jan 12 11:31:25 node2 pengine: [3735]: info: RecurringOp: Cancelling action drbd_hadoop:0_monitor_20000 (Slave vs. Master)
Jan 12 11:31:25 node2 pengine: [3735]: notice: RecurringOp: Start recurring monitor (10s) for drbd_hadoop:0 on node2
Jan 12 11:31:25 node2 pengine: [3735]: info: RecurringOp: Cancelling action drbd_hadoop:0_monitor_20000 (Slave vs. Master)
Jan 12 11:31:25 node2 pengine: [3735]: info: stage6: Scheduling Node node1 for shutdown
Jan 12 11:31:25 node2 pengine: [3735]: notice: LogActions: Promote drbd_hadoop:0 (Slave -> Master node2)
Jan 12 11:31:25 node2 pengine: [3735]: notice: LogActions: Demote drbd_hadoop:1 (Master -> Stopped node1)
Jan 12 11:31:25 node2 pengine: [3735]: notice: LogActions: Stop resource drbd_hadoop:1 (node1)
Jan 12 11:31:25 node2 pengine: [3735]: info: process_pe_message: Transition 3: PEngine Input stored in: /var/lib/pengine/pe-input-1520.bz2
Jan 12 11:31:25 node2 crmd: [3731]: info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response ]
Jan 12 11:31:25 node2 crmd: [3731]: info: unpack_graph: Unpacked transition 3: 34 actions in 34 synapses
Jan 12 11:31:25 node2 crmd: [3731]: info: do_te_invoke: Processing graph 3 (ref=pe_calc-dc-1326335485-33) derived from /var/lib/pengine/pe-input-1520.bz2
Jan 12 11:31:25 node2 crmd: [3731]: info: te_rsc_command: Initiating action 2: cancel drbd_hadoop:0_monitor_20000 on node2 (local)
Jan 12 11:31:25 node2 lrmd: [3728]: info: cancel_op: operation monitor[5] on ocf::drbd::drbd_hadoop:0 for client 3731, its parameters: CRM_meta_interval=[20000] CRM_meta_role=[Slave] drbdconf=[/etc/drbd.conf] drbd_resource=[r0] CRM_meta_master_max=[1] CRM_meta_on_fail=[restart] CRM_meta_timeout=[20000] CRM_meta_clone_max=[2] CRM_meta_master_node_max=[1] crm_feature_set=[3.0.1] CRM_meta_globally_unique=[false] CRM_meta_name=[monitor] CRM_meta_clone=[0] CRM_meta_clone_node_max=[1] CRM_meta_notify=[true] cancelled
Jan 12 11:31:25 node2 crmd: [3731]: info: send_direct_ack: ACK'ing resource op drbd_hadoop:0_monitor_20000 from 2:3:0:3ab7eec5-827b-4fa1-b137-65c9363ad006: lrm_invoke-lrmd-1326335485-35
Jan 12 11:31:25 node2 crmd: [3731]: info: process_te_message: Processing (N)ACK lrm_invoke-lrmd-1326335485-35 from node2
Jan 12 11:31:25 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:0_monitor_20000 (2) confirmed on node2 (rc=0)
Jan 12 11:31:25 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 34 fired and confirmed
Jan 12 11:31:25 node2 crmd: [3731]: info: process_lrm_event: LRM operation drbd_hadoop:0_monitor_20000 (call=5, status=1, cib-update=0, confirmed=true) Cancelled
Jan 12 11:31:25 node2 crmd: [3731]: info: te_rsc_command: Initiating action 50: notify drbd_hadoop:0_pre_notify_demote_0 on node2 (local)
Jan 12 11:31:25 node2 crmd: [3731]: info: do_lrm_rsc_op: Performing key=50:3:0:3ab7eec5-827b-4fa1-b137-65c9363ad006 op=drbd_hadoop:0_notify_0 )
Jan 12 11:31:25 node2 lrmd: [3728]: info: rsc:drbd_hadoop:0:6: notify
Jan 12 11:31:25 node2 crmd: [3731]: info: te_rsc_command: Initiating action 52: notify drbd_hadoop:1_pre_notify_demote_0 on node1
Jan 12 11:31:25 node2 crmd: [3731]: info: process_lrm_event: LRM operation drbd_hadoop:0_notify_0 (call=6, rc=0, cib-update=57, confirmed=true) ok
Jan 12 11:31:25 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:0_pre_notify_demote_0 (50) confirmed on node2 (rc=0)
Jan 12 11:31:27 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:1_pre_notify_demote_0 (52) confirmed on node1 (rc=0)
Jan 12 11:31:27 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 35 fired and confirmed
Jan 12 11:31:27 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 32 fired and confirmed
Jan 12 11:31:27 node2 crmd: [3731]: info: te_rsc_command: Initiating action 12: demote drbd_hadoop:1_demote_0 on node1
Jan 12 11:31:27 node2 kernel: block drbd0: peer( Primary -> Secondary )
Jan 12 11:31:28 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:1_demote_0 (12) confirmed on node1 (rc=0)
Jan 12 11:31:28 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 33 fired and confirmed
Jan 12 11:31:28 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 36 fired and confirmed
Jan 12 11:31:28 node2 crmd: [3731]: info: te_rsc_command: Initiating action 51: notify drbd_hadoop:0_post_notify_demote_0 on node2 (local)
Jan 12 11:31:28 node2 crmd: [3731]: info: do_lrm_rsc_op: Performing key=51:3:0:3ab7eec5-827b-4fa1-b137-65c9363ad006 op=drbd_hadoop:0_notify_0 )
Jan 12 11:31:28 node2 lrmd: [3728]: info: rsc:drbd_hadoop:0:7: notify
Jan 12 11:31:28 node2 crmd: [3731]: info: te_rsc_command: Initiating action 53: notify drbd_hadoop:1_post_notify_demote_0 on node1
Jan 12 11:31:28 node2 lrmd: [3728]: info: RA output: (drbd_hadoop:0:notify:stdout)
Jan 12 11:31:28 node2 crmd: [3731]: info: process_lrm_event: LRM operation drbd_hadoop:0_notify_0 (call=7, rc=0, cib-update=58, confirmed=true) ok
Jan 12 11:31:28 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:0_post_notify_demote_0 (51) confirmed on node2 (rc=0)
Jan 12 11:31:30 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:1_post_notify_demote_0 (53) confirmed on node1 (rc=0)
Jan 12 11:31:30 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 37 fired and confirmed
Jan 12 11:31:30 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 22 fired and confirmed
Jan 12 11:31:30 node2 crmd: [3731]: info: te_rsc_command: Initiating action 45: notify drbd_hadoop:0_pre_notify_stop_0 on node2 (local)
Jan 12 11:31:30 node2 crmd: [3731]: info: do_lrm_rsc_op: Performing key=45:3:0:3ab7eec5-827b-4fa1-b137-65c9363ad006 op=drbd_hadoop:0_notify_0 )
Jan 12 11:31:30 node2 lrmd: [3728]: info: rsc:drbd_hadoop:0:8: notify
Jan 12 11:31:30 node2 crmd: [3731]: info: te_rsc_command: Initiating action 47: notify drbd_hadoop:1_pre_notify_stop_0 on node1
Jan 12 11:31:30 node2 crmd: [3731]: info: process_lrm_event: LRM operation drbd_hadoop:0_notify_0 (call=8, rc=0, cib-update=59, confirmed=true) ok
Jan 12 11:31:30 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:0_pre_notify_stop_0 (45) confirmed on node2 (rc=0)
Jan 12 11:31:32 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:1_pre_notify_stop_0 (47) confirmed on node1 (rc=0)
Jan 12 11:31:32 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 23 fired and confirmed
Jan 12 11:31:32 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 20 fired and confirmed
Jan 12 11:31:32 node2 crmd: [3731]: info: te_rsc_command: Initiating action 13: stop drbd_hadoop:1_stop_0 on node1
Jan 12 11:31:33 node2 kernel: block drbd0: peer( Secondary -> Unknown ) conn( Connected -> TearDown ) pdsk( UpToDate -> DUnknown )
Jan 12 11:31:33 node2 kernel: block drbd0: meta connection shut down by peer.
Jan 12 11:31:33 node2 kernel: block drbd0: asender terminated
Jan 12 11:31:33 node2 kernel: block drbd0: Terminating asender thread
Jan 12 11:31:33 node2 kernel: block drbd0: Connection closed
Jan 12 11:31:33 node2 kernel: block drbd0: conn( TearDown -> Unconnected )
Jan 12 11:31:33 node2 kernel: block drbd0: receiver terminated
Jan 12 11:31:33 node2 kernel: block drbd0: Restarting receiver thread
Jan 12 11:31:33 node2 kernel: block drbd0: receiver (re)started
Jan 12 11:31:33 node2 kernel: block drbd0: conn( Unconnected -> WFConnection )
Jan 12 11:31:34 node2 attrd: [3730]: info: attrd_ha_callback: flush message from node1
Jan 12 11:31:34 node2 crmd: [3731]: info: abort_transition_graph: te_update_diff:164 - Triggered transition abort (complete=0, tag=transient_attributes, id=d70cae93-bce1-4389-808f-facb2ce776f4, magic=NA, cib=0.8.27) : Transient attribute: removal
Jan 12 11:31:34 node2 crmd: [3731]: info: update_abort_priority: Abort priority upgraded from 0 to 1000000
Jan 12 11:31:34 node2 crmd: [3731]: info: update_abort_priority: Abort action done superceeded by restart
Jan 12 11:31:34 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:1_stop_0 (13) confirmed on node1 (rc=0)
Jan 12 11:31:34 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 21 fired and confirmed
Jan 12 11:31:34 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 24 fired and confirmed
Jan 12 11:31:34 node2 crmd: [3731]: info: te_rsc_command: Initiating action 46: notify drbd_hadoop:0_post_notify_stop_0 on node2 (local)
Jan 12 11:31:34 node2 crmd: [3731]: info: do_lrm_rsc_op: Performing key=46:3:0:3ab7eec5-827b-4fa1-b137-65c9363ad006 op=drbd_hadoop:0_notify_0 )
Jan 12 11:31:34 node2 lrmd: [3728]: info: rsc:drbd_hadoop:0:9: notify
Jan 12 11:31:34 node2 attrd: [3730]: info: attrd_trigger_update: Sending flush op to all hosts for: master-drbd_hadoop:0 (1000)
Jan 12 11:31:34 node2 attrd: [3730]: info: attrd_perform_update: Sent update 20: master-drbd_hadoop:0=1000
Jan 12 11:31:34 node2 lrmd: [3728]: info: RA output: (drbd_hadoop:0:notify:stdout)
Jan 12 11:31:34 node2 crmd: [3731]: info: process_lrm_event: LRM operation drbd_hadoop:0_notify_0 (call=9, rc=0, cib-update=60, confirmed=true) ok
Jan 12 11:31:34 node2 crmd: [3731]: info: abort_transition_graph: te_update_diff:150 - Triggered transition abort (complete=0, tag=nvpair, id=status-0a6c069f-e618-4f5b-a7b4-4ba53e5ff890-master-drbd_hadoop:0, name=NA, value=1000, magic=NA, cib=0.8.29) : Transient attribute: update
Jan 12 11:31:34 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:0_post_notify_stop_0 (46) confirmed on node2 (rc=0)
Jan 12 11:31:34 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 25 fired and confirmed
Jan 12 11:31:34 node2 crmd: [3731]: info: run_graph: ====================================================
Jan 12 11:31:34 node2 crmd: [3731]: notice: run_graph: Transition 3 (Complete=22, Pending=0, Fired=0, Skipped=8, Incomplete=4, Source=/var/lib/pengine/pe-input-1520.bz2): Stopped
Jan 12 11:31:34 node2 crmd: [3731]: info: te_graph_trigger: Transition 3 is now complete
Jan 12 11:31:36 node2 crmd: [3731]: info: crm_timer_popped: New Transition Timer (I_PE_CALC) just popped!
Jan 12 11:31:36 node2 crmd: [3731]: info: do_state_transition: State transition S_TRANSITION_ENGINE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_TIMER_POPPED origin=crm_timer_popped ]
Jan 12 11:31:36 node2 crmd: [3731]: info: do_state_transition: Progressed to state S_POLICY_ENGINE after C_TIMER_POPPED
Jan 12 11:31:36 node2 crmd: [3731]: info: do_state_transition: All 2 cluster nodes are eligible to run resources.
Jan 12 11:31:36 node2 crmd: [3731]: info: do_pe_invoke: Query 61: Requesting the current CIB: S_POLICY_ENGINE
Jan 12 11:31:36 node2 crmd: [3731]: info: do_pe_invoke_callback: Invoking the PE: query=61, ref=pe_calc-dc-1326335496-45, seq=2, quorate=1
Jan 12 11:31:36 node2 pengine: [3735]: notice: unpack_config: On loss of CCM Quorum: Ignore
Jan 12 11:31:36 node2 pengine: [3735]: info: unpack_config: Node scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0
Jan 12 11:31:36 node2 pengine: [3735]: WARN: unpack_nodes: Blind faith: not fencing unseen nodes
Jan 12 11:31:36 node2 pengine: [3735]: info: determine_online_status: Node node1 is shutting down
Jan 12 11:31:36 node2 pengine: [3735]: info: determine_online_status: Node node2 is online
Jan 12 11:31:36 node2 pengine: [3735]: notice: unpack_rsc_op: Operation drbd_hadoop:1_monitor_0 found resource drbd_hadoop:1 active on node1
Jan 12 11:31:36 node2 pengine: [3735]: notice: unpack_rsc_op: Operation drbd_hadoop:0_monitor_0 found resource drbd_hadoop:0 active on node2
Jan 12 11:31:36 node2 pengine: [3735]: notice: clone_print: Master/Slave Set: ms_drbd
Jan 12 11:31:36 node2 pengine: [3735]: notice: short_print: Slaves: [ node2 ]
Jan 12 11:31:36 node2 pengine: [3735]: notice: short_print: Stopped: [ drbd_hadoop:1 ]
Jan 12 11:31:36 node2 pengine: [3735]: info: native_color: Resource drbd_hadoop:1 cannot run anywhere
Jan 12 11:31:36 node2 pengine: [3735]: info: master_color: Promoting drbd_hadoop:0 (Slave node2)
Jan 12 11:31:36 node2 pengine: [3735]: info: master_color: ms_drbd: Promoted 1 instances of a possible 1 to master
Jan 12 11:31:36 node2 pengine: [3735]: notice: RecurringOp: Start recurring monitor (10s) for drbd_hadoop:0 on node2
Jan 12 11:31:36 node2 pengine: [3735]: notice: RecurringOp: Start recurring monitor (10s) for drbd_hadoop:0 on node2
Jan 12 11:31:36 node2 pengine: [3735]: info: stage6: Scheduling Node node1 for shutdown
Jan 12 11:31:36 node2 pengine: [3735]: notice: LogActions: Promote drbd_hadoop:0 (Slave -> Master node2)
Jan 12 11:31:36 node2 pengine: [3735]: notice: LogActions: Leave resource drbd_hadoop:1 (Stopped)
Jan 12 11:31:36 node2 crmd: [3731]: info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response ]
Jan 12 11:31:36 node2 crmd: [3731]: WARN: destroy_action: Cancelling timer for action 2 (src=75)
Jan 12 11:31:36 node2 crmd: [3731]: info: unpack_graph: Unpacked transition 4: 11 actions in 11 synapses
Jan 12 11:31:36 node2 crmd: [3731]: info: do_te_invoke: Processing graph 4 (ref=pe_calc-dc-1326335496-45) derived from /var/lib/pengine/pe-input-1521.bz2
Jan 12 11:31:36 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 24 fired and confirmed
Jan 12 11:31:36 node2 crmd: [3731]: info: te_crm_command: Executing crm-event (36): do_shutdown on node1
Jan 12 11:31:36 node2 crmd: [3731]: info: te_rsc_command: Initiating action 45: notify drbd_hadoop:0_pre_notify_promote_0 on node2 (local)
Jan 12 11:31:36 node2 crmd: [3731]: info: do_lrm_rsc_op: Performing key=45:4:0:3ab7eec5-827b-4fa1-b137-65c9363ad006 op=drbd_hadoop:0_notify_0 )
Jan 12 11:31:36 node2 lrmd: [3728]: info: rsc:drbd_hadoop:0:10: notify
Jan 12 11:31:36 node2 pengine: [3735]: info: process_pe_message: Transition 4: PEngine Input stored in: /var/lib/pengine/pe-input-1521.bz2
Jan 12 11:31:36 node2 crmd: [3731]: info: process_lrm_event: LRM operation drbd_hadoop:0_notify_0 (call=10, rc=0, cib-update=62, confirmed=true) ok
Jan 12 11:31:36 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:0_pre_notify_promote_0 (45) confirmed on node2 (rc=0)
Jan 12 11:31:36 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 25 fired and confirmed
Jan 12 11:31:36 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 22 fired and confirmed
Jan 12 11:31:36 node2 crmd: [3731]: info: te_rsc_command: Initiating action 8: promote drbd_hadoop:0_promote_0 on node2 (local)
Jan 12 11:31:36 node2 crmd: [3731]: info: do_lrm_rsc_op: Performing key=8:4:0:3ab7eec5-827b-4fa1-b137-65c9363ad006 op=drbd_hadoop:0_promote_0 )
Jan 12 11:31:36 node2 lrmd: [3728]: info: rsc:drbd_hadoop:0:11: promote
Jan 12 11:31:36 node2 kernel: block drbd0: role( Secondary -> Primary )
Jan 12 11:31:36 node2 kernel: block drbd0: Creating new current UUID
Jan 12 11:31:36 node2 lrmd: [3728]: info: RA output: (drbd_hadoop:0:promote:stdout)
Jan 12 11:31:36 node2 crmd: [3731]: info: process_lrm_event: LRM operation drbd_hadoop:0_promote_0 (call=11, rc=0, cib-update=63, confirmed=true) ok
Jan 12 11:31:36 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:0_promote_0 (8) confirmed on node2 (rc=0)
Jan 12 11:31:36 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 23 fired and confirmed
Jan 12 11:31:36 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 26 fired and confirmed
Jan 12 11:31:36 node2 crmd: [3731]: info: te_rsc_command: Initiating action 46: notify drbd_hadoop:0_post_notify_promote_0 on node2 (local)
Jan 12 11:31:36 node2 crmd: [3731]: info: do_lrm_rsc_op: Performing key=46:4:0:3ab7eec5-827b-4fa1-b137-65c9363ad006 op=drbd_hadoop:0_notify_0 )
Jan 12 11:31:36 node2 lrmd: [3728]: info: rsc:drbd_hadoop:0:12: notify
Jan 12 11:31:36 node2 attrd: [3730]: info: attrd_trigger_update: Sending flush op to all hosts for: master-drbd_hadoop:0 (10000)
Jan 12 11:31:36 node2 attrd: [3730]: info: attrd_perform_update: Sent update 22: master-drbd_hadoop:0=10000
Jan 12 11:31:36 node2 lrmd: [3728]: info: RA output: (drbd_hadoop:0:notify:stdout)
Jan 12 11:31:36 node2 crmd: [3731]: info: abort_transition_graph: te_update_diff:150 - Triggered transition abort (complete=0, tag=nvpair, id=status-0a6c069f-e618-4f5b-a7b4-4ba53e5ff890-master-drbd_hadoop:0, name=NA, value=10000, magic=NA, cib=0.8.33) : Transient attribute: update
Jan 12 11:31:36 node2 crmd: [3731]: info: update_abort_priority: Abort priority upgraded from 0 to 1000000
Jan 12 11:31:36 node2 crmd: [3731]: info: update_abort_priority: Abort action done superceeded by restart
Jan 12 11:31:36 node2 kernel: block drbd0: State change failed: Need access to UpToDate data
Jan 12 11:31:36 node2 kernel: block drbd0: state = { cs:WFConnection ro:Primary/Unknown ds:UpToDate/DUnknown r--- }
Jan 12 11:31:36 node2 kernel: block drbd0: wanted = { cs:WFConnection ro:Primary/Unknown ds:Outdated/DUnknown r--- }
Jan 12 11:31:36 node2 drbd[4205]: ERROR: r0: Called drbdadm -c /etc/drbd.conf outdate r0
Jan 12 11:31:36 node2 drbd[4205]: ERROR: r0: Exit code 17
Jan 12 11:31:36 node2 drbd[4205]: ERROR: r0: Command output:
Jan 12 11:31:36 node2 lrmd: [3728]: info: RA output: (drbd_hadoop:0:notify:stdout)
Jan 12 11:31:36 node2 attrd: [3730]: info: attrd_trigger_update: Sending flush op to all hosts for: master-drbd_hadoop:0 (-INFINITY)
Jan 12 11:31:36 node2 attrd: [3730]: info: attrd_perform_update: Sent update 24: master-drbd_hadoop:0=-INFINITY
Jan 12 11:31:36 node2 lrmd: [3728]: info: RA output: (drbd_hadoop:0:notify:stdout)
Jan 12 11:31:36 node2 crmd: [3731]: info: process_lrm_event: LRM operation drbd_hadoop:0_notify_0 (call=12, rc=0, cib-update=64, confirmed=true) ok
Jan 12 11:31:36 node2 crmd: [3731]: info: abort_transition_graph: te_update_diff:150 - Triggered transition abort (complete=0, tag=nvpair, id=status-0a6c069f-e618-4f5b-a7b4-4ba53e5ff890-master-drbd_hadoop:0, name=NA, value=-INFINITY, magic=NA, cib=0.8.34) : Transient attribute: update
Jan 12 11:31:36 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:0_post_notify_promote_0 (46) confirmed on node2 (rc=0)
Jan 12 11:31:36 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 27 fired and confirmed
Jan 12 11:31:36 node2 crmd: [3731]: info: run_graph: ====================================================
Jan 12 11:31:36 node2 crmd: [3731]: notice: run_graph: Transition 4 (Complete=10, Pending=0, Fired=0, Skipped=1, Incomplete=0, Source=/var/lib/pengine/pe-input-1521.bz2): Stopped
Jan 12 11:31:36 node2 crmd: [3731]: info: te_graph_trigger: Transition 4 is now complete
Jan 12 11:31:37 node2 crmd: [3731]: notice: crmd_client_status_callback: Status update: Client node1/crmd now has status [offline] (DC=true)
Jan 12 11:31:37 node2 crmd: [3731]: info: crm_update_peer_proc: node1.crmd is now offline
Jan 12 11:31:37 node2 crmd: [3731]: info: erase_node_from_join: Removed node node1 from join calculations: welcomed=0 itegrated=0 finalized=0 confirmed=1
Jan 12 11:31:37 node2 cib: [3727]: info: cib_process_shutdown_req: Shutdown REQ from node1
Jan 12 11:31:37 node2 cib: [3727]: info: cib_process_request: Operation complete: op cib_shutdown_req for section 'all' (origin=node1/node1/(null), version=0.8.36): ok (rc=0)
Jan 12 11:31:38 node2 crmd: [3731]: info: crm_timer_popped: New Transition Timer (I_PE_CALC) just popped!
Jan 12 11:31:38 node2 crmd: [3731]: info: do_state_transition: State transition S_TRANSITION_ENGINE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_TIMER_POPPED origin=crm_timer_popped ]
Jan 12 11:31:38 node2 crmd: [3731]: info: do_state_transition: Progressed to state S_POLICY_ENGINE after C_TIMER_POPPED
Jan 12 11:31:38 node2 crmd: [3731]: WARN: do_state_transition: Only 1 of 2 cluster nodes are eligible to run resources - continue 0
Jan 12 11:31:38 node2 crmd: [3731]: info: do_pe_invoke: Query 66: Requesting the current CIB: S_POLICY_ENGINE
Jan 12 11:31:38 node2 crmd: [3731]: info: do_pe_invoke_callback: Invoking the PE: query=66, ref=pe_calc-dc-1326335498-50, seq=2, quorate=1
Jan 12 11:31:38 node2 pengine: [3735]: notice: unpack_config: On loss of CCM Quorum: Ignore
Jan 12 11:31:38 node2 pengine: [3735]: info: unpack_config: Node scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0
Jan 12 11:31:38 node2 pengine: [3735]: WARN: unpack_nodes: Blind faith: not fencing unseen nodes
Jan 12 11:31:38 node2 pengine: [3735]: info: determine_online_status: Node node2 is online
Jan 12 11:31:38 node2 pengine: [3735]: notice: unpack_rsc_op: Operation drbd_hadoop:0_monitor_0 found resource drbd_hadoop:0 active on node2
Jan 12 11:31:38 node2 pengine: [3735]: notice: clone_print: Master/Slave Set: ms_drbd
Jan 12 11:31:38 node2 pengine: [3735]: notice: short_print: Masters: [ node2 ]
Jan 12 11:31:38 node2 pengine: [3735]: notice: short_print: Stopped: [ drbd_hadoop:1 ]
Jan 12 11:31:38 node2 pengine: [3735]: info: native_color: Resource drbd_hadoop:1 cannot run anywhere
Jan 12 11:31:38 node2 pengine: [3735]: info: master_color: ms_drbd: Promoted 0 instances of a possible 1 to master
Jan 12 11:31:38 node2 pengine: [3735]: notice: RecurringOp: Start recurring monitor (20s) for drbd_hadoop:0 on node2
Jan 12 11:31:38 node2 pengine: [3735]: notice: RecurringOp: Start recurring monitor (20s) for drbd_hadoop:0 on node2
Jan 12 11:31:38 node2 pengine: [3735]: notice: LogActions: Demote drbd_hadoop:0 (Master -> Slave node2)
Jan 12 11:31:38 node2 pengine: [3735]: notice: LogActions: Leave resource drbd_hadoop:1 (Stopped)
Jan 12 11:31:38 node2 crmd: [3731]: info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response ]
Jan 12 11:31:38 node2 crmd: [3731]: info: unpack_graph: Unpacked transition 5: 10 actions in 10 synapses
Jan 12 11:31:38 node2 crmd: [3731]: info: do_te_invoke: Processing graph 5 (ref=pe_calc-dc-1326335498-50) derived from /var/lib/pengine/pe-input-1522.bz2
Jan 12 11:31:38 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 28 fired and confirmed
Jan 12 11:31:38 node2 crmd: [3731]: info: te_rsc_command: Initiating action 44: notify drbd_hadoop:0_pre_notify_demote_0 on node2 (local)
Jan 12 11:31:38 node2 crmd: [3731]: info: do_lrm_rsc_op: Performing key=44:5:0:3ab7eec5-827b-4fa1-b137-65c9363ad006 op=drbd_hadoop:0_notify_0 )
Jan 12 11:31:38 node2 lrmd: [3728]: info: rsc:drbd_hadoop:0:13: notify
Jan 12 11:31:38 node2 crmd: [3731]: info: process_lrm_event: LRM operation drbd_hadoop:0_notify_0 (call=13, rc=0, cib-update=67, confirmed=true) ok
Jan 12 11:31:38 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:0_pre_notify_demote_0 (44) confirmed on node2 (rc=0)
Jan 12 11:31:38 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 29 fired and confirmed
Jan 12 11:31:38 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 26 fired and confirmed
Jan 12 11:31:38 node2 crmd: [3731]: info: te_rsc_command: Initiating action 6: demote drbd_hadoop:0_demote_0 on node2 (local)
Jan 12 11:31:38 node2 crmd: [3731]: info: do_lrm_rsc_op: Performing key=6:5:0:3ab7eec5-827b-4fa1-b137-65c9363ad006 op=drbd_hadoop:0_demote_0 )
Jan 12 11:31:38 node2 lrmd: [3728]: info: rsc:drbd_hadoop:0:14: demote
Jan 12 11:31:38 node2 pengine: [3735]: info: process_pe_message: Transition 5: PEngine Input stored in: /var/lib/pengine/pe-input-1522.bz2
Jan 12 11:31:38 node2 kernel: block drbd0: role( Primary -> Secondary )
Jan 12 11:31:38 node2 lrmd: [3728]: info: RA output: (drbd_hadoop:0:demote:stdout)
Jan 12 11:31:38 node2 crmd: [3731]: info: process_lrm_event: LRM operation drbd_hadoop:0_demote_0 (call=14, rc=0, cib-update=68, confirmed=true) ok
Jan 12 11:31:38 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:0_demote_0 (6) confirmed on node2 (rc=0)
Jan 12 11:31:38 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 27 fired and confirmed
Jan 12 11:31:38 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 30 fired and confirmed
Jan 12 11:31:38 node2 crmd: [3731]: info: te_rsc_command: Initiating action 45: notify drbd_hadoop:0_post_notify_demote_0 on node2 (local)
Jan 12 11:31:38 node2 crmd: [3731]: info: do_lrm_rsc_op: Performing key=45:5:0:3ab7eec5-827b-4fa1-b137-65c9363ad006 op=drbd_hadoop:0_notify_0 )
Jan 12 11:31:38 node2 lrmd: [3728]: info: rsc:drbd_hadoop:0:15: notify
Jan 12 11:31:38 node2 attrd: [3730]: info: attrd_trigger_update: Sending flush op to all hosts for: master-drbd_hadoop:0 (1000)
Jan 12 11:31:38 node2 attrd: [3730]: info: attrd_perform_update: Sent update 26: master-drbd_hadoop:0=1000
Jan 12 11:31:38 node2 lrmd: [3728]: info: RA output: (drbd_hadoop:0:notify:stdout)
Jan 12 11:31:38 node2 crmd: [3731]: info: abort_transition_graph: te_update_diff:150 - Triggered transition abort (complete=0, tag=nvpair, id=status-0a6c069f-e618-4f5b-a7b4-4ba53e5ff890-master-drbd_hadoop:0, name=NA, value=1000, magic=NA, cib=0.8.39) : Transient attribute: update
Jan 12 11:31:38 node2 crmd: [3731]: info: update_abort_priority: Abort priority upgraded from 0 to 1000000
Jan 12 11:31:38 node2 crmd: [3731]: info: update_abort_priority: Abort action done superceeded by restart
Jan 12 11:31:38 node2 kernel: block drbd0: disk( UpToDate -> Outdated )
Jan 12 11:31:38 node2 lrmd: [3728]: info: RA output: (drbd_hadoop:0:notify:stdout)
Jan 12 11:31:38 node2 attrd: [3730]: info: attrd_trigger_update: Sending flush op to all hosts for: master-drbd_hadoop:0 (-INFINITY)
Jan 12 11:31:38 node2 attrd: [3730]: info: attrd_perform_update: Sent update 28: master-drbd_hadoop:0=-INFINITY
Jan 12 11:31:38 node2 lrmd: [3728]: info: RA output: (drbd_hadoop:0:notify:stdout)
Jan 12 11:31:38 node2 crmd: [3731]: info: process_lrm_event: LRM operation drbd_hadoop:0_notify_0 (call=15, rc=0, cib-update=69, confirmed=true) ok
Jan 12 11:31:38 node2 crmd: [3731]: info: abort_transition_graph: te_update_diff:150 - Triggered transition abort (complete=0, tag=nvpair, id=status-0a6c069f-e618-4f5b-a7b4-4ba53e5ff890-master-drbd_hadoop:0, name=NA, value=-INFINITY, magic=NA, cib=0.8.40) : Transient attribute: update
Jan 12 11:31:38 node2 crmd: [3731]: info: match_graph_event: Action drbd_hadoop:0_post_notify_demote_0 (45) confirmed on node2 (rc=0)
Jan 12 11:31:38 node2 crmd: [3731]: info: te_pseudo_action: Pseudo action 31 fired and confirmed
Jan 12 11:31:38 node2 crmd: [3731]: info: run_graph: ====================================================
Jan 12 11:31:38 node2 crmd: [3731]: notice: run_graph: Transition 5 (Complete=9, Pending=0, Fired=0, Skipped=1, Incomplete=0, Source=/var/lib/pengine/pe-input-1522.bz2): Stopped
Jan 12 11:31:38 node2 crmd: [3731]: info: te_graph_trigger: Transition 5 is now complete
Jan 12 11:31:38 node2 ccm: [3726]: info: Break tie for 2 nodes cluster
Jan 12 11:31:38 node2 crmd: [3731]: info: mem_handle_event: Got an event OC_EV_MS_INVALID from ccm
Jan 12 11:31:38 node2 crmd: [3731]: info: mem_handle_event: no mbr_track info
Jan 12 11:31:38 node2 crmd: [3731]: info: mem_handle_event: Got an event OC_EV_MS_NEW_MEMBERSHIP from ccm
Jan 12 11:31:38 node2 crmd: [3731]: info: mem_handle_event: instance=3, nodes=1, new=0, lost=1, n_idx=0, new_idx=1, old_idx=3
Jan 12 11:31:38 node2 crmd: [3731]: info: crmd_ccm_msg_callback: Quorum (re)attained after event=NEW MEMBERSHIP (id=3)
Jan 12 11:31:38 node2 crmd: [3731]: info: ccm_event_detail: NEW MEMBERSHIP: trans=3, nodes=1, new=0, lost=1 n_idx=0, new_idx=1, old_idx=3
Jan 12 11:31:38 node2 crmd: [3731]: info: ccm_event_detail: CURRENT: node2 [nodeid=1, born=3]
Jan 12 11:31:38 node2 crmd: [3731]: info: ccm_event_detail: LOST: node1 [nodeid=0, born=2]
Jan 12 11:31:38 node2 crmd: [3731]: info: ais_status_callback: status: node1 is now lost (was member)
Jan 12 11:31:38 node2 crmd: [3731]: info: crm_update_peer: Node node1: id=0 state=lost (new) addr=(null) votes=-1 born=2 seen=2 proc=00000000000000000000000000000002
Jan 12 11:31:38 node2 crmd: [3731]: info: populate_cib_nodes_ha: Requesting the list of configured nodes
Jan 12 11:31:38 node2 cib: [3727]: info: cib_client_status_callback: Status update: Client node1/cib now has status [leave]
Jan 12 11:31:38 node2 cib: [3727]: info: crm_update_peer_proc: node1.cib is now offline
Jan 12 11:31:38 node2 cib: [3727]: info: mem_handle_event: Got an event OC_EV_MS_INVALID from ccm
Jan 12 11:31:38 node2 cib: [3727]: info: mem_handle_event: no mbr_track info
Jan 12 11:31:38 node2 cib: [3727]: info: mem_handle_event: Got an event OC_EV_MS_NEW_MEMBERSHIP from ccm
Jan 12 11:31:38 node2 cib: [3727]: info: mem_handle_event: instance=3, nodes=1, new=0, lost=1, n_idx=0, new_idx=1, old_idx=3
Jan 12 11:31:38 node2 cib: [3727]: info: cib_ccm_msg_callback: Processing CCM event=NEW MEMBERSHIP (id=3)
Jan 12 11:31:38 node2 cib: [3727]: info: crm_update_peer: Node node1: id=0 state=lost (new) addr=(null) votes=-1 born=2 seen=2 proc=00000000000000000000000000000202
Jan 12 11:31:38 node2 cib: [3727]: info: cib_process_request: Operation complete: op cib_modify for section nodes (origin=local/crmd/70, version=0.8.41): ok (rc=0)
Jan 12 11:31:38 node2 crmd: [3731]: WARN: match_down_event: No match for shutdown action on d70cae93-bce1-4389-808f-facb2ce776f4
Jan 12 11:31:38 node2 crmd: [3731]: info: te_update_diff: Stonith/shutdown of d70cae93-bce1-4389-808f-facb2ce776f4 not matched
Jan 12 11:31:38 node2 crmd: [3731]: info: abort_transition_graph: te_update_diff:198 - Triggered transition abort (complete=1, tag=node_state, id=d70cae93-bce1-4389-808f-facb2ce776f4, magic=NA, cib=0.8.42) : Node failure
Jan 12 11:31:40 node2 crmd: [3731]: info: crm_timer_popped: New Transition Timer (I_PE_CALC) just popped!
Jan 12 11:31:40 node2 crmd: [3731]: info: do_state_transition: State transition S_TRANSITION_ENGINE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_TIMER_POPPED origin=crm_timer_popped ]
Jan 12 11:31:40 node2 crmd: [3731]: info: do_state_transition: Progressed to state S_POLICY_ENGINE after C_TIMER_POPPED
Jan 12 11:31:40 node2 crmd: [3731]: info: do_state_transition: All 1 cluster nodes are eligible to run resources.
Jan 12 11:31:40 node2 crmd: [3731]: info: do_pe_invoke: Query 72: Requesting the current CIB: S_POLICY_ENGINE
Jan 12 11:31:40 node2 crmd: [3731]: info: do_pe_invoke_callback: Invoking the PE: query=72, ref=pe_calc-dc-1326335500-55, seq=3, quorate=1
Jan 12 11:31:40 node2 pengine: [3735]: notice: unpack_config: On loss of CCM Quorum: Ignore
Jan 12 11:31:40 node2 pengine: [3735]: info: unpack_config: Node scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0
Jan 12 11:31:40 node2 pengine: [3735]: WARN: unpack_nodes: Blind faith: not fencing unseen nodes
Jan 12 11:31:40 node2 pengine: [3735]: info: determine_online_status: Node node2 is online
Jan 12 11:31:40 node2 pengine: [3735]: notice: unpack_rsc_op: Operation drbd_hadoop:0_monitor_0 found resource drbd_hadoop:0 active on node2
Jan 12 11:31:40 node2 pengine: [3735]: notice: clone_print: Master/Slave Set: ms_drbd
Jan 12 11:31:40 node2 pengine: [3735]: notice: short_print: Slaves: [ node2 ]
Jan 12 11:31:40 node2 pengine: [3735]: notice: short_print: Stopped: [ drbd_hadoop:1 ]
Jan 12 11:31:40 node2 pengine: [3735]: info: native_color: Resource drbd_hadoop:1 cannot run anywhere
Jan 12 11:31:40 node2 pengine: [3735]: info: master_color: ms_drbd: Promoted 0 instances of a possible 1 to master
Jan 12 11:31:40 node2 pengine: [3735]: notice: RecurringOp: Start recurring monitor (20s) for drbd_hadoop:0 on node2
Jan 12 11:31:40 node2 pengine: [3735]: notice: RecurringOp: Start recurring monitor (20s) for drbd_hadoop:0 on node2
Jan 12 11:31:40 node2 pengine: [3735]: notice: LogActions: Leave resource drbd_hadoop:0 (Slave node2)
Jan 12 11:31:40 node2 pengine: [3735]: notice: LogActions: Leave resource drbd_hadoop:1 (Stopped)


--

********************************
$B%9%?!<%7%9%F%`%:3t<02q<R(B
$BEl5~ET9A6hFn@D;3(B7-10-3
$BFn@D;3(BST$B%S%k(B5F
$B<D5\!!H;?M(B
TEL:03-5774-4086
FAX:03-3409-3135
E-Mail:hayato [at] starsystems
********************************

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


tsukishima.ha at gmail

Jan 11, 2012, 9:06 PM

Post #7 of 11 (560 views)
Permalink
Re: DRBD primary$B$K>:3J(B$B$7$J$$(B [In reply to]

$B<D5\MM(B

$BCSED$G$9!#(B
$BE:IU$7$F$$$?$@$$$?%m%0$N(B131$B9TL\0J9_(B
$B2<5-$NF0:n$,=PNO$5$l$F$$$^$9!#(B

(1) node2$B$O(BPrimary$B$+$i(BSecondary$B$X>:3JF0:n3+;O(B
(2) node2$B$N(BDRBD$B$N>uBV$,!V(BOutdated$B!W>uBV$N$?$a>:3JF0:n$rDd;_(B

$B!V(BOutdated$B!W>uBV$H$O(Bnode1$B$H(Bnode2$B$N%G!<%?$KIT [at] 09$,H/@8$7$?2DG=@-$,(B
$B$"$k$3$H$r<($7$F$$$^$9!#(B

DRBD$BC1BN$G<jF0%U%'%$%k%*!<%P$O@.8y$7$?$H$N$3$H$G$9$,(B
DRBD, Pacemaker$B$N%M%C%H%o!<%/@_Dj$O$I$N$h$&$K$J$C$F$$$k$N$G$7$g$&$+!#(B
$B$^$?!"(Bdrbd.conf$B$G%U%'%s%7%s%0$N [at] _D(B(crm-fence-peer.sh)$B$O9T$C$F$$$^$9$+!)(B

$B2DG=$G$"$l$P!"(Bha.cf$B$H(Bdrbd.conf$B$N [at] _D$rE:IU$7$F$$$?$@$1$J$$$G$7$g$&$+!#(B
$B$^$?!"(BHeartbeat$B$N%$%s%?!<%3%M%/%H(BLAN, DRBD$B$NF14|(BLAN$B$N(B
$B%M%C%H%o!<%/9=@.$r3+<($7$F$$$?$@$/$3$H$O2DG=$G$7$g$&$+!#(B


Jan 12 11:31:36 node2 kernel: block drbd0: role( Secondary -> Primary )
Jan 12 11:31:36 node2 kernel: block drbd0: Creating new current UUID
Jan 12 11:31:36 node2 lrmd: [3728]: info: RA output:
(drbd_hadoop:0:promote:stdout)
Jan 12 11:31:36 node2 crmd: [3731]: info: process_lrm_event: LRM
operation drbd_hadoop:0_promote_0 (call=11, rc=0, cib-update=63,
confirmed=true) ok
Jan 12 11:31:36 node2 crmd: [3731]: info: match_graph_event: Action
drbd_hadoop:0_promote_0 (8) confirmed on node2 (rc=0)
Jan 12 11:31:36 node2 crmd: [3731]: info: te_pseudo_action: Pseudo
action 23 fired and confirmed
Jan 12 11:31:36 node2 crmd: [3731]: info: te_pseudo_action: Pseudo
action 26 fired and confirmed
Jan 12 11:31:36 node2 crmd: [3731]: info: te_rsc_command: Initiating
action 46: notify drbd_hadoop:0_post_notify_promote_0 on node2 (local)
Jan 12 11:31:36 node2 crmd: [3731]: info: do_lrm_rsc_op: Performing
key=46:4:0:3ab7eec5-827b-4fa1-b137-65c9363ad006
op=drbd_hadoop:0_notify_0 )
Jan 12 11:31:36 node2 lrmd: [3728]: info: rsc:drbd_hadoop:0:12: notify
Jan 12 11:31:36 node2 attrd: [3730]: info: attrd_trigger_update:
Sending flush op to all hosts for: master-drbd_hadoop:0 (10000)
Jan 12 11:31:36 node2 attrd: [3730]: info: attrd_perform_update: Sent
update 22: master-drbd_hadoop:0=10000
Jan 12 11:31:36 node2 lrmd: [3728]: info: RA output:
(drbd_hadoop:0:notify:stdout)
Jan 12 11:31:36 node2 crmd: [3731]: info: abort_transition_graph:
te_update_diff:150 - Triggered transition abort (complete=0,
tag=nvpair, id=status-0a6c069f-e618-4f5b-a7b4-4ba53e5ff890-master-drbd_hadoop:0,
name=NA, value=10000, magic=NA, cib=0.8.33) : Transient attribute:
update
Jan 12 11:31:36 node2 crmd: [3731]: info: update_abort_priority:
Abort priority upgraded from 0 to 1000000
Jan 12 11:31:36 node2 crmd: [3731]: info: update_abort_priority:
Abort action done superceeded by restart
Jan 12 11:31:36 node2 kernel: block drbd0: State change failed: Need
access to UpToDate data
Jan 12 11:31:36 node2 kernel: block drbd0: state = {
cs:WFConnection ro:Primary/Unknown ds:UpToDate/DUnknown r--- }
Jan 12 11:31:36 node2 kernel: block drbd0: wanted = {
cs:WFConnection ro:Primary/Unknown ds:Outdated/DUnknown r--- }
Jan 12 11:31:36 node2 drbd[4205]: ERROR: r0: Called drbdadm -c
/etc/drbd.conf outdate r0
Jan 12 11:31:36 node2 drbd[4205]: ERROR: r0: Exit code 17
Jan 12 11:31:36 node2 drbd[4205]: ERROR: r0: Command output:

$B0J>e$h$m$7$/$*4j$$$$$?$7$^$9!#(B

$BCSED=_;R(B

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


hayato at starsystems

Jan 15, 2012, 7:43 PM

Post #8 of 11 (475 views)
Permalink
Re: DRBD primary$B$K>:3J(B$B$7$J$$(B [In reply to]

$BCSEDMM(B

$B$*@$OC$K$J$j$^$9!#(B
$B<D5\$G$9!#(B


$B%m%0$N$43NG'!"@?$KM-$jFq$&$4$6$$$^$7$?!#(B

ha.cf$B$H(Bdrbd.conf$B$OE:IU$5$;$FD:$-$^$9$N$G(B
$B0z$-B3$-$4;XE&D:$1$^$9MM!"59$7$/$*4j$$CW$7$^$9!#(B

$B0J2<!"%$%s%i%$%s$K$F<:NiCW$7$^$9!#(B
> (1) node2$B$O(BPrimary$B$+$i(BSecondary$B$X>:3JF0:n3+;O(B
> (2) node2$B$N(BDRBD$B$N>uBV$,!V(BOutdated$B!W>uBV$N$?$a>:3JF0:n$rDd;_(B

> $B!V(BOutdated$B!W>uBV$H$O(Bnode1$B$H(Bnode2$B$N%G!<%?$KIT [at] 09$,H/@8$7$?2DG=@-$,(B
> $B$"$k$3$H$r<($7$F$$$^$9!#(B
$B!!!&!V(BOutdated$B!W>uBV$K0\9T$5$;$F$$$k$N$O!"(BDRBD$BB&$NF0:n$J$N$G$7$g$&$+!)(B
$B!!!&(BDRBD$B%5!<%S%95/F0 [at] _D(B(chkconfig)$B$O!"A4%i%s%l%Y%k$K$*$$$F(BOFF$B$GLdBj(B
$B!!!!$J$$$G$7$g$&$+!)(B

> DRBD$BC1BN$G<jF0%U%'%$%k%*!<%P$O@.8y$7$?$H$N$3$H$G$9$,(B
> DRBD, Pacemaker$B$N%M%C%H%o!<%/@_Dj$O$I$N$h$&$K$J$C$F$$$k$N$G$7$g$&$+!#(B
> $B$^$?!"(Bdrbd.conf$B$G%U%'%s%7%s%0$N [at] _D(B(crm-fence-peer.sh)$B$O9T$C$F$$$^$9$+!)(B
drbd.conf$B>e$KL [at] 5$O$7$F$$$^$;$s!#(B

> $B$^$?!"(BHeartbeat$B$N%$%s%?!<%3%M%/%H(BLAN, DRBD$B$NF14|(BLAN$B$N(B
> $B%M%C%H%o!<%/9=@.$r3+<($7$F$$$?$@$/$3$H$O2DG=$G$7$g$&$+!#(B
$B0J2<$K:G=*E*$JL\I8$N9=@.?^$r5-:\$5$;$FD:$-$^$9!#(B
eth0$BB&$K(BVIP$B$r [at] _D$7(Bhttpd$BEy$N%5!<%S%9Ds6!$r9M$($F$$$^$9!#(B
$B"(8=@_Dj$O!"(BDRBD$B$N$_$G$9!#B>$N%5!<%S%9$O5/F0$7$F$$$^$;$s!#(B
eth1$BB&$O!"(Bheartbeat$B$H(BDRBD$B$NF14|DL?.$H$7$F [at] _D$7$F$$$^$9!#(B

VIP(172.30.5.30/16)
|(service)
---------------------------------
|eth0 |eth0
--------------- -----------------
| | | |
| node1 | | node2 |
| | | |
--------------- -----------------
|eth1 |eth1
---------------------------------
heartbeat,DRBD inter-connect(192.168.0.0/24)


$B0J>e$H$J$j$^$9!#(B
$B59$7$/$*4j$$CW$7$^$9!#(B

--

********************************
$B%9%?!<%7%9%F%`%:3t<02q<R(B
$BEl5~ET9A6hFn@D;3(B7-10-3
$BFn@D;3(BST$B%S%k(B5F
$B<D5\!!H;?M(B
TEL:03-5774-4086
FAX:03-3409-3135
E-Mail:hayato [at] starsystems
********************************
Attachments: drbd.conf (0.66 KB)
  ha.cf (0.17 KB)


tsukishima.ha at gmail

Jan 15, 2012, 11:50 PM

Post #9 of 11 (495 views)
Permalink
Re: DRBD primary$B$K>:3J(B$B$7$J$$(B [In reply to]

$B<D5\MM(B

> $B!!!&!V(BOutdated$B!W>uBV$K0\9T$5$;$F$$$k$N$O!"(BDRBD$BB&$NF0:n$J$N$G$7$g$&$+!)(B

DRBD$BB&$NF0:n$G$9!#(B

> $B!!!&(BDRBD$B%5!<%S%95/F0 [at] _D(B(chkconfig)$B$O!"A4%i%s%l%Y%k$K$*$$$F(BOFF$B$GLdBj(B
> $B!!!!$J$$$G$7$g$&$+!)(B

$BLdBj$"$j$^$;$s!#(B
Pacemaker$B$,(BDRBD$B$r5/F0$7$^$9$N$G!"(BDRBD$B%5!<%S%95/F0 [at] _D(B(chkconfig)$B$O!"(B
$BA4%i%s%l%Y%k$K$*$$$F(BOFF$B$r [at] _D$7$F$/$@$5$$!#(B

>> $B$^$?!"(Bdrbd.conf$B$G%U%'%s%7%s%0$N [at] _D(B(crm-fence-peer.sh)$B$O9T$C$F$$$^$9$+!)(B
> drbd.conf$B>e$KL [at] 5$O$7$F$$$^$;$s!#(B
> eth1$BB&$O!"(Bheartbeat$B$H(BDRBD$B$NF14|DL?.$H$7$F [at] _D$7$F$$$^$9!#(B

drbd.conf$B$G%U%'%s%7%s%0$O [at] _D$5$l$F$$$^$;$s$,(B
$B0JA0AwIU$7$F$$$?$@$$$?%m%0(B(/var/log/ha-log)$B$N(B150$B9TL\$K(B
$B2<5-$N%a%C%;!<%8$,=PNO$5$l$F$$$^$9!#(B

Jan 12 11:31:36 node2 drbd[4205]: ERROR: r0: Called drbdadm -c
/etc/drbd.conf outdate r0
Jan 12 11:31:36 node2 drbd[4205]: ERROR: r0: Exit code 17
Jan 12 11:31:36 node2 drbd[4205]: ERROR: r0: Command output:

node2$B$,(BPrimary$B$X>:3J$9$k:]$K!"(Bnode1$B$N%G!<%?$r!V(BOutdated$B!W>uBV$K(B
$BJQ99$7$h$&$H$7$F<:GT$7$F$$$^$9!#(B
# node1$B$O$9$G$KDd;_$7$F$$$k$N$G<:GT$7$F$$$k$N$@$H;W$$$^$9!#(B
$BA02s$N%a!<%k$G!V(BOutdated$B!W>uBV$N$?$a>:3J<:GT$H5-=R$7$^$7$?$,(B
$B!V(BOutdated$B!W>uBV$X$NJQ99<+BN$K<:GT$7$F$$$k$h$&$G$9!#(B
$B<:Ni$$$?$7$^$7$?!#(B
$B$I$A$i$K$7$m!"<D5\MM$N [at] _D$G$O%U%'%s%7%s%0$r;XDj$7$F$$$J$$$N$G(B
Outdate$B$&$s$L$s$O$"$^$j4X78$J$$$h$&$J5$$,$9$k$N$G$9$,!D(B

ha.cf, drbd.conf $B$OFC$KLdBj$J$$$h$&$G$9$N$G(B
$B8=:_$N [at] _D$N$^$^$G2<5-$NF0:n$r3NG'$7$F$$$?$@$/$3$H$O2DG=$G$7$g$&$+!#(B

(1) node1, node2$B$G(BDRBD$B$r!V<jF0!W5/F0(B
# service drbd start

(2) node1$B$r(BPriamry$B$X(B
# drbdadm primary all

(3) $B$3$N;~E@$GN>%N!<%I$N>uBV$r3NG'(B
# cat /proc/drbd

(4) node1$B$r:F5/F0(B
# shutdown -r now

(5) node2$B$r(BPrimary$B2=(B
# drbdadm primary all
# cat /proc/drbd

(1)$B!A(B(4)$B<B9T;~$N(B/var/log/messages$B$r(B
$BE:IU$7$F$/$@$5$$!#(B

$B$*<j?t$r$*$+$1$$$?$7$^$9$,(B
$B0J>e$h$m$7$/$*4j$$$$$?$7$^$9!#(B

$BCSED=_;R(B

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


tsukishima.ha at gmail

Jan 26, 2012, 7:38 PM

Post #10 of 11 (458 views)
Permalink
Re: DRBD primary$B$K>:3J(B$B$7$J$$(B [In reply to]

$B<D5\MM(B

$B%a!<%j%s%0%j%9%H$NJL%9%l%C%I$G(B
DRBD 8.3.8 + Pacemaker 1.0.11
$B$+$i(B
DRBD 8.3.12 + Pacemaker 1.0.11
$B$X%"%C%W%G!<%H$9$k$H@5>o$KF0:n$7$?$H$$$&(B
$B$4Js9p$r$$$?$@$$$F$$$^$9!#(B

DRBD$B$r(B8.3.8$B$+$i(B8.3.12$B$X%"%C%W%G!<%H$7$F$$$?$@$/$3$H$O2DG=$G$7$g$&$+!#(B

DRBD 8.3$B7O$N:G?7HG$O(B8.3.12$B$H$J$j$^$9!#(B
http://oss.linbit.com/drbd/

8.4$B7O$K$O?75,5!G=$,DI2C$5$l$F$$$^$9$,$^$@3+H/ESCf$J$N$G(B
$B0BDjHG$N(B8.3$B7O$r;HMQ$5$l$k$3$H$r$*$9$9$a$7$^$9!#(B

$B0J>e$h$m$7$/$*4j$$$$$?$7$^$9!#(B

$BCSED=_;R(B


hayato at starsystems

Jan 31, 2012, 5:03 PM

Post #11 of 11 (450 views)
Permalink
Re: DRBD primary$B$K>:3J(B$B$7$J$$(B [In reply to]

$BCSEDMM(B

$B$*@$OC$K$J$C$F$*$j$^$9!#(B
$B<D5\$G$9!#(B

$B$4O"MmD:$-M-$jFq$&$4$6$$$^$9!#(B


$B$3$A$i$NET9g$G!"0JA0D:$$$?>pJs$G8!>Z$,(B
$B9T$($F$$$J$$$N$G$9$,!"0J2<$N4D6-2<$G(B
$B;~4V$,<h$l<!Bh!"8!>Z$r9T$C$F$_$^$9!#(B
> $B%a!<%j%s%0%j%9%H$NJL%9%l%C%I$G(B
> DRBD 8.3.8 + Pacemaker 1.0.11
> $B$+$i(B
> DRBD 8.3.12 + Pacemaker 1.0.11

$B?'!9$H$4=u8@$rD:$-!"@?$KM-$jFq$&$4$6$$$^$7$?!#(B

>
> 1. Re: DRBD primary$B$K>:3J$7$J$$(B (Junko IKEDA)
>
> $B<D5\MM(B
>
> $B%a!<%j%s%0%j%9%H$NJL%9%l%C%I$G(B
> DRBD 8.3.8 + Pacemaker 1.0.11
> $B$+$i(B
> DRBD 8.3.12 + Pacemaker 1.0.11
> $B$X%"%C%W%G!<%H$9$k$H@5>o$KF0:n$7$?$H$$$&(B
> $B$4Js9p$r$$$?$@$$$F$$$^$9!#(B
>
> DRBD$B$r(B8.3.8$B$+$i(B8.3.12$B$X%"%C%W%G!<%H$7$F$$$?$@$/$3$H$O2DG=$G$7$g$&$+!#(B
>
> DRBD 8.3$B7O$N:G?7HG$O(B8.3.12$B$H$J$j$^$9!#(B
> http://oss.linbit.com/drbd/
>
> 8.4$B7O$K$O?75,5!G=$,DI2C$5$l$F$$$^$9$,$^$@3+H/ESCf$J$N$G(B
> $B0BDjHG$N(B8.3$B7O$r;HMQ$5$l$k$3$H$r$*$9$9$a$7$^$9!#(B
>
> $B0J>e$h$m$7$/$*4j$$$$$?$7$^$9!#(B
>
> $BCSED=_;R(B
> -------------- next part --------------
> HTML$B$NE:IU%U%!%$%k$rJ]4I$7$^$7$?(B...
> URL: http://lists.sourceforge.jp/mailman/archives/linux-ha-japan/attachments/20120127/7ea1148d/attachment.html
>
> ------------------------------
>
> _______________________________________________
> Linux-ha-japan mailing list
> Linux-ha-japan [at] lists
> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>
>
> $B0J>e(B: Linux-ha-japan $B$^$H$aFI$_(B, 50 $B4,(B, 25 $B9f(B
> *********************************************

--

********************************
$B%9%?!<%7%9%F%`%:3t<02q<R(B
$BEl5~ET9A6hFn@D;3(B7-10-3
$BFn@D;3(BST$B%S%k(B5F
$B<D5\!!H;?M(B
TEL:03-5774-4086
FAX:03-3409-3135
E-Mail:hayato [at] starsystems
********************************

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan

Linux-HA japanese RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.