Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Linux-HA: Japanese

Re: $B2>A[(BIP$B$,%U%'(B$B!<%k%*!<%P!<$9$k$b$9$0%U%'!<%k%P%C%/(B$B$7!"$5$i$K$=$l$,<B(B IP$B$H$J$C$F(B$B$7$^$&!#(B

 

 

Linux-HA japanese RSS feed   Index | Next | Previous | View Threaded


wyama at kke

Nov 2, 2011, 9:06 PM

Post #1 of 5 (483 views)
Permalink
Re: $B2>A[(BIP$B$,%U%'(B$B!<%k%*!<%P!<$9$k$b$9$0%U%'!<%k%P%C%/(B$B$7!"$5$i$K$=$l$,<B(B IP$B$H$J$C$F(B$B$7$^$&!#(B

$B>>EgMM(B
$B;3K\$G$9!#(B

$B$5$C$=$/$42sEz$$$?$@$-!"$^$3$H$K$"$j$,$H$&$4$6$$$^$9!#(B

(2011/11/03 ($BLZ(B) 6:39), Takehiro Matsushima wrote:
> $B$O$8$a$^$7$F!">>Eg$H?=$7$^$9!#(B
> $B;d$b?t%+7nA0$K(BHeartbeatV3 + Pacemaker$B$r;O$a$?AG?M$G$9!#(B
>
> $B$$$/$D$+5$$K$J$kE@$,$"$j$^$7$F!"$*;G$$$5$;$FD:$-$^$9!#(B
>
> $B4X78$J$$$+$b$7$l$^$;$s$,!"(Beth0$BB&$N%M%C%H%o!<%/$,%k!<%?!<$H(BServer$B$N(BNIC$B$H(B
> Mask$B$,0c$&$N$O$J$K$+M}M3$,$"$C$F$N$3$H$G$7$g$&$+!#(B
> Router2$B$N(BServer Network$BB&$O(B193-198$B$N%"%I%l%9$,F1$8%M%C%H%o!<%/$G$9$,!"(B
> Server$B$,$=$l$N30$K$"$k$h$&$J5$$,$7$^$9!#(B

$B?=$7Lu$4$6$^$;$s!#?^$N8m?"$G$7$?!#(B

Router2$B$N%"%I%l%9(B($B%M%C%H%^%9%/(B)$B$O(B
$B!!8m(B=> aa.bb.cc.193/29
$B!!@5(B=> aa.bb.cc.193/28
$B$H$J$j$^$9!#(B

$B0l1~!"0J2<$K=$@5$7$?%M%C%H%o!<%/?^$r:\$;$F$*$-$^$9!#(B

+---------+
| Router1 |
+---------+
(aa.bb.cc.241/29)
|
|
+---------$B2>A[(BIP1------+
| (aa.bb.cc.242/29) |
| |
| |
eth2 eth2
(aa.bb.cc.243/29) (aa.bb.cc.244/29)
+--------+ +--------+ (dd.ee.ff.1/28)
eth3| |eth1 eth1| |eth3 +---------+
+-----| hostM1 |-------------| hostB1 |--------+----| |
| | | | | | | Router3 |
| +--------+ + -------+ | | |
| (aa.bb.cc.202/28) (aa.bb.cc.203/28) | +---------+
| eth0 eth0 |
| | | |
| | (aa.bb.cc.201/28) | |
| +---------$B2>A[(BIP2------+ |
| | |
| | |
| (aa.bb.cc.193/28) |
| +---------+ |
| | Router2 | |
| +---------+ |
| |
+----------------------------------------------+

>
>
>> hostM1$B$N(Beth2$B$K$D$$$F$b0JA0$OF1MM$G$7$?!#$7$+$7$3$N(BhostM1$B$H(BhostB1$B$r(B
>> $BB>$N4D6-$K;}$C$F$$$/$H!"(Beth0$B$K$D$$$F$O%U%'!<%k%*!<%P!<(B&$B%P%C%/$9$k$b(B
>> $B$N$N!"(Beth2$B$K$D$$$F$O$=$&$J$i$J$/$J$j$^$7$?!#(B
>
> $B$3$3$G$$$&B>$N4D6-$H$$$&$N$O$I$&$$$&$3$H$G$7$g$&$+!#(B
> IP$B%"%I%l%9$rJQ$($?$j%M%C%H%o!<%/$N%;%0%a%s%H$rJQ$($?$j!"$O$?$^$?2>A[4D6-$K(B
> $BF~$l$?!"$H$$$&$3$H$G$7$g$&$+!)(B

Router1,2,3$B$O$*5R$5$s$N4D6-$G!"$=$3$K;d$,@_Dj$7F0:n3NG'$7$F$-$?(BhostM1,B1
$B$r;}$C$F$-$?!"$H$$$&<!Bh$G$9!#0\F0A0$N;d$N4D6-$G$b0l1~(BRouter1,2,3$B$r%@%_!<(B
$B$H$7$FCV$$$F;n83$7$F$$$^$7$?!#%@%_!<$H$O$$$C$F$b<B%[%9%H$G%"%I%l%9$b>e$N(B
$B5R@h$HF1$8$b$N$K$7$F$$$^$7$?!#3F%M%C%H%o!<%/%"%I%l%9$bF1MM$G$9!#(B


>
>
>> $B6qBNE*$K$O(BhostM1$B$K$F(Bifdown eth2$B$H$9$k$H!"(Bcrm_mon$B$N2>A[.(BIP$BMs>e$G$O0l(B
>> $B=V(BFATAL$B$HI=<($5$l$k$b$N$N!"%U%'!<%k%*!<%P!<$;$:(Bcrm_mon$B$G$O(BhostM1$B$N(B
>> $B$^$^$H$J$C$F$$$^$9!#$5$i$K$=$N;~E@$G(BhostM1$B$G(Bifconfig$B$9$k$H$=$N(Beth2
>> $B$K2>A[(BIP(aa.bb.cc.242/29)$B$,D>@\?6$i$l$F$7$^$C$F$$$^$9!#$5$i$K$=$N>u(B
>> $BBV$G(Bservice heartbeat stop$B$H$7$F(Bifconfig$B$G(BI/F$B$r3NG'$9$k$H!"(Beth1$B$N(BIP
>> $B%"%I%l%9$,A4$/?6$i$l$F$$$J$$>uBV$H$J$C$F$7$^$C$F$$$^$9!#(B
>
> $B$3$3$G!"(Bip addr show$B$7$?$H$-$O$$$+$,$G$7$g$&$+!#(B
> fail$BB&!"(Bfailover$BB&$G!"$b$7:9;Y$(L5$1$l$P!#(B
> $BAPJ}$N(Bhost$B$GDL>o;~!"IT6q9g;~$NN>J}$r$*4j$$$$$?$7$^$9!#(B
> $B!JE*30$l$+$b$7$l$^$;$s$,!K(B

$B:#!"5R@h$+$iE1<}$7$F$$$k$N$G3NG'$G$-$^$;$s!#Mh=5$^$?9T$/$N$G$=$N:]$K(B
$B3NG'$7$h$&$H;W$C$F$$$^$9!#(B

$B$?$@!"<B$O<j85$K$b$&#1%;%C%H;D$7$F$$$^$9!#%"%I%l%9BN7O$O<c430c$&$N(B
$B$G$9$,!"%M%C%H%o!<%/9=@.$O>e$HA4$/$$$C$7$g$G$9!#$?$@$3$l$b$-$A$s$HF0:n(B
$B$7$^$9$7!"$b$A$m$s5R@h$XG<F~$7$?$b$N$b$3$A$i$G$O$&$^$/F0:n$7$F$$$^$7$?!#(B

$B$=$N<j85$N4D6-$G!"(Bip addr show$B%3%^%s%I$rBG$C$F$_$^$7$?!#2TF/7O%[%9%H(B
$B$G$N(Bip addr show$B$G2>A[(BIP$B$,3NG'$G$-$k$s$G$9$M!#$9$_$^$;$s!"CN$j$^$;$s(B
$B$G$7$?!#$"$j$,$H$&$4$6$$$^$7$?!#(B


>
>
>> $B4N?4$N(B/var/log/ha-log$B$G$9$,$-$A$s$H<}=8$G$-$F$$$^$;$s$,!"(B
>> $B$3$N(Beth2$B$N%U%'!<%k%*!<%P!<<:GT$N8=>]$,=P$k:]$K$O(B
>> $B!!(Bget_failcount: ip-s $B1>!9(B
>> $B$N$h$&$J(BWARNING$B%a%C%;!<%8(B?$B$,=P$F$$$^$7$?!#(B
>
> $B>u67$r$_$k$?$a$K(Blog$B$O$=$l$J$j$KBg@Z$@$H9M$($F$*$j$^$7$F!"2DG=$G$7$?$i(B
> $BIT6q9gH/@8IU6a$N(Blog$B$rH4?h$7$F$$$?$@$1$k$H%R%s%H$,8+$D$+$k$+$b$7$l$^$;$s!#(B
>
> $BAG?M$J$N$GA4$/E*$O$:$l$J$3$H$r8@$C$F$$$?$i%O%:%+%7%$$N$G$9$,!"$h$m$7$1$l$P(B
> $B$42sEz$/$@$5$$!#(B

log$B$NDs<($NI,MW@-$K$D$$$F$bA4$/$*$C$7$c$kDL$j$G$9!#$3$A$i$3$=%O%:%+%7%$8B$j$G$9!#(B

$B<B$O(Bha-log$B$N%U%!%$%k$O(BhostM1,hostB1$B$H$b<hF@$7$F$$$^$9!#$7$+$75R@h$G;~4V$,$J$/(B
$B:xAn$7!"(BlinuxHA$B$^$o$j$N$$$m$s$J9`L\$NJQ99$r<!!9$H<B;\$7$?$?$a!"$I$N;v>]$,%m%0(B
$B$N$I$l$d$iBgJQ$o$+$j$K$/$/$J$C$F$$$?$N$G!"$"$($FE:IU$7$^$;$s$G$7$?!#(B

$B$=$l$G$b!"$A$g$C$H4hD%$C$F5-:\$7$F$_$^$9!#<j854D6-$K$F(Bifdown eth2$B$r<B;\$7$=$N(B
$B%m%0$N=PNO798~$r$D$+$_$^$7$?$N$G!"5R [at] h4D6%m%0$G$N4XO"$7$=$&$J2U=j$r0J9_$K<($7$F$_$^$9!#(B

$B>0!"K\%m%0$O<c43=$@5$7$F$$$^$9!#6qBNE*$K$O!"<j854D6-$H$N0c$&$H;W$o$l$k2U=j$K(B"<<point1>>",
"<<point2>>"$B$H$N#29T$rA^F~$7$^$7$?!#>0!"%[.%9%HL>$d%"%I%l%9$K$D$$$F$O>e$N?^$K9g$o$;$F(B
$BF?L>2=$7$F$$$^$9!#0J2<$G$9!#(B

---------------- ha-log failover err start -----------------------------
Nov 2 10:02:42 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation ip-s_monitor_10000
(call=20, rc=7, cib-update=80, confirmed=false) not running
Nov 2 10:02:42 hostM1 crmd: [20254]: info: process_graph_event: Action ip-s_monitor_10000 arrived
after a completed transition
Nov 2 10:02:42 hostM1 crmd: [20254]: info: abort_transition_graph: process_graph_event:486 -
Triggered transition abort (complete=1, tag=lrm_rsc_op, id=ip-s_monitor_10000,
magic=0:7;4:5:0:db95561c-6613-41e2-b7a0-17b8701178c3, cib=0.7.38) : Inactive graph
Nov 2 10:02:42 hostM1 crmd: [20254]: WARN: update_failcount: Updating failcount for ip-s on hostm1
after failed monitor: rc=7 (update=value++, time=1320195762)
Nov 2 10:02:42 hostM1 attrd: [20253]: info: find_hash_entry: Creating hash entry for fail-count-ip-s
Nov 2 10:02:42 hostM1 attrd: [20253]: info: attrd_local_callback: Expanded fail-count-ip-s=value++ to 1
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_state_transition: State transition S_IDLE ->
S_POLICY_ENGINE [ input=I_PE_CALC cause=C_FSA_INTERNAL origin=abort_transition_graph ]
Nov 2 10:02:42 hostM1 attrd: [20253]: info: attrd_trigger_update: Sending flush op to all hosts
for: fail-count-ip-s (1)
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_state_transition: All 2 cluster nodes are eligible to
run resources.
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_pe_invoke: Query 81: Requesting the current CIB:
S_POLICY_ENGINE
Nov 2 10:02:42 hostM1 attrd: [20253]: info: attrd_perform_update: Sent update 44: fail-count-ip-s=1
Nov 2 10:02:42 hostM1 attrd: [20253]: info: find_hash_entry: Creating hash entry for last-failure-ip-s
Nov 2 10:02:42 hostM1 attrd: [20253]: info: attrd_trigger_update: Sending flush op to all hosts
for: last-failure-ip-s (1320195762)
Nov 2 10:02:42 hostM1 attrd: [20253]: info: attrd_perform_update: Sent update 47:
last-failure-ip-s=1320195762
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_pe_invoke_callback: Invoking the PE: query=81,
ref=pe_calc-dc-1320195762-53, seq=2, quorate=1
Nov 2 10:02:42 hostM1 crmd: [20254]: info: abort_transition_graph: te_update_diff:150 - Triggered
transition abort (complete=1, tag=nvpair,
id=status-396fe8ff-762a-41bb-a3e6-da8270fa144f-fail-count-ip-s, magic=NA, cib=0.7.39) : Transient
attribute: update
Nov 2 10:02:42 hostM1 crmd: [20254]: info: abort_transition_graph: te_update_diff:150 - Triggered
transition abort (complete=1, tag=nvpair,
id=status-396fe8ff-762a-41bb-a3e6-da8270fa144f-last-failure-ip-s, magic=NA, cib=0.7.40) : Transient
attribute: update
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_pe_invoke: Query 82: Requesting the current CIB:
S_POLICY_ENGINE
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_pe_invoke: Query 83: Requesting the current CIB:
S_POLICY_ENGINE
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: unpack_config: On loss of CCM Quorum: Ignore
Nov 2 10:02:42 hostM1 pengine: [20268]: info: unpack_config: Node scores: 'red' = -INFINITY,
'yellow' = 0, 'green' = 0
Nov 2 10:02:42 hostM1 pengine: [20268]: info: determine_online_status: Node hostm1 is online
Nov 2 10:02:42 hostM1 pengine: [20268]: info: determine_online_status: Node hostb1 is online
Nov 2 10:02:42 hostM1 pengine: [20268]: WARN: unpack_rsc_op: Processing failed op
ip-s_monitor_10000 on hostm1: not running (7)
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: group_print: Resource Group: hoge-grp
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: native_print:
ip-g#011(ocf::heartbeat:IPaddr2):#011Started hostm1
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: native_print:
ip-s#011(ocf::heartbeat:IPaddr2):#011Started hostm1 FAILED
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: native_print:
hoge-r#011(ocf::hoge:hoge_proxy.ra):#011Started hostm1
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: clone_print: Clone Set: clone_ping
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: short_print: Started: [ hostm1 hostb1 ]
Nov 2 10:02:42 hostM1 pengine: [20268]: info: get_failcount: ip-g has failed 1 times on hostm1
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: common_apply_stickiness: ip-g can fail 999999 more
times on hostm1 before being forced off
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: RecurringOp: Start recurring monitor (10s) for
ip-s on hostm1
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: LogActions: Leave resource ip-g#011(Started hostm1)
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: LogActions: Recover resource ip-s#011(Started hostm1)
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: LogActions: Restart resource hoge-r#011(Started hostm1)
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: LogActions: Leave resource ping-r:0#011(Started
hostm1)
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: LogActions: Leave resource ping-r:1#011(Started
hostb1)
Nov 2 10:02:42 hostM1 crmd: [20254]: info: handle_response: pe_calc calculation
pe_calc-dc-1320195762-53 is obsolete
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_pe_invoke_callback: Invoking the PE: query=83,
ref=pe_calc-dc-1320195762-54, seq=2, quorate=1
Nov 2 10:02:42 hostM1 pengine: [20268]: info: process_pe_message: Transition 6: PEngine Input
stored in: /var/lib/pengine/pe-input-614.bz2
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: unpack_config: On loss of CCM Quorum: Ignore
Nov 2 10:02:42 hostM1 pengine: [20268]: info: unpack_config: Node scores: 'red' = -INFINITY,
'yellow' = 0, 'green' = 0
Nov 2 10:02:42 hostM1 pengine: [20268]: info: determine_online_status: Node hostm1 is online
Nov 2 10:02:42 hostM1 pengine: [20268]: info: determine_online_status: Node hostb1 is online
Nov 2 10:02:42 hostM1 pengine: [20268]: WARN: unpack_rsc_op: Processing failed op
ip-s_monitor_10000 on hostm1: not running (7)
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: group_print: Resource Group: hoge-grp
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: native_print:
ip-g#011(ocf::heartbeat:IPaddr2):#011Started hostm1
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: native_print:
ip-s#011(ocf::heartbeat:IPaddr2):#011Started hostm1 FAILED
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: native_print:
hoge-r#011(ocf::hoge:hoge_proxy.ra):#011Started hostm1
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: clone_print: Clone Set: clone_ping
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: short_print: Started: [ hostm1 hostb1 ]
Nov 2 10:02:42 hostM1 pengine: [20268]: info: get_failcount: ip-g has failed 1 times on hostm1
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: common_apply_stickiness: ip-g can fail 999999 more
times on hostm1 before being forced off
Nov 2 10:02:42 hostM1 pengine: [20268]: info: get_failcount: ip-s has failed 1 times on hostm1
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: common_apply_stickiness: ip-s can fail 999999 more
times on hostm1 before being forced off
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: RecurringOp: Start recurring monitor (10s) for
ip-s on hostm1
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: LogActions: Leave resource ip-g#011(Started hostm1)
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: LogActions: Recover resource ip-s#011(Started hostm1)
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: LogActions: Restart resource hoge-r#011(Started hostm1)
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: LogActions: Leave resource ping-r:0#011(Started
hostm1)
Nov 2 10:02:42 hostM1 pengine: [20268]: notice: LogActions: Leave resource ping-r:1#011(Started
hostb1)
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_state_transition: State transition S_POLICY_ENGINE ->
S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response ]
Nov 2 10:02:42 hostM1 crmd: [20254]: info: unpack_graph: Unpacked transition 7: 11 actions in 11
synapses
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_te_invoke: Processing graph 7
(ref=pe_calc-dc-1320195762-54) derived from /var/lib/pengine/pe-input-615.bz2
Nov 2 10:02:42 hostM1 crmd: [20254]: info: te_pseudo_action: Pseudo action 18 fired and confirmed
Nov 2 10:02:42 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 14: stop hoge-r_stop_0
on hostm1 (local)
Nov 2 10:02:42 hostM1 lrmd: [20251]: info: cancel_op: operation monitor[22] on
ocf::hoge_proxy.ra::hoge-r for client 20254, its parameters: CRM_meta_name=[monitor]
crm_feature_set=[3.0.1] CRM_meta_on_fail=[restart] CRM_meta_interval=[15000]
CRM_meta_timeout=[20000] cancelled
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_lrm_rsc_op: Performing
key=14:7:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=hoge-r_stop_0 )
Nov 2 10:02:42 hostM1 lrmd: [20251]: info: rsc:hoge-r:23: stop
Nov 2 10:02:42 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation hoge-r_monitor_15000
(call=22, status=1, cib-update=0, confirmed=true) Cancelled
Nov 2 10:02:42 hostM1 hoge_proxy.ra[22228]: INFO: Stopping hoge_proxy ...
Nov 2 10:02:42 hostM1 pengine: [20268]: info: process_pe_message: Transition 7: PEngine Input
stored in: /var/lib/pengine/pe-input-615.bz2
Nov 2 10:02:42 hostM1 hoge_proxy.ra[22228]: INFO: hoge_proxy stopped
Nov 2 10:02:42 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation hoge-r_stop_0 (call=23,
rc=0, cib-update=84, confirmed=true) ok
Nov 2 10:02:42 hostM1 crmd: [20254]: info: match_graph_event: Action hoge-r_stop_0 (14) confirmed
on hostm1 (rc=0)
Nov 2 10:02:42 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 4: stop ip-s_stop_0 on
hostm1 (local)
Nov 2 10:02:42 hostM1 lrmd: [20251]: info: cancel_op: operation monitor[20] on ocf::IPaddr2::ip-s
for client 20254, its parameters: CRM_meta_name=[monitor] cidr_netmask=[29] crm_feature_set=[3.0.1]
CRM_meta_timeout=[20000] CRM_meta_interval=[10000] nic=[eth2] ip=[aa.bb.cc.242] cancelled
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_lrm_rsc_op: Performing
key=4:7:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=ip-s_stop_0 )
Nov 2 10:02:42 hostM1 lrmd: [20251]: info: rsc:ip-s:24: stop
Nov 2 10:02:42 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation ip-s_monitor_10000
(call=20, status=1, cib-update=0, confirmed=true) Cancelled
Nov 2 10:02:42 hostM1 IPaddr2[22259]: INFO: IP status = no, IP_CIP=
Nov 2 10:02:42 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation ip-s_stop_0 (call=24,
rc=0, cib-update=85, confirmed=true) ok
Nov 2 10:02:42 hostM1 crmd: [20254]: info: match_graph_event: Action ip-s_stop_0 (4) confirmed on
hostm1 (rc=0)
Nov 2 10:02:42 hostM1 crmd: [20254]: info: te_pseudo_action: Pseudo action 19 fired and confirmed
Nov 2 10:02:42 hostM1 crmd: [20254]: info: te_pseudo_action: Pseudo action 7 fired and confirmed
Nov 2 10:02:42 hostM1 crmd: [20254]: info: te_pseudo_action: Pseudo action 16 fired and confirmed
Nov 2 10:02:42 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 13: start ip-s_start_0
on hostm1 (local)
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_lrm_rsc_op: Performing
key=13:7:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=ip-s_start_0 )
Nov 2 10:02:42 hostM1 lrmd: [20251]: info: rsc:ip-s:25: start
Nov 2 10:02:42 hostM1 IPaddr2[22296]: INFO: ip -f inet addr add aa.bb.cc.242/29 brd aa.bb.cc.247
dev eth2
Nov 2 10:02:42 hostM1 IPaddr2[22296]: INFO: ip link set eth2 up
Nov 2 10:02:42 hostM1 IPaddr2[22296]: INFO: /usr/lib64/heartbeat/send_arp -i 200 -r 5 -p
/var/run/heartbeat/rsctmp/send_arp-aa.bb.cc.242 eth2 aa.bb.cc.242 auto not_used not_used
Nov 2 10:02:42 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation ip-s_start_0 (call=25,
rc=0, cib-update=86, confirmed=true) ok
Nov 2 10:02:42 hostM1 crmd: [20254]: info: match_graph_event: Action ip-s_start_0 (13) confirmed on
hostm1 (rc=0)
Nov 2 10:02:42 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 3: monitor
ip-s_monitor_10000 on hostm1 (local)
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_lrm_rsc_op: Performing
key=3:7:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=ip-s_monitor_10000 )
Nov 2 10:02:42 hostM1 lrmd: [20251]: info: rsc:ip-s:26: monitor
Nov 2 10:02:42 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 15: start
hoge-r_start_0 on hostm1 (local)
Nov 2 10:02:42 hostM1 crmd: [20254]: info: do_lrm_rsc_op: Performing
key=15:7:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=hoge-r_start_0 )
Nov 2 10:02:42 hostM1 lrmd: [20251]: info: rsc:hoge-r:27: start
Nov 2 10:02:42 hostM1 hoge_proxy.ra[22355]: INFO: Starting hoge_proxy ...
Nov 2 10:02:42 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation ip-s_monitor_10000
(call=26, rc=0, cib-update=87, confirmed=false) ok
Nov 2 10:02:42 hostM1 crmd: [20254]: info: match_graph_event: Action ip-s_monitor_10000 (3)
confirmed on hostm1 (rc=0)
Nov 2 10:02:43 hostM1 hoge_proxy.ra[22355]: INFO: hoge_proxy started
Nov 2 10:02:43 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation hoge-r_start_0
(call=27, rc=0, cib-update=88, confirmed=true) ok
Nov 2 10:02:43 hostM1 crmd: [20254]: info: match_graph_event: Action hoge-r_start_0 (15) confirmed
on hostm1 (rc=0)
Nov 2 10:02:43 hostM1 crmd: [20254]: info: te_pseudo_action: Pseudo action 17 fired and confirmed
Nov 2 10:02:43 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 5: monitor
hoge-r_monitor_15000 on hostm1 (local)
Nov 2 10:02:43 hostM1 crmd: [20254]: info: do_lrm_rsc_op: Performing
key=5:7:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=hoge-r_monitor_15000 )
Nov 2 10:02:43 hostM1 lrmd: [20251]: info: rsc:hoge-r:28: monitor
Nov 2 10:02:43 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation hoge-r_monitor_15000
(call=28, rc=0, cib-update=89, confirmed=false) ok
Nov 2 10:02:43 hostM1 crmd: [20254]: info: match_graph_event: Action hoge-r_monitor_15000 (5)
confirmed on hostm1 (rc=0)
Nov 2 10:02:43 hostM1 crmd: [20254]: info: run_graph:
====================================================
Nov 2 10:02:43 hostM1 crmd: [20254]: notice: run_graph: Transition 7 (Complete=11, Pending=0,
Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pengine/pe-input-615.bz2): Complete
Nov 2 10:02:43 hostM1 crmd: [20254]: info: te_graph_trigger: Transition 7 is now complete
Nov 2 10:02:43 hostM1 crmd: [20254]: info: notify_crmd: Transition 7 status: done - <null>
Nov 2 10:02:43 hostM1 crmd: [20254]: info: do_state_transition: State transition
S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS cause=C_FSA_INTERNAL origin=notify_crmd ]
Nov 2 10:02:43 hostM1 crmd: [20254]: info: do_state_transition: Starting PEngine Recheck Timer
<<point1>>
Nov 2 10:02:45 hostM1 pingd: [20434]: info: stand_alone_ping: Node aa.bb.cc.241 is unreachable (read)
Nov 2 10:02:46 hostM1 pingd: [20434]: info: ping_read: Retrying...
Nov 2 10:02:46 hostM1 lrmd: [20251]: info: RA output: (ip-s:start:stderr) ARPING aa.bb.cc.242 from
aa.bb.cc.242 eth2#012Sent 5 probes (5 broadcast(s))#012Received 0 response(s)
Nov 2 10:04:01 hostM1 pingd: [20434]: info: stand_alone_ping: Node dd.ee.ff.1 is unreachable (read)
Nov 2 10:04:02 hostM1 pingd: [20434]: info: stand_alone_ping: Node dd.ee.ff.1 is unreachable (read)
Nov 2 10:04:03 hostM1 pingd: [20434]: info: stand_alone_ping: Node dd.ee.ff.1 is unreachable (read)
Nov 2 10:04:04 hostM1 pingd: [20434]: info: stand_alone_ping: Node dd.ee.ff.1 is unreachable (read)
Nov 2 10:04:05 hostM1 pingd: [20434]: info: stand_alone_ping: Node dd.ee.ff.1 is unreachable (read)
Nov 2 10:04:06 hostM1 attrd: [20253]: info: attrd_trigger_update: Sending flush op to all hosts
for: default_ping_set (200)
Nov 2 10:04:06 hostM1 pingd: [20434]: info: stand_alone_ping: Node dd.ee.ff.1 is unreachable (read)
Nov 2 10:04:07 hostM1 attrd: [20253]: info: attrd_ha_callback: flush message from hostm1
Nov 2 10:04:07 hostM1 attrd: [20253]: info: attrd_perform_update: Sent update 49: default_ping_set=200
Nov 2 10:04:07 hostM1 crmd: [20254]: info: abort_transition_graph: te_update_diff:150 - Triggered
transition abort (complete=1, tag=nvpair,
id=status-396fe8ff-762a-41bb-a3e6-da8270fa144f-default_ping_set, magic=NA, cib=0.7.47) : Transient
attribute: update
Nov 2 10:04:07 hostM1 crmd: [20254]: info: do_state_transition: State transition S_IDLE ->
S_POLICY_ENGINE [ input=I_PE_CALC cause=C_FSA_INTERNAL origin=abort_transition_graph ]
Nov 2 10:04:07 hostM1 crmd: [20254]: info: do_state_transition: All 2 cluster nodes are eligible to
run resources.
Nov 2 10:04:07 hostM1 crmd: [20254]: info: do_pe_invoke: Query 90: Requesting the current CIB:
S_POLICY_ENGINE
Nov 2 10:04:07 hostM1 crmd: [20254]: info: do_pe_invoke_callback: Invoking the PE: query=90,
ref=pe_calc-dc-1320195847-61, seq=2, quorate=1
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: unpack_config: On loss of CCM Quorum: Ignore
Nov 2 10:04:07 hostM1 pengine: [20268]: info: unpack_config: Node scores: 'red' = -INFINITY,
'yellow' = 0, 'green' = 0
Nov 2 10:04:07 hostM1 pengine: [20268]: info: determine_online_status: Node hostm1 is online
Nov 2 10:04:07 hostM1 pengine: [20268]: info: determine_online_status: Node hostb1 is online
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: group_print: Resource Group: hoge-grp
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: native_print:
ip-g#011(ocf::heartbeat:IPaddr2):#011Started hostm1
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: native_print:
ip-s#011(ocf::heartbeat:IPaddr2):#011Started hostm1
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: native_print:
hoge-r#011(ocf::hoge:hoge_proxy.ra):#011Started hostm1
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: clone_print: Clone Set: clone_ping
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: short_print: Started: [ hostm1 hostb1 ]
Nov 2 10:04:07 hostM1 pengine: [20268]: info: get_failcount: ip-g has failed 1 times on hostm1
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: common_apply_stickiness: ip-g can fail 999999 more
times on hostm1 before being forced off
Nov 2 10:04:07 hostM1 pengine: [20268]: info: get_failcount: ip-s has failed 1 times on hostm1
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: common_apply_stickiness: ip-s can fail 999999 more
times on hostm1 before being forced off
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: RecurringOp: Start recurring monitor (10s) for
ip-g on hostb1
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: RecurringOp: Start recurring monitor (10s) for
ip-s on hostb1
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: RecurringOp: Start recurring monitor (15s) for
hoge-r on hostb1
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: LogActions: Move resource ip-g#011(Started
hostm1 -> hostb1)
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: LogActions: Move resource ip-s#011(Started
hostm1 -> hostb1)
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: LogActions: Move resource hoge-r#011(Started
hostm1 -> hostb1)
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: LogActions: Leave resource ping-r:0#011(Started
hostm1)
Nov 2 10:04:07 hostM1 pengine: [20268]: notice: LogActions: Leave resource ping-r:1#011(Started
hostb1)
Nov 2 10:04:07 hostM1 crmd: [20254]: info: do_state_transition: State transition S_POLICY_ENGINE ->
S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response ]
Nov 2 10:04:07 hostM1 crmd: [20254]: info: unpack_graph: Unpacked transition 8: 14 actions in 14
synapses
Nov 2 10:04:07 hostM1 crmd: [20254]: info: do_te_invoke: Processing graph 8
(ref=pe_calc-dc-1320195847-61) derived from /var/lib/pengine/pe-input-616.bz2
Nov 2 10:04:07 hostM1 crmd: [20254]: info: te_pseudo_action: Pseudo action 21 fired and confirmed
Nov 2 10:04:07 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 16: stop hoge-r_stop_0
on hostm1 (local)
Nov 2 10:04:07 hostM1 lrmd: [20251]: info: cancel_op: operation monitor[28] on
ocf::hoge_proxy.ra::hoge-r for client 20254, its parameters: CRM_meta_name=[monitor]
crm_feature_set=[3.0.1] CRM_meta_on_fail=[restart] CRM_meta_interval=[15000]
CRM_meta_timeout=[20000] cancelled
Nov 2 10:04:07 hostM1 crmd: [20254]: info: do_lrm_rsc_op: Performing
key=16:8:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=hoge-r_stop_0 )
Nov 2 10:04:07 hostM1 lrmd: [20251]: info: rsc:hoge-r:29: stop
Nov 2 10:04:07 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation hoge-r_monitor_15000
(call=28, status=1, cib-update=0, confirmed=true) Cancelled
Nov 2 10:04:07 hostM1 hoge_proxy.ra[23276]: INFO: Stopping hoge_proxy ...
Nov 2 10:04:07 hostM1 pengine: [20268]: info: process_pe_message: Transition 8: PEngine Input
stored in: /var/lib/pengine/pe-input-616.bz2
Nov 2 10:04:07 hostM1 hoge_proxy.ra[23276]: INFO: hoge_proxy stopped
Nov 2 10:04:07 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation hoge-r_stop_0 (call=29,
rc=0, cib-update=91, confirmed=true) ok
Nov 2 10:04:07 hostM1 crmd: [20254]: info: match_graph_event: Action hoge-r_stop_0 (16) confirmed
on hostm1 (rc=0)
Nov 2 10:04:07 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 13: stop ip-s_stop_0
on hostm1 (local)
Nov 2 10:04:07 hostM1 lrmd: [20251]: info: cancel_op: operation monitor[26] on ocf::IPaddr2::ip-s
for client 20254, its parameters: CRM_meta_name=[monitor] cidr_netmask=[29] crm_feature_set=[3.0.1]
CRM_meta_timeout=[20000] CRM_meta_interval=[10000] nic=[eth2] ip=[aa.bb.cc.242] cancelled
Nov 2 10:04:07 hostM1 crmd: [20254]: info: do_lrm_rsc_op: Performing
key=13:8:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=ip-s_stop_0 )
Nov 2 10:04:07 hostM1 lrmd: [20251]: info: rsc:ip-s:30: stop
Nov 2 10:04:07 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation ip-s_monitor_10000
(call=26, status=1, cib-update=0, confirmed=true) Cancelled
Nov 2 10:04:07 hostM1 IPaddr2[23307]: INFO: IP status = ok, IP_CIP=
Nov 2 10:04:07 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation ip-s_stop_0 (call=30,
rc=0, cib-update=92, confirmed=true) ok
Nov 2 10:04:07 hostM1 crmd: [20254]: info: match_graph_event: Action ip-s_stop_0 (13) confirmed on
hostm1 (rc=0)
Nov 2 10:04:07 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 10: stop ip-g_stop_0
on hostm1 (local)
Nov 2 10:04:07 hostM1 lrmd: [20251]: info: cancel_op: operation monitor[18] on ocf::IPaddr2::ip-g
for client 20254, its parameters: CRM_meta_name=[monitor] cidr_netmask=[28] crm_feature_set=[3.0.1]
CRM_meta_timeout=[20000] CRM_meta_interval=[10000] nic=[eth0] ip=[aa.bb.cc.201] cancelled
Nov 2 10:04:07 hostM1 crmd: [20254]: info: do_lrm_rsc_op: Performing
key=10:8:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=ip-g_stop_0 )
Nov 2 10:04:07 hostM1 lrmd: [20251]: info: rsc:ip-g:31: stop
Nov 2 10:04:07 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation ip-g_monitor_10000
(call=18, status=1, cib-update=0, confirmed=true) Cancelled
Nov 2 10:04:07 hostM1 IPaddr2[23347]: INFO: IP status = ok, IP_CIP=
Nov 2 10:04:07 hostM1 crmd: [20254]: info: process_lrm_event: LRM operation ip-g_stop_0 (call=31,
rc=0, cib-update=93, confirmed=true) ok
Nov 2 10:04:07 hostM1 crmd: [20254]: info: match_graph_event: Action ip-g_stop_0 (10) confirmed on
hostm1 (rc=0)
Nov 2 10:04:07 hostM1 crmd: [20254]: info: te_pseudo_action: Pseudo action 22 fired and confirmed
Nov 2 10:04:07 hostM1 crmd: [20254]: info: te_pseudo_action: Pseudo action 6 fired and confirmed
Nov 2 10:04:07 hostM1 crmd: [20254]: info: te_pseudo_action: Pseudo action 19 fired and confirmed
Nov 2 10:04:07 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 11: start ip-g_start_0
on hostb1
<<point2>>
Nov 2 10:04:07 hostM1 pingd: [20434]: WARN: ping_write: Wrote -1 of 39 chars: Network is
unreachable (101)
Nov 2 10:04:07 hostM1 pingd: [20434]: info: stand_alone_ping: Node aa.bb.cc.241 is unreachable (write)
Nov 2 10:04:08 hostM1 crmd: [20254]: info: match_graph_event: Action ip-g_start_0 (11) confirmed on
hostb1 (rc=0)
Nov 2 10:04:08 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 12: monitor
ip-g_monitor_10000 on hostb1
Nov 2 10:04:08 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 14: start ip-s_start_0
on hostb1
Nov 2 10:04:08 hostM1 pingd: [20434]: WARN: ping_write: Wrote -1 of 39 chars: Network is
unreachable (101)
Nov 2 10:04:08 hostM1 pingd: [20434]: info: stand_alone_ping: Node aa.bb.cc.241 is unreachable (write)
Nov 2 10:04:09 hostM1 crmd: [20254]: info: match_graph_event: Action ip-g_monitor_10000 (12)
confirmed on hostb1 (rc=0)
Nov 2 10:04:09 hostM1 crmd: [20254]: info: match_graph_event: Action ip-s_start_0 (14) confirmed on
hostb1 (rc=0)
Nov 2 10:04:09 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 15: monitor
ip-s_monitor_10000 on hostb1
Nov 2 10:04:09 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 17: start
hoge-r_start_0 on hostb1
Nov 2 10:04:09 hostM1 pingd: [20434]: WARN: ping_write: Wrote -1 of 39 chars: Network is
unreachable (101)
Nov 2 10:04:09 hostM1 pingd: [20434]: info: stand_alone_ping: Node aa.bb.cc.193 is unreachable (write)
Nov 2 10:04:10 hostM1 pingd: [20434]: WARN: ping_write: Wrote -1 of 39 chars: Network is
unreachable (101)
Nov 2 10:04:10 hostM1 pingd: [20434]: info: stand_alone_ping: Node aa.bb.cc.193 is unreachable (write)
Nov 2 10:04:11 hostM1 crmd: [20254]: info: match_graph_event: Action ip-s_monitor_10000 (15)
confirmed on hostb1 (rc=0)
Nov 2 10:04:11 hostM1 crmd: [20254]: info: match_graph_event: Action hoge-r_start_0 (17) confirmed
on hostb1 (rc=0)
Nov 2 10:04:11 hostM1 crmd: [20254]: info: te_pseudo_action: Pseudo action 20 fired and confirmed
Nov 2 10:04:11 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 18: monitor
hoge-r_monitor_15000 on hostb1
Nov 2 10:04:11 hostM1 pingd: [20434]: WARN: ping_write: Wrote -1 of 39 chars: Network is
unreachable (101)
Nov 2 10:04:11 hostM1 pingd: [20434]: info: stand_alone_ping: Node dd.ee.ff.1 is unreachable (write)
Nov 2 10:04:12 hostM1 crmd: [20254]: info: match_graph_event: Action hoge-r_monitor_15000 (18)
confirmed on hostb1 (rc=0)
Nov 2 10:04:12 hostM1 crmd: [20254]: info: run_graph:
====================================================
Nov 2 10:04:12 hostM1 crmd: [20254]: notice: run_graph: Transition 8 (Complete=14, Pending=0,
Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pengine/pe-input-616.bz2): Complete
Nov 2 10:04:12 hostM1 crmd: [20254]: info: te_graph_trigger: Transition 8 is now complete
Nov 2 10:04:12 hostM1 crmd: [20254]: info: notify_crmd: Transition 8 status: done - <null>
Nov 2 10:04:12 hostM1 crmd: [20254]: info: do_state_transition: State transition
S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS cause=C_FSA_INTERNAL origin=notify_crmd ]
Nov 2 10:04:12 hostM1 crmd: [20254]: info: do_state_transition: Starting PEngine Recheck Timer
Nov 2 10:04:12 hostM1 pingd: [20434]: WARN: ping_write: Wrote -1 of 39 chars: Network is
unreachable (101)
Nov 2 10:04:12 hostM1 pingd: [20434]: info: stand_alone_ping: Node dd.ee.ff.1 is unreachable (write)
Nov 2 10:04:12 hostM1 attrd: [20253]: info: attrd_trigger_update: Sending flush op to all hosts
for: default_ping_set (100)
Nov 2 10:04:13 hostM1 attrd: [20253]: info: attrd_ha_callback: flush message from hostm1
Nov 2 10:04:13 hostM1 attrd: [20253]: info: attrd_perform_update: Sent update 51: default_ping_set=100
Nov 2 10:04:13 hostM1 crmd: [20254]: info: abort_transition_graph: te_update_diff:150 - Triggered
transition abort (complete=1, tag=nvpair,
id=status-396fe8ff-762a-41bb-a3e6-da8270fa144f-default_ping_set, magic=NA, cib=0.7.57) : Transient
attribute: update
Nov 2 10:04:13 hostM1 crmd: [20254]: info: do_state_transition: State transition S_IDLE ->
S_POLICY_ENGINE [ input=I_PE_CALC cause=C_FSA_INTERNAL origin=abort_transition_graph ]

---------------- ha-log failover err end -----------------------------

$B"#>e$N%m%0$K$D$$$F(B
$B!&(B<<point1>>$B$K$D$$$F(B
>Nov 2 10:04:01 hostM1 pingd: [20434]: info: stand_alone_ping: Node dd.ee.ff.1 is unreachable (read)
>Nov 2 10:04:02 hostM1 pingd: [20434]: info: stand_alone_ping: Node dd.ee.ff.1 is unreachable (read)
>...
$B$3$3$G!"(BNode dd.ee.ff.241$B$KBP$7$F(Bping$B$,(Bunreachable(read)$B$K$J$C$F$$$^$9!#(B
$B<j854D6-$G$O$3$N$h$&$J%a%C%;!<%8$O=P$F$$$^$;$s!#(B

$B!&(B<<point2>>$B$K$D$$$F(B
>Nov 2 10:04:07 hostM1 pingd: [20434]: WARN: ping_write: Wrote -1 of 39 chars: Network is
unreachable (101)
>Nov 2 10:04:07 hostM1 pingd: [20434]: info: stand_alone_ping: Node aa.bb.cc.241 is unreachable (write)
>Nov 2 10:04:08 hostM1 crmd: [20254]: info: match_graph_event: Action ip-g_start_0 (11) confirmed
on hostb1 (rc=0)
>Nov 2 10:04:08 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 12: monitor
ip-g_monitor_10000 on hostb1
>Nov 2 10:04:08 hostM1 crmd: [20254]: info: te_rsc_command: Initiating action 14: start
ip-s_start_0 on hostb1
$B$3$3$G$O(BNode aa.bb.cc.241$B$KBP$7$F(Bping$B$,(Bunreachable(read)$B$K$J$C$F$$$^$9!#0J9_!"$3$N%Q%?!<%s$r(B
$B7+$jJV$9%1!<%9$,B?$$$h$&$G$9!#(B


$B"#=j4I(B
$B$I$&$b%U%'!<%k%*!<%P$7$?8e$K(Bpingd$B$N08@h$K(Bping$B$,DL$i$J$/$J$C$F$$$k$h$&$J5$$,$7$^$9!#>/$J$/$H$b(B
eth2$B$N@h$G$"$k(BRouter1(aa.bb.cc.241)$B$K$D$$$FDL$i$J$$!"$H$$$&%1!<%9$,B?$$$h$&$G$9!#$A$J$_$K!"(B
heartbeat$B5/F0$7$J$$>uBV$G$N(Baa.bb.cc.241$B08$F$N(Bping$B$O!"DL$k$3$H$r3NG'$7$F$$$^$9!#$=$N:](Beth2$B0J30(B
$B$N(Bif$B$r(Bdown$B$5$;$F$N(Bping$B$b#O#K$G$7$?!#(Bheartbeat$B5/F08e$b(Bfailover$B$5$($J$1$l$PDL$j$^$9!#(B

$B0J9_!"$h$/$o$+$i$J$$$N$G>!<j$K?dB,$7$^$9$,!"(BlinuxHA$B$C$F%U%'!<%k%*!<%P$7$F2>A[(BIP$B$rJ];}$7$D$E$1(B
$B$F$b$=$N%Q%1%C%H$N(Bmac$B%"%I%l%9$OJQ$o$C$F$7$^$$!"$=$l$r(BRouter1$B$J$I$,7y$C$F(Bping$B$NJV;v$r(B($B$9$0$K$O!)(B)
$BJV$5$J$$!"$J$s$F$3$H(B($B$d [at] _D(B)$B$,$"$k$N$G$7$g$&$+!):#2s$N(BRouter1,2,3$B$O5R@h$N$b$N$J$N$G!"$A$g$C$H(B
$B:#$9$0$O3NG'$G$-$J$$$N$G$9$,!"$=$N(Bmac$B%"%I%l%9$+$i;!$9$k$K(BCisco$B$N%k!<%?$@$+%9%$%C%A$@$+$N$h$&$G$9!#(B

$B:#EY5R@h$K=P8~$$$?$H$-$K$G$b3NG'$7$^$9$,!"$J$K$+$3$&$$$C$??4Ev$?$j$,$"$l$P$465<x4j$$$^$9!#(B

$B$^$?!":#;W$($P!"(Bha.cf$B$G!V(Bauto_failback on$B!W$H$7$F$*$j!">u67$,$D$+$a$K$/$/$J$C$F$$$?$N$+$b$7$l(B
$B$^$;$s!#$3$l$r(Boff$B$H$7$F%U%'!<%k%P%C%/$J$7$G$d$C$F$_$h$&$+$H$b;W$C$F$$$^$9!#(B

$B0J>e$G$9!#$48!F$$$$?$@$1$k$H9,$$$G$9!#(B

---
yamamoto

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


takehiro.dreamizm at gmail

Nov 2, 2011, 9:50 PM

Post #2 of 5 (454 views)
Permalink
Re: $B2>A[(BIP$B$,%U%'(B$B!<%k%*!<%P!<$9$k$b$9$0%U%'!<%k(B$B%P%C%/$7!"$5$i$K$=$l$,<B(B IP$B$H$J$C$F$7$^$&!#(B [In reply to]

$B;3K\MM!">>Eg$G$9!#(B

$B;d$O(BHA$B$r$[$H$s$IM}2r$7$F$$$J$$$N$G$9$,!"$$$m$$$m9M$($F$_$F$$$^$9!#(B


> $B$=$l$G$b!"$A$g$C$H4hD%$C$F5-:\$7$F$_$^$9!#<j854D6-$K$F(Bifdown eth2$B$r<B;\$7$=$N(B
> $B%m%0$N=PNO798~$r$D$+$_$^$7$?$N$G!"5R [at] h4D6%m%0$G$N4XO"$7$=$&$J2U=j$r0J9_$K<($7$F$_$^$9!#(B
ifdown$B$7$?$N$O(BhostM1$B$H$$$&$3$H$G$h$m$7$$$G$7$g$&$+!)(B


> $B"#>e$N%m%0$K$D$$$F(B
> $B!&(B<<point1>>$B$K$D$$$F(B
>>Nov 2 10:04:01 hostM1 pingd: [20434]: info: stand_alone_ping: Node dd.ee.ff.1 is unreachable (read)
>>Nov 2 10:04:02 hostM1 pingd: [20434]: info: stand_alone_ping: Node dd.ee.ff.1 is unreachable (read)
>>...
> $B$3$3$G!"(BNode dd.ee.ff.241$B$KBP$7$F(Bping$B$,(Bunreachable(read)$B$K$J$C$F$$$^$9!#(B
> $B<j854D6-$G$O$3$N$h$&$J%a%C%;!<%8$O=P$F$$$^$;$s!#(B
ifdown eth2$B$7$F(Beth3$B$+$i$NJV;v$b$J$/$J$C$F$$$k$N$,IT;W5D$G$9$M!#(B

$B$U$H;W$C$?$N$G$9$,!"(BRouter 1$B$H(BRouter 2$B$OF1$8%M%C%H%o!<%/%;%0%a%s%H$J$N$G$9$M!#(B
VIP$B$H$=$N3MF@!"%Q%1%C%H$r$I$N%$%s%?!<%U%'%$%9$+$i=P$9$+!"(Broute$B$J$I$G%7%9%F%`$,:.Mp$7$F$$$k$N$+$b(B
$B$7$l$^$;$s!JCN<1$NN"IU$1$,;d$K$O$"$j$^$;$s!K!#(B


> $B"#=j4I(B
> $B$I$&$b%U%'!<%k%*!<%P$7$?8e$K(Bpingd$B$N08@h$K(Bping$B$,DL$i$J$/$J$C$F$$$k$h$&$J5$$,$7$^$9!#>/$J$/$H$b(B
> eth2$B$N@h$G$"$k(BRouter1(aa.bb.cc.241)$B$K$D$$$FDL$i$J$$!"$H$$$&%1!<%9$,B?$$$h$&$G$9!#$A$J$_$K!"(B
> heartbeat$B5/F0$7$J$$>uBV$G$N(Baa.bb.cc.241$B08$F$N(Bping$B$O!"DL$k$3$H$r3NG'$7$F$$$^$9!#$=$N:](Beth2$B0J30(B
> $B$N(Bif$B$r(Bdown$B$5$;$F$N(Bping$B$b#O#K$G$7$?!#(Bheartbeat$B5/F08e$b(Bfailover$B$5$($J$1$l$PDL$j$^$9!#(B
ifdown eth2$B$r(BhostM1$B$G$d$C$F$$$?$i!"(BhostM1$B$+$i(BRouter1$B$K$OE~C#$G$-$J$$$N$G$O!"$H$$$&$N$O(B
$B;d$N4*0c$$$G$7$g$&$+!#(B


> $B0J9_!"$h$/$o$+$i$J$$$N$G>!<j$K?dB,$7$^$9$,!"(BlinuxHA$B$C$F%U%'!<%k%*!<%P$7$F2>A[(BIP$B$rJ];}$7$D$E$1(B
> $B$F$b$=$N%Q%1%C%H$N(Bmac$B%"%I%l%9$OJQ$o$C$F$7$^$$!"$=$l$r(BRouter1$B$J$I$,7y$C$F(Bping$B$NJV;v$r(B($B$9$0$K$O!)(B)
> $BJV$5$J$$!"$J$s$F$3$H(B($B$d [at] _D(B)$B$,$"$k$N$G$7$g$&$+!):#2s$N(BRouter1,2,3$B$O5R@h$N$b$N$J$N$G!"$A$g$C$H(B
> $B:#$9$0$O3NG'$G$-$J$$$N$G$9$,!"$=$N(Bmac$B%"%I%l%9$+$i;!$9$k$K(BCisco$B$N%k!<%?$@$+%9%$%C%A$@$+$N$h$&$G$9!#(B
IPaddr2$B$N(BRA$B$O(BARPing$B$7$F$$$k$h$&$G$9$N$G!"(BRouter 1$B$dESCf$N(BSwitch$B$,(BMACtable$B$r(B($B4|8B@Z$l$J$I$G(B)
$BGK4~$7$F$$$J$1$l$P!"LdBj$J$$$H$O;W$$$^$9!#(B
ICMP$B$r(BReal IP$B$G$b(BVirtual IP$B$G$bGK4~$7$F$$$k$h$&$K$O8+$($^$;$s$7!&!&!&(B

$B2?$+;W$$=P$7$?$i$*CN$i$;$$$?$7$^$9!#(B

----
$B>>Eg(B

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


wyama at kke

Nov 3, 2011, 12:09 AM

Post #3 of 5 (454 views)
Permalink
Re: $B2>A[(BIP$B$,%U%'(B$B!<%k%*!<%P!<$9$k$b$9$0%U%'!<%k%P%C%/(B$B$7!"$5$i$K$=$l$,<B(B IP$B$H$J$C$F(B$B$7$^$&!#(B [In reply to]

$B>>EgMM(B
$B;3K\$G$9!#(B

$B$42sEz$"$j$,$H$&$4$6$$$^$9!#(B

> ifdown$B$7$?$N$O(BhostM1$B$H$$$&$3$H$G$h$m$7$$$G$7$g$&$+!)(B
$B$=$&$G$9!#(B


> $B$U$H;W$C$?$N$G$9$,!"(BRouter 1$B$H(BRouter 2$B$OF1$8%M%C%H%o!<%/%;%0%a%s%H$J$N$G$9$M!#(B
> VIP$B$H$=$N3MF@!"%Q%1%C%H$r$I$N%$%s%?!<%U%'%$%9$+$i=P$9$+!"(Broute$B$J$I$G%7%9%F%`$,:.Mp$7$F$$$k$N$+$b(B
> $B$7$l$^$;$s!JCN<1$NN"IU$1$,;d$K$O$"$j$^$;$s!K!#(B
$B$9$_$^$;$s!#%M%C%H%o!<%/%"%I%l%9$r5-=R$7$F$$$^$;$s$G$7$?!#(B
$BF10l%;%0%a%s%H$G$O$"$j$^$;$s!#(B
$B!&(BRouter1$BB&$N%M%C%H%o!<%/%"%I%l%9$O!"(Baa.bb.cc.240/29
$B!&(BRouter2$BB&$N%M%C%H%o!<%/%"%I%l%9$O!"(Baa.bb.cc.192/28
$B$H$J$j$^$9!#(B

$B?^$K$bDI5-$7$F$_$^$7$?!#2?EY$b$9$_$^$;$s!#(B

+---------+
| Router1 |
+---------+
(aa.bb.cc.241/29)
|
netaddr |
(aa.bb.cc.240/29) |
|
+---------$B2>A[(BIP1------+
| (aa.bb.cc.242/29) |
| |
| |
eth2 eth2
(aa.bb.cc.243/29) (aa.bb.cc.244/29)
+--------+ +--------+ (dd.ee.ff.1/28)
eth3| |eth1 eth1| |eth3 +---------+
+-----| hostM1 |-------------| hostB1 |--------+----| |
| | | | | | | Router3 |
| +--------+ +--------+ | | |
| (aa.bb.cc.202/28) (aa.bb.cc.203/28) | +---------+
| eth0 eth0 |
| | | |
| | (aa.bb.cc.201/28) | |
| +---------$B2>A[(BIP2------+ |
| | |
| netaddr | | netaddr
| (aa.bb.cc.192/28)| |(dd.ee.ff.0/28)
| | |
| (aa.bb.cc.193/28) |
| +---------+ |
| | Router2 | |
| +---------+ |
| |
+----------------------------------------------+

$B2>$K!"(Baa.bb.cc$B$,(B1.2.3$B$@$H$9$k$H!"(B

$B!&(BRouter1$BB&$N%M%C%H%o!<%/%"%I%l%9$H;HMQ2DG=$J(BIP$B%"%I%l%9$O!"(B
1.2.3.240/29$B!"(B1.2.3.241$B!A(B1.2.3.246

$B!&(BRouter2$BB&$N%M%C%H%o!<%/%"%I%l%9$H;HMQ2DG=$J(BIP$B%"%I%l%9$O!"(B
1.2.3.192/28$B!"(B1.2.3.192.193$B!A(B1.2.3.192.206

$B$H$J$j!"LdBj$J$$$h$&$K;W$($^$9!#(B($B;d!"$3$N7W;;6l<j$J$N$G$9$,(B)

$B<B$OEv=i!"$*5RMM$+$i(BRouter1,2$B$rF10l%M%C%H%o!<%/%"%I%l%9$G(B
$B$d$C$F$[$7$$$H$NMWK>$,$"$j$^$7$?!#0l1~$=$l$G$bF0$$$?$N$r>/$7$@$1(B
$B3NG'$7$?$N$G$9$,!"$d$C$Q$j%j%9%-!<$K;W$($?$N$G!"$3$N$h$&$K%;%0%a%s%H(B
$B$r$o$1$F$b$C$?<!Bh$G$9!#%;%0%a%s%H$NCM$O$*5RMM$,7h$a$^$7$?!#(B

$B$A$J$_$K:#2s$NLdBj$,H/@8$7$?;~$K!"2>A[(BIP2$B$r$D$V$7$F2>A[(BIP1$B$@$1$K(B
$B$7$F$b!"F1$88=>]$G$7$?!#(B


> ifdown eth2$B$7$F(Beth3$B$+$i$NJV;v$b$J$/$J$C$F$$$k$N$,IT;W5D$G$9$M!#(B
$B$=$&$J$s$G$9!#(B


> ifdown eth2$B$r(BhostM1$B$G$d$C$F$$$?$i!"(BhostM1$B$+$i(BRouter1$B$K$OE~C#$G$-$J$$$N$G$O!"(B
$B$&$&$%!"$^$5$7$/$=$NDL$j$G$9!#$3$3$+$i$N(BRouter1$B$H$N(Bping$B$NMM;R$O!"(BhostM1$B$G$J$/(B
$B%U%'!<%k%*!<%P@h$N(BhostB1$B$N%m%0$r;2>H$9$Y$-$G$7$?!*Bg4VH4$1$G$9$_$^$;$s!*(B
$B0J9_$K%U%'!<%k%*!<%PD>8e$H;W$o$l$k(BhostB1$B$N%m%0$r<($7$^$9!#Nc$K$h$C$FF?L>2=$7$F!"(B
<<point b1>>,<<point b2>>,<<point b3>>$B$rF~$l$F$$$^$9!#(B

-------- ha-log failover err @ hostB1 start -----------
Nov 2 10:02:42 hostB1 attrd: [30546]: info: attrd_ha_callback: flush message from hostm1
Nov 2 10:02:42 hostB1 attrd: [30546]: info: find_hash_entry: Creating hash entry for fail-count-ip-s
Nov 2 10:02:42 hostB1 attrd: [30546]: info: attrd_ha_callback: flush message from hostm1
Nov 2 10:02:42 hostB1 attrd: [30546]: info: find_hash_entry: Creating hash entry for last-failure-ip-s
Nov 2 10:04:07 hostB1 attrd: [30546]: info: attrd_ha_callback: flush message from hostm1
Nov 2 10:04:07 hostB1 crmd: [30547]: info: do_lrm_rsc_op: Performing
key=11:8:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=ip-g_start_0 )
Nov 2 10:04:07 hostB1 lrmd: [30544]: info: rsc:ip-g:8: start
<<point b1>>
Nov 2 10:04:07 hostB1 IPaddr2[30799]: INFO: ip -f inet addr add aa.bb.cc.201/28 brd aa.bb.cc.207
dev eth0
Nov 2 10:04:07 hostB1 IPaddr2[30799]: INFO: ip link set eth0 up
Nov 2 10:04:07 hostB1 IPaddr2[30799]: INFO: /usr/lib64/heartbeat/send_arp -i 200 -r 5 -p
/var/run/heartbeat/rsctmp/send_arp-aa.bb.cc.201 eth0 aa.bb.cc.201 auto not_used not_used
Nov 2 10:04:07 hostB1 crmd: [30547]: info: process_lrm_event: LRM operation ip-g_start_0 (call=8,
rc=0, cib-update=20, confirmed=true) ok
Nov 2 10:04:08 hostB1 crmd: [30547]: info: do_lrm_rsc_op: Performing
key=12:8:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=ip-g_monitor_10000 )
Nov 2 10:04:08 hostB1 lrmd: [30544]: info: rsc:ip-g:9: monitor
Nov 2 10:04:08 hostB1 crmd: [30547]: info: do_lrm_rsc_op: Performing
key=14:8:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=ip-s_start_0 )
Nov 2 10:04:08 hostB1 lrmd: [30544]: info: rsc:ip-s:10: start
Nov 2 10:04:08 hostB1 crmd: [30547]: info: process_lrm_event: LRM operation ip-g_monitor_10000
(call=9, rc=0, cib-update=21, confirmed=false) ok
<<point b2>>
Nov 2 10:04:08 hostB1 IPaddr2[30858]: INFO: ip -f inet addr add aa.bb.cc.242/29 brd aa.bb.cc.247
dev eth2
Nov 2 10:04:08 hostB1 IPaddr2[30858]: INFO: ip link set eth2 up
Nov 2 10:04:08 hostB1 IPaddr2[30858]: INFO: /usr/lib64/heartbeat/send_arp -i 200 -r 5 -p
/var/run/heartbeat/rsctmp/send_arp-aa.bb.cc.242 eth2 aa.bb.cc.242 auto not_used not_used
Nov 2 10:04:08 hostB1 crmd: [30547]: info: process_lrm_event: LRM operation ip-s_start_0 (call=10,
rc=0, cib-update=22, confirmed=true) ok
Nov 2 10:04:09 hostB1 crmd: [30547]: info: do_lrm_rsc_op: Performing
key=15:8:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=ip-s_monitor_10000 )
Nov 2 10:04:09 hostB1 lrmd: [30544]: info: rsc:ip-s:11: monitor
Nov 2 10:04:09 hostB1 crmd: [30547]: info: do_lrm_rsc_op: Performing
key=17:8:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=hoge-r_start_0 )
Nov 2 10:04:09 hostB1 lrmd: [30544]: info: rsc:hoge-r:12: start
Nov 2 10:04:09 hostB1 hoge_proxy.ra[30942]: INFO: Starting hoge_proxy ...
Nov 2 10:04:09 hostB1 crmd: [30547]: info: process_lrm_event: LRM operation ip-s_monitor_10000
(call=11, rc=0, cib-update=23, confirmed=false) ok
Nov 2 10:04:10 hostB1 hoge_proxy.ra[30942]: INFO: hoge_proxy started
Nov 2 10:04:10 hostB1 crmd: [30547]: info: process_lrm_event: LRM operation hoge-r_start_0
(call=12, rc=0, cib-update=24, confirmed=true) ok
Nov 2 10:04:11 hostB1 crmd: [30547]: info: do_lrm_rsc_op: Performing
key=18:8:0:db95561c-6613-41e2-b7a0-17b8701178c3 op=hoge-r_monitor_15000 )
Nov 2 10:04:11 hostB1 lrmd: [30544]: info: rsc:hoge-r:13: monitor
Nov 2 10:04:11 hostB1 crmd: [30547]: info: process_lrm_event: LRM operation hoge-r_monitor_15000
(call=13, rc=0, cib-update=25, confirmed=false) ok
Nov 2 10:04:11 hostB1 lrmd: [30544]: info: RA output: (ip-g:start:stderr) ARPING aa.bb.cc.201 from
aa.bb.cc.201 eth0#012Sent 5 probes (5 broadcast(s))#012Received 0 response(s)
Nov 2 10:04:12 hostB1 lrmd: [30544]: info: RA output: (ip-s:start:stderr) ARPING aa.bb.cc.242 from
aa.bb.cc.242 eth2#012Sent 5 probes (5 broadcast(s))#012Received 0 response(s)
Nov 2 10:04:13 hostB1 attrd: [30546]: info: attrd_ha_callback: flush message from hostm1
Nov 2 10:04:19 hostB1 attrd: [30546]: info: attrd_ha_callback: flush message from hostm1
<<point b3>>
Nov 2 10:04:30 hostB1 heartbeat: [30529]: info: Link hostm1:eth3 dead.
Nov 2 10:04:41 hostB1 attrd: [30546]: info: attrd_ha_callback: flush message from hostm1
Nov 2 10:06:38 hostB1 attrd: [30546]: info: attrd_ha_callback: flush message from hostm1
Nov 2 10:06:38 hostB1 crmd: [30547]: notice: crmd_client_status_callback: Status update: Client
hostm1/crmd now has status [offline] (DC=false)
Nov 2 10:06:38 hostB1 crmd: [30547]: info: crm_update_peer_proc: hostm1.crmd is now offline
Nov 2 10:06:38 hostB1 crmd: [30547]: info: crmd_client_status_callback: Got client status callback
- our DC is dead
Nov 2 10:06:38 hostB1 crmd: [30547]: info: do_state_transition: State transition S_NOT_DC ->
S_ELECTION [ input=I_ELECTION cause=C_CRMD_STATUS_CALLBACK origin=crmd_client_status_callback ]
Nov 2 10:06:38 hostB1 crmd: [30547]: info: update_dc: Unset DC hostm1
Nov 2 10:06:38 hostB1 cib: [30543]: info: cib_process_shutdown_req: Shutdown REQ from hostm1
Nov 2 10:06:38 hostB1 cib: [30543]: info: cib_process_request: Operation complete: op
cib_shutdown_req for section 'all' (origin=hostm1/hostm1/(null), version=0.7.62): ok (rc=0)
Nov 2 10:06:39 hostB1 crmd: [30547]: info: mem_handle_event: Got an event OC_EV_MS_NOT_PRIMARY from ccm
Nov 2 10:06:39 hostB1 crmd: [30547]: info: mem_handle_event: instance=2, nodes=2, new=2, lost=0,
n_idx=0, new_idx=0, old_idx=4
Nov 2 10:06:39 hostB1 crmd: [30547]: info: crmd_ccm_msg_callback: Quorum lost after event=NOT
PRIMARY (id=2)
Nov 2 10:06:39 hostB1 cib: [30543]: info: cib_client_status_callback: Status update: Client
hostm1/cib now has status [leave]
Nov 2 10:06:39 hostB1 cib: [30543]: info: crm_update_peer_proc: hostm1.cib is now offline
Nov 2 10:06:39 hostB1 cib: [30543]: info: mem_handle_event: Got an event OC_EV_MS_NOT_PRIMARY from ccm
Nov 2 10:06:39 hostB1 cib: [30543]: info: mem_handle_event: instance=2, nodes=2, new=2, lost=0,
n_idx=0, new_idx=0, old_idx=4
Nov 2 10:06:39 hostB1 cib: [30543]: info: cib_ccm_msg_callback: Processing CCM event=NOT PRIMARY (id=2)
Nov 2 10:06:49 hostB1 heartbeat: [30529]: info: killing /usr/lib64/heartbeat/crmd process group
30547 with signal 15
Nov 2 10:06:49 hostB1 crmd: [30547]: info: crm_signal_dispatch: Invoking handler for signal 15:
Terminated
Nov 2 10:06:49 hostB1 crmd: [30547]: info: crm_shutdown: Requesting shutdown
Nov 2 10:06:49 hostB1 crmd: [30547]: info: do_shutdown_req: Sending shutdown request to DC: <null>
Nov 2 10:06:49 hostB1 ccm: [30542]: debug: quorum plugin: majority
Nov 2 10:06:49 hostB1 ccm: [30542]: debug: cluster:linux-ha, member_count=1, member_quorum_votes=100
Nov 2 10:06:49 hostB1 ccm: [30542]: debug: total_node_count=2, total_quorum_votes=200
Nov 2 10:06:49 hostB1 ccm: [30542]: debug: quorum plugin: twonodes
Nov 2 10:06:49 hostB1 ccm: [30542]: debug: cluster:linux-ha, member_count=1, member_quorum_votes=100
Nov 2 10:06:49 hostB1 ccm: [30542]: debug: total_node_count=2, total_quorum_votes=200
Nov 2 10:06:49 hostB1 ccm: [30542]: info: Break tie for 2 nodes cluster
Nov 2 10:06:49 hostB1 crmd: [30547]: info: mem_handle_event: Got an event OC_EV_MS_INVALID from ccm
Nov 2 10:06:49 hostB1 crmd: [30547]: info: mem_handle_event: no mbr_track info
Nov 2 10:06:49 hostB1 crmd: [30547]: info: mem_handle_event: Got an event OC_EV_MS_NEW_MEMBERSHIP
from ccm
Nov 2 10:06:49 hostB1 crmd: [30547]: info: mem_handle_event: instance=3, nodes=1, new=0, lost=1,
n_idx=0, new_idx=1, old_idx=3
Nov 2 10:06:49 hostB1 crmd: [30547]: info: crmd_ccm_msg_callback: Quorum (re)attained after
event=NEW MEMBERSHIP (id=3)
Nov 2 10:06:49 hostB1 crmd: [30547]: info: ccm_event_detail: NEW MEMBERSHIP: trans=3, nodes=1,
new=0, lost=1 n_idx=0, new_idx=1, old_idx=3
Nov 2 10:06:49 hostB1 crmd: [30547]: info: ccm_event_detail: #011CURRENT: hostb1 [nodeid=0, born=3]
Nov 2 10:06:49 hostB1 crmd: [30547]: info: ccm_event_detail: #011LOST: hostm1 [nodeid=1, born=1]
Nov 2 10:06:49 hostB1 crmd: [30547]: info: crm_update_peer: Node hostm1: id=1 state=lost (new)
addr=(null) votes=-1 born=1 seen=2 proc=00000000000000000000000000000002
Nov 2 10:06:49 hostB1 cib: [30543]: info: mem_handle_event: Got an event OC_EV_MS_INVALID from ccm
Nov 2 10:06:49 hostB1 cib: [30543]: info: mem_handle_event: no mbr_track info
Nov 2 10:06:49 hostB1 cib: [30543]: info: mem_handle_event: Got an event OC_EV_MS_NEW_MEMBERSHIP
from ccm
Nov 2 10:06:49 hostB1 cib: [30543]: info: mem_handle_event: instance=3, nodes=1, new=0, lost=1,
n_idx=0, new_idx=1, old_idx=3
Nov 2 10:06:49 hostB1 cib: [30543]: info: cib_ccm_msg_callback: Processing CCM event=NEW MEMBERSHIP
(id=3)
Nov 2 10:06:49 hostB1 crmd: [30547]: info: do_state_transition: State transition S_ELECTION ->
S_INTEGRATION [ input=I_ELECTION_DC cause=C_FSA_INTERNAL origin=do_election_check ]
Nov 2 10:06:49 hostB1 cib: [30543]: info: crm_update_peer: Node hostm1: id=1 state=lost (new)
addr=(null) votes=-1 born=1 seen=2 proc=00000000000000000000000000000202
Nov 2 10:06:49 hostB1 crmd: [30547]: info: do_te_control: Registering TE UUID:
91485bad-0cc7-4a69-8dc5-6b3880de3067
Nov 2 10:06:49 hostB1 crmd: [30547]: info: set_graph_functions: Setting custom graph functions
Nov 2 10:06:49 hostB1 crmd: [30547]: info: unpack_graph: Unpacked transition -1: 0 actions in 0
synapses
Nov 2 10:06:49 hostB1 crmd: [30547]: info: start_subsystem: Starting sub-system "pengine"
Nov 2 10:06:49 hostB1 pengine: [32308]: info: Invoked: /usr/lib64/heartbeat/pengine
Nov 2 10:06:49 hostB1 pengine: [32308]: info: main: Starting pengine
-------- ha-log failover err @ hostB1 end -------------

$B3N$+$K!"(B<<point b1>>, <<point b2>>$B$G!"(Beth0,eth2$B$=$l$>$l$K(Bip$B3d$jEv$F$F(Barp
$B=P$7$F$^$9$M!#JY6/$K$J$j$^$9!#$d$C$Q$j$-$A$s$H%m%0$_$J$$$H$@$a$G$9$M!#(B
$B$"$j$,$H$&$4$6$$$^$9!#(B

$B$?$@(B<<point b3>>$B$G!"(B
>Nov 2 10:04:30 hostB1 heartbeat: [30529]: info: Link hostm1:eth3 dead.
$B$H(Bhostm1$B$N(Beth3$B$,;`$s$G$7$^$C$F$$$^$9!#@h$[.$I$N(BhostM1$B$N%m%0$d!">>Eg$5$s$N;XE&$K$b(B
$B$"$j$^$9$H$*$j!"$3$l$,860x$J$N$G$7$g$&$+!)$G$b%U%'!<%k%*!<%P$7$F$7$^$C$?(B
$B%[.%9%H$N(BI/F$B$J$N$G!"$3$l$O$3$l$G$b$h$$$b$N$J$N$+$J!)!)(B

$B$A$J$_$K!"?^$G$b<($7$F$$$k$h$&$K(Beth3$B%5%$%I$K$D$$$F$O$*5R$5$s$NMWK>$G2>A[.(BIP
$B$r@_$1$F$$$^$;$s!#$b$7$+$7$F2>A[.(BIP$B$OA4(BI/F$B$K [at] _D$7$F$*$$$?J}$,$h$$$N$+$J!)(B
$B$5$i$K(Bpingd$B$b$=$l$K9g$o$;$F$*$$$?$[.$&$,$h$$$N$+$J!)(B

$B$G$b!"$=$b$=$b(Beth0$B$rH4$$$?;~$K$O$-$l$$$K%U%'!<%k%*!<%P!<$7$F$$$k!#(Beth3$B$b(B
$BF1MM!#$J$<(Beth2$B$@$1$,!)!)(B

$B$J$s$@$+$h$/$o$+$i$J$/$J$C$F$-$^$7$?!#8=>l$G$-$A$s$H%m%0$r<h$jD>$9I,MW$,$"$k$H;W$$$^$9!#(B

$B$A$J$_$KF1%m%0$r(BERROR$B$G(Bgrep$B$9$k$H0J2<$N$h$&$J$b$N$,=P$^$9!#(B
-----------------------------------------------------------------
Nov 2 10:23:09 hostB1 pengine: [4037]: ERROR: unpack_resources: Resource start-up disabled since no
STONITH resources have been defined
Nov 2 10:23:09 hostB1 pengine: [4037]: ERROR: unpack_resources: Either configure some or disable
STONITH with the stonith-enabled option
Nov 2 10:23:09 hostB1 pengine: [4037]: ERROR: unpack_resources: NOTE: Clusters with shared data
need STONITH to ensure data integrity
-----------------------------------------------------------------
STONITH$B$C$F$h$/$o$+$i$:@_Dj$bLLE]$G$7$?$N$G!";H$o$J$$$h$&@_Dj$7$F$$$?$H;W$$$^$9$,!"(B
$B$3$l$b$J$K$+4X78$"$j$=$&$G$7$g$&$+!#(B

$B$^$?$J$K$+;W$$=P$7$?;~$K$G$b9=$$$^$;$s!#$h$m$7$/$*4j$$$$$?$7$^$9!#(B

$B0J>e$G$9!#(B

--
yamamoto

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


wyama at kke

Nov 4, 2011, 1:45 AM

Post #4 of 5 (429 views)
Permalink
Re: $B2>A[(BIP$B$,%U%'(B$B!<%k%*!<%P!<$9$k$b$9$0%U%'!<%k%P%C%/(B$B$7!"$5$i$K$=$l$,<B(B IP$B$H$J$C$F(B$B$7$^$&!#(B [In reply to]

$B>>EgMM(B
$B;3K\$G$9!#(B

$B$*@$OC$K$J$C$F$$$^$9!#(B
$B$I$&$d$i2r7h$7$=$&$J46$8$G$9!#(B

$B$^$:!"(BhostM1$B$N(B/etc/ha.d/ha.cf$B$N0lIt$G$9!#(B
------------------
bcast eth1 eth3
node hostM1
node hostB1
------------------

$BN>%[%9%H$N(B/etc/hosts$B$K$F!"(BhostM1,hostB1$B$N%[%9%HL>$KBP$9$k%"%I%l%9$r!"(B
$B$=$l$>$l$N(Beth3$B$N%"%I%l%9!"$H$7$FDj5A$7$F$$$^$7$?!#0J2<$N?^$K$=$NCM$O(B
$B5-=R$7$F$$$J$$$N$G$9$,!#(B

> +---------+
> | Router1 |
> +---------+
> (aa.bb.cc.241/29)
> |
> netaddr |
> (aa.bb.cc.240/29) |
> |
> +---------$B2>A[(BIP1------+
> | (aa.bb.cc.242/29) |
> | |
> | |
> eth2 eth2
> (aa.bb.cc.243/29) (aa.bb.cc.244/29)
> +--------+ +--------+ (dd.ee.ff.1/28)
> eth3| |eth1 eth1| |eth3 +---------+
> +-----| hostM1 |-------------| hostB1 |--------+----| |
> | | | | | | | Router3 |
> | +--------+ +--------+ | | |
> | (aa.bb.cc.202/28) (aa.bb.cc.203/28) | +---------+
> | eth0 eth0 |
> | | | |
> | | (aa.bb.cc.201/28) | |
> | +---------$B2>A[(BIP2------+ |
> | | |
> | netaddr | | netaddr
> | (aa.bb.cc.192/28)| |(dd.ee.ff.0/28)
> | | |
> | (aa.bb.cc.193/28) |
> | +---------+ |
> | | Router2 | |
> | +---------+ |
> | |
> +----------------------------------------------+

$B$=$N>uBV$G(BLinuxHA$B$r9=C[$7F0$+$7$F$$$?$N$G$9$,!"$=$N8e!"$*5R$5$s$N(B
$BET9g$G(BhostB1$B$N(Beth3$B$N%"%I%l%9$,JQ99$K$J$j$^$7$?!#$7$+$7!"$=$NJQ99(B
$B$r(BhostM1$B$N(B/etc/hosts$B$KH?1G$9$k$N$rK:$l$F$$$k$h$&$G$9!*(B

$BK^%_%9$G$*A{$,$;$7$F?=$7Lu$4$6$$$^$;$s!#(B

$B$*$=$i$/$=$l$r@5$7$/H?1G$9$k$H!"$-$A$s$H%U%'!<%k%P%C%/$9$k$N$@$H(B
$B;W$$$^$9!#:#EY!"$*5R$5$s$N$H$3$m$X9T$C$?$i$5$C$=$/;n$9$D$b$j$G$9!#(B

$B$?$@!"8=:_$N$3$N>uBV$G$O!"(BhostM1$B$O(BhostB1$B$rCN$i$J$$$3$H$K$J$k(B($B [at] 53(B
$B$K$O(BhostM1$B$O(BhostB1$B$N%"%I%l%9$r4V0c$C$FGD0.$7$F$$$k(B)$B$N$G!"(BLinuxHA
$B<+BN$O!":G=i$+$i$&$^$/F0$+$J$/$J$k$h$&$J5$$,$7$^$9!#$7$+$7!"(B
eth0,eth3$B$N2>A[(BIP$B$N%U%'!<%k%*!<%P!<$O$&$^$/F0$$$F$$$^$7$?!#!#!#(B

$B5$;}$A0-$$$N$G!"<j;}$A$N4D6-$K$"$($F>e$N$h$&$K4V0c$C$?4D6-$r [at] _D(B
$B$7$FF0$+$7$F$_$^$7$?!#$3$l$K$h$C$F>c32$r:F8=$5$;$?$+$C$?$N$G$9$,!"(B
$B:F8=$;$:$-$A$s$HF0$-$^$7$?!#!#!#(Beth0,2,3$B$H$b$=$l$rH4$/$H$-$A$s$H(B
$B%U%'!<%k%*!<%P!<$7$^$7$?!#(B

bcast$B$G(Beth1,3$B$HFs$D;XDj$7$F$$$k$N$G!"$I$&$K$+4hD%$C$F$$$k$N$@$J$!(B
$B!<!"$H;W$$!":F8=$9$k$3$H$r$r$"$-$i$a!"85$N@5$7$$4D6-$KLa$7$?$H$3$m!"(B
$B$=$ND>8e$N0lEY$@$1:F8=$7$^$7$?!*!!$7$+$7!"(BOS$B%j%V!<%H$9$k$H:#$N$H$3$m(B
$B:F8=$7$J$/$J$j$^$7$?!#!#(B

$B85$N@5$7$$4D6-$KLa$7$?$i:F8=$7$?$@$J$s$F!"$J$s$@$+5$L#0-$$$N$G!"(B
$B$b$&>/$7F0:n3NG'$7$F$_$^$9!#(B

$B$A$J$_$K!"85$N4D6-$KLa$9$H$O!"%j%=!<%9 [at] _D$r$d$jD>$9$H$$$&$3$H$r(B
$B$d$C$F$$$^$9!#$=$N>l9g!";d$O0J2<$r<B9T$7$F$$$^$9!#(B

1.service heartbeat stop $B!JN>%N!<%I$G(B)
2.rm -f /var/lib/heartbeat/crm/* $B!JN>%N!<%I$G(B)
3.service heartbeat start $B!JN>%N!<%I$G(B)
4.crm_mon$B$GN>%N!<%I$,(Bonline$B$K$J$k$N$r3NG'(B $B!J$I$A$i$+%N!<%I$G(B)
5.crm configure load update hoge.crm $B!J2TF/Cf$N%N!<%I$G(B)

$B$$$m$$$m$J$d$jJ}$,$"$k$N$G$7$g$&$,!"$3$l$O$3$l$G$h$$$N$+$J!)!"(B
$B$3$NA`:n$@$1$G(BLinuxHA$B$N4D6-$O%/%j!<%s$9$k$N$+$J!)(BOS$B$N%j%V!<%H$O(B
$B$d$C$F$?J}$,L5Fq!)$4B8$8$NJ}$,$$$i$C$7$c$?$i$465<xD:$1$k$H=u$+(B
$B$j$^$9!#(B

$B$^$?!"(B5$B$N%3%^%s%IBG$C$F$$$F$o$+$C$?$N$G$9$,!"(B
crm configure load update
$B$@$1$G$J$/(B
crm configure load replace
$B$H$$$&%Q%i%a!<%?$b$"$k$N$G$9$M!#$3$N(Bupdate$B$H(Breplace$B$N:9$b$h$/$o$+(B
$B$C$F$$$^$;$s!#(B

$B0J>e$G$9!#(B

$B$$$:$l$K$;$h!":#2s$N7o$O!"$3$A$i$NK^%_%9$,1F6A5Z$\$7$F$$$k$3$H$O4V0c$$(B
$B$J$$$h$&$G$9!#>>EgMM$K$O$*;~4V$r<h$i$;$F$7$^$$!"BgJQ?=$7Lu$4$6$$$^$;$s(B
$B$G$7$?!#$^$?BgJQ$*@$OC$K$J$j$7$?!#$3$N>l$r<Z$j$F$*Ni$r?=$7>e$2$^$9!#(B

--
yamamoto

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


takehiro.dreamizm at gmail

Nov 6, 2011, 1:48 PM

Post #5 of 5 (420 views)
Permalink
Re: $B2>A[(BIP$B$,%U%'(B$B!<%k%*!<%P!<$9$k$b$9$0%U%'!<%k(B$B%P%C%/$7!"$5$i$K$=$l$,<B(B IP$B$H$J$C$F$7$^$&!#(B [In reply to]

$B;3K\MM(B
$B>>Eg$G$9!#CY$/$J$C$F$7$^$$?=$7Lu$4$6$$$^$;$s!#(B

ha.cf$B$NItJ,$G$9$,!"$I$&$9$k$N$,NI$$$N$G$7$g$&!#(B
$B7k6I!"(BBroadcast$B$G$d$j<h$j$r$7$F$$$k$N$G!"(B/etc/hosts$B$H$O$"$^$j4X78$,L5$$$h$&$J5$$,$7$F$$$^$9!#(B
$B$U$H(Biptables$B$GN>J}$N(Binterface$B$+$i$N(Bbroadcast$B$N=hM}$+$b!&!&!&$H$b;W$C$?$N$G$9$,(B
$B0lC6$O$&$^$/$$$C$F$$$k$N$G$*$=$i$/LdBj$O$J$$$N$G$7$g$&!#(B
$B$J$K$+!"O3$l$,$"$k$O$:$J$N$G$9$,!&!&!&(B

replace$B$H(Bupdate$B$K$D$$$F$G$9$,!"(BDocument(http://www.clusterlabs.org/doc/crm_cli.html)$B$K(B
"load

Load a part of configuration (or all of it) from a local file or a
network URL. The replace method replaces the current configuration
with the one from the source. The update tries to import the contents
into the current configuration. "
$B$H$$$&5-=R$,$"$j$^$7$?!#(B
$B!&(Breplace$B$O40A4$K [at] _D%U%!%$%k$GCV$-49$((B
$B!&(Bupdate$B$O8=:_$N [at] _D$r%Y!<%9$K!"@_Dj%U%!%$%k$NCf?H$rE,MQ$7$F$$$/(B
$B$H$$$&;v$N$h$&$G$9!#(B


$BB>$K$J$K$+;W$$$D$-$^$7$?$i$4O"Mm$$$?$7$^$9!#(B

----
$B>>Eg(B

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan

Linux-HA japanese RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.