Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Linux-HA: Japanese

Re: group内のリソースで発生した異常について

 

 

Linux-HA japanese RSS feed   Index | Next | Previous | View Threaded


renayama19661014 at ybb

Jan 18, 2012, 8:15 PM

Post #1 of 7 (239 views)
Permalink
Re: group内のリソースで発生した異常について

和田さん

こんにちは、山内です。

御連絡ありがとうございます。

確認して結果を御連絡します。

#似た事象が1.0.11では出ていますので、不具合を踏んでしまっている可能性が大きいです。

以上です。

--- On Thu, 2012/1/19, 和田 伸一朗 <wada.shinichiro [at] jp> wrote:

> 山内さん
>
> こんにちは。
> 和田です。
>
> いつもありがとうございます。
>
> バージョンは
> pacemaker-1.0.11-1.2.el6.x86_64
> corosync-1.3.1-1.1.el6.x86_64
> になります。
>
> http://sourceforge.jp/projects/linux-ha/releases/
> で公開されてる、
> pacemaker-1.0.11-1.2.2.el6.x86_64.repo.tar.gz
> に入っているものになります。
>
> 記憶が定かではないのですが、Scientific Linux 6.1にバンドルされているものでも
> 同様の事象が発生していたように記憶しています。
>
> お手数ですが、よろしくお願い致します。
>
> > 和田さん
> >
> > こんにちは、山内です。
> >
> > 事象を確認しますので、お手数ですが、
> > Pacemakerとcorosyncのバージョンをお知らせいただけますか?
> > #場合によっては、バージョンによる不具合の可能性があります。
> >
> > 以上、宜しく御願いいたします。
> >
> > --- On Thu, 2012/1/19, 和田 伸一朗<wada.shinichiro [at] jp>  wrote:
> >
> >> こんにちは。
> >> 和田です。
> >>
> >> いつもお世話になっております。
> >>
> >> 先日、N + 1構成でのスプリットブレインの質問をさせていただいた際に、
> >> 山内さんよりリソースのstart,monitorは、standbyではなくrestartのほうが
> >> よいのではというアドバイスをいただきました。
> >>
> >> その際にも少し触れていたのですが、group内の最後のリソース以外に異常が発生した場合、
> >> うまく切り替わらないという事象が、他のアドバイスも取り込み、検証を進めていったところ
> >> やはり同様の事象が発生しました。
> >>
> >> #現状、standbyにすることですぐに切り替わることは確認できているので、
> >> #困っているわけではないのですが、どうもすっきりせず。。。
> >>
> >> たとえば、
> >>
> >> --------------------------------------------------------------------
> >>
> >>   Resource Group: group_all
> >>       fs_db      (ocf::heartbeat:Filesystem):    Started it14
> >>       ip_db      (ocf::heartbeat:IPaddr2):       Started it14
> >>       prmPg      (ocf::heartbeat:pgsql): Started it14
> >>       apache     (ocf::heartbeat:apache):        Started it14
> >>
> >> --------------------------------------------------------------------
> >>
> >> というリソースがあった場合に、apacheの異常が発生した場合は正常に
> >> 切り替わるのですが、pgsqlで異常が発生した場合には、
> >>
> >> --------------------------------------------------------------------
> >>
> >>   Resource Group: group_all
> >>       fs_db      (ocf::heartbeat:Filesystem):    Started it14
> >>       ip_db      (ocf::heartbeat:IPaddr2):       Started it14
> >>       prmPg      (ocf::heartbeat:pgsql): Stopped
> >>       apache     (ocf::heartbeat:apache):        Stopped
> >>
> >> --------------------------------------------------------------------
> >>
> >> の状態でとまってしまいます。
> >> ただし、15分後に異常を検知されその後であれば切り替えが発生します。
> >> #15分はpropertyのcluster-recheck-intervalに依存するものと考えています。
> >>
> >> なお、apacheで異常が発生した場合は正常に切り替えが行われ、
> >> pgsqlとapacheの順序を入れ替えた場合はapacheで正常に切り替えが
> >> 行われない状態となります。
> >>
> >> 上記の挙動はgroupリソースでrestartを設定した場合の挙動として正しいのでしょうか?
> >> ご存知の方、設定方法の誤りなども含めてどなたかご教示いただけないでしょうか?
> >>
> >> なお、設定は以下のようにしています。
> >>
> >> --------------------------------------------------------------------
> >>
> >> primitive drbd_db ocf:linbit:drbd \
> >>          params drbd_resource="pgsql" \
> >>          op start interval="0s" timeout="240s" on-fail="restart" \
> >>          op monitor interval="11s" timeout="60s" on-fail="restart" \
> >>          op monitor interval="10s" timeout="60s" on-fail="restart" role="Master" \
> >>          op stop interval="0s" timeout="100s" on-fail="fence"
> >>
> >> primitive ip_db ocf:heartbeat:IPaddr2 \
> >>          params ip="192.168.1.175" \
> >>                  nic="eth1" \
> >>                  cidr_netmask="24" \
> >>          op start interval="0s" timeout="90s" on-fail="restart" \
> >>          op monitor interval="10s" timeout="60s" on-fail="restart" \
> >>          op stop interval="0s" timeout="100s" on-fail="fence"
> >>
> >> primitive prmPing ocf:pacemaker:ping \
> >>          params \
> >>                  name="ping_set" \
> >>                  host_list="192.168.1.1 192.168.2.1" \
> >>                  multiplier="100" \
> >>                  dampen="0" \
> >>          meta \
> >>                  migration-threshold="3" \
> >>                  failure-timeout="60s" \
> >>          op start interval="0s" timeout="90s" on-fail="restart" \
> >>          op monitor interval="10s" timeout="60s" on-fail="restart" \
> >>          op stop interval="0s" timeout="100s" on-fail="ignore"
> >>
> >> primitive fs_db ocf:heartbeat:Filesystem \
> >>          params device="/dev/drbd/by-res/pgsql" directory="/data" fstype="ext4" \
> >>          op start interval="0s" timeout="60s" on-fail="restart" \
> >>          op monitor interval="10s" timeout="60s" on-fail="restart" \
> >>          op stop interval="0s" timeout="60s" on-fail="fence"
> >>
> >> primitive prmPg ocf:heartbeat:pgsql \
> >>          params pgctl="/usr/bin/pg_ctl" \
> >>          start_opt="-p 5432" \
> >>          psql="/usr/bin/psql" \
> >>          pgdata="/data/" \
> >>          pgdba="postgres" \
> >>          pgport="5432" \
> >>          pgdb="postgres" \
> >>          op start interval="0s" timeout="120s" on-fail="restart" \
> >>          op monitor interval="10s" timeout="60s" on-fail="restart" \
> >>          op stop interval="0s" timeout="120s" on-fail="fence"
> >>
> >> primitive apache ocf:heartbeat:apache \
> >>          params configfile="/etc/httpd/conf/httpd.conf" \
> >>          port="80" \
> >>          op start interval="0s" timeout="40s" on-fail="restart" \
> >>          op monitor interval="10s" timeout="60s" on-fail="restart" \
> >>          op stop interval="0s" timeout="60s" on-fail="fence"
> >>
> >> primitive prmDiskd ocf:pacemaker:diskd \
> >>          params name="diskd_set" \
> >>          device="/dev/sda1" \
> >>          op start interval="0s" timeout="60s" on-fail="restart" \
> >>          op monitor interval="10s" timeout="60s" on-fail="restart" \
> >>          op stop interval="0s" timeout="60s" on-fail="ignore"
> >>
> >> primitive prmStonith1-1 stonith:external/stonith-helper \
> >>      params \
> >>          priority="1" \
> >>          stonith-timeout="60s" \
> >>          hostlist="it13" \
> >>          dead_check_target="192.168.1.173" \
> >>          run_standby_wait="no" \
> >>      op start interval="0s" timeout="60s" \
> >>      op monitor interval="3600s" timeout="60s" \
> >>      op stop interval="0s" timeout="60s"
> >>
> >> primitive prmStonith1-2 stonith:external/ssh \
> >>      params \
> >>          priority="2" \
> >>          stonith-timeout="60s" \
> >>          hostlist="it13" \
> >>      op start interval="0s" timeout="60s" \
> >>      op monitor interval="3600s" timeout="60s" \
> >>      op stop interval="0s" timeout="60s"
> >>
> >> primitive prmStonith1-3 stonith:meatware \
> >>      params \
> >>          priority="3" \
> >>          stonith-timeout="600" \
> >>          hostlist="it13" \
> >>      op start interval="0s" timeout="60s" \
> >>      op monitor interval="3600s" timeout="60s" \
> >>      op stop interval="0s" timeout="60s"
> >>
> >> primitive prmStonith2-1 stonith:external/stonith-helper \
> >>      params \
> >>          priority="1" \
> >>          stonith-timeout="60s" \
> >>          hostlist="it14" \
> >>          dead_check_target="192.168.1.174" \
> >>          run_standby_wait="no" \
> >>      op start interval="0s" timeout="60s" \
> >>      op monitor interval="3600s" timeout="60s" \
> >>      op stop interval="0s" timeout="60s"
> >>
> >> primitive prmStonith2-2 stonith:external/ssh \
> >>      params \
> >>          priority="2" \
> >>          stonith-timeout="60s" \
> >>          hostlist="it14" \
> >>      op start interval="0s" timeout="60s" \
> >>      op monitor interval="3600s" timeout="60s" \
> >>      op stop interval="0s" timeout="60s"
> >>
> >> primitive prmStonith2-3 stonith:meatware \
> >>      params \
> >>          priority="3" \
> >>          stonith-timeout="600" \
> >>          hostlist="it14" \
> >>      op start interval="0s" timeout="60s" \
> >>      op monitor interval="3600s" timeout="60s" \
> >>      op stop interval="0s" timeout="60s"
> >>
> >> group group_all fs_db ip_db prmPg apache
> >>
> >> group grpStonith1 \
> >>      prmStonith1-1 \
> >>      prmStonith1-2 \
> >>      prmStonith1-3
> >>
> >> group grpStonith2 \
> >>      prmStonith2-1 \
> >>      prmStonith2-2 \
> >>      prmStonith2-3
> >>
> >> ms ms_drbd_db drbd_db \
> >>          meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"
> >>
> >> clone clnPing prmPing \
> >>          meta clone-max="2" clone-node-max="1"
> >>
> >> clone clnDiskd prmDiskd \
> >>          meta clone-max="2" clone-node-max="1"
> >>
> >> location group_all-location group_all \
> >>          rule 200: #uname eq it13 \
> >>          rule 100: #uname eq it14 \
> >>          rule -INFINITY: defined ping_set and ping_set lt 200 \
> >>          rule -INFINITY: defined diskd_set and diskd_set eq SUCCESS
> >>
> >> location master-location_db ms_drbd_db \
> >>          rule 200: #uname eq it13 \
> >>          rule 100: #uname eq it14 \
> >>          rule role=master -INFINITY: defined ping_set and ping_set lt 200 \
> >>          rule role=master -INFINITY: defined diskd_set and diskd_set eq SUCCESS
> >>
> >> location rsc_location-grpStonith1-1 grpStonith1 \
> >>      rule -INFINITY: #uname eq it13
> >>
> >> location rsc_location-grpStonith2-1 grpStonith2 \
> >>      rule -INFINITY: #uname eq it14
> >>
> >> colocation db_on_drbd INFINITY: group_all ms_drbd_db:Master
> >> colocation clnPing-colocation INFINITY: group_all clnPing
> >> colocation clnDiskd-colocation INFINITY: group_all clnDiskd
> >> order order_db_after_drbd INFINITY: ms_drbd_db:promote group_all:start
> >> order order_clnPing_after_all 0: clnPing group_all symmetrical=false
> >> order order_clnDiskd_after_all 0: clnDiskd group_all symmetrical=false
> >>
> >> property no-quorum-policy="freeze" \
> >>      stonith-enabled="true" \
> >>          startup-fencing="false" \
> >>          stonith-timeout="430s"
> >>
> >> rsc_defaults resource-stickiness="INFINITY" \
> >>          migration-threshold="1"
> >>
> >> --------------------------------------------------------------------
> >>
> >> よろしくお願い致します。
> >>
> >> _______________________________________________
> >> Linux-ha-japan mailing list
> >> Linux-ha-japan [at] lists
> >> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
> >>
> >
> > _______________________________________________
> > Linux-ha-japan mailing list
> > Linux-ha-japan [at] lists
> > http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>
> _______________________________________________
> Linux-ha-japan mailing list
> Linux-ha-japan [at] lists
> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


renayama19661014 at ybb

Jan 18, 2012, 7:55 PM

Post #2 of 7 (228 views)
Permalink
Re: group内のリソースで発生した異常について [In reply to]

和田さん

こんにちは、山内です。

事象を確認しますので、お手数ですが、
Pacemakerとcorosyncのバージョンをお知らせいただけますか?
#場合によっては、バージョンによる不具合の可能性があります。

以上、宜しく御願いいたします。

--- On Thu, 2012/1/19, 和田 伸一朗 <wada.shinichiro [at] jp> wrote:

> こんにちは。
> 和田です。
>
> いつもお世話になっております。
>
> 先日、N + 1構成でのスプリットブレインの質問をさせていただいた際に、
> 山内さんよりリソースのstart,monitorは、standbyではなくrestartのほうが
> よいのではというアドバイスをいただきました。
>
> その際にも少し触れていたのですが、group内の最後のリソース以外に異常が発生した場合、
> うまく切り替わらないという事象が、他のアドバイスも取り込み、検証を進めていったところ
> やはり同様の事象が発生しました。
>
> #現状、standbyにすることですぐに切り替わることは確認できているので、
> #困っているわけではないのですが、どうもすっきりせず。。。
>
> たとえば、
>
> --------------------------------------------------------------------
>
> Resource Group: group_all
>      fs_db      (ocf::heartbeat:Filesystem):    Started it14
>      ip_db      (ocf::heartbeat:IPaddr2):       Started it14
>      prmPg      (ocf::heartbeat:pgsql): Started it14
>      apache     (ocf::heartbeat:apache):        Started it14
>
> --------------------------------------------------------------------
>
> というリソースがあった場合に、apacheの異常が発生した場合は正常に
> 切り替わるのですが、pgsqlで異常が発生した場合には、
>
> --------------------------------------------------------------------
>
> Resource Group: group_all
>      fs_db      (ocf::heartbeat:Filesystem):    Started it14
>      ip_db      (ocf::heartbeat:IPaddr2):       Started it14
>      prmPg      (ocf::heartbeat:pgsql): Stopped
>      apache     (ocf::heartbeat:apache):        Stopped
>
> --------------------------------------------------------------------
>
> の状態でとまってしまいます。
> ただし、15分後に異常を検知されその後であれば切り替えが発生します。
> #15分はpropertyのcluster-recheck-intervalに依存するものと考えています。
>
> なお、apacheで異常が発生した場合は正常に切り替えが行われ、
> pgsqlとapacheの順序を入れ替えた場合はapacheで正常に切り替えが
> 行われない状態となります。
>
> 上記の挙動はgroupリソースでrestartを設定した場合の挙動として正しいのでしょうか?
> ご存知の方、設定方法の誤りなども含めてどなたかご教示いただけないでしょうか?
>
> なお、設定は以下のようにしています。
>
> --------------------------------------------------------------------
>
> primitive drbd_db ocf:linbit:drbd \
>         params drbd_resource="pgsql" \
>         op start interval="0s" timeout="240s" on-fail="restart" \
>         op monitor interval="11s" timeout="60s" on-fail="restart" \
>         op monitor interval="10s" timeout="60s" on-fail="restart" role="Master" \
>         op stop interval="0s" timeout="100s" on-fail="fence"
>
> primitive ip_db ocf:heartbeat:IPaddr2 \
>         params ip="192.168.1.175" \
>                 nic="eth1" \
>                 cidr_netmask="24" \
>         op start interval="0s" timeout="90s" on-fail="restart" \
>         op monitor interval="10s" timeout="60s" on-fail="restart" \
>         op stop interval="0s" timeout="100s" on-fail="fence"
>
> primitive prmPing ocf:pacemaker:ping \
>         params \
>                 name="ping_set" \
>                 host_list="192.168.1.1 192.168.2.1" \
>                 multiplier="100" \
>                 dampen="0" \
>         meta \
>                 migration-threshold="3" \
>                 failure-timeout="60s" \
>         op start interval="0s" timeout="90s" on-fail="restart" \
>         op monitor interval="10s" timeout="60s" on-fail="restart" \
>         op stop interval="0s" timeout="100s" on-fail="ignore"
>
> primitive fs_db ocf:heartbeat:Filesystem \
>         params device="/dev/drbd/by-res/pgsql" directory="/data" fstype="ext4" \
>         op start interval="0s" timeout="60s" on-fail="restart" \
>         op monitor interval="10s" timeout="60s" on-fail="restart" \
>         op stop interval="0s" timeout="60s" on-fail="fence"
>
> primitive prmPg ocf:heartbeat:pgsql \
>         params pgctl="/usr/bin/pg_ctl" \
>         start_opt="-p 5432" \
>         psql="/usr/bin/psql" \
>         pgdata="/data/" \
>         pgdba="postgres" \
>         pgport="5432" \
>         pgdb="postgres" \
>         op start interval="0s" timeout="120s" on-fail="restart" \
>         op monitor interval="10s" timeout="60s" on-fail="restart" \
>         op stop interval="0s" timeout="120s" on-fail="fence"
>
> primitive apache ocf:heartbeat:apache \
>         params configfile="/etc/httpd/conf/httpd.conf" \
>         port="80" \
>         op start interval="0s" timeout="40s" on-fail="restart" \
>         op monitor interval="10s" timeout="60s" on-fail="restart" \
>         op stop interval="0s" timeout="60s" on-fail="fence"
>
> primitive prmDiskd ocf:pacemaker:diskd \
>         params name="diskd_set" \
>         device="/dev/sda1" \
>         op start interval="0s" timeout="60s" on-fail="restart" \
>         op monitor interval="10s" timeout="60s" on-fail="restart" \
>         op stop interval="0s" timeout="60s" on-fail="ignore"
>
> primitive prmStonith1-1 stonith:external/stonith-helper \
>     params \
>         priority="1" \
>         stonith-timeout="60s" \
>         hostlist="it13" \
>         dead_check_target="192.168.1.173" \
>         run_standby_wait="no" \
>     op start interval="0s" timeout="60s" \
>     op monitor interval="3600s" timeout="60s" \
>     op stop interval="0s" timeout="60s"
>
> primitive prmStonith1-2 stonith:external/ssh \
>     params \
>         priority="2" \
>         stonith-timeout="60s" \
>         hostlist="it13" \
>     op start interval="0s" timeout="60s" \
>     op monitor interval="3600s" timeout="60s" \
>     op stop interval="0s" timeout="60s"
>
> primitive prmStonith1-3 stonith:meatware \
>     params \
>         priority="3" \
>         stonith-timeout="600" \
>         hostlist="it13" \
>     op start interval="0s" timeout="60s" \
>     op monitor interval="3600s" timeout="60s" \
>     op stop interval="0s" timeout="60s"
>
> primitive prmStonith2-1 stonith:external/stonith-helper \
>     params \
>         priority="1" \
>         stonith-timeout="60s" \
>         hostlist="it14" \
>         dead_check_target="192.168.1.174" \
>         run_standby_wait="no" \
>     op start interval="0s" timeout="60s" \
>     op monitor interval="3600s" timeout="60s" \
>     op stop interval="0s" timeout="60s"
>
> primitive prmStonith2-2 stonith:external/ssh \
>     params \
>         priority="2" \
>         stonith-timeout="60s" \
>         hostlist="it14" \
>     op start interval="0s" timeout="60s" \
>     op monitor interval="3600s" timeout="60s" \
>     op stop interval="0s" timeout="60s"
>
> primitive prmStonith2-3 stonith:meatware \
>     params \
>         priority="3" \
>         stonith-timeout="600" \
>         hostlist="it14" \
>     op start interval="0s" timeout="60s" \
>     op monitor interval="3600s" timeout="60s" \
>     op stop interval="0s" timeout="60s"
>
> group group_all fs_db ip_db prmPg apache
>
> group grpStonith1 \
>     prmStonith1-1 \
>     prmStonith1-2 \
>     prmStonith1-3
>
> group grpStonith2 \
>     prmStonith2-1 \
>     prmStonith2-2 \
>     prmStonith2-3
>
> ms ms_drbd_db drbd_db \
>         meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"
>
> clone clnPing prmPing \
>         meta clone-max="2" clone-node-max="1"
>
> clone clnDiskd prmDiskd \
>         meta clone-max="2" clone-node-max="1"
>
> location group_all-location group_all \
>         rule 200: #uname eq it13 \
>         rule 100: #uname eq it14 \
>         rule -INFINITY: defined ping_set and ping_set lt 200 \
>         rule -INFINITY: defined diskd_set and diskd_set eq SUCCESS
>
> location master-location_db ms_drbd_db \
>         rule 200: #uname eq it13 \
>         rule 100: #uname eq it14 \
>         rule role=master -INFINITY: defined ping_set and ping_set lt 200 \
>         rule role=master -INFINITY: defined diskd_set and diskd_set eq SUCCESS
>
> location rsc_location-grpStonith1-1 grpStonith1 \
>     rule -INFINITY: #uname eq it13
>
> location rsc_location-grpStonith2-1 grpStonith2 \
>     rule -INFINITY: #uname eq it14
>
> colocation db_on_drbd INFINITY: group_all ms_drbd_db:Master
> colocation clnPing-colocation INFINITY: group_all clnPing
> colocation clnDiskd-colocation INFINITY: group_all clnDiskd
> order order_db_after_drbd INFINITY: ms_drbd_db:promote group_all:start
> order order_clnPing_after_all 0: clnPing group_all symmetrical=false
> order order_clnDiskd_after_all 0: clnDiskd group_all symmetrical=false
>
> property no-quorum-policy="freeze" \
>     stonith-enabled="true" \
>         startup-fencing="false" \
>         stonith-timeout="430s"
>
> rsc_defaults resource-stickiness="INFINITY" \
>         migration-threshold="1"
>
> --------------------------------------------------------------------
>
> よろしくお願い致します。
>
> _______________________________________________
> Linux-ha-japan mailing list
> Linux-ha-japan [at] lists
> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


wada.shinichiro at jp

Jan 18, 2012, 8:13 PM

Post #3 of 7 (216 views)
Permalink
Re: group内のリソースで発生した異常について [In reply to]

$B;3Fb$5$s(B

$B$3$s$K$A$O!#(B
$BOBED$G$9!#(B

$B$$$D$b$"$j$,$H$&$4$6$$$^$9!#(B

$B%P!<%8%g%s$O(B
pacemaker-1.0.11-1.2.el6.x86_64
corosync-1.3.1-1.1.el6.x86_64
$B$K$J$j$^$9!#(B

http://sourceforge.jp/projects/linux-ha/releases/
$B$G8x3+$5$l$F$k!"(B
pacemaker-1.0.11-1.2.2.el6.x86_64.repo.tar.gz
$B$KF~$C$F$$$k$b$N$K$J$j$^$9!#(B

$B5-21$,Dj$+$G$O$J$$$N$G$9$,!"(BScientific Linux 6.1$B$K%P%s%I%k$5$l$F$$$k$b$N$G$b(B
$BF1MM$N;v>]$,H/@8$7$F$$$?$h$&$K5-21$7$F$$$^$9!#(B

$B$*<j?t$G$9$,!"$h$m$7$/$*4j$$CW$7$^$9!#(B

> $BOBED$5$s(B
>
> $B$3$s$K$A$O!";3Fb$G$9!#(B
>
> $B;v>]$r3NG'$7$^$9$N$G!"$*<j?t$G$9$,!"(B
> Pacemaker$B$H(Bcorosync$B$N%P!<%8%g%s$r$*CN$i$;$$$?$@$1$^$9$+!)(B
> $B!t>l9g$K$h$C$F$O!"%P!<%8%g%s$K$h$kIT6q9g$N2DG=@-$,$"$j$^$9!#(B
>
> $B0J>e!"59$7$/8f4j$$$$$?$7$^$9!#(B
>
> --- On Thu, 2012/1/19, $BOBED!!?-0lO/(B<wada.shinichiro [at] jp> wrote:
>
>> $B$3$s$K$A$O!#(B
>> $BOBED$G$9!#(B
>>
>> $B$$$D$b$*@$OC$K$J$C$F$*$j$^$9!#(B
>>
>> $B [at] h|!"(BN + 1$B9=@.$G$N%9%W%j%C%H%V%l%$%s$N<ALd$r$5$;$F$$$?$@$$$?:]$K!"(B
>> $B;3Fb$5$s$h$j%j%=!<%9$N(Bstart,monitor$B$O!"(Bstandby$B$G$O$J$/(Brestart$B$N$[$&$,(B
>> $B$h$$$N$G$O$H$$$&%"%I%P%$%9$r$$$?$@$-$^$7$?!#(B
>>
>> $B$=$N:]$K$b>/$7?($l$F$$$?$N$G$9$,!"(Bgroup$BFb$N:G8e$N%j%=!<%90J30$K0[>o$,H/@8$7$?>l9g!"(B
>> $B$&$^$/@Z$jBX$o$i$J$$$H$$$&;v>]$,!"B>$N%"%I%P%$%9$b<h$j9~$_!"8!>Z$r?J$a$F$$$C$?$H$3$m(B
>> $B$d$O$jF1MM$N;v>]$,H/@8$7$^$7$?!#(B
>>
>> $B!t8=>u!"(Bstandby$B$K$9$k$3$H$G$9$0$K@Z$jBX$o$k$3$H$O3NG'$G$-$F$$$k$N$G!"(B
>> $B!t:$$C$F$$$k$o$1$G$O$J$$$N$G$9$,!"$I$&$b$9$C$-$j$;$:!#!#!#(B
>>
>> $B$?$H$($P!"(B
>>
>> --------------------------------------------------------------------
>>
>> Resource Group: group_all
>> fs_db (ocf::heartbeat:Filesystem): Started it14
>> ip_db (ocf::heartbeat:IPaddr2): Started it14
>> prmPg (ocf::heartbeat:pgsql): Started it14
>> apache (ocf::heartbeat:apache): Started it14
>>
>> --------------------------------------------------------------------
>>
>> $B$H$$$&%j%=!<%9$,$"$C$?>l9g$K!"(Bapache$B$N0[>o$,H/@8$7$?>l9g$O@5>o$K(B
>> $B@Z$jBX$o$k$N$G$9$,!"(Bpgsql$B$G0[>o$,H/@8$7$?>l9g$K$O!"(B
>>
>> --------------------------------------------------------------------
>>
>> Resource Group: group_all
>> fs_db (ocf::heartbeat:Filesystem): Started it14
>> ip_db (ocf::heartbeat:IPaddr2): Started it14
>> prmPg (ocf::heartbeat:pgsql): Stopped
>> apache (ocf::heartbeat:apache): Stopped
>>
>> --------------------------------------------------------------------
>>
>> $B$N>uBV$G$H$^$C$F$7$^$$$^$9!#(B
>> $B$?$@$7!"(B15$BJ,8e$K0[>o$r8!CN$5$l$=$N8e$G$"$l$P@Z$jBX$($,H/@8$7$^$9!#(B
>> $B!t(B15$BJ,$O(Bproperty$B$N(Bcluster-recheck-interval$B$K0MB8$9$k$b$N$H9M$($F$$$^$9!#(B
>>
>> $B$J$*!"(Bapache$B$G0[>o$,H/@8$7$?>l9g$O@5>o$K@Z$jBX$($,9T$o$l!"(B
>> pgsql$B$H(Bapache$B$N=g=x$rF~$lBX$($?>l9g$O(Bapache$B$G@5>o$K@Z$jBX$($,(B
>> $B9T$o$l$J$$>uBV$H$J$j$^$9!#(B
>>
>> $B>e5-$N5sF0$O(Bgroup$B%j%=!<%9$G(Brestart$B$r [at] _D$7$?>l9g$N5sF0$H$7$F@5$7$$$N$G$7$g$&$+!)(B
>> $B$4B8CN$NJ}!"@_DjJ}K!$N8m$j$J$I$b4^$a$F$I$J$?$+$465<($$$?$@$1$J$$$G$7$g$&$+!)(B
>>
>> $B$J$*!"@_Dj$O0J2<$N$h$&$K$7$F$$$^$9!#(B
>>
>> --------------------------------------------------------------------
>>
>> primitive drbd_db ocf:linbit:drbd \
>> params drbd_resource="pgsql" \
>> op start interval="0s" timeout="240s" on-fail="restart" \
>> op monitor interval="11s" timeout="60s" on-fail="restart" \
>> op monitor interval="10s" timeout="60s" on-fail="restart" role="Master" \
>> op stop interval="0s" timeout="100s" on-fail="fence"
>>
>> primitive ip_db ocf:heartbeat:IPaddr2 \
>> params ip="192.168.1.175" \
>> nic="eth1" \
>> cidr_netmask="24" \
>> op start interval="0s" timeout="90s" on-fail="restart" \
>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>> op stop interval="0s" timeout="100s" on-fail="fence"
>>
>> primitive prmPing ocf:pacemaker:ping \
>> params \
>> name="ping_set" \
>> host_list="192.168.1.1 192.168.2.1" \
>> multiplier="100" \
>> dampen="0" \
>> meta \
>> migration-threshold="3" \
>> failure-timeout="60s" \
>> op start interval="0s" timeout="90s" on-fail="restart" \
>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>> op stop interval="0s" timeout="100s" on-fail="ignore"
>>
>> primitive fs_db ocf:heartbeat:Filesystem \
>> params device="/dev/drbd/by-res/pgsql" directory="/data" fstype="ext4" \
>> op start interval="0s" timeout="60s" on-fail="restart" \
>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>> op stop interval="0s" timeout="60s" on-fail="fence"
>>
>> primitive prmPg ocf:heartbeat:pgsql \
>> params pgctl="/usr/bin/pg_ctl" \
>> start_opt="-p 5432" \
>> psql="/usr/bin/psql" \
>> pgdata="/data/" \
>> pgdba="postgres" \
>> pgport="5432" \
>> pgdb="postgres" \
>> op start interval="0s" timeout="120s" on-fail="restart" \
>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>> op stop interval="0s" timeout="120s" on-fail="fence"
>>
>> primitive apache ocf:heartbeat:apache \
>> params configfile="/etc/httpd/conf/httpd.conf" \
>> port="80" \
>> op start interval="0s" timeout="40s" on-fail="restart" \
>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>> op stop interval="0s" timeout="60s" on-fail="fence"
>>
>> primitive prmDiskd ocf:pacemaker:diskd \
>> params name="diskd_set" \
>> device="/dev/sda1" \
>> op start interval="0s" timeout="60s" on-fail="restart" \
>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>> op stop interval="0s" timeout="60s" on-fail="ignore"
>>
>> primitive prmStonith1-1 stonith:external/stonith-helper \
>> params \
>> priority="1" \
>> stonith-timeout="60s" \
>> hostlist="it13" \
>> dead_check_target="192.168.1.173" \
>> run_standby_wait="no" \
>> op start interval="0s" timeout="60s" \
>> op monitor interval="3600s" timeout="60s" \
>> op stop interval="0s" timeout="60s"
>>
>> primitive prmStonith1-2 stonith:external/ssh \
>> params \
>> priority="2" \
>> stonith-timeout="60s" \
>> hostlist="it13" \
>> op start interval="0s" timeout="60s" \
>> op monitor interval="3600s" timeout="60s" \
>> op stop interval="0s" timeout="60s"
>>
>> primitive prmStonith1-3 stonith:meatware \
>> params \
>> priority="3" \
>> stonith-timeout="600" \
>> hostlist="it13" \
>> op start interval="0s" timeout="60s" \
>> op monitor interval="3600s" timeout="60s" \
>> op stop interval="0s" timeout="60s"
>>
>> primitive prmStonith2-1 stonith:external/stonith-helper \
>> params \
>> priority="1" \
>> stonith-timeout="60s" \
>> hostlist="it14" \
>> dead_check_target="192.168.1.174" \
>> run_standby_wait="no" \
>> op start interval="0s" timeout="60s" \
>> op monitor interval="3600s" timeout="60s" \
>> op stop interval="0s" timeout="60s"
>>
>> primitive prmStonith2-2 stonith:external/ssh \
>> params \
>> priority="2" \
>> stonith-timeout="60s" \
>> hostlist="it14" \
>> op start interval="0s" timeout="60s" \
>> op monitor interval="3600s" timeout="60s" \
>> op stop interval="0s" timeout="60s"
>>
>> primitive prmStonith2-3 stonith:meatware \
>> params \
>> priority="3" \
>> stonith-timeout="600" \
>> hostlist="it14" \
>> op start interval="0s" timeout="60s" \
>> op monitor interval="3600s" timeout="60s" \
>> op stop interval="0s" timeout="60s"
>>
>> group group_all fs_db ip_db prmPg apache
>>
>> group grpStonith1 \
>> prmStonith1-1 \
>> prmStonith1-2 \
>> prmStonith1-3
>>
>> group grpStonith2 \
>> prmStonith2-1 \
>> prmStonith2-2 \
>> prmStonith2-3
>>
>> ms ms_drbd_db drbd_db \
>> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"
>>
>> clone clnPing prmPing \
>> meta clone-max="2" clone-node-max="1"
>>
>> clone clnDiskd prmDiskd \
>> meta clone-max="2" clone-node-max="1"
>>
>> location group_all-location group_all \
>> rule 200: #uname eq it13 \
>> rule 100: #uname eq it14 \
>> rule -INFINITY: defined ping_set and ping_set lt 200 \
>> rule -INFINITY: defined diskd_set and diskd_set eq SUCCESS
>>
>> location master-location_db ms_drbd_db \
>> rule 200: #uname eq it13 \
>> rule 100: #uname eq it14 \
>> rule role=master -INFINITY: defined ping_set and ping_set lt 200 \
>> rule role=master -INFINITY: defined diskd_set and diskd_set eq SUCCESS
>>
>> location rsc_location-grpStonith1-1 grpStonith1 \
>> rule -INFINITY: #uname eq it13
>>
>> location rsc_location-grpStonith2-1 grpStonith2 \
>> rule -INFINITY: #uname eq it14
>>
>> colocation db_on_drbd INFINITY: group_all ms_drbd_db:Master
>> colocation clnPing-colocation INFINITY: group_all clnPing
>> colocation clnDiskd-colocation INFINITY: group_all clnDiskd
>> order order_db_after_drbd INFINITY: ms_drbd_db:promote group_all:start
>> order order_clnPing_after_all 0: clnPing group_all symmetrical=false
>> order order_clnDiskd_after_all 0: clnDiskd group_all symmetrical=false
>>
>> property no-quorum-policy="freeze" \
>> stonith-enabled="true" \
>> startup-fencing="false" \
>> stonith-timeout="430s"
>>
>> rsc_defaults resource-stickiness="INFINITY" \
>> migration-threshold="1"
>>
>> --------------------------------------------------------------------
>>
>> $B$h$m$7$/$*4j$$CW$7$^$9!#(B
>>
>> _______________________________________________
>> Linux-ha-japan mailing list
>> Linux-ha-japan [at] lists
>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>
>
> _______________________________________________
> Linux-ha-japan mailing list
> Linux-ha-japan [at] lists
> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


wada.shinichiro at jp

Jan 18, 2012, 8:22 PM

Post #4 of 7 (217 views)
Permalink
Re: group内のリソースで発生した異常について [In reply to]

$B;3Fb$5$s(B

$B$3$s$K$A$O!#(B
$BOBED$G$9!#(B

> $B!t;w$?;v>]$,(B1.0.11$B$G$O=P$F$$$^$9$N$G!"IT6q9g$rF'$s$G$7$^$C$F$$$k2DG=@-$,Bg$-$$$G$9!#(B

$B$&!#!#!#$=$&$G$9$+!#!#(B

> $B3NG'$7$F7k2L$r8fO"Mm$7$^$9!#(B

$B$*<j?t$*$+$1$$$?$7$^$9$,!"$h$m$7$/$*4j$$CW$7$^$9!#(B

$B4JC1$G$9$,!"0J>e$G$9!#(B

> $BOBED$5$s(B
>
> $B$3$s$K$A$O!";3Fb$G$9!#(B
>
> $B8fO"Mm$"$j$,$H$&$4$6$$$^$9!#(B
>
> $B3NG'$7$F7k2L$r8fO"Mm$7$^$9!#(B
>
> $B!t;w$?;v>]$,(B1.0.11$B$G$O=P$F$$$^$9$N$G!"IT6q9g$rF'$s$G$7$^$C$F$$$k2DG=@-$,Bg$-$$$G$9!#(B
>
> $B0J>e$G$9!#(B
>
> --- On Thu, 2012/1/19, $BOBED!!?-0lO/(B<wada.shinichiro [at] jp> wrote:
>
>> $B;3Fb$5$s(B
>>
>> $B$3$s$K$A$O!#(B
>> $BOBED$G$9!#(B
>>
>> $B$$$D$b$"$j$,$H$&$4$6$$$^$9!#(B
>>
>> $B%P!<%8%g%s$O(B
>> pacemaker-1.0.11-1.2.el6.x86_64
>> corosync-1.3.1-1.1.el6.x86_64
>> $B$K$J$j$^$9!#(B
>>
>> http://sourceforge.jp/projects/linux-ha/releases/
>> $B$G8x3+$5$l$F$k!"(B
>> pacemaker-1.0.11-1.2.2.el6.x86_64.repo.tar.gz
>> $B$KF~$C$F$$$k$b$N$K$J$j$^$9!#(B
>>
>> $B5-21$,Dj$+$G$O$J$$$N$G$9$,!"(BScientific Linux 6.1$B$K%P%s%I%k$5$l$F$$$k$b$N$G$b(B
>> $BF1MM$N;v>]$,H/@8$7$F$$$?$h$&$K5-21$7$F$$$^$9!#(B
>>
>> $B$*<j?t$G$9$,!"$h$m$7$/$*4j$$CW$7$^$9!#(B
>>
>>> $BOBED$5$s(B
>>>
>>> $B$3$s$K$A$O!";3Fb$G$9!#(B
>>>
>>> $B;v>]$r3NG'$7$^$9$N$G!"$*<j?t$G$9$,!"(B
>>> Pacemaker$B$H(Bcorosync$B$N%P!<%8%g%s$r$*CN$i$;$$$?$@$1$^$9$+!)(B
>>> $B!t>l9g$K$h$C$F$O!"%P!<%8%g%s$K$h$kIT6q9g$N2DG=@-$,$"$j$^$9!#(B
>>>
>>> $B0J>e!"59$7$/8f4j$$$$$?$7$^$9!#(B
>>>
>>> --- On Thu, 2012/1/19, $BOBED!!?-0lO/(B<wada.shinichiro [at] jp> wrote:
>>>
>>>> $B$3$s$K$A$O!#(B
>>>> $BOBED$G$9!#(B
>>>>
>>>> $B$$$D$b$*@$OC$K$J$C$F$*$j$^$9!#(B
>>>>
>>>> $B [at] h|!"(BN + 1$B9=@.$G$N%9%W%j%C%H%V%l%$%s$N<ALd$r$5$;$F$$$?$@$$$?:]$K!"(B
>>>> $B;3Fb$5$s$h$j%j%=!<%9$N(Bstart,monitor$B$O!"(Bstandby$B$G$O$J$/(Brestart$B$N$[$&$,(B
>>>> $B$h$$$N$G$O$H$$$&%"%I%P%$%9$r$$$?$@$-$^$7$?!#(B
>>>>
>>>> $B$=$N:]$K$b>/$7?($l$F$$$?$N$G$9$,!"(Bgroup$BFb$N:G8e$N%j%=!<%90J30$K0[>o$,H/@8$7$?>l9g!"(B
>>>> $B$&$^$/@Z$jBX$o$i$J$$$H$$$&;v>]$,!"B>$N%"%I%P%$%9$b<h$j9~$_!"8!>Z$r?J$a$F$$$C$?$H$3$m(B
>>>> $B$d$O$jF1MM$N;v>]$,H/@8$7$^$7$?!#(B
>>>>
>>>> $B!t8=>u!"(Bstandby$B$K$9$k$3$H$G$9$0$K@Z$jBX$o$k$3$H$O3NG'$G$-$F$$$k$N$G!"(B
>>>> $B!t:$$C$F$$$k$o$1$G$O$J$$$N$G$9$,!"$I$&$b$9$C$-$j$;$:!#!#!#(B
>>>>
>>>> $B$?$H$($P!"(B
>>>>
>>>> --------------------------------------------------------------------
>>>>
>>>> Resource Group: group_all
>>>> fs_db (ocf::heartbeat:Filesystem): Started it14
>>>> ip_db (ocf::heartbeat:IPaddr2): Started it14
>>>> prmPg (ocf::heartbeat:pgsql): Started it14
>>>> apache (ocf::heartbeat:apache): Started it14
>>>>
>>>> --------------------------------------------------------------------
>>>>
>>>> $B$H$$$&%j%=!<%9$,$"$C$?>l9g$K!"(Bapache$B$N0[>o$,H/@8$7$?>l9g$O@5>o$K(B
>>>> $B@Z$jBX$o$k$N$G$9$,!"(Bpgsql$B$G0[>o$,H/@8$7$?>l9g$K$O!"(B
>>>>
>>>> --------------------------------------------------------------------
>>>>
>>>> Resource Group: group_all
>>>> fs_db (ocf::heartbeat:Filesystem): Started it14
>>>> ip_db (ocf::heartbeat:IPaddr2): Started it14
>>>> prmPg (ocf::heartbeat:pgsql): Stopped
>>>> apache (ocf::heartbeat:apache): Stopped
>>>>
>>>> --------------------------------------------------------------------
>>>>
>>>> $B$N>uBV$G$H$^$C$F$7$^$$$^$9!#(B
>>>> $B$?$@$7!"(B15$BJ,8e$K0[>o$r8!CN$5$l$=$N8e$G$"$l$P@Z$jBX$($,H/@8$7$^$9!#(B
>>>> $B!t(B15$BJ,$O(Bproperty$B$N(Bcluster-recheck-interval$B$K0MB8$9$k$b$N$H9M$($F$$$^$9!#(B
>>>>
>>>> $B$J$*!"(Bapache$B$G0[>o$,H/@8$7$?>l9g$O@5>o$K@Z$jBX$($,9T$o$l!"(B
>>>> pgsql$B$H(Bapache$B$N=g=x$rF~$lBX$($?>l9g$O(Bapache$B$G@5>o$K@Z$jBX$($,(B
>>>> $B9T$o$l$J$$>uBV$H$J$j$^$9!#(B
>>>>
>>>> $B>e5-$N5sF0$O(Bgroup$B%j%=!<%9$G(Brestart$B$r [at] _D$7$?>l9g$N5sF0$H$7$F@5$7$$$N$G$7$g$&$+!)(B
>>>> $B$4B8CN$NJ}!"@_DjJ}K!$N8m$j$J$I$b4^$a$F$I$J$?$+$465<($$$?$@$1$J$$$G$7$g$&$+!)(B
>>>>
>>>> $B$J$*!"@_Dj$O0J2<$N$h$&$K$7$F$$$^$9!#(B
>>>>
>>>> --------------------------------------------------------------------
>>>>
>>>> primitive drbd_db ocf:linbit:drbd \
>>>> params drbd_resource="pgsql" \
>>>> op start interval="0s" timeout="240s" on-fail="restart" \
>>>> op monitor interval="11s" timeout="60s" on-fail="restart" \
>>>> op monitor interval="10s" timeout="60s" on-fail="restart" role="Master" \
>>>> op stop interval="0s" timeout="100s" on-fail="fence"
>>>>
>>>> primitive ip_db ocf:heartbeat:IPaddr2 \
>>>> params ip="192.168.1.175" \
>>>> nic="eth1" \
>>>> cidr_netmask="24" \
>>>> op start interval="0s" timeout="90s" on-fail="restart" \
>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>> op stop interval="0s" timeout="100s" on-fail="fence"
>>>>
>>>> primitive prmPing ocf:pacemaker:ping \
>>>> params \
>>>> name="ping_set" \
>>>> host_list="192.168.1.1 192.168.2.1" \
>>>> multiplier="100" \
>>>> dampen="0" \
>>>> meta \
>>>> migration-threshold="3" \
>>>> failure-timeout="60s" \
>>>> op start interval="0s" timeout="90s" on-fail="restart" \
>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>> op stop interval="0s" timeout="100s" on-fail="ignore"
>>>>
>>>> primitive fs_db ocf:heartbeat:Filesystem \
>>>> params device="/dev/drbd/by-res/pgsql" directory="/data" fstype="ext4" \
>>>> op start interval="0s" timeout="60s" on-fail="restart" \
>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>> op stop interval="0s" timeout="60s" on-fail="fence"
>>>>
>>>> primitive prmPg ocf:heartbeat:pgsql \
>>>> params pgctl="/usr/bin/pg_ctl" \
>>>> start_opt="-p 5432" \
>>>> psql="/usr/bin/psql" \
>>>> pgdata="/data/" \
>>>> pgdba="postgres" \
>>>> pgport="5432" \
>>>> pgdb="postgres" \
>>>> op start interval="0s" timeout="120s" on-fail="restart" \
>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>> op stop interval="0s" timeout="120s" on-fail="fence"
>>>>
>>>> primitive apache ocf:heartbeat:apache \
>>>> params configfile="/etc/httpd/conf/httpd.conf" \
>>>> port="80" \
>>>> op start interval="0s" timeout="40s" on-fail="restart" \
>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>> op stop interval="0s" timeout="60s" on-fail="fence"
>>>>
>>>> primitive prmDiskd ocf:pacemaker:diskd \
>>>> params name="diskd_set" \
>>>> device="/dev/sda1" \
>>>> op start interval="0s" timeout="60s" on-fail="restart" \
>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>> op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>
>>>> primitive prmStonith1-1 stonith:external/stonith-helper \
>>>> params \
>>>> priority="1" \
>>>> stonith-timeout="60s" \
>>>> hostlist="it13" \
>>>> dead_check_target="192.168.1.173" \
>>>> run_standby_wait="no" \
>>>> op start interval="0s" timeout="60s" \
>>>> op monitor interval="3600s" timeout="60s" \
>>>> op stop interval="0s" timeout="60s"
>>>>
>>>> primitive prmStonith1-2 stonith:external/ssh \
>>>> params \
>>>> priority="2" \
>>>> stonith-timeout="60s" \
>>>> hostlist="it13" \
>>>> op start interval="0s" timeout="60s" \
>>>> op monitor interval="3600s" timeout="60s" \
>>>> op stop interval="0s" timeout="60s"
>>>>
>>>> primitive prmStonith1-3 stonith:meatware \
>>>> params \
>>>> priority="3" \
>>>> stonith-timeout="600" \
>>>> hostlist="it13" \
>>>> op start interval="0s" timeout="60s" \
>>>> op monitor interval="3600s" timeout="60s" \
>>>> op stop interval="0s" timeout="60s"
>>>>
>>>> primitive prmStonith2-1 stonith:external/stonith-helper \
>>>> params \
>>>> priority="1" \
>>>> stonith-timeout="60s" \
>>>> hostlist="it14" \
>>>> dead_check_target="192.168.1.174" \
>>>> run_standby_wait="no" \
>>>> op start interval="0s" timeout="60s" \
>>>> op monitor interval="3600s" timeout="60s" \
>>>> op stop interval="0s" timeout="60s"
>>>>
>>>> primitive prmStonith2-2 stonith:external/ssh \
>>>> params \
>>>> priority="2" \
>>>> stonith-timeout="60s" \
>>>> hostlist="it14" \
>>>> op start interval="0s" timeout="60s" \
>>>> op monitor interval="3600s" timeout="60s" \
>>>> op stop interval="0s" timeout="60s"
>>>>
>>>> primitive prmStonith2-3 stonith:meatware \
>>>> params \
>>>> priority="3" \
>>>> stonith-timeout="600" \
>>>> hostlist="it14" \
>>>> op start interval="0s" timeout="60s" \
>>>> op monitor interval="3600s" timeout="60s" \
>>>> op stop interval="0s" timeout="60s"
>>>>
>>>> group group_all fs_db ip_db prmPg apache
>>>>
>>>> group grpStonith1 \
>>>> prmStonith1-1 \
>>>> prmStonith1-2 \
>>>> prmStonith1-3
>>>>
>>>> group grpStonith2 \
>>>> prmStonith2-1 \
>>>> prmStonith2-2 \
>>>> prmStonith2-3
>>>>
>>>> ms ms_drbd_db drbd_db \
>>>> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"
>>>>
>>>> clone clnPing prmPing \
>>>> meta clone-max="2" clone-node-max="1"
>>>>
>>>> clone clnDiskd prmDiskd \
>>>> meta clone-max="2" clone-node-max="1"
>>>>
>>>> location group_all-location group_all \
>>>> rule 200: #uname eq it13 \
>>>> rule 100: #uname eq it14 \
>>>> rule -INFINITY: defined ping_set and ping_set lt 200 \
>>>> rule -INFINITY: defined diskd_set and diskd_set eq SUCCESS
>>>>
>>>> location master-location_db ms_drbd_db \
>>>> rule 200: #uname eq it13 \
>>>> rule 100: #uname eq it14 \
>>>> rule role=master -INFINITY: defined ping_set and ping_set lt 200 \
>>>> rule role=master -INFINITY: defined diskd_set and diskd_set eq SUCCESS
>>>>
>>>> location rsc_location-grpStonith1-1 grpStonith1 \
>>>> rule -INFINITY: #uname eq it13
>>>>
>>>> location rsc_location-grpStonith2-1 grpStonith2 \
>>>> rule -INFINITY: #uname eq it14
>>>>
>>>> colocation db_on_drbd INFINITY: group_all ms_drbd_db:Master
>>>> colocation clnPing-colocation INFINITY: group_all clnPing
>>>> colocation clnDiskd-colocation INFINITY: group_all clnDiskd
>>>> order order_db_after_drbd INFINITY: ms_drbd_db:promote group_all:start
>>>> order order_clnPing_after_all 0: clnPing group_all symmetrical=false
>>>> order order_clnDiskd_after_all 0: clnDiskd group_all symmetrical=false
>>>>
>>>> property no-quorum-policy="freeze" \
>>>> stonith-enabled="true" \
>>>> startup-fencing="false" \
>>>> stonith-timeout="430s"
>>>>
>>>> rsc_defaults resource-stickiness="INFINITY" \
>>>> migration-threshold="1"
>>>>
>>>> --------------------------------------------------------------------
>>>>
>>>> $B$h$m$7$/$*4j$$CW$7$^$9!#(B
>>>>
>>>> _______________________________________________
>>>> Linux-ha-japan mailing list
>>>> Linux-ha-japan [at] lists
>>>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>>
>>>
>>> _______________________________________________
>>> Linux-ha-japan mailing list
>>> Linux-ha-japan [at] lists
>>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>
>> _______________________________________________
>> Linux-ha-japan mailing list
>> Linux-ha-japan [at] lists
>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>
>
> _______________________________________________
> Linux-ha-japan mailing list
> Linux-ha-japan [at] lists
> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


tsukishima.ha at gmail

Jan 18, 2012, 9:28 PM

Post #5 of 7 (215 views)
Permalink
Re: group$BFb$N%j%=!<%9(B$B$GH/@8$7$?0[>o$K$D$$$F(B [In reply to]

$BOBEDMM(B

NTT$B%G!<%?@hC<5;=Q$NCSED$G$9!#(B

Pacemaker 1.0.11$B$G(BMaster/Slave$B%j%=!<%9$H(Bgroup$B%j%=!<%9$K(B
$B=g=x@)Ls$*$h$SF15o@)Ls$r [at] _D$7$?>l9g(B
group$B%j%=!<%9$K4^$^$l$k%j%=!<%98N>c$r7@5!$K$7$?(B
Master/Slave$B%j%=!<%9$N%U%'%$%k%*!<%P$,<:GT$7$^$9!#(B

$B;29M(B
http://linux-ha.sourceforge.jp/wp/archives/2468/2

$B2r7hJ}K!(B
http://linux-ha.sourceforge.jp/wp/archives/2468/3

$B2r7hJ}K!$r;29M$K(Blocation$B [at] _D$rDI5-$7$FF0:n$r3NG'$7$F$$$?$@$/$3$H$O2DG=$G$7$g$&$+!#(B

$B0J>e$h$m$7$/$*4j$$$$$?$7$^$9!#(B

$BCSED=_;R(B

2012$BG/(B1$B7n(B19$BF|(B13:22 $BOBED!!?-0lO/(B <wada.shinichiro [at] jp>:
> $B;3Fb$5$s(B
>
> $B$3$s$K$A$O!#(B
> $BOBED$G$9!#(B
>
>> $B!t;w$?;v>]$,(B1.0.11$B$G$O=P$F$$$^$9$N$G!"IT6q9g$rF'$s$G$7$^$C$F$$$k2DG=@-$,Bg$-$$$G$9!#(B
>
> $B$&!#!#!#$=$&$G$9$+!#!#(B
>
>> $B3NG'$7$F7k2L$r8fO"Mm$7$^$9!#(B
>
> $B$*<j?t$*$+$1$$$?$7$^$9$,!"$h$m$7$/$*4j$$CW$7$^$9!#(B
>
> $B4JC1$G$9$,!"0J>e$G$9!#(B
>
>> $BOBED$5$s(B
>>
>> $B$3$s$K$A$O!";3Fb$G$9!#(B
>>
>> $B8fO"Mm$"$j$,$H$&$4$6$$$^$9!#(B
>>
>> $B3NG'$7$F7k2L$r8fO"Mm$7$^$9!#(B
>>
>> $B!t;w$?;v>]$,(B1.0.11$B$G$O=P$F$$$^$9$N$G!"IT6q9g$rF'$s$G$7$^$C$F$$$k2DG=@-$,Bg$-$$$G$9!#(B
>>
>> $B0J>e$G$9!#(B
>>
>> --- On Thu, 2012/1/19, $BOBED!!?-0lO/(B<wada.shinichiro [at] jp> wrote:
>>
>>> $B;3Fb$5$s(B
>>>
>>> $B$3$s$K$A$O!#(B
>>> $BOBED$G$9!#(B
>>>
>>> $B$$$D$b$"$j$,$H$&$4$6$$$^$9!#(B
>>>
>>> $B%P!<%8%g%s$O(B
>>> pacemaker-1.0.11-1.2.el6.x86_64
>>> corosync-1.3.1-1.1.el6.x86_64
>>> $B$K$J$j$^$9!#(B
>>>
>>> http://sourceforge.jp/projects/linux-ha/releases/
>>> $B$G8x3+$5$l$F$k!"(B
>>> pacemaker-1.0.11-1.2.2.el6.x86_64.repo.tar.gz
>>> $B$KF~$C$F$$$k$b$N$K$J$j$^$9!#(B
>>>
>>> $B5-21$,Dj$+$G$O$J$$$N$G$9$,!"(BScientific Linux 6.1$B$K%P%s%I%k$5$l$F$$$k$b$N$G$b(B
>>> $BF1MM$N;v>]$,H/@8$7$F$$$?$h$&$K5-21$7$F$$$^$9!#(B
>>>
>>> $B$*<j?t$G$9$,!"$h$m$7$/$*4j$$CW$7$^$9!#(B
>>>
>>>> $BOBED$5$s(B
>>>>
>>>> $B$3$s$K$A$O!";3Fb$G$9!#(B
>>>>
>>>> $B;v>]$r3NG'$7$^$9$N$G!"$*<j?t$G$9$,!"(B
>>>> Pacemaker$B$H(Bcorosync$B$N%P!<%8%g%s$r$*CN$i$;$$$?$@$1$^$9$+!)(B
>>>> $B!t>l9g$K$h$C$F$O!"%P!<%8%g%s$K$h$kIT6q9g$N2DG=@-$,$"$j$^$9!#(B
>>>>
>>>> $B0J>e!"59$7$/8f4j$$$$$?$7$^$9!#(B
>>>>
>>>> --- On Thu, 2012/1/19, $BOBED!!?-0lO/(B<wada.shinichiro [at] jp> wrote:
>>>>
>>>>> $B$3$s$K$A$O!#(B
>>>>> $BOBED$G$9!#(B
>>>>>
>>>>> $B$$$D$b$*@$OC$K$J$C$F$*$j$^$9!#(B
>>>>>
>>>>> $B [at] h|!"(BN + 1$B9=@.$G$N%9%W%j%C%H%V%l%$%s$N<ALd$r$5$;$F$$$?$@$$$?:]$K!"(B
>>>>> $B;3Fb$5$s$h$j%j%=!<%9$N(Bstart,monitor$B$O!"(Bstandby$B$G$O$J$/(Brestart$B$N$[$&$,(B
>>>>> $B$h$$$N$G$O$H$$$&%"%I%P%$%9$r$$$?$@$-$^$7$?!#(B
>>>>>
>>>>> $B$=$N:]$K$b>/$7?($l$F$$$?$N$G$9$,!"(Bgroup$BFb$N:G8e$N%j%=!<%90J30$K0[>o$,H/@8$7$?>l9g!"(B
>>>>> $B$&$^$/@Z$jBX$o$i$J$$$H$$$&;v>]$,!"B>$N%"%I%P%$%9$b<h$j9~$_!"8!>Z$r?J$a$F$$$C$?$H$3$m(B
>>>>> $B$d$O$jF1MM$N;v>]$,H/@8$7$^$7$?!#(B
>>>>>
>>>>> $B!t8=>u!"(Bstandby$B$K$9$k$3$H$G$9$0$K@Z$jBX$o$k$3$H$O3NG'$G$-$F$$$k$N$G!"(B
>>>>> $B!t:$$C$F$$$k$o$1$G$O$J$$$N$G$9$,!"$I$&$b$9$C$-$j$;$:!#!#!#(B
>>>>>
>>>>> $B$?$H$($P!"(B
>>>>>
>>>>> --------------------------------------------------------------------
>>>>>
>>>>> Resource Group: group_all
>>>>> fs_db (ocf::heartbeat:Filesystem): Started it14
>>>>> ip_db (ocf::heartbeat:IPaddr2): Started it14
>>>>> prmPg (ocf::heartbeat:pgsql): Started it14
>>>>> apache (ocf::heartbeat:apache): Started it14
>>>>>
>>>>> --------------------------------------------------------------------
>>>>>
>>>>> $B$H$$$&%j%=!<%9$,$"$C$?>l9g$K!"(Bapache$B$N0[>o$,H/@8$7$?>l9g$O@5>o$K(B
>>>>> $B@Z$jBX$o$k$N$G$9$,!"(Bpgsql$B$G0[>o$,H/@8$7$?>l9g$K$O!"(B
>>>>>
>>>>> --------------------------------------------------------------------
>>>>>
>>>>> Resource Group: group_all
>>>>> fs_db (ocf::heartbeat:Filesystem): Started it14
>>>>> ip_db (ocf::heartbeat:IPaddr2): Started it14
>>>>> prmPg (ocf::heartbeat:pgsql): Stopped
>>>>> apache (ocf::heartbeat:apache): Stopped
>>>>>
>>>>> --------------------------------------------------------------------
>>>>>
>>>>> $B$N>uBV$G$H$^$C$F$7$^$$$^$9!#(B
>>>>> $B$?$@$7!"(B15$BJ,8e$K0[>o$r8!CN$5$l$=$N8e$G$"$l$P@Z$jBX$($,H/@8$7$^$9!#(B
>>>>> $B!t(B15$BJ,$O(Bproperty$B$N(Bcluster-recheck-interval$B$K0MB8$9$k$b$N$H9M$($F$$$^$9!#(B
>>>>>
>>>>> $B$J$*!"(Bapache$B$G0[>o$,H/@8$7$?>l9g$O@5>o$K@Z$jBX$($,9T$o$l!"(B
>>>>> pgsql$B$H(Bapache$B$N=g=x$rF~$lBX$($?>l9g$O(Bapache$B$G@5>o$K@Z$jBX$($,(B
>>>>> $B9T$o$l$J$$>uBV$H$J$j$^$9!#(B
>>>>>
>>>>> $B>e5-$N5sF0$O(Bgroup$B%j%=!<%9$G(Brestart$B$r [at] _D$7$?>l9g$N5sF0$H$7$F@5$7$$$N$G$7$g$&$+!)(B
>>>>> $B$4B8CN$NJ}!"@_DjJ}K!$N8m$j$J$I$b4^$a$F$I$J$?$+$465<($$$?$@$1$J$$$G$7$g$&$+!)(B
>>>>>
>>>>> $B$J$*!"@_Dj$O0J2<$N$h$&$K$7$F$$$^$9!#(B
>>>>>
>>>>> --------------------------------------------------------------------
>>>>>
>>>>> primitive drbd_db ocf:linbit:drbd \
>>>>> params drbd_resource="pgsql" \
>>>>> op start interval="0s" timeout="240s" on-fail="restart" \
>>>>> op monitor interval="11s" timeout="60s" on-fail="restart" \
>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" role="Master" \
>>>>> op stop interval="0s" timeout="100s" on-fail="fence"
>>>>>
>>>>> primitive ip_db ocf:heartbeat:IPaddr2 \
>>>>> params ip="192.168.1.175" \
>>>>> nic="eth1" \
>>>>> cidr_netmask="24" \
>>>>> op start interval="0s" timeout="90s" on-fail="restart" \
>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>> op stop interval="0s" timeout="100s" on-fail="fence"
>>>>>
>>>>> primitive prmPing ocf:pacemaker:ping \
>>>>> params \
>>>>> name="ping_set" \
>>>>> host_list="192.168.1.1 192.168.2.1" \
>>>>> multiplier="100" \
>>>>> dampen="0" \
>>>>> meta \
>>>>> migration-threshold="3" \
>>>>> failure-timeout="60s" \
>>>>> op start interval="0s" timeout="90s" on-fail="restart" \
>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>> op stop interval="0s" timeout="100s" on-fail="ignore"
>>>>>
>>>>> primitive fs_db ocf:heartbeat:Filesystem \
>>>>> params device="/dev/drbd/by-res/pgsql" directory="/data" fstype="ext4" \
>>>>> op start interval="0s" timeout="60s" on-fail="restart" \
>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>> op stop interval="0s" timeout="60s" on-fail="fence"
>>>>>
>>>>> primitive prmPg ocf:heartbeat:pgsql \
>>>>> params pgctl="/usr/bin/pg_ctl" \
>>>>> start_opt="-p 5432" \
>>>>> psql="/usr/bin/psql" \
>>>>> pgdata="/data/" \
>>>>> pgdba="postgres" \
>>>>> pgport="5432" \
>>>>> pgdb="postgres" \
>>>>> op start interval="0s" timeout="120s" on-fail="restart" \
>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>> op stop interval="0s" timeout="120s" on-fail="fence"
>>>>>
>>>>> primitive apache ocf:heartbeat:apache \
>>>>> params configfile="/etc/httpd/conf/httpd.conf" \
>>>>> port="80" \
>>>>> op start interval="0s" timeout="40s" on-fail="restart" \
>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>> op stop interval="0s" timeout="60s" on-fail="fence"
>>>>>
>>>>> primitive prmDiskd ocf:pacemaker:diskd \
>>>>> params name="diskd_set" \
>>>>> device="/dev/sda1" \
>>>>> op start interval="0s" timeout="60s" on-fail="restart" \
>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>> op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>
>>>>> primitive prmStonith1-1 stonith:external/stonith-helper \
>>>>> params \
>>>>> priority="1" \
>>>>> stonith-timeout="60s" \
>>>>> hostlist="it13" \
>>>>> dead_check_target="192.168.1.173" \
>>>>> run_standby_wait="no" \
>>>>> op start interval="0s" timeout="60s" \
>>>>> op monitor interval="3600s" timeout="60s" \
>>>>> op stop interval="0s" timeout="60s"
>>>>>
>>>>> primitive prmStonith1-2 stonith:external/ssh \
>>>>> params \
>>>>> priority="2" \
>>>>> stonith-timeout="60s" \
>>>>> hostlist="it13" \
>>>>> op start interval="0s" timeout="60s" \
>>>>> op monitor interval="3600s" timeout="60s" \
>>>>> op stop interval="0s" timeout="60s"
>>>>>
>>>>> primitive prmStonith1-3 stonith:meatware \
>>>>> params \
>>>>> priority="3" \
>>>>> stonith-timeout="600" \
>>>>> hostlist="it13" \
>>>>> op start interval="0s" timeout="60s" \
>>>>> op monitor interval="3600s" timeout="60s" \
>>>>> op stop interval="0s" timeout="60s"
>>>>>
>>>>> primitive prmStonith2-1 stonith:external/stonith-helper \
>>>>> params \
>>>>> priority="1" \
>>>>> stonith-timeout="60s" \
>>>>> hostlist="it14" \
>>>>> dead_check_target="192.168.1.174" \
>>>>> run_standby_wait="no" \
>>>>> op start interval="0s" timeout="60s" \
>>>>> op monitor interval="3600s" timeout="60s" \
>>>>> op stop interval="0s" timeout="60s"
>>>>>
>>>>> primitive prmStonith2-2 stonith:external/ssh \
>>>>> params \
>>>>> priority="2" \
>>>>> stonith-timeout="60s" \
>>>>> hostlist="it14" \
>>>>> op start interval="0s" timeout="60s" \
>>>>> op monitor interval="3600s" timeout="60s" \
>>>>> op stop interval="0s" timeout="60s"
>>>>>
>>>>> primitive prmStonith2-3 stonith:meatware \
>>>>> params \
>>>>> priority="3" \
>>>>> stonith-timeout="600" \
>>>>> hostlist="it14" \
>>>>> op start interval="0s" timeout="60s" \
>>>>> op monitor interval="3600s" timeout="60s" \
>>>>> op stop interval="0s" timeout="60s"
>>>>>
>>>>> group group_all fs_db ip_db prmPg apache
>>>>>
>>>>> group grpStonith1 \
>>>>> prmStonith1-1 \
>>>>> prmStonith1-2 \
>>>>> prmStonith1-3
>>>>>
>>>>> group grpStonith2 \
>>>>> prmStonith2-1 \
>>>>> prmStonith2-2 \
>>>>> prmStonith2-3
>>>>>
>>>>> ms ms_drbd_db drbd_db \
>>>>> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"
>>>>>
>>>>> clone clnPing prmPing \
>>>>> meta clone-max="2" clone-node-max="1"
>>>>>
>>>>> clone clnDiskd prmDiskd \
>>>>> meta clone-max="2" clone-node-max="1"
>>>>>
>>>>> location group_all-location group_all \
>>>>> rule 200: #uname eq it13 \
>>>>> rule 100: #uname eq it14 \
>>>>> rule -INFINITY: defined ping_set and ping_set lt 200 \
>>>>> rule -INFINITY: defined diskd_set and diskd_set eq SUCCESS
>>>>>
>>>>> location master-location_db ms_drbd_db \
>>>>> rule 200: #uname eq it13 \
>>>>> rule 100: #uname eq it14 \
>>>>> rule role=master -INFINITY: defined ping_set and ping_set lt 200 \
>>>>> rule role=master -INFINITY: defined diskd_set and diskd_set eq SUCCESS
>>>>>
>>>>> location rsc_location-grpStonith1-1 grpStonith1 \
>>>>> rule -INFINITY: #uname eq it13
>>>>>
>>>>> location rsc_location-grpStonith2-1 grpStonith2 \
>>>>> rule -INFINITY: #uname eq it14
>>>>>
>>>>> colocation db_on_drbd INFINITY: group_all ms_drbd_db:Master
>>>>> colocation clnPing-colocation INFINITY: group_all clnPing
>>>>> colocation clnDiskd-colocation INFINITY: group_all clnDiskd
>>>>> order order_db_after_drbd INFINITY: ms_drbd_db:promote group_all:start
>>>>> order order_clnPing_after_all 0: clnPing group_all symmetrical=false
>>>>> order order_clnDiskd_after_all 0: clnDiskd group_all symmetrical=false
>>>>>
>>>>> property no-quorum-policy="freeze" \
>>>>> stonith-enabled="true" \
>>>>> startup-fencing="false" \
>>>>> stonith-timeout="430s"
>>>>>
>>>>> rsc_defaults resource-stickiness="INFINITY" \
>>>>> migration-threshold="1"
>>>>>
>>>>> --------------------------------------------------------------------
>>>>>
>>>>> $B$h$m$7$/$*4j$$CW$7$^$9!#(B
>>>>>
>>>>> _______________________________________________
>>>>> Linux-ha-japan mailing list
>>>>> Linux-ha-japan [at] lists
>>>>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>>>
>>>>
>>>> _______________________________________________
>>>> Linux-ha-japan mailing list
>>>> Linux-ha-japan [at] lists
>>>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>
>>> _______________________________________________
>>> Linux-ha-japan mailing list
>>> Linux-ha-japan [at] lists
>>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>
>>
>> _______________________________________________
>> Linux-ha-japan mailing list
>> Linux-ha-japan [at] lists
>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>
> _______________________________________________
> Linux-ha-japan mailing list
> Linux-ha-japan [at] lists
> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


renayama19661014 at ybb

Jan 18, 2012, 9:32 PM

Post #6 of 7 (224 views)
Permalink
Re: group内のリソースで発生した異常について [In reply to]

和田さん

こんにちは、山内です。

池田さんも回答されえていますが。。。。

stonithを簡易にして、リソースもDummyなどで代用して現象を確認しました。


1) 該当バージョンでの起動後の様子
============
Last updated: Thu Jan 19 23:10:17 2012
Stack: openais
Current DC: rh61-coro1 - partition with quorum
Version: 1.0.11-1554a83db0d3c3e546cfd3aaff6af1184f79ee87
2 Nodes configured, 2 expected votes
6 Resources configured.
============

Online: [ rh61-coro1 rh61-coro2 ]

Resource Group: group_all
fs_db (ocf::heartbeat:Dummy): Started rh61-coro1
ip_db (ocf::heartbeat:Dummy): Started rh61-coro1
prmPg (ocf::heartbeat:Dummy2): Started rh61-coro1
apache (ocf::heartbeat:Dummy): Started rh61-coro1
Resource Group: grpStonith1
prmStonith1-2 (stonith:external/ssh): Started rh61-coro2
Resource Group: grpStonith2
prmStonith2-2 (stonith:external/ssh): Started rh61-coro1
Master/Slave Set: ms_drbd_db
Masters: [ rh61-coro1 ]
Slaves: [ rh61-coro2 ]
Clone Set: clnDiskd
Started: [ rh61-coro1 rh61-coro2 ]
Clone Set: clnPing
Started: [ rh61-coro1 rh61-coro2 ]

Migration summary:
* Node rh61-coro1:
* Node rh61-coro2:

2)該当バージョンでのpgsql(Dummy2)故障時の様子

============
Last updated: Thu Jan 19 23:11:02 2012
Stack: openais
Current DC: rh61-coro1 - partition with quorum
Version: 1.0.11-1554a83db0d3c3e546cfd3aaff6af1184f79ee87
2 Nodes configured, 2 expected votes
6 Resources configured.
============

Online: [ rh61-coro1 rh61-coro2 ]

Resource Group: group_all
fs_db (ocf::heartbeat:Dummy): Started rh61-coro1
ip_db (ocf::heartbeat:Dummy): Started rh61-coro1
prmPg (ocf::heartbeat:Dummy2): Stopped
apache (ocf::heartbeat:Dummy): Stopped
Resource Group: grpStonith1
prmStonith1-2 (stonith:external/ssh): Started rh61-coro2
Resource Group: grpStonith2
prmStonith2-2 (stonith:external/ssh): Started rh61-coro1
Master/Slave Set: ms_drbd_db
Masters: [ rh61-coro1 ]
Slaves: [ rh61-coro2 ]
Clone Set: clnDiskd
Started: [ rh61-coro1 rh61-coro2 ]
Clone Set: clnPing
Started: [ rh61-coro1 rh61-coro2 ]

Migration summary:
* Node rh61-coro1:
prmPg: migration-threshold=1 fail-count=1
* Node rh61-coro2:

Failed actions:
prmPg_monitor_10000 (node=rh61-coro1, call=28, rc=1, status=complete): unknown error

3) 1.0.12での起動後の様子

============
Last updated: Thu Jan 19 23:20:48 2012
Stack: openais
Current DC: rh61-coro1 - partition with quorum
Version: 1.0.12-unknown
2 Nodes configured, 2 expected votes
6 Resources configured.
============

Online: [ rh61-coro1 rh61-coro2 ]

Resource Group: group_all
fs_db (ocf::heartbeat:Dummy): Started rh61-coro1
ip_db (ocf::heartbeat:Dummy): Started rh61-coro1
prmPg (ocf::heartbeat:Dummy2): Started rh61-coro1
apache (ocf::heartbeat:Dummy): Started rh61-coro1
Resource Group: grpStonith1
prmStonith1-2 (stonith:external/ssh): Started rh61-coro2
Resource Group: grpStonith2
prmStonith2-2 (stonith:external/ssh): Started rh61-coro1
Master/Slave Set: ms_drbd_db
Masters: [ rh61-coro1 ]
Slaves: [ rh61-coro2 ]
Clone Set: clnDiskd
Started: [ rh61-coro1 rh61-coro2 ]
Clone Set: clnPing
Started: [ rh61-coro1 rh61-coro2 ]

Migration summary:
* Node rh61-coro2:
* Node rh61-coro1:

4) 1.0.12でのpgsql(Dummy2)故障時の様子
============
Last updated: Thu Jan 19 23:21:55 2012
Stack: openais
Current DC: rh61-coro1 - partition with quorum
Version: 1.0.12-unknown
2 Nodes configured, 2 expected votes
6 Resources configured.
============

Online: [ rh61-coro1 rh61-coro2 ]

Resource Group: group_all
fs_db (ocf::heartbeat:Dummy): Started rh61-coro2
ip_db (ocf::heartbeat:Dummy): Started rh61-coro2
prmPg (ocf::heartbeat:Dummy2): Started rh61-coro2
apache (ocf::heartbeat:Dummy): Started rh61-coro2
Resource Group: grpStonith1
prmStonith1-2 (stonith:external/ssh): Started rh61-coro2
Resource Group: grpStonith2
prmStonith2-2 (stonith:external/ssh): Started rh61-coro1
Master/Slave Set: ms_drbd_db
Masters: [ rh61-coro2 ]
Slaves: [ rh61-coro1 ]
Clone Set: clnDiskd
Started: [ rh61-coro1 rh61-coro2 ]
Clone Set: clnPing
Started: [ rh61-coro1 rh61-coro2 ]

Migration summary:
* Node rh61-coro2:
* Node rh61-coro1:
prmPg: migration-threshold=1 fail-count=1

Failed actions:
prmPg_monitor_10000 (node=rh61-coro1, call=28, rc=1, status=complete): unknown error


結論としては不具合で、以下の修正が反映される必要があります。
http://hg.clusterlabs.org/pacemaker/stable-1.0/rev/5432bf2773a5


今の所、日本語コミュニティからは修正版はリリースされていませんので、以下からzipでソースコードを取得して再インストールするか
* https://github.com/ClusterLabs/pacemaker-1.0/tree/Pacemaker-1.0.12
本家側に該当のrpmがあればそちらを利用する必要があります。

今回は試していませんが、また、1.0.11では、以前も報告されていますが、以下のMLの修正をcrmに適用することでご要望の動作可能と思われます。
* http://sourceforge.jp/projects/linux-ha/lists/archive/japan/2011-September/000941.html

以上です。



--- On Thu, 2012/1/19, Junko IKEDA <tsukishima.ha [at] gmail> wrote:

> 和田様
>
> NTTデータ先端技術の池田です。
>
> Pacemaker 1.0.11でMaster/Slaveリソースとgroupリソースに
> 順序制約および同居制約を設定した場合
> groupリソースに含まれるリソース故障を契機にした
> Master/Slaveリソースのフェイルオーバが失敗します。
>
> 参考
> http://linux-ha.sourceforge.jp/wp/archives/2468/2
>
> 解決方法
> http://linux-ha.sourceforge.jp/wp/archives/2468/3
>
> 解決方法を参考にlocation設定を追記して動作を確認していただくことは可能でしょうか。
>
> 以上よろしくお願いいたします。
>
> 池田淳子
>
> 2012年1月19日13:22 和田 伸一朗 <wada.shinichiro [at] jp>:
> > 山内さん
> >
> > こんにちは。
> > 和田です。
> >
> >> #似た事象が1.0.11では出ていますので、不具合を踏んでしまっている可能性が大きいです。
> >
> > う。。。そうですか。。
> >
> >> 確認して結果を御連絡します。
> >
> > お手数おかけいたしますが、よろしくお願い致します。
> >
> > 簡単ですが、以上です。
> >
> >> 和田さん
> >>
> >> こんにちは、山内です。
> >>
> >> 御連絡ありがとうございます。
> >>
> >> 確認して結果を御連絡します。
> >>
> >> #似た事象が1.0.11では出ていますので、不具合を踏んでしまっている可能性が大きいです。
> >>
> >> 以上です。
> >>
> >> --- On Thu, 2012/1/19, 和田 伸一朗<wada.shinichiro [at] jp>  wrote:
> >>
> >>> 山内さん
> >>>
> >>> こんにちは。
> >>> 和田です。
> >>>
> >>> いつもありがとうございます。
> >>>
> >>> バージョンは
> >>> pacemaker-1.0.11-1.2.el6.x86_64
> >>> corosync-1.3.1-1.1.el6.x86_64
> >>> になります。
> >>>
> >>> http://sourceforge.jp/projects/linux-ha/releases/
> >>> で公開されてる、
> >>> pacemaker-1.0.11-1.2.2.el6.x86_64.repo.tar.gz
> >>> に入っているものになります。
> >>>
> >>> 記憶が定かではないのですが、Scientific Linux 6.1にバンドルされているものでも
> >>> 同様の事象が発生していたように記憶しています。
> >>>
> >>> お手数ですが、よろしくお願い致します。
> >>>
> >>>> 和田さん
> >>>>
> >>>> こんにちは、山内です。
> >>>>
> >>>> 事象を確認しますので、お手数ですが、
> >>>> Pacemakerとcorosyncのバージョンをお知らせいただけますか?
> >>>> #場合によっては、バージョンによる不具合の可能性があります。
> >>>>
> >>>> 以上、宜しく御願いいたします。
> >>>>
> >>>> --- On Thu, 2012/1/19, 和田 伸一朗<wada.shinichiro [at] jp>   wrote:
> >>>>
> >>>>> こんにちは。
> >>>>> 和田です。
> >>>>>
> >>>>> いつもお世話になっております。
> >>>>>
> >>>>> 先日、N + 1構成でのスプリットブレインの質問をさせていただいた際に、
> >>>>> 山内さんよりリソースのstart,monitorは、standbyではなくrestartのほうが
> >>>>> よいのではというアドバイスをいただきました。
> >>>>>
> >>>>> その際にも少し触れていたのですが、group内の最後のリソース以外に異常が発生した場合、
> >>>>> うまく切り替わらないという事象が、他のアドバイスも取り込み、検証を進めていったところ
> >>>>> やはり同様の事象が発生しました。
> >>>>>
> >>>>> #現状、standbyにすることですぐに切り替わることは確認できているので、
> >>>>> #困っているわけではないのですが、どうもすっきりせず。。。
> >>>>>
> >>>>> たとえば、
> >>>>>
> >>>>> --------------------------------------------------------------------
> >>>>>
> >>>>>     Resource Group: group_all
> >>>>>         fs_db      (ocf::heartbeat:Filesystem):    Started it14
> >>>>>         ip_db      (ocf::heartbeat:IPaddr2):       Started it14
> >>>>>         prmPg      (ocf::heartbeat:pgsql): Started it14
> >>>>>         apache     (ocf::heartbeat:apache):        Started it14
> >>>>>
> >>>>> --------------------------------------------------------------------
> >>>>>
> >>>>> というリソースがあった場合に、apacheの異常が発生した場合は正常に
> >>>>> 切り替わるのですが、pgsqlで異常が発生した場合には、
> >>>>>
> >>>>> --------------------------------------------------------------------
> >>>>>
> >>>>>     Resource Group: group_all
> >>>>>         fs_db      (ocf::heartbeat:Filesystem):    Started it14
> >>>>>         ip_db      (ocf::heartbeat:IPaddr2):       Started it14
> >>>>>         prmPg      (ocf::heartbeat:pgsql): Stopped
> >>>>>         apache     (ocf::heartbeat:apache):        Stopped
> >>>>>
> >>>>> --------------------------------------------------------------------
> >>>>>
> >>>>> の状態でとまってしまいます。
> >>>>> ただし、15分後に異常を検知されその後であれば切り替えが発生します。
> >>>>> #15分はpropertyのcluster-recheck-intervalに依存するものと考えています。
> >>>>>
> >>>>> なお、apacheで異常が発生した場合は正常に切り替えが行われ、
> >>>>> pgsqlとapacheの順序を入れ替えた場合はapacheで正常に切り替えが
> >>>>> 行われない状態となります。
> >>>>>
> >>>>> 上記の挙動はgroupリソースでrestartを設定した場合の挙動として正しいのでしょうか?
> >>>>> ご存知の方、設定方法の誤りなども含めてどなたかご教示いただけないでしょうか?
> >>>>>
> >>>>> なお、設定は以下のようにしています。
> >>>>>
> >>>>> --------------------------------------------------------------------
> >>>>>
> >>>>> primitive drbd_db ocf:linbit:drbd \
> >>>>>            params drbd_resource="pgsql" \
> >>>>>            op start interval="0s" timeout="240s" on-fail="restart" \
> >>>>>            op monitor interval="11s" timeout="60s" on-fail="restart" \
> >>>>>            op monitor interval="10s" timeout="60s" on-fail="restart" role="Master" \
> >>>>>            op stop interval="0s" timeout="100s" on-fail="fence"
> >>>>>
> >>>>> primitive ip_db ocf:heartbeat:IPaddr2 \
> >>>>>            params ip="192.168.1.175" \
> >>>>>                    nic="eth1" \
> >>>>>                    cidr_netmask="24" \
> >>>>>            op start interval="0s" timeout="90s" on-fail="restart" \
> >>>>>            op monitor interval="10s" timeout="60s" on-fail="restart" \
> >>>>>            op stop interval="0s" timeout="100s" on-fail="fence"
> >>>>>
> >>>>> primitive prmPing ocf:pacemaker:ping \
> >>>>>            params \
> >>>>>                    name="ping_set" \
> >>>>>                    host_list="192.168.1.1 192.168.2.1" \
> >>>>>                    multiplier="100" \
> >>>>>                    dampen="0" \
> >>>>>            meta \
> >>>>>                    migration-threshold="3" \
> >>>>>                    failure-timeout="60s" \
> >>>>>            op start interval="0s" timeout="90s" on-fail="restart" \
> >>>>>            op monitor interval="10s" timeout="60s" on-fail="restart" \
> >>>>>            op stop interval="0s" timeout="100s" on-fail="ignore"
> >>>>>
> >>>>> primitive fs_db ocf:heartbeat:Filesystem \
> >>>>>            params device="/dev/drbd/by-res/pgsql" directory="/data" fstype="ext4" \
> >>>>>            op start interval="0s" timeout="60s" on-fail="restart" \
> >>>>>            op monitor interval="10s" timeout="60s" on-fail="restart" \
> >>>>>            op stop interval="0s" timeout="60s" on-fail="fence"
> >>>>>
> >>>>> primitive prmPg ocf:heartbeat:pgsql \
> >>>>>            params pgctl="/usr/bin/pg_ctl" \
> >>>>>            start_opt="-p 5432" \
> >>>>>            psql="/usr/bin/psql" \
> >>>>>            pgdata="/data/" \
> >>>>>            pgdba="postgres" \
> >>>>>            pgport="5432" \
> >>>>>            pgdb="postgres" \
> >>>>>            op start interval="0s" timeout="120s" on-fail="restart" \
> >>>>>            op monitor interval="10s" timeout="60s" on-fail="restart" \
> >>>>>            op stop interval="0s" timeout="120s" on-fail="fence"
> >>>>>
> >>>>> primitive apache ocf:heartbeat:apache \
> >>>>>            params configfile="/etc/httpd/conf/httpd.conf" \
> >>>>>            port="80" \
> >>>>>            op start interval="0s" timeout="40s" on-fail="restart" \
> >>>>>            op monitor interval="10s" timeout="60s" on-fail="restart" \
> >>>>>            op stop interval="0s" timeout="60s" on-fail="fence"
> >>>>>
> >>>>> primitive prmDiskd ocf:pacemaker:diskd \
> >>>>>            params name="diskd_set" \
> >>>>>            device="/dev/sda1" \
> >>>>>            op start interval="0s" timeout="60s" on-fail="restart" \
> >>>>>            op monitor interval="10s" timeout="60s" on-fail="restart" \
> >>>>>            op stop interval="0s" timeout="60s" on-fail="ignore"
> >>>>>
> >>>>> primitive prmStonith1-1 stonith:external/stonith-helper \
> >>>>>        params \
> >>>>>            priority="1" \
> >>>>>            stonith-timeout="60s" \
> >>>>>            hostlist="it13" \
> >>>>>            dead_check_target="192.168.1.173" \
> >>>>>            run_standby_wait="no" \
> >>>>>        op start interval="0s" timeout="60s" \
> >>>>>        op monitor interval="3600s" timeout="60s" \
> >>>>>        op stop interval="0s" timeout="60s"
> >>>>>
> >>>>> primitive prmStonith1-2 stonith:external/ssh \
> >>>>>        params \
> >>>>>            priority="2" \
> >>>>>            stonith-timeout="60s" \
> >>>>>            hostlist="it13" \
> >>>>>        op start interval="0s" timeout="60s" \
> >>>>>        op monitor interval="3600s" timeout="60s" \
> >>>>>        op stop interval="0s" timeout="60s"
> >>>>>
> >>>>> primitive prmStonith1-3 stonith:meatware \
> >>>>>        params \
> >>>>>            priority="3" \
> >>>>>            stonith-timeout="600" \
> >>>>>            hostlist="it13" \
> >>>>>        op start interval="0s" timeout="60s" \
> >>>>>        op monitor interval="3600s" timeout="60s" \
> >>>>>        op stop interval="0s" timeout="60s"
> >>>>>
> >>>>> primitive prmStonith2-1 stonith:external/stonith-helper \
> >>>>>        params \
> >>>>>            priority="1" \
> >>>>>            stonith-timeout="60s" \
> >>>>>            hostlist="it14" \
> >>>>>            dead_check_target="192.168.1.174" \
> >>>>>            run_standby_wait="no" \
> >>>>>        op start interval="0s" timeout="60s" \
> >>>>>        op monitor interval="3600s" timeout="60s" \
> >>>>>        op stop interval="0s" timeout="60s"
> >>>>>
> >>>>> primitive prmStonith2-2 stonith:external/ssh \
> >>>>>        params \
> >>>>>            priority="2" \
> >>>>>            stonith-timeout="60s" \
> >>>>>            hostlist="it14" \
> >>>>>        op start interval="0s" timeout="60s" \
> >>>>>        op monitor interval="3600s" timeout="60s" \
> >>>>>        op stop interval="0s" timeout="60s"
> >>>>>
> >>>>> primitive prmStonith2-3 stonith:meatware \
> >>>>>        params \
> >>>>>            priority="3" \
> >>>>>            stonith-timeout="600" \
> >>>>>            hostlist="it14" \
> >>>>>        op start interval="0s" timeout="60s" \
> >>>>>        op monitor interval="3600s" timeout="60s" \
> >>>>>        op stop interval="0s" timeout="60s"
> >>>>>
> >>>>> group group_all fs_db ip_db prmPg apache
> >>>>>
> >>>>> group grpStonith1 \
> >>>>>        prmStonith1-1 \
> >>>>>        prmStonith1-2 \
> >>>>>        prmStonith1-3
> >>>>>
> >>>>> group grpStonith2 \
> >>>>>        prmStonith2-1 \
> >>>>>        prmStonith2-2 \
> >>>>>        prmStonith2-3
> >>>>>
> >>>>> ms ms_drbd_db drbd_db \
> >>>>>            meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"
> >>>>>
> >>>>> clone clnPing prmPing \
> >>>>>            meta clone-max="2" clone-node-max="1"
> >>>>>
> >>>>> clone clnDiskd prmDiskd \
> >>>>>            meta clone-max="2" clone-node-max="1"
> >>>>>
> >>>>> location group_all-location group_all \
> >>>>>            rule 200: #uname eq it13 \
> >>>>>            rule 100: #uname eq it14 \
> >>>>>            rule -INFINITY: defined ping_set and ping_set lt 200 \
> >>>>>            rule -INFINITY: defined diskd_set and diskd_set eq SUCCESS
> >>>>>
> >>>>> location master-location_db ms_drbd_db \
> >>>>>            rule 200: #uname eq it13 \
> >>>>>            rule 100: #uname eq it14 \
> >>>>>            rule role=master -INFINITY: defined ping_set and ping_set lt 200 \
> >>>>>            rule role=master -INFINITY: defined diskd_set and diskd_set eq SUCCESS
> >>>>>
> >>>>> location rsc_location-grpStonith1-1 grpStonith1 \
> >>>>>        rule -INFINITY: #uname eq it13
> >>>>>
> >>>>> location rsc_location-grpStonith2-1 grpStonith2 \
> >>>>>        rule -INFINITY: #uname eq it14
> >>>>>
> >>>>> colocation db_on_drbd INFINITY: group_all ms_drbd_db:Master
> >>>>> colocation clnPing-colocation INFINITY: group_all clnPing
> >>>>> colocation clnDiskd-colocation INFINITY: group_all clnDiskd
> >>>>> order order_db_after_drbd INFINITY: ms_drbd_db:promote group_all:start
> >>>>> order order_clnPing_after_all 0: clnPing group_all symmetrical=false
> >>>>> order order_clnDiskd_after_all 0: clnDiskd group_all symmetrical=false
> >>>>>
> >>>>> property no-quorum-policy="freeze" \
> >>>>>        stonith-enabled="true" \
> >>>>>            startup-fencing="false" \
> >>>>>            stonith-timeout="430s"
> >>>>>
> >>>>> rsc_defaults resource-stickiness="INFINITY" \
> >>>>>            migration-threshold="1"
> >>>>>
> >>>>> --------------------------------------------------------------------
> >>>>>
> >>>>> よろしくお願い致します。
> >>>>>
> >>>>> _______________________________________________
> >>>>> Linux-ha-japan mailing list
> >>>>> Linux-ha-japan [at] lists
> >>>>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
> >>>>>
> >>>>
> >>>> _______________________________________________
> >>>> Linux-ha-japan mailing list
> >>>> Linux-ha-japan [at] lists
> >>>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
> >>>
> >>> _______________________________________________
> >>> Linux-ha-japan mailing list
> >>> Linux-ha-japan [at] lists
> >>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
> >>>
> >>
> >> _______________________________________________
> >> Linux-ha-japan mailing list
> >> Linux-ha-japan [at] lists
> >> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
> >
> > _______________________________________________
> > Linux-ha-japan mailing list
> > Linux-ha-japan [at] lists
> > http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>
> _______________________________________________
> Linux-ha-japan mailing list
> Linux-ha-japan [at] lists
> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan


wada.shinichiro at jp

Jan 18, 2012, 10:41 PM

Post #7 of 7 (210 views)
Permalink
Re: group内のリソースで発生した異常について [In reply to]

$B;3Fb$5$s!"CSED$5$s(B

$B$3$s$K$A$O!#(B
$BOBED$G$9!#(B

$B$^$:!"K\5sF0$K$D$$$F(B1.0.11$B$NIT6q9g$G$"$j!"(B1.0.12$B$G2~A1$5$l$k$H$$$&E@$K(B
$B$D$$$FN;2r$7$^$7$?!#(B
$B!t8!>Z$^$G$7$F$$$?$@$-!"$"$j$,$H$&$4$6$$$^$9!#!d;3Fb$5$s(B

$B$^$?!"$465<($$$?$@$$$?BP=hJ}K!$G$&$^$/@Z$jBX$o$k$3$H$r3NG'$7$^$7$?!#(B
$B!d;3Fb$5$s!"CSED$5$s(B

$B;29M$^$G$KF0:n3NG'$7$?@_Dj$r0J2<$K5-:\$7$F$*$-$^$9!#(B
$B$*K;$7$$$H$3$m$$$m$$$m$H$4BP1~$$$?$@$-!"$"$j$,$H$&$4$6$$$^$7$?!#(B

-------------------------------------------------------------------------

primitive drbd_db ocf:linbit:drbd \
params drbd_resource="pgsql" \
op start interval="0s" timeout="240s" on-fail="restart" \
op monitor interval="11s" timeout="60s" on-fail="restart" \
op monitor interval="10s" timeout="60s" on-fail="restart" role="Master" \
op stop interval="0s" timeout="100s" on-fail="fence"

primitive ip_db ocf:heartbeat:IPaddr2 \
params ip="192.168.1.175" \
nic="eth1" \
cidr_netmask="24" \
op start interval="0s" timeout="90s" on-fail="restart" \
op monitor interval="10s" timeout="60s" on-fail="restart" \
op stop interval="0s" timeout="100s" on-fail="fence"

primitive prmPing ocf:pacemaker:ping \
params \
name="ping_set" \
host_list="192.168.1.1 192.168.2.1" \
multiplier="100" \
dampen="0" \
meta \
migration-threshold="3" \
failure-timeout="60s" \
op start interval="0s" timeout="90s" on-fail="restart" \
op monitor interval="10s" timeout="60s" on-fail="restart" \
op stop interval="0s" timeout="100s" on-fail="ignore"

primitive fs_db ocf:heartbeat:Filesystem \
params device="/dev/drbd/by-res/pgsql" directory="/data" fstype="ext4" \
op start interval="0s" timeout="60s" on-fail="restart" \
op monitor interval="10s" timeout="60s" on-fail="restart" \
op stop interval="0s" timeout="60s" on-fail="fence"

primitive prmPg ocf:heartbeat:pgsql \
params pgctl="/usr/bin/pg_ctl" \
start_opt="-p 5432" \
psql="/usr/bin/psql" \
pgdata="/data/" \
pgdba="postgres" \
pgport="5432" \
pgdb="postgres" \
op start interval="0s" timeout="120s" on-fail="restart" \
op monitor interval="10s" timeout="60s" on-fail="restart" \
op stop interval="0s" timeout="120s" on-fail="fence"

primitive apache ocf:heartbeat:apache \
params configfile="/etc/httpd/conf/httpd.conf" \
port="80" \
op start interval="0s" timeout="40s" on-fail="restart" \
op monitor interval="10s" timeout="60s" on-fail="restart" \
op stop interval="0s" timeout="60s" on-fail="fence"

primitive prmDiskd ocf:pacemaker:diskd \
params name="diskd_set" \
device="/dev/sda1" \
op start interval="0s" timeout="60s" on-fail="restart" \
op monitor interval="10s" timeout="60s" on-fail="restart" \
op stop interval="0s" timeout="60s" on-fail="ignore"

primitive prmStonith1-1 stonith:external/stonith-helper \
params \
priority="1" \
stonith-timeout="60s" \
hostlist="it13" \
dead_check_target="192.168.1.173" \
run_standby_wait="no" \
op start interval="0s" timeout="60s" \
op monitor interval="3600s" timeout="60s" \
op stop interval="0s" timeout="60s"

primitive prmStonith1-2 stonith:external/ssh \
params \
priority="2" \
stonith-timeout="60s" \
hostlist="it13" \
op start interval="0s" timeout="60s" \
op monitor interval="3600s" timeout="60s" \
op stop interval="0s" timeout="60s"

primitive prmStonith1-3 stonith:meatware \
params \
priority="3" \
stonith-timeout="600" \
hostlist="it13" \
op start interval="0s" timeout="60s" \
op monitor interval="3600s" timeout="60s" \
op stop interval="0s" timeout="60s"

primitive prmStonith2-1 stonith:external/stonith-helper \
params \
priority="1" \
stonith-timeout="60s" \
hostlist="it14" \
dead_check_target="192.168.1.174" \
run_standby_wait="no" \
op start interval="0s" timeout="60s" \
op monitor interval="3600s" timeout="60s" \
op stop interval="0s" timeout="60s"

primitive prmStonith2-2 stonith:external/ssh \
params \
priority="2" \
stonith-timeout="60s" \
hostlist="it14" \
op start interval="0s" timeout="60s" \
op monitor interval="3600s" timeout="60s" \
op stop interval="0s" timeout="60s"

primitive prmStonith2-3 stonith:meatware \
params \
priority="3" \
stonith-timeout="600" \
hostlist="it14" \
op start interval="0s" timeout="60s" \
op monitor interval="3600s" timeout="60s" \
op stop interval="0s" timeout="60s"

group group_all fs_db ip_db prmPg apache

group grpStonith1 \
prmStonith1-1 \
prmStonith1-2 \
prmStonith1-3

group grpStonith2 \
prmStonith2-1 \
prmStonith2-2 \
prmStonith2-3

ms ms_drbd_db drbd_db \
meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"

clone clnPing prmPing \
meta clone-max="2" clone-node-max="1"

clone clnDiskd prmDiskd \
meta clone-max="2" clone-node-max="1"

location group_all-location group_all \
rule 200: #uname eq it13 \
rule 100: #uname eq it14 \
rule -INFINITY: defined ping_set and ping_set lt 200 \
rule -INFINITY: defined diskd_set and diskd_set eq SUCCESS

location master-location_db ms_drbd_db \
rule 200: #uname eq it13 \
rule 100: #uname eq it14 \
rule role=master -INFINITY: defined ping_set and ping_set lt 200 \
rule role=master -INFINITY: defined diskd_set and diskd_set eq SUCCESS \
rule role=master -INFINITY: defined fail-count-fs_db \
rule role=master -INFINITY: defined fail-count-ip_db \
rule role=master -INFINITY: defined fail-count-prmPg \
rule role=master -INFINITY: defined fail-count-apache

location rsc_location-grpStonith1-1 grpStonith1 \
rule -INFINITY: #uname eq it13

location rsc_location-grpStonith2-1 grpStonith2 \
rule -INFINITY: #uname eq it14

colocation db_on_drbd INFINITY: group_all ms_drbd_db:Master
colocation clnPing-colocation INFINITY: group_all clnPing
colocation clnDiskd-colocation INFINITY: group_all clnDiskd
order order_db_after_drbd INFINITY: ms_drbd_db:promote group_all:start
order order_clnPing_after_all 0: clnPing group_all symmetrical=false
order order_clnDiskd_after_all 0: clnDiskd group_all symmetrical=false

property no-quorum-policy="freeze" \
stonith-enabled="true" \
startup-fencing="false" \
stonith-timeout="430s"

rsc_defaults resource-stickiness="INFINITY" \
migration-threshold="1"

-------------------------------------------------------------------------

$B0J>e$G$9!#(B

> $BOBED$5$s(B
>
> $B$3$s$K$A$O!";3Fb$G$9!#(B
>
> $BCSED$5$s$b2sEz$5$l$($F$$$^$9$,!#!#!#!#(B
>
> stonith$B$r4J0W$K$7$F!"%j%=!<%9$b(BDummy$B$J$I$GBeMQ$7$F8=>]$r3NG'$7$^$7$?!#(B
>
>
> 1) $B3:Ev%P!<%8%g%s$G$N5/F08e$NMM;R(B
> ============
> Last updated: Thu Jan 19 23:10:17 2012
> Stack: openais
> Current DC: rh61-coro1 - partition with quorum
> Version: 1.0.11-1554a83db0d3c3e546cfd3aaff6af1184f79ee87
> 2 Nodes configured, 2 expected votes
> 6 Resources configured.
> ============
>
> Online: [ rh61-coro1 rh61-coro2 ]
>
> Resource Group: group_all
> fs_db (ocf::heartbeat:Dummy): Started rh61-coro1
> ip_db (ocf::heartbeat:Dummy): Started rh61-coro1
> prmPg (ocf::heartbeat:Dummy2): Started rh61-coro1
> apache (ocf::heartbeat:Dummy): Started rh61-coro1
> Resource Group: grpStonith1
> prmStonith1-2 (stonith:external/ssh): Started rh61-coro2
> Resource Group: grpStonith2
> prmStonith2-2 (stonith:external/ssh): Started rh61-coro1
> Master/Slave Set: ms_drbd_db
> Masters: [ rh61-coro1 ]
> Slaves: [ rh61-coro2 ]
> Clone Set: clnDiskd
> Started: [ rh61-coro1 rh61-coro2 ]
> Clone Set: clnPing
> Started: [ rh61-coro1 rh61-coro2 ]
>
> Migration summary:
> * Node rh61-coro1:
> * Node rh61-coro2:
>
> 2)$B3:Ev%P!<%8%g%s$G$N(Bpgsql(Dummy2)$B8N>c;~$NMM;R(B
>
> ============
> Last updated: Thu Jan 19 23:11:02 2012
> Stack: openais
> Current DC: rh61-coro1 - partition with quorum
> Version: 1.0.11-1554a83db0d3c3e546cfd3aaff6af1184f79ee87
> 2 Nodes configured, 2 expected votes
> 6 Resources configured.
> ============
>
> Online: [ rh61-coro1 rh61-coro2 ]
>
> Resource Group: group_all
> fs_db (ocf::heartbeat:Dummy): Started rh61-coro1
> ip_db (ocf::heartbeat:Dummy): Started rh61-coro1
> prmPg (ocf::heartbeat:Dummy2): Stopped
> apache (ocf::heartbeat:Dummy): Stopped
> Resource Group: grpStonith1
> prmStonith1-2 (stonith:external/ssh): Started rh61-coro2
> Resource Group: grpStonith2
> prmStonith2-2 (stonith:external/ssh): Started rh61-coro1
> Master/Slave Set: ms_drbd_db
> Masters: [ rh61-coro1 ]
> Slaves: [ rh61-coro2 ]
> Clone Set: clnDiskd
> Started: [ rh61-coro1 rh61-coro2 ]
> Clone Set: clnPing
> Started: [ rh61-coro1 rh61-coro2 ]
>
> Migration summary:
> * Node rh61-coro1:
> prmPg: migration-threshold=1 fail-count=1
> * Node rh61-coro2:
>
> Failed actions:
> prmPg_monitor_10000 (node=rh61-coro1, call=28, rc=1, status=complete): unknown error
>
> 3) 1.0.12$B$G$N5/F08e$NMM;R(B
>
> ============
> Last updated: Thu Jan 19 23:20:48 2012
> Stack: openais
> Current DC: rh61-coro1 - partition with quorum
> Version: 1.0.12-unknown
> 2 Nodes configured, 2 expected votes
> 6 Resources configured.
> ============
>
> Online: [ rh61-coro1 rh61-coro2 ]
>
> Resource Group: group_all
> fs_db (ocf::heartbeat:Dummy): Started rh61-coro1
> ip_db (ocf::heartbeat:Dummy): Started rh61-coro1
> prmPg (ocf::heartbeat:Dummy2): Started rh61-coro1
> apache (ocf::heartbeat:Dummy): Started rh61-coro1
> Resource Group: grpStonith1
> prmStonith1-2 (stonith:external/ssh): Started rh61-coro2
> Resource Group: grpStonith2
> prmStonith2-2 (stonith:external/ssh): Started rh61-coro1
> Master/Slave Set: ms_drbd_db
> Masters: [ rh61-coro1 ]
> Slaves: [ rh61-coro2 ]
> Clone Set: clnDiskd
> Started: [ rh61-coro1 rh61-coro2 ]
> Clone Set: clnPing
> Started: [ rh61-coro1 rh61-coro2 ]
>
> Migration summary:
> * Node rh61-coro2:
> * Node rh61-coro1:
>
> 4) 1.0.12$B$G$N(Bpgsql(Dummy2)$B8N>c;~$NMM;R(B
> ============
> Last updated: Thu Jan 19 23:21:55 2012
> Stack: openais
> Current DC: rh61-coro1 - partition with quorum
> Version: 1.0.12-unknown
> 2 Nodes configured, 2 expected votes
> 6 Resources configured.
> ============
>
> Online: [ rh61-coro1 rh61-coro2 ]
>
> Resource Group: group_all
> fs_db (ocf::heartbeat:Dummy): Started rh61-coro2
> ip_db (ocf::heartbeat:Dummy): Started rh61-coro2
> prmPg (ocf::heartbeat:Dummy2): Started rh61-coro2
> apache (ocf::heartbeat:Dummy): Started rh61-coro2
> Resource Group: grpStonith1
> prmStonith1-2 (stonith:external/ssh): Started rh61-coro2
> Resource Group: grpStonith2
> prmStonith2-2 (stonith:external/ssh): Started rh61-coro1
> Master/Slave Set: ms_drbd_db
> Masters: [ rh61-coro2 ]
> Slaves: [ rh61-coro1 ]
> Clone Set: clnDiskd
> Started: [ rh61-coro1 rh61-coro2 ]
> Clone Set: clnPing
> Started: [ rh61-coro1 rh61-coro2 ]
>
> Migration summary:
> * Node rh61-coro2:
> * Node rh61-coro1:
> prmPg: migration-threshold=1 fail-count=1
>
> Failed actions:
> prmPg_monitor_10000 (node=rh61-coro1, call=28, rc=1, status=complete): unknown error
>
>
> $B7kO@$H$7$F$OIT6q9g$G!"0J2<$N=$@5$,H?1G$5$l$kI,MW$,$"$j$^$9!#(B
> http://hg.clusterlabs.org/pacemaker/stable-1.0/rev/5432bf2773a5
>
>
> $B:#$N=j!"F|K\8l%3%_%e%K%F%#$+$i$O=$@5HG$O%j%j!<%9$5$l$F$$$^$;$s$N$G!"0J2<$+$i(Bzip$B$G%=!<%9%3!<%I$r<hF@$7$F:F%$%s%9%H!<%k$9$k$+(B
> * https://github.com/ClusterLabs/pacemaker-1.0/tree/Pacemaker-1.0.12
> $BK\2HB&$K3:Ev$N(Brpm$B$,$"$l$P$=$A$i$rMxMQ$9$kI,MW$,$"$j$^$9!#(B
>
> $B:#2s$O;n$7$F$$$^$;$s$,!"$^$?!"(B1.0.11$B$G$O!"0JA0$bJs9p$5$l$F$$$^$9$,!"0J2<$N(BML$B$N=$@5$r(Bcrm$B$KE,MQ$9$k$3$H$G$4MWK>$NF0:n2DG=$H;W$o$l$^$9!#(B
> * http://sourceforge.jp/projects/linux-ha/lists/archive/japan/2011-September/000941.html
>
> $B0J>e$G$9!#(B
>
>
>
> --- On Thu, 2012/1/19, Junko IKEDA<tsukishima.ha [at] gmail> wrote:
>
>> $BOBEDMM(B
>>
>> NTT$B%G!<%?@hC<5;=Q$NCSED$G$9!#(B
>>
>> Pacemaker 1.0.11$B$G(BMaster/Slave$B%j%=!<%9$H(Bgroup$B%j%=!<%9$K(B
>> $B=g=x@)Ls$*$h$SF15o@)Ls$r [at] _D$7$?>l9g(B
>> group$B%j%=!<%9$K4^$^$l$k%j%=!<%98N>c$r7@5!$K$7$?(B
>> Master/Slave$B%j%=!<%9$N%U%'%$%k%*!<%P$,<:GT$7$^$9!#(B
>>
>> $B;29M(B
>> http://linux-ha.sourceforge.jp/wp/archives/2468/2
>>
>> $B2r7hJ}K!(B
>> http://linux-ha.sourceforge.jp/wp/archives/2468/3
>>
>> $B2r7hJ}K!$r;29M$K(Blocation$B [at] _D$rDI5-$7$FF0:n$r3NG'$7$F$$$?$@$/$3$H$O2DG=$G$7$g$&$+!#(B
>>
>> $B0J>e$h$m$7$/$*4j$$$$$?$7$^$9!#(B
>>
>> $BCSED=_;R(B
>>
>> 2012$BG/(B1$B7n(B19$BF|(B13:22 $BOBED!!?-0lO/(B<wada.shinichiro [at] jp>:
>>> $B;3Fb$5$s(B
>>>
>>> $B$3$s$K$A$O!#(B
>>> $BOBED$G$9!#(B
>>>
>>>> $B!t;w$?;v>]$,(B1.0.11$B$G$O=P$F$$$^$9$N$G!"IT6q9g$rF'$s$G$7$^$C$F$$$k2DG=@-$,Bg$-$$$G$9!#(B
>>>
>>> $B$&!#!#!#$=$&$G$9$+!#!#(B
>>>
>>>> $B3NG'$7$F7k2L$r8fO"Mm$7$^$9!#(B
>>>
>>> $B$*<j?t$*$+$1$$$?$7$^$9$,!"$h$m$7$/$*4j$$CW$7$^$9!#(B
>>>
>>> $B4JC1$G$9$,!"0J>e$G$9!#(B
>>>
>>>> $BOBED$5$s(B
>>>>
>>>> $B$3$s$K$A$O!";3Fb$G$9!#(B
>>>>
>>>> $B8fO"Mm$"$j$,$H$&$4$6$$$^$9!#(B
>>>>
>>>> $B3NG'$7$F7k2L$r8fO"Mm$7$^$9!#(B
>>>>
>>>> $B!t;w$?;v>]$,(B1.0.11$B$G$O=P$F$$$^$9$N$G!"IT6q9g$rF'$s$G$7$^$C$F$$$k2DG=@-$,Bg$-$$$G$9!#(B
>>>>
>>>> $B0J>e$G$9!#(B
>>>>
>>>> --- On Thu, 2012/1/19, $BOBED!!?-0lO/(B<wada.shinichiro [at] jp> wrote:
>>>>
>>>>> $B;3Fb$5$s(B
>>>>>
>>>>> $B$3$s$K$A$O!#(B
>>>>> $BOBED$G$9!#(B
>>>>>
>>>>> $B$$$D$b$"$j$,$H$&$4$6$$$^$9!#(B
>>>>>
>>>>> $B%P!<%8%g%s$O(B
>>>>> pacemaker-1.0.11-1.2.el6.x86_64
>>>>> corosync-1.3.1-1.1.el6.x86_64
>>>>> $B$K$J$j$^$9!#(B
>>>>>
>>>>> http://sourceforge.jp/projects/linux-ha/releases/
>>>>> $B$G8x3+$5$l$F$k!"(B
>>>>> pacemaker-1.0.11-1.2.2.el6.x86_64.repo.tar.gz
>>>>> $B$KF~$C$F$$$k$b$N$K$J$j$^$9!#(B
>>>>>
>>>>> $B5-21$,Dj$+$G$O$J$$$N$G$9$,!"(BScientific Linux 6.1$B$K%P%s%I%k$5$l$F$$$k$b$N$G$b(B
>>>>> $BF1MM$N;v>]$,H/@8$7$F$$$?$h$&$K5-21$7$F$$$^$9!#(B
>>>>>
>>>>> $B$*<j?t$G$9$,!"$h$m$7$/$*4j$$CW$7$^$9!#(B
>>>>>
>>>>>> $BOBED$5$s(B
>>>>>>
>>>>>> $B$3$s$K$A$O!";3Fb$G$9!#(B
>>>>>>
>>>>>> $B;v>]$r3NG'$7$^$9$N$G!"$*<j?t$G$9$,!"(B
>>>>>> Pacemaker$B$H(Bcorosync$B$N%P!<%8%g%s$r$*CN$i$;$$$?$@$1$^$9$+!)(B
>>>>>> $B!t>l9g$K$h$C$F$O!"%P!<%8%g%s$K$h$kIT6q9g$N2DG=@-$,$"$j$^$9!#(B
>>>>>>
>>>>>> $B0J>e!"59$7$/8f4j$$$$$?$7$^$9!#(B
>>>>>>
>>>>>> --- On Thu, 2012/1/19, $BOBED!!?-0lO/(B<wada.shinichiro [at] jp> wrote:
>>>>>>
>>>>>>> $B$3$s$K$A$O!#(B
>>>>>>> $BOBED$G$9!#(B
>>>>>>>
>>>>>>> $B$$$D$b$*@$OC$K$J$C$F$*$j$^$9!#(B
>>>>>>>
>>>>>>> $B [at] h|!"(BN + 1$B9=@.$G$N%9%W%j%C%H%V%l%$%s$N<ALd$r$5$;$F$$$?$@$$$?:]$K!"(B
>>>>>>> $B;3Fb$5$s$h$j%j%=!<%9$N(Bstart,monitor$B$O!"(Bstandby$B$G$O$J$/(Brestart$B$N$[$&$,(B
>>>>>>> $B$h$$$N$G$O$H$$$&%"%I%P%$%9$r$$$?$@$-$^$7$?!#(B
>>>>>>>
>>>>>>> $B$=$N:]$K$b>/$7?($l$F$$$?$N$G$9$,!"(Bgroup$BFb$N:G8e$N%j%=!<%90J30$K0[>o$,H/@8$7$?>l9g!"(B
>>>>>>> $B$&$^$/@Z$jBX$o$i$J$$$H$$$&;v>]$,!"B>$N%"%I%P%$%9$b<h$j9~$_!"8!>Z$r?J$a$F$$$C$?$H$3$m(B
>>>>>>> $B$d$O$jF1MM$N;v>]$,H/@8$7$^$7$?!#(B
>>>>>>>
>>>>>>> $B!t8=>u!"(Bstandby$B$K$9$k$3$H$G$9$0$K@Z$jBX$o$k$3$H$O3NG'$G$-$F$$$k$N$G!"(B
>>>>>>> $B!t:$$C$F$$$k$o$1$G$O$J$$$N$G$9$,!"$I$&$b$9$C$-$j$;$:!#!#!#(B
>>>>>>>
>>>>>>> $B$?$H$($P!"(B
>>>>>>>
>>>>>>> --------------------------------------------------------------------
>>>>>>>
>>>>>>> Resource Group: group_all
>>>>>>> fs_db (ocf::heartbeat:Filesystem): Started it14
>>>>>>> ip_db (ocf::heartbeat:IPaddr2): Started it14
>>>>>>> prmPg (ocf::heartbeat:pgsql): Started it14
>>>>>>> apache (ocf::heartbeat:apache): Started it14
>>>>>>>
>>>>>>> --------------------------------------------------------------------
>>>>>>>
>>>>>>> $B$H$$$&%j%=!<%9$,$"$C$?>l9g$K!"(Bapache$B$N0[>o$,H/@8$7$?>l9g$O@5>o$K(B
>>>>>>> $B@Z$jBX$o$k$N$G$9$,!"(Bpgsql$B$G0[>o$,H/@8$7$?>l9g$K$O!"(B
>>>>>>>
>>>>>>> --------------------------------------------------------------------
>>>>>>>
>>>>>>> Resource Group: group_all
>>>>>>> fs_db (ocf::heartbeat:Filesystem): Started it14
>>>>>>> ip_db (ocf::heartbeat:IPaddr2): Started it14
>>>>>>> prmPg (ocf::heartbeat:pgsql): Stopped
>>>>>>> apache (ocf::heartbeat:apache): Stopped
>>>>>>>
>>>>>>> --------------------------------------------------------------------
>>>>>>>
>>>>>>> $B$N>uBV$G$H$^$C$F$7$^$$$^$9!#(B
>>>>>>> $B$?$@$7!"(B15$BJ,8e$K0[>o$r8!CN$5$l$=$N8e$G$"$l$P@Z$jBX$($,H/@8$7$^$9!#(B
>>>>>>> $B!t(B15$BJ,$O(Bproperty$B$N(Bcluster-recheck-interval$B$K0MB8$9$k$b$N$H9M$($F$$$^$9!#(B
>>>>>>>
>>>>>>> $B$J$*!"(Bapache$B$G0[>o$,H/@8$7$?>l9g$O@5>o$K@Z$jBX$($,9T$o$l!"(B
>>>>>>> pgsql$B$H(Bapache$B$N=g=x$rF~$lBX$($?>l9g$O(Bapache$B$G@5>o$K@Z$jBX$($,(B
>>>>>>> $B9T$o$l$J$$>uBV$H$J$j$^$9!#(B
>>>>>>>
>>>>>>> $B>e5-$N5sF0$O(Bgroup$B%j%=!<%9$G(Brestart$B$r [at] _D$7$?>l9g$N5sF0$H$7$F@5$7$$$N$G$7$g$&$+!)(B
>>>>>>> $B$4B8CN$NJ}!"@_DjJ}K!$N8m$j$J$I$b4^$a$F$I$J$?$+$465<($$$?$@$1$J$$$G$7$g$&$+!)(B
>>>>>>>
>>>>>>> $B$J$*!"@_Dj$O0J2<$N$h$&$K$7$F$$$^$9!#(B
>>>>>>>
>>>>>>> --------------------------------------------------------------------
>>>>>>>
>>>>>>> primitive drbd_db ocf:linbit:drbd \
>>>>>>> params drbd_resource="pgsql" \
>>>>>>> op start interval="0s" timeout="240s" on-fail="restart" \
>>>>>>> op monitor interval="11s" timeout="60s" on-fail="restart" \
>>>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" role="Master" \
>>>>>>> op stop interval="0s" timeout="100s" on-fail="fence"
>>>>>>>
>>>>>>> primitive ip_db ocf:heartbeat:IPaddr2 \
>>>>>>> params ip="192.168.1.175" \
>>>>>>> nic="eth1" \
>>>>>>> cidr_netmask="24" \
>>>>>>> op start interval="0s" timeout="90s" on-fail="restart" \
>>>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>>>> op stop interval="0s" timeout="100s" on-fail="fence"
>>>>>>>
>>>>>>> primitive prmPing ocf:pacemaker:ping \
>>>>>>> params \
>>>>>>> name="ping_set" \
>>>>>>> host_list="192.168.1.1 192.168.2.1" \
>>>>>>> multiplier="100" \
>>>>>>> dampen="0" \
>>>>>>> meta \
>>>>>>> migration-threshold="3" \
>>>>>>> failure-timeout="60s" \
>>>>>>> op start interval="0s" timeout="90s" on-fail="restart" \
>>>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>>>> op stop interval="0s" timeout="100s" on-fail="ignore"
>>>>>>>
>>>>>>> primitive fs_db ocf:heartbeat:Filesystem \
>>>>>>> params device="/dev/drbd/by-res/pgsql" directory="/data" fstype="ext4" \
>>>>>>> op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>>>> op stop interval="0s" timeout="60s" on-fail="fence"
>>>>>>>
>>>>>>> primitive prmPg ocf:heartbeat:pgsql \
>>>>>>> params pgctl="/usr/bin/pg_ctl" \
>>>>>>> start_opt="-p 5432" \
>>>>>>> psql="/usr/bin/psql" \
>>>>>>> pgdata="/data/" \
>>>>>>> pgdba="postgres" \
>>>>>>> pgport="5432" \
>>>>>>> pgdb="postgres" \
>>>>>>> op start interval="0s" timeout="120s" on-fail="restart" \
>>>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>>>> op stop interval="0s" timeout="120s" on-fail="fence"
>>>>>>>
>>>>>>> primitive apache ocf:heartbeat:apache \
>>>>>>> params configfile="/etc/httpd/conf/httpd.conf" \
>>>>>>> port="80" \
>>>>>>> op start interval="0s" timeout="40s" on-fail="restart" \
>>>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>>>> op stop interval="0s" timeout="60s" on-fail="fence"
>>>>>>>
>>>>>>> primitive prmDiskd ocf:pacemaker:diskd \
>>>>>>> params name="diskd_set" \
>>>>>>> device="/dev/sda1" \
>>>>>>> op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>> op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>>>> op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>
>>>>>>> primitive prmStonith1-1 stonith:external/stonith-helper \
>>>>>>> params \
>>>>>>> priority="1" \
>>>>>>> stonith-timeout="60s" \
>>>>>>> hostlist="it13" \
>>>>>>> dead_check_target="192.168.1.173" \
>>>>>>> run_standby_wait="no" \
>>>>>>> op start interval="0s" timeout="60s" \
>>>>>>> op monitor interval="3600s" timeout="60s" \
>>>>>>> op stop interval="0s" timeout="60s"
>>>>>>>
>>>>>>> primitive prmStonith1-2 stonith:external/ssh \
>>>>>>> params \
>>>>>>> priority="2" \
>>>>>>> stonith-timeout="60s" \
>>>>>>> hostlist="it13" \
>>>>>>> op start interval="0s" timeout="60s" \
>>>>>>> op monitor interval="3600s" timeout="60s" \
>>>>>>> op stop interval="0s" timeout="60s"
>>>>>>>
>>>>>>> primitive prmStonith1-3 stonith:meatware \
>>>>>>> params \
>>>>>>> priority="3" \
>>>>>>> stonith-timeout="600" \
>>>>>>> hostlist="it13" \
>>>>>>> op start interval="0s" timeout="60s" \
>>>>>>> op monitor interval="3600s" timeout="60s" \
>>>>>>> op stop interval="0s" timeout="60s"
>>>>>>>
>>>>>>> primitive prmStonith2-1 stonith:external/stonith-helper \
>>>>>>> params \
>>>>>>> priority="1" \
>>>>>>> stonith-timeout="60s" \
>>>>>>> hostlist="it14" \
>>>>>>> dead_check_target="192.168.1.174" \
>>>>>>> run_standby_wait="no" \
>>>>>>> op start interval="0s" timeout="60s" \
>>>>>>> op monitor interval="3600s" timeout="60s" \
>>>>>>> op stop interval="0s" timeout="60s"
>>>>>>>
>>>>>>> primitive prmStonith2-2 stonith:external/ssh \
>>>>>>> params \
>>>>>>> priority="2" \
>>>>>>> stonith-timeout="60s" \
>>>>>>> hostlist="it14" \
>>>>>>> op start interval="0s" timeout="60s" \
>>>>>>> op monitor interval="3600s" timeout="60s" \
>>>>>>> op stop interval="0s" timeout="60s"
>>>>>>>
>>>>>>> primitive prmStonith2-3 stonith:meatware \
>>>>>>> params \
>>>>>>> priority="3" \
>>>>>>> stonith-timeout="600" \
>>>>>>> hostlist="it14" \
>>>>>>> op start interval="0s" timeout="60s" \
>>>>>>> op monitor interval="3600s" timeout="60s" \
>>>>>>> op stop interval="0s" timeout="60s"
>>>>>>>
>>>>>>> group group_all fs_db ip_db prmPg apache
>>>>>>>
>>>>>>> group grpStonith1 \
>>>>>>> prmStonith1-1 \
>>>>>>> prmStonith1-2 \
>>>>>>> prmStonith1-3
>>>>>>>
>>>>>>> group grpStonith2 \
>>>>>>> prmStonith2-1 \
>>>>>>> prmStonith2-2 \
>>>>>>> prmStonith2-3
>>>>>>>
>>>>>>> ms ms_drbd_db drbd_db \
>>>>>>> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true"
>>>>>>>
>>>>>>> clone clnPing prmPing \
>>>>>>> meta clone-max="2" clone-node-max="1"
>>>>>>>
>>>>>>> clone clnDiskd prmDiskd \
>>>>>>> meta clone-max="2" clone-node-max="1"
>>>>>>>
>>>>>>> location group_all-location group_all \
>>>>>>> rule 200: #uname eq it13 \
>>>>>>> rule 100: #uname eq it14 \
>>>>>>> rule -INFINITY: defined ping_set and ping_set lt 200 \
>>>>>>> rule -INFINITY: defined diskd_set and diskd_set eq SUCCESS
>>>>>>>
>>>>>>> location master-location_db ms_drbd_db \
>>>>>>> rule 200: #uname eq it13 \
>>>>>>> rule 100: #uname eq it14 \
>>>>>>> rule role=master -INFINITY: defined ping_set and ping_set lt 200 \
>>>>>>> rule role=master -INFINITY: defined diskd_set and diskd_set eq SUCCESS
>>>>>>>
>>>>>>> location rsc_location-grpStonith1-1 grpStonith1 \
>>>>>>> rule -INFINITY: #uname eq it13
>>>>>>>
>>>>>>> location rsc_location-grpStonith2-1 grpStonith2 \
>>>>>>> rule -INFINITY: #uname eq it14
>>>>>>>
>>>>>>> colocation db_on_drbd INFINITY: group_all ms_drbd_db:Master
>>>>>>> colocation clnPing-colocation INFINITY: group_all clnPing
>>>>>>> colocation clnDiskd-colocation INFINITY: group_all clnDiskd
>>>>>>> order order_db_after_drbd INFINITY: ms_drbd_db:promote group_all:start
>>>>>>> order order_clnPing_after_all 0: clnPing group_all symmetrical=false
>>>>>>> order order_clnDiskd_after_all 0: clnDiskd group_all symmetrical=false
>>>>>>>
>>>>>>> property no-quorum-policy="freeze" \
>>>>>>> stonith-enabled="true" \
>>>>>>> startup-fencing="false" \
>>>>>>> stonith-timeout="430s"
>>>>>>>
>>>>>>> rsc_defaults resource-stickiness="INFINITY" \
>>>>>>> migration-threshold="1"
>>>>>>>
>>>>>>> --------------------------------------------------------------------
>>>>>>>
>>>>>>> $B$h$m$7$/$*4j$$CW$7$^$9!#(B
>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> Linux-ha-japan mailing list
>>>>>>> Linux-ha-japan [at] lists
>>>>>>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>>>>>
>>>>>>
>>>>>> _______________________________________________
>>>>>> Linux-ha-japan mailing list
>>>>>> Linux-ha-japan [at] lists
>>>>>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>>>
>>>>> _______________________________________________
>>>>> Linux-ha-japan mailing list
>>>>> Linux-ha-japan [at] lists
>>>>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>>>
>>>>
>>>> _______________________________________________
>>>> Linux-ha-japan mailing list
>>>> Linux-ha-japan [at] lists
>>>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>
>>> _______________________________________________
>>> Linux-ha-japan mailing list
>>> Linux-ha-japan [at] lists
>>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>
>> _______________________________________________
>> Linux-ha-japan mailing list
>> Linux-ha-japan [at] lists
>> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>
>
> _______________________________________________
> Linux-ha-japan mailing list
> Linux-ha-japan [at] lists
> http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan

_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan [at] lists
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan

Linux-HA japanese RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.