Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Linux-HA: Users

problem with pind

 

 

Linux-HA users RSS feed   Index | Next | Previous | View Threaded


antonio.trujillo.sspa at juntadeandalucia

Apr 12, 2012, 5:59 AM

Post #1 of 5 (450 views)
Permalink
problem with pind

I'm try to configure a cluster and I have problem with pingd.
my config is
crm(live)configure# show
node proxy-00
node proxy-01
primitive ip-segura ocf:heartbeat:IPaddr2 \
params ip="10.104.16.123" nic="lan" cidr_netmask="19" \
op monitor interval="10" \
meta target-role="Started"
primitive pingd ocf:pacemaker:pingd \
params host_list="10.104.16.157" \
meta target-role="Started"
property $id="cib-bootstrap-options" \
dc-version="1.1.6-9971ebba4494012a93c03b40a2c58ec0eb60f50c" \
cluster-infrastructure="openais" \
stonith-enabled="false" \
no-quorum-policy="ignore" \
expected-quorum-votes="2"

crm(live)# status
============
Last updated: Thu Apr 12 14:54:21 2012
Last change: Thu Apr 12 14:40:00 2012
Stack: openais
Current DC: proxy-00 - partition WITHOUT quorum
Version: 1.1.6-9971ebba4494012a93c03b40a2c58ec0eb60f50c
2 Nodes configured, 2 expected votes
2 Resources configured.
============

Online: [ proxy-00 ]
OFFLINE: [ proxy-01 ]

ip-segura (ocf::heartbeat:IPaddr2): Started proxy-00

Failed actions:
pingd:0_monitor_0 (node=proxy-00, call=5, rc=2, status=complete):
invalid parameter
pingd_monitor_0 (node=proxy-00, call=8, rc=2, status=complete):
invalid parameter

crm(live)resource# start pingd
crm(live)resource# status
ip-segura (ocf::heartbeat:IPaddr2) Started
pingd (ocf::pacemaker:pingd) Stopped

and in the system log I got:

Apr 12 14:55:18 proxy-00 crm_resource: [27941]: ERROR: unpack_rsc_op:
Hard error - pingd:0_last_failure_0 failed with rc=2: Preventing pingd:0
from re-starting on proxy-00
Apr 12 14:55:18 proxy-00 crm_resource: [27941]: ERROR: unpack_rsc_op:
Hard error - pingd_last_failure_0 failed with rc=2: Preventing pingd
from re-starting on proxy-00

I have stoped node 2 in order to less problem

¿I can't found any reference to this error?
¿Can you help me? please.




--
trujo <antonio.trujillo.sspa [at] juntadeandalucia>
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


andreas at hastexo

Apr 13, 2012, 2:18 AM

Post #2 of 5 (427 views)
Permalink
Re: problem with pind [In reply to]

On 04/12/2012 02:59 PM, Trujillo Carmona, Antonio wrote:
>
> I'm try to configure a cluster and I have problem with pingd.
> my config is
> crm(live)configure# show
> node proxy-00
> node proxy-01
> primitive ip-segura ocf:heartbeat:IPaddr2 \
> params ip="10.104.16.123" nic="lan" cidr_netmask="19" \
> op monitor interval="10" \
> meta target-role="Started"
> primitive pingd ocf:pacemaker:pingd \

use ocf:pacemaker:ping

> params host_list="10.104.16.157" \

and you have to define a monitor operation.

Without any constraints to let the cluster react on connectivity changes
ping resource is useless ... this may help:

http://www.hastexo.com/resources/hints-and-kinks/network-connectivity-check-pacemaker

Regards,
Andreas

--
Need help with Pacemaker?
http://www.hastexo.com/now

> meta target-role="Started"
> property $id="cib-bootstrap-options" \
> dc-version="1.1.6-9971ebba4494012a93c03b40a2c58ec0eb60f50c" \
> cluster-infrastructure="openais" \
> stonith-enabled="false" \
> no-quorum-policy="ignore" \
> expected-quorum-votes="2"
>
> crm(live)# status
> ============
> Last updated: Thu Apr 12 14:54:21 2012
> Last change: Thu Apr 12 14:40:00 2012
> Stack: openais
> Current DC: proxy-00 - partition WITHOUT quorum
> Version: 1.1.6-9971ebba4494012a93c03b40a2c58ec0eb60f50c
> 2 Nodes configured, 2 expected votes
> 2 Resources configured.
> ============
>
> Online: [ proxy-00 ]
> OFFLINE: [ proxy-01 ]
>
> ip-segura (ocf::heartbeat:IPaddr2): Started proxy-00
>
> Failed actions:
> pingd:0_monitor_0 (node=proxy-00, call=5, rc=2, status=complete):
> invalid parameter
> pingd_monitor_0 (node=proxy-00, call=8, rc=2, status=complete):
> invalid parameter
>
> crm(live)resource# start pingd
> crm(live)resource# status
> ip-segura (ocf::heartbeat:IPaddr2) Started
> pingd (ocf::pacemaker:pingd) Stopped
>
> and in the system log I got:
>
> Apr 12 14:55:18 proxy-00 crm_resource: [27941]: ERROR: unpack_rsc_op:
> Hard error - pingd:0_last_failure_0 failed with rc=2: Preventing pingd:0
> from re-starting on proxy-00
> Apr 12 14:55:18 proxy-00 crm_resource: [27941]: ERROR: unpack_rsc_op:
> Hard error - pingd_last_failure_0 failed with rc=2: Preventing pingd
> from re-starting on proxy-00
>
> I have stoped node 2 in order to less problem
>
> ¿I can't found any reference to this error?
> ¿Can you help me? please.
>
>
>
>
Attachments: signature.asc (0.22 KB)


mohamed.s at alcatel-lucent

Apr 13, 2012, 3:51 AM

Post #3 of 5 (428 views)
Permalink
Re: problem with pind [In reply to]

Hi,

The Pacemaker_Explained.pdf document says that

" setting of migration-threshold=2 and failure-timeout=60s would cause the resource to move to a new node after 2 failures, and allow it to move back (depending on the stickiness and constraint scores) after one minute."

Can you please help me understand what will happen on the following scenarios in 2 node active passive configuration?

1 - If one resource failed twice within 60s, it will move to the other node.
This is clear to understand.

2 - If one resource failed once and there is no failure within 60s, will the pacemaker reset the failcounts of that resource, so that the failcounts are tracked freshly?

The failcounts are not reset if the migration-threshold didn't occur within the failure-timeout period. Is that a bug in pacemaker-1.0.5-4.1?

Thanks,
Raffi





> -----Original Message-----
> From: linux-ha-bounces [at] lists [mailto:linux-ha-
> bounces [at] lists] On Behalf Of Andreas Kurz
> Sent: Friday, April 13, 2012 2:49 PM
> To: linux-ha [at] lists
> Subject: Re: [Linux-HA] problem with pind
>
> On 04/12/2012 02:59 PM, Trujillo Carmona, Antonio wrote:
> >
> > I'm try to configure a cluster and I have problem with pingd.
> > my config is
> > crm(live)configure# show
> > node proxy-00
> > node proxy-01
> > primitive ip-segura ocf:heartbeat:IPaddr2 \
> > params ip="10.104.16.123" nic="lan" cidr_netmask="19" \
> > op monitor interval="10" \
> > meta target-role="Started"
> > primitive pingd ocf:pacemaker:pingd \
>
> use ocf:pacemaker:ping
>
> > params host_list="10.104.16.157" \
>
> and you have to define a monitor operation.
>
> Without any constraints to let the cluster react on connectivity changes
> ping resource is useless ... this may help:
>
> http://www.hastexo.com/resources/hints-and-kinks/network-connectivity-
> check-pacemaker
>
> Regards,
> Andreas
>
> --
> Need help with Pacemaker?
> http://www.hastexo.com/now
>
> > meta target-role="Started"
> > property $id="cib-bootstrap-options" \
> > dc-version="1.1.6-9971ebba4494012a93c03b40a2c58ec0eb60f50c" \
> > cluster-infrastructure="openais" \
> > stonith-enabled="false" \
> > no-quorum-policy="ignore" \
> > expected-quorum-votes="2"
> >
> > crm(live)# status
> > ============
> > Last updated: Thu Apr 12 14:54:21 2012
> > Last change: Thu Apr 12 14:40:00 2012
> > Stack: openais
> > Current DC: proxy-00 - partition WITHOUT quorum
> > Version: 1.1.6-9971ebba4494012a93c03b40a2c58ec0eb60f50c
> > 2 Nodes configured, 2 expected votes
> > 2 Resources configured.
> > ============
> >
> > Online: [ proxy-00 ]
> > OFFLINE: [ proxy-01 ]
> >
> > ip-segura (ocf::heartbeat:IPaddr2): Started proxy-00
> >
> > Failed actions:
> > pingd:0_monitor_0 (node=proxy-00, call=5, rc=2, status=complete):
> > invalid parameter
> > pingd_monitor_0 (node=proxy-00, call=8, rc=2, status=complete):
> > invalid parameter
> >
> > crm(live)resource# start pingd
> > crm(live)resource# status
> > ip-segura (ocf::heartbeat:IPaddr2) Started
> > pingd (ocf::pacemaker:pingd) Stopped
> >
> > and in the system log I got:
> >
> > Apr 12 14:55:18 proxy-00 crm_resource: [27941]: ERROR: unpack_rsc_op:
> > Hard error - pingd:0_last_failure_0 failed with rc=2: Preventing pingd:0
> > from re-starting on proxy-00
> > Apr 12 14:55:18 proxy-00 crm_resource: [27941]: ERROR: unpack_rsc_op:
> > Hard error - pingd_last_failure_0 failed with rc=2: Preventing pingd
> > from re-starting on proxy-00
> >
> > I have stoped node 2 in order to less problem
> >
> > ¿I can't found any reference to this error?
> > ¿Can you help me? please.
> >
> >
> >
> >
>
>

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


andrew at beekhof

Apr 15, 2012, 5:20 PM

Post #4 of 5 (421 views)
Permalink
Re: problem with pind [In reply to]

On Fri, Apr 13, 2012 at 8:51 PM, S, MOHAMED (MOHAMED)** CTR **
<mohamed.s [at] alcatel-lucent> wrote:
> Hi,
>
> The Pacemaker_Explained.pdf document says that
>
> " setting of migration-threshold=2 and failure-timeout=60s would cause the resource to move to a new node after 2 failures, and allow it to move back (depending on the stickiness and constraint scores) after one minute."
>
> Can you please help me understand what will happen on the following scenarios in 2 node active passive configuration?
>
> 1 - If one resource failed twice within 60s, it will move to the other node.
> This is clear to understand.
>
> 2 - If one resource failed once and there is no failure within 60s, will the pacemaker reset the failcounts of that resource, so that the failcounts are tracked freshly?

Yes, but only with recent releases of 1.1.x

>
> The failcounts are not reset if the migration-threshold didn't occur within the failure-timeout period. Is that a bug in pacemaker-1.0.5-4.1?


No, its a new feature in 1.1.x
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


mohamed.s at alcatel-lucent

Apr 15, 2012, 10:37 PM

Post #5 of 5 (419 views)
Permalink
Re: problem with pind [In reply to]

Andrew,

Thanks for the answer.

It will really help the users if that is mentioned in the Pacemaker_Explained.pdf under the section
9.3.2. Moving Resources Due to Failure

Thanks for the document.
Raffi

> -----Original Message-----
> From: linux-ha-bounces [at] lists [mailto:linux-ha-
> bounces [at] lists] On Behalf Of Andrew Beekhof
> Sent: Monday, April 16, 2012 5:51 AM
> To: General Linux-HA mailing list
> Subject: Re: [Linux-HA] problem with pind
>
> On Fri, Apr 13, 2012 at 8:51 PM, S, MOHAMED (MOHAMED)** CTR **
> <mohamed.s [at] alcatel-lucent> wrote:
> > Hi,
> >
> > The Pacemaker_Explained.pdf document says that
> >
> > " setting of migration-threshold=2 and failure-timeout=60s would cause
> the resource to move to a new node after 2 failures, and allow it to move
> back (depending on the stickiness and constraint scores) after one
> minute."
> >
> > Can you please help me understand what will happen on the following
> scenarios in 2 node active passive configuration?
> >
> > 1 - If one resource failed twice within 60s, it will move to the other
> node.
> > This is clear to understand.
> >
> > 2 - If one resource failed once and there is no failure within 60s, will
> the pacemaker reset the failcounts of that resource, so that the
> failcounts are tracked freshly?
>
> Yes, but only with recent releases of 1.1.x
>
> >
> > The failcounts are not reset if the migration-threshold didn't occur
> within the failure-timeout period. Is that a bug in pacemaker-1.0.5-4.1?
>
>
> No, its a new feature in 1.1.x
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Linux-HA users RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.