Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Linux-HA: Users

Antw: Duplicate monitor operation on a multi state resource

 

 

Linux-HA users RSS feed   Index | Next | Previous | View Threaded


Ulrich.Windl at rz

Aug 22, 2012, 12:00 AM

Post #1 of 7 (302 views)
Permalink
Antw: Duplicate monitor operation on a multi state resource

>>> RaSca <rasca [at] miamammausalinux> schrieb am 21.08.2012 um 18:05 in Nachricht
<5033B1C0.4010302 [at] miamammausalinux>:
> Hi all,
> I'm trying to use the mysql resource agent to manage a setup with one
> master and two slaves. This is the configuration of the mysql resource
> and the master/slave one:
>
> primitive resMySQL ocf:custom:mysql \
> params binary="/usr/bin/mysqld_safe" config="/etc/my.cnf"
> datadir="/var/lib/mysql" user="mysql" replication_user="myuser"
> replication_passwd="mypassword" \
> op start interval="0" timeout="120" \
> op stop interval="0" timeout="120" \
> op promote interval="0" timeout="120" \
> op demote interval="0" timeout="120" \
> op monitor interval="10" role="Master" timeout="30" \
> op monitor interval="10" role="Slave" timeout="30"

Hi!

Amazingly the "primary key" (=ID) of the monitor operations is built using the "ineterval", not the "role". So if you have to monitor operations with the same interval, you have a resource conflict. It's documented, although it's a sick concept...

Decide whose fault it is... yours or the CRMs...

Regards,
Ulrich

> ms ms_resMySQL resMySQL \
> meta master-max="1" master-node-max="1" clone-node-max="1"
> clone-max="3" notify="true" globally-unique="false"
>
> The problem is that I see from the logs some errors like these:
>
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Operation resMySQL-monitor-10-0 is a duplicate of
> resMySQL-monitor-10
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Do not use the same (name, interval) combination more than
> once per resource
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Operation resMySQL-monitor-10-0 is a duplicate of
> resMySQL-monitor-10
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Do not use the same (name, interval) combination more than
> once per resource
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Operation resMySQL-monitor-10-0 is a duplicate of
> resMySQL-monitor-10
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Do not use the same (name, interval) combination more than
> once per resource
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Operation resMySQL-monitor-10-0 is a duplicate of
> resMySQL-monitor-10
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Do not use the same (name, interval) combination more than
> once per resource
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Operation resMySQL-monitor-10-0 is a duplicate of
> resMySQL-monitor-10
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Do not use the same (name, interval) combination more than
> once per resource
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Operation resMySQL-monitor-10-0 is a duplicate of
> resMySQL-monitor-10
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Do not use the same (name, interval) combination more than
> once per resource
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Operation resMySQL-monitor-10-0 is a duplicate of
> resMySQL-monitor-10
> Aug 21 15:24:53 domU-12-31-39-0C-1A-2B pengine: [3816]: ERROR:
> is_op_dup: Do not use the same (name, interval) combination more than
> once per resource
>
> and in fact, even if I manually kill the process in a node, the cluster
> isn't aware and does not react.
>
> What is wrong with this ms resource?





_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


rasca at miamammausalinux

Aug 22, 2012, 1:08 AM

Post #2 of 7 (288 views)
Permalink
Re: Antw: Duplicate monitor operation on a multi state resource [In reply to]

Il giorno Mer 22 Ago 2012 09:00:55 CEST, Ulrich Windl ha scritto:
[...]
> Hi!
> Amazingly the "primary key" (=ID) of the monitor operations is built using the "ineterval", not the "role". So if you have to monitor operations with the same interval, you have a resource conflict. It's documented, although it's a sick concept...
> Decide whose fault it is... yours or the CRMs...
> Regards,
> Ulrich

Thank you Ulrich,
As far as you know, Is there a way to override the ID for each cloned
instance of the mysql resource? How can I resolve the problem?

--
RaSca
Mia Mamma Usa Linux: Niente è impossibile da capire, se lo spieghi bene!
rasca [at] miamammausalinux
http://www.miamammausalinux.org
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


lmb at suse

Aug 22, 2012, 1:11 AM

Post #3 of 7 (288 views)
Permalink
Re: Antw: Duplicate monitor operation on a multi state resource [In reply to]

On 2012-08-22T10:08:14, RaSca <rasca [at] miamammausalinux> wrote:

> Thank you Ulrich,
> As far as you know, Is there a way to override the ID for each cloned
> instance of the mysql resource? How can I resolve the problem?

Just make the intervals slightly different - 31s, 30s, 29s ...


Regards,
Lars

--
Architect Storage/HA
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 21284 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


rasca at miamammausalinux

Aug 22, 2012, 1:32 AM

Post #4 of 7 (287 views)
Permalink
Re: Antw: Duplicate monitor operation on a multi state resource [In reply to]

Il giorno Mer 22 Ago 2012 10:11:52 CEST, Lars Marowsky-Bree ha scritto:
> Just make the intervals slightly different - 31s, 30s, 29s ...
> Regards,
> Lars

Thank you Lars,
In fact, this is what I've done and now everything is ok. But I want to
understand one last thing: if the ID is calculated with the value of
interval then why I don't have errors even if I've got two slaves, which
means that I've got two identical intervals?

I hope to have made myself clear.

Thanks a lot,

--
RaSca
Mia Mamma Usa Linux: Niente è impossibile da capire, se lo spieghi bene!
rasca [at] miamammausalinux
http://www.miamammausalinux.org
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


lmb at suse

Aug 22, 2012, 1:35 AM

Post #5 of 7 (288 views)
Permalink
Re: Antw: Duplicate monitor operation on a multi state resource [In reply to]

On 2012-08-22T10:32:57, RaSca <rasca [at] miamammausalinux> wrote:

> Thank you Lars,
> In fact, this is what I've done and now everything is ok. But I want to
> understand one last thing: if the ID is calculated with the value of
> interval then why I don't have errors even if I've got two slaves, which
> means that I've got two identical intervals?

Because they have a different resource id internally.

That the interval for the master and the slave role can't be identical
is a pacemaker/lrmd bug, I think. (Or rather, obscure enough to
configure that it might well be a bug.) It'd be trivial to just append
the role to the operation key too. (It'd cause a few monitors to be
recreated on update, but that'd be harmless.)


Regards,
Lars

--
Architect Storage/HA
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 21284 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


andrew at beekhof

Sep 4, 2012, 2:54 PM

Post #6 of 7 (255 views)
Permalink
Re: Antw: Duplicate monitor operation on a multi state resource [In reply to]

On Wed, Aug 22, 2012 at 6:35 PM, Lars Marowsky-Bree <lmb [at] suse> wrote:
> On 2012-08-22T10:32:57, RaSca <rasca [at] miamammausalinux> wrote:
>
>> Thank you Lars,
>> In fact, this is what I've done and now everything is ok. But I want to
>> understand one last thing: if the ID is calculated with the value of
>> interval then why I don't have errors even if I've got two slaves, which
>> means that I've got two identical intervals?
>
> Because they have a different resource id internally.
>
> That the interval for the master and the slave role can't be identical
> is a pacemaker/lrmd bug, I think.

A design limitation in the PE that we never got around to doing anything about.

> (Or rather, obscure enough to
> configure that it might well be a bug.) It'd be trivial to just append
> the role to the operation key too. (It'd cause a few monitors to be
> recreated on update, but that'd be harmless.)

Not really that trivial actually. Much of the crmd and pengine would
be affected.

>
>
> Regards,
> Lars
>
> --
> Architect Storage/HA
> SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 21284 (AG Nürnberg)
> "Experience is the name everyone gives to their mistakes." -- Oscar Wilde
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


lmb at suse

Sep 5, 2012, 12:14 AM

Post #7 of 7 (254 views)
Permalink
Re: Antw: Duplicate monitor operation on a multi state resource [In reply to]

On 2012-09-05T07:54:46, Andrew Beekhof <andrew [at] beekhof> wrote:

> > (Or rather, obscure enough to configure that it might well be a
> > bug.) It'd be trivial to just append the role to the operation key
> > too. (It'd cause a few monitors to be recreated on update, but
> > that'd be harmless.)
> Not really that trivial actually. Much of the crmd and pengine would
> be affected.

Yes, but what I meant was not so much the internal code but the upgrade
case. It's not a high priority, but one of these days ... ;-)


Regards,
Lars

--
Architect Storage/HA
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 21284 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Linux-HA users RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.