Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Linux-HA: Users

hb 2.0.8: monitor operation - restart of clone resources [wd-vc]

 

 

Linux-HA users RSS feed   Index | Next | Previous | View Threaded


Rainer.Brunold at allianz

Jul 8, 2008, 5:34 AM

Post #1 of 5 (113 views)
Permalink
hb 2.0.8: monitor operation - restart of clone resources [wd-vc]

Hello,

I'm running heartbeat-2.0.8-0.19 on sles 10 sp1 and have some problems with
clone resources when the OCF resources script reports OCF_NOT_RUNNING.

For a JBoss installation we have configured a two node cluster with several
clone resources where each clone can have a maximum of two instances and
one per node. So we will have exact one instance per server.

The monitor operation for this clone runs every ten seconds and check if
the instance is up and running. If it detects a failure it does a restart.

When I kill the instance on one of the two nodes it happens frequently that
the resource on the other node also get's restarted even it was up and
running before.

Am I wrong with my cluster design or is here something going wrong ?

Thanks,
Rainer

Allianz Elementar Versicherungs-Aktiengesellschaft
A-1130 Wien, Hietzinger Kai 101-105
FN 34004g, Handelsgericht Wien
UID: ATU 1536 4406; DVR: 0003565
http://www.allianz.at

********************************************************
Dieses E-Mail und allfaellig daran angeschlossene Anhaenge
enthalten Informationen, die vertraulich und
ausschliesslich fuer den (die) bezeichneten Adressaten
bestimmt sind.
Wenn Sie nicht der genannte Adressat sind, darf dieses
E-Mail samt allfaelliger Anhaenge von Ihnen weder anderen
Personen zugaenglich gemacht noch in anderer Weise
verwertet werden.
Wenn Sie nicht der beabsichtigte Empfaenger sind, bitten
wir Sie, dieses E-Mail und saemtliche angeschlossene
Anhaenge zu loeschen.

Please note: This email and any files transmitted with it is
intended only for the named recipients and may contain
confidential and/or privileged information. If you are not the
intended recipient, please do not read, copy, use or disclose
the contents of this communication to others and notify the
sender immediately. Then please delete the email and any
copies of it. Thank you.
********************************************************

_______________________________________________
Linux-HA mailing list
Linux-HA[at]lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


beekhof at gmail

Jul 8, 2008, 5:40 AM

Post #2 of 5 (108 views)
Permalink
Re: hb 2.0.8: monitor operation - restart of clone resources [wd-vc] [In reply to]

On Tue, Jul 8, 2008 at 14:34, <Rainer.Brunold[at]allianz.at> wrote:
>
> Hello,
>
> I'm running heartbeat-2.0.8-0.19 on sles 10 sp1 and have some problems with
> clone resources when the OCF resources script reports OCF_NOT_RUNNING.
>
> For a JBoss installation we have configured a two node cluster with several
> clone resources where each clone can have a maximum of two instances and
> one per node. So we will have exact one instance per server.
>
> The monitor operation for this clone runs every ten seconds and check if
> the instance is up and running. If it detects a failure it does a restart.
>
> When I kill the instance on one of the two nodes it happens frequently that
> the resource on the other node also get's restarted even it was up and
> running before.
>
> Am I wrong with my cluster design or is here something going wrong ?

No, this is a bug that has been fixed since then.
It will part of a refresh of the Heartbeat package for SP2 in a week or two.

If you have a support contract with SUSE/Novell you could request
early access to this version.
_______________________________________________
Linux-HA mailing list
Linux-HA[at]lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Rainer.Brunold at allianz

Jul 8, 2008, 6:17 AM

Post #3 of 5 (108 views)
Permalink
Re: hb 2.0.8: monitor operation - restart of clone resources [wd-vc] [In reply to]

>> Hello,
>>
>> I'm running heartbeat-2.0.8-0.19 on sles 10 sp1 and have some problems
with
>> clone resources when the OCF resources script reports OCF_NOT_RUNNING.
>>
>> For a JBoss installation we have configured a two node cluster with
several
>> clone resources where each clone can have a maximum of two instances and
>> one per node. So we will have exact one instance per server.
>>
>> The monitor operation for this clone runs every ten seconds and check if
>> the instance is up and running. If it detects a failure it does a
restart.
>>
>> When I kill the instance on one of the two nodes it happens frequently
that
>> the resource on the other node also get's restarted even it was up and
>> running before.
>>
>> Am I wrong with my cluster design or is here something going wrong ?
>
>No, this is a bug that has been fixed since then.
>It will part of a refresh of the Heartbeat package for SP2 in a week or
two.
>
>If you have a support contract with SUSE/Novell you could request
>early access to this version.

Thank you Andrew,

I have full access to Novell's bugzilla. Can you give me the bug number ?

Rainer

Allianz Elementar Versicherungs-Aktiengesellschaft
A-1130 Wien, Hietzinger Kai 101-105
FN 34004g, Handelsgericht Wien
UID: ATU 1536 4406; DVR: 0003565
http://www.allianz.at

********************************************************
Dieses E-Mail und allfaellig daran angeschlossene Anhaenge
enthalten Informationen, die vertraulich und
ausschliesslich fuer den (die) bezeichneten Adressaten
bestimmt sind.
Wenn Sie nicht der genannte Adressat sind, darf dieses
E-Mail samt allfaelliger Anhaenge von Ihnen weder anderen
Personen zugaenglich gemacht noch in anderer Weise
verwertet werden.
Wenn Sie nicht der beabsichtigte Empfaenger sind, bitten
wir Sie, dieses E-Mail und saemtliche angeschlossene
Anhaenge zu loeschen.

Please note: This email and any files transmitted with it is
intended only for the named recipients and may contain
confidential and/or privileged information. If you are not the
intended recipient, please do not read, copy, use or disclose
the contents of this communication to others and notify the
sender immediately. Then please delete the email and any
copies of it. Thank you.
********************************************************

_______________________________________________
Linux-HA mailing list
Linux-HA[at]lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


beekhof at gmail

Jul 8, 2008, 8:10 AM

Post #4 of 5 (108 views)
Permalink
Re: hb 2.0.8: monitor operation - restart of clone resources [wd-vc] [In reply to]

On Tue, Jul 8, 2008 at 15:17, <Rainer.Brunold[at]allianz.at> wrote:
>>No, this is a bug that has been fixed since then.
>>It will part of a refresh of the Heartbeat package for SP2 in a week or
> two.
>>
>>If you have a support contract with SUSE/Novell you could request
>>early access to this version.
>
> Thank you Andrew,
>
> I have full access to Novell's bugzilla. Can you give me the bug number ?

I'd love to, but I can't seem to find it
_______________________________________________
Linux-HA mailing list
Linux-HA[at]lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


beekhof at gmail

Jul 9, 2008, 11:43 AM

Post #5 of 5 (94 views)
Permalink
Re: hb 2.0.8: monitor operation - restart of clone resources [wd-vc] [In reply to]

On Tue, Jul 8, 2008 at 17:10, Andrew Beekhof <beekhof[at]gmail.com> wrote:
> On Tue, Jul 8, 2008 at 15:17, <Rainer.Brunold[at]allianz.at> wrote:
>>>No, this is a bug that has been fixed since then.
>>>It will part of a refresh of the Heartbeat package for SP2 in a week or
>> two.
>>>
>>>If you have a support contract with SUSE/Novell you could request
>>>early access to this version.
>>
>> Thank you Andrew,
>>
>> I have full access to Novell's bugzilla. Can you give me the bug number ?
>
> I'd love to, but I can't seem to find it

aha! found it. https://bugzilla.novell.com/show_bug.cgi?id=347004
_______________________________________________
Linux-HA mailing list
Linux-HA[at]lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Linux-HA users RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact lists@gossamer-threads.com
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.