Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Linux-HA: Users

Strange issues with my ha setup

 

 

Linux-HA users RSS feed   Index | Next | Previous | View Threaded


shadus at gmail

Aug 6, 2009, 8:01 AM

Post #1 of 5 (922 views)
Permalink
Strange issues with my ha setup

I've setup ha a couple times in the past and had no serious issues but they
were simple setups... this is fairly simple also at least i thought so until
it exploded :) This is just simple heartbeat.

If i start heartbeat on mach2.domain.tld it brings up the ip addresses its
preferred for... right until i start mach1's heartbeat at which point it
takes them all down and mach1's ip addresses never come up. I'm seeing this
on mach1.

ResourceManager[7853]: 2009/08/06_10:03:23 ERROR: Cannot locate resource
script mach2.domain.tld
ResourceManager[7853]: 2009/08/06_10:03:24 info: Retrying failed stop
operation [mach2.domain.tld]

Furthermore the ip addresses on mach1 never come up due to the above error
at least in part.

Ideas?

ha.cf
--------
logfile /var/log/ha-log
logfacility local0
udpport 694
keepalive 1
warntime 3
deadtime 6
initdead 30
bcast eth0
auto_failback on
node mach1.domain.tld
node mach2.domain.tld

haresources
-------
mach1.domain.tld 12.34.56.78/21 12.34.56.77/21 12.34.56.75/21 named
mach2.domain.tld 12.34.56.79/21 12.34.56.76/21 12.34.56.74/21
12.34.56.73/21named

hosts
-------
127.0.0.1 localhost.localdomain localhost
12.23.56.70 mach1.domain.tld
12.23.56.71 mach2.domain.tld

resolv.conf
------
search domain.tld
nameserver 127.0.0.1
nameserver 12.34.56.78
nameserver 12.34.56.79



--

Shadus
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


dejanmm at fastmail

Oct 1, 2009, 1:02 AM

Post #2 of 5 (706 views)
Permalink
Re: Strange issues with my ha setup [In reply to]

Hi,

On Wed, Sep 30, 2009 at 09:09:24PM -0400, Shadus wrote:
> I posted this to the list a month or so ago, but had no responses and
> it kind of dropped off the radar because of more important issues with
> a san/vmware cluster, but now I need to revisit it and I've come up
> with little to be able to help me figure out where the problem is
> originating precisely. Any advice or help would be greatly
> appreciated.
>
> I've setup ha a couple times in the past and had no serious issues but
> they were simple setups... this is fairly simple also at least i
> thought so until it exploded :)  This is just simple heartbeat.
>
> If i start heartbeat on mach2.domain.tld it brings up the ip addresses
> its preferred for... right until i start mach1's heartbeat at which
> point it takes them all down and mach1's ip addresses never come up.
> I'm seeing this on mach1.
>
> ResourceManager[7853]:  2009/08/06_10:03:23 ERROR: Cannot locate
> resource script mach2.domain.tld

heartbeat thinks that your node is a resource. Check the format
of haresources. They look fine as pasted here, but I can't see
any other explanation.

Thanks,

Dejan

> ResourceManager[7853]:  2009/08/06_10:03:24 info: Retrying failed stop
> operation [mach2.domain.tld]
>
> Furthermore the ip addresses on mach1 never come up due to the above
> error at least in part.
>
> Ideas?
>
> ha.cf
> --------
> logfile /var/log/ha-log
> logfacility local0
> udpport 694
> keepalive 1
> warntime 3
> deadtime 6
> initdead 30
> bcast eth0
> auto_failback on
> node mach1.domain.tld
> node mach2.domain.tld
>
> haresources
> -------
> mach1.domain.tld 12.34.56.78/21 12.34.56.77/21 12.34.56.75/21 named
> mach2.domain.tld 12.34.56.79/21 12.34.56.76/21 12.34.56.74/21
> 55.23.41.73/24 named
>
> hosts
> -------
> 127.0.0.1 localhost.localdomain localhost
> 12.23.56.70 mach1.domain.tld
> 12.23.56.71 mach2.domain.tld
>
> resolv.conf
> ------
> search domain.tld
> nameserver 127.0.0.1
> nameserver 12.34.56.78
> nameserver 12.34.56.79
>
> --
> Shadus
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


timothy.carr at foxtrail

Oct 1, 2009, 6:02 AM

Post #3 of 5 (692 views)
Permalink
Re: Strange issues with my ha setup [In reply to]

Hi

Which version of Linux Heartbeat are you using ?

What has worked well for me is Linux HA v2. Ensure that you setup the hosts
correct like you have setup in your hosts file. I make use of "hb_gui" to
setup my resources
Make use of the VIP resource agent for your virtual ip addressing. I make
use of 2 nic's for the public / private setup

Hope this helps

Tim


On Thu, Oct 1, 2009 at 3:09 AM, Shadus <shadus [at] gmail> wrote:

> I posted this to the list a month or so ago, but had no responses and
> it kind of dropped off the radar because of more important issues with
> a san/vmware cluster, but now I need to revisit it and I've come up
> with little to be able to help me figure out where the problem is
> originating precisely. Any advice or help would be greatly
> appreciated.
>
> I've setup ha a couple times in the past and had no serious issues but
> they were simple setups... this is fairly simple also at least i
> thought so until it exploded :) This is just simple heartbeat.
>
> If i start heartbeat on mach2.domain.tld it brings up the ip addresses
> its preferred for... right until i start mach1's heartbeat at which
> point it takes them all down and mach1's ip addresses never come up.
> I'm seeing this on mach1.
>
> ResourceManager[7853]: 2009/08/06_10:03:23 ERROR: Cannot locate
> resource script mach2.domain.tld
> ResourceManager[7853]: 2009/08/06_10:03:24 info: Retrying failed stop
> operation [mach2.domain.tld]
>
> Furthermore the ip addresses on mach1 never come up due to the above
> error at least in part.
>
> Ideas?
>
> ha.cf
> --------
> logfile /var/log/ha-log
> logfacility local0
> udpport 694
> keepalive 1
> warntime 3
> deadtime 6
> initdead 30
> bcast eth0
> auto_failback on
> node mach1.domain.tld
> node mach2.domain.tld
>
> haresources
> -------
> mach1.domain.tld 12.34.56.78/21 12.34.56.77/21 12.34.56.75/21 named
> mach2.domain.tld 12.34.56.79/21 12.34.56.76/21 12.34.56.74/21
> 55.23.41.73/24 named
>
> hosts
> -------
> 127.0.0.1 localhost.localdomain localhost
> 12.23.56.70 mach1.domain.tld
> 12.23.56.71 mach2.domain.tld
>
> resolv.conf
> ------
> search domain.tld
> nameserver 127.0.0.1
> nameserver 12.34.56.78
> nameserver 12.34.56.79
>
> --
> Shadus
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>



--
Timothy Carr
Technical Specialist
University of Cape Town
Cell: +27834572568
Fax: +27865472190
Gtalk: timothy.carr [at] foxtrail
Skype: timothy.carr.foxtrail
Sent from Cape Town, Western Cape, South Africa
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


Richard.Marshall at Arbella

Oct 1, 2009, 6:29 AM

Post #4 of 5 (701 views)
Permalink
Re: Strange issues with my ha setup [In reply to]

We use SLES 10.1 (SP1) HA version 2.8.
Things work OK, but when something goes wrong it very difficult to
isolate the issue, HA log entries are cryptic and trying to find what
they mean is next to impossible.


_________________________________________________
Richard Marshall | Senior Technical Specialist | Arbella Insurance Group


1900 Crown Colony Drive | Quincy, MA 02269 | ': 617.328.2921| 7:
617.515.2491 | *: Richard.Marshall [at] Arbella


-----Original Message-----
From: linux-ha-bounces [at] lists
[mailto:linux-ha-bounces [at] lists] On Behalf Of Timothy Carr
Sent: Thursday, October 01, 2009 9:03 AM
To: General Linux-HA mailing list
Subject: Re: [Linux-HA] Strange issues with my ha setup

Hi

Which version of Linux Heartbeat are you using ?

What has worked well for me is Linux HA v2. Ensure that you setup the
hosts correct like you have setup in your hosts file. I make use of
"hb_gui" to setup my resources Make use of the VIP resource agent for
your virtual ip addressing. I make use of 2 nic's for the public /
private setup

Hope this helps

Tim


On Thu, Oct 1, 2009 at 3:09 AM, Shadus <shadus [at] gmail> wrote:

> I posted this to the list a month or so ago, but had no responses and
> it kind of dropped off the radar because of more important issues with

> a san/vmware cluster, but now I need to revisit it and I've come up
> with little to be able to help me figure out where the problem is
> originating precisely. Any advice or help would be greatly
> appreciated.
>
> I've setup ha a couple times in the past and had no serious issues but

> they were simple setups... this is fairly simple also at least i
> thought so until it exploded :) This is just simple heartbeat.
>
> If i start heartbeat on mach2.domain.tld it brings up the ip addresses

> its preferred for... right until i start mach1's heartbeat at which
> point it takes them all down and mach1's ip addresses never come up.
> I'm seeing this on mach1.
>
> ResourceManager[7853]: 2009/08/06_10:03:23 ERROR: Cannot locate
> resource script mach2.domain.tld
> ResourceManager[7853]: 2009/08/06_10:03:24 info: Retrying failed stop

> operation [mach2.domain.tld]
>
> Furthermore the ip addresses on mach1 never come up due to the above
> error at least in part.
>
> Ideas?
>
> ha.cf
> --------
> logfile /var/log/ha-log
> logfacility local0
> udpport 694
> keepalive 1
> warntime 3
> deadtime 6
> initdead 30
> bcast eth0
> auto_failback on
> node mach1.domain.tld
> node mach2.domain.tld
>
> haresources
> -------
> mach1.domain.tld 12.34.56.78/21 12.34.56.77/21 12.34.56.75/21 named
> mach2.domain.tld 12.34.56.79/21 12.34.56.76/21 12.34.56.74/21
> 55.23.41.73/24 named
>
> hosts
> -------
> 127.0.0.1 localhost.localdomain localhost 12.23.56.70 mach1.domain.tld
> 12.23.56.71 mach2.domain.tld
>
> resolv.conf
> ------
> search domain.tld
> nameserver 127.0.0.1
> nameserver 12.34.56.78
> nameserver 12.34.56.79
>
> --
> Shadus
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>



--
Timothy Carr
Technical Specialist
University of Cape Town
Cell: +27834572568
Fax: +27865472190
Gtalk: timothy.carr [at] foxtrail
Skype: timothy.carr.foxtrail
Sent from Cape Town, Western Cape, South Africa
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
This email message is intended only for the addressee(s) and contains information that may be confidential.
If you are not the intended recipient please notify the sender by reply email and immediately delete this message.
Use, disclosure or reproduction of this email by anyone other than the intended recipient(s) is strictly prohibited.

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


timothy.carr at foxtrail

Oct 1, 2009, 2:43 PM

Post #5 of 5 (683 views)
Permalink
Re: Strange issues with my ha setup [In reply to]

Had the same problem as well at first but thats why you need to play with
the applications for a while to iron out your understanding before running
in prd :)
I been running IBM DB2 with Linux HA and it works well but not after weeks
of headaches. Documentation is not 100%, i'll give you that but its
sufficient to get you to the end :)

Tim



On Thu, Oct 1, 2009 at 3:29 PM, Marshall, Richard <
Richard.Marshall [at] arbella> wrote:

> We use SLES 10.1 (SP1) HA version 2.8.
> Things work OK, but when something goes wrong it very difficult to
> isolate the issue, HA log entries are cryptic and trying to find what
> they mean is next to impossible.
>
>
> _________________________________________________
> Richard Marshall | Senior Technical Specialist | Arbella Insurance Group
>
>
> 1900 Crown Colony Drive | Quincy, MA 02269 | ': 617.328.2921| 7:
> 617.515.2491 | *: Richard.Marshall [at] Arbella
>
>
> -----Original Message-----
> From: linux-ha-bounces [at] lists
> [mailto:linux-ha-bounces [at] lists] On Behalf Of Timothy Carr
> Sent: Thursday, October 01, 2009 9:03 AM
> To: General Linux-HA mailing list
> Subject: Re: [Linux-HA] Strange issues with my ha setup
>
> Hi
>
> Which version of Linux Heartbeat are you using ?
>
> What has worked well for me is Linux HA v2. Ensure that you setup the
> hosts correct like you have setup in your hosts file. I make use of
> "hb_gui" to setup my resources Make use of the VIP resource agent for
> your virtual ip addressing. I make use of 2 nic's for the public /
> private setup
>
> Hope this helps
>
> Tim
>
>
> On Thu, Oct 1, 2009 at 3:09 AM, Shadus <shadus [at] gmail> wrote:
>
> > I posted this to the list a month or so ago, but had no responses and
> > it kind of dropped off the radar because of more important issues with
>
> > a san/vmware cluster, but now I need to revisit it and I've come up
> > with little to be able to help me figure out where the problem is
> > originating precisely. Any advice or help would be greatly
> > appreciated.
> >
> > I've setup ha a couple times in the past and had no serious issues but
>
> > they were simple setups... this is fairly simple also at least i
> > thought so until it exploded :) This is just simple heartbeat.
> >
> > If i start heartbeat on mach2.domain.tld it brings up the ip addresses
>
> > its preferred for... right until i start mach1's heartbeat at which
> > point it takes them all down and mach1's ip addresses never come up.
> > I'm seeing this on mach1.
> >
> > ResourceManager[7853]: 2009/08/06_10:03:23 ERROR: Cannot locate
> > resource script mach2.domain.tld
> > ResourceManager[7853]: 2009/08/06_10:03:24 info: Retrying failed stop
>
> > operation [mach2.domain.tld]
> >
> > Furthermore the ip addresses on mach1 never come up due to the above
> > error at least in part.
> >
> > Ideas?
> >
> > ha.cf
> > --------
> > logfile /var/log/ha-log
> > logfacility local0
> > udpport 694
> > keepalive 1
> > warntime 3
> > deadtime 6
> > initdead 30
> > bcast eth0
> > auto_failback on
> > node mach1.domain.tld
> > node mach2.domain.tld
> >
> > haresources
> > -------
> > mach1.domain.tld 12.34.56.78/21 12.34.56.77/21 12.34.56.75/21 named
> > mach2.domain.tld 12.34.56.79/21 12.34.56.76/21 12.34.56.74/21
> > 55.23.41.73/24 named
> >
> > hosts
> > -------
> > 127.0.0.1 localhost.localdomain localhost 12.23.56.70 mach1.domain.tld
> > 12.23.56.71 mach2.domain.tld
> >
> > resolv.conf
> > ------
> > search domain.tld
> > nameserver 127.0.0.1
> > nameserver 12.34.56.78
> > nameserver 12.34.56.79
> >
> > --
> > Shadus
> > _______________________________________________
> > Linux-HA mailing list
> > Linux-HA [at] lists
> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
> >
>
>
>
> --
> Timothy Carr
> Technical Specialist
> University of Cape Town
> Cell: +27834572568
> Fax: +27865472190
> Gtalk: timothy.carr [at] foxtrail
> Skype: timothy.carr.foxtrail
> Sent from Cape Town, Western Cape, South Africa
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
> This email message is intended only for the addressee(s) and contains
> information that may be confidential.
> If you are not the intended recipient please notify the sender by reply
> email and immediately delete this message.
> Use, disclosure or reproduction of this email by anyone other than the
> intended recipient(s) is strictly prohibited.
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA [at] lists
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>



--
Timothy Carr
Technical Specialist
University of Cape Town
Cell: +27834572568
Fax: +27865472190
Gtalk: timothy.carr [at] foxtrail
Skype: timothy.carr.foxtrail
Sent from Cape Town, Western Cape, South Africa
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Linux-HA users RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.