Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Linux-HA: Users

Re: tengine (was Re: Re: Linux-HA Digest, Vol 24, Issue 45)

 

 

Linux-HA users RSS feed   Index | Next | Previous | View Threaded


veberhardt at amadeus

Nov 15, 2005, 12:36 AM

Post #1 of 3 (565 views)
Permalink
Re: tengine (was Re: Re: Linux-HA Digest, Vol 24, Issue 45)

>>
>> Hi!
>>
>> I run heartbeat-2.0.1-1 on a 3node linux 2.6 cluster having configured
>> 2 resource groups (I've attached the cibadmin -Q to this email)
>>
>> Two things I experience:
>>
>> 1. is that the tengine process goes sometimes probably in to a loop
>> using as much CPU as it gets.
>
snip...
>
> Can you use gdb to attach to the process and post a backtrace when this
> happens?
>
> thanks
> -Guochun


Sorry for the subject 8-(

Some additional info: I did a failover node3 -> node2 by stopping the
heartbeat on node3. After node3:/../heartbeat start the resource moved
back to node3 but there is no "tengine" process there running!?

Here is the gdb backtrace:

muctam02:~> sudo gdb /usr/lib/heartbeat/tengine 31593
GNU gdb 6.3
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you
are
welcome to change it and/or distribute copies of it under certain
conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB. Type "show warranty" for
details.
This GDB was configured as "i586-suse-linux"...Using host libthread_db
library "/lib/tls/libthread_db.so.1".

Attaching to program: /usr/lib/heartbeat/tengine, process 31593
Reading symbols from /usr/lib/libplumb.so.1...done.
Loaded symbols for /usr/lib/libplumb.so.1
Reading symbols from /usr/lib/libpils.so.1...done.
Loaded symbols for /usr/lib/libpils.so.1
Reading symbols from /usr/lib/libcrmcommon.so.0...done.
Loaded symbols for /usr/lib/libcrmcommon.so.0
Reading symbols from /usr/lib/libcib.so.0...done.
Loaded symbols for /usr/lib/libcib.so.0
Reading symbols from /usr/lib/libapphb.so.0...done.
Loaded symbols for /usr/lib/libapphb.so.0
Reading symbols from /usr/lib/libhbclient.so.1...done.
Loaded symbols for /usr/lib/libhbclient.so.1
Reading symbols from /usr/lib/libstonithd.so.0...done.
Loaded symbols for /usr/lib/libstonithd.so.0
Reading symbols from /opt/gnome/lib/libglib-2.0.so.0...done.
Loaded symbols for /opt/gnome/lib/libglib-2.0.so.0
Reading symbols from /usr/lib/libbz2.so.1...done.
Loaded symbols for /usr/lib/libbz2.so.1
Reading symbols from /lib/libz.so.1...done.
Loaded symbols for /lib/libz.so.1
Reading symbols from /lib/tls/libc.so.6...done.
Loaded symbols for /lib/tls/libc.so.6
Reading symbols from /lib/libuuid.so.1...done.
Loaded symbols for /lib/libuuid.so.1
Reading symbols from /lib/tls/librt.so.1...done.
Loaded symbols for /lib/tls/librt.so.1
Reading symbols from /usr/lib/libltdl.so.3...done.
Loaded symbols for /usr/lib/libltdl.so.3
Reading symbols from /lib/libdl.so.2...done.
Loaded symbols for /lib/libdl.so.2
Reading symbols from /lib/ld-linux.so.2...done.
Loaded symbols for /lib/ld-linux.so.2
Reading symbols from /lib/tls/libpthread.so.0...done.
Loaded symbols for /lib/tls/libpthread.so.0
0xffffe410 in ?? ()
(gdb) bt
#0 0xffffe410 in ?? ()
#1 0xbfffef48 in ?? ()
#2 0x000003e8 in ?? ()
#3 0x00000004 in ?? ()
#4 0x401ee91d in poll () from /lib/tls/libc.so.6
#5 0x400c2c73 in g_main_context_acquire ()
from /opt/gnome/lib/libglib-2.0.so.0
#6 0x400c3cf7 in g_main_loop_run () from /opt/gnome/lib/libglib-2.0.so.0
#7 0x0804f8db in init_start () at main.c:182
#8 0x0804f690 in main (argc=1, argv=0xbffff0d4) at main.c:103


Thanks
Valentin


alanr at unix

Nov 15, 2005, 12:51 AM

Post #2 of 3 (524 views)
Permalink
Re: Re: tengine (was Re: Re: Linux-HA Digest, Vol 24, Issue 45) [In reply to]

Valentin Eberhardt wrote:
>
> >>
> >> Hi!
> >>
> >> I run heartbeat-2.0.1-1 on a 3node linux 2.6 cluster having configured
> >> 2 resource groups (I've attached the cibadmin -Q to this email)
> >>
> >> Two things I experience:
> >>
> >> 1. is that the tengine process goes sometimes probably in to a loop
> >> using as much CPU as it gets.
> >
> snip...
> >
> > Can you use gdb to attach to the process and post a backtrace when this
> > happens?
> >
> > thanks
> > -Guochun
>
>
> Sorry for the subject 8-(
>
> Some additional info: I did a failover node3 -> node2 by stopping the
> heartbeat on node3. After node3:/../heartbeat start the resource moved
> back to node3 but there is no "tengine" process there running!?


tengine and pengine only run on the DC (designated controller)

--
Alan Robertson <alanr [at] unix>

"Openness is the foundation and preservative of friendship... Let me
claim from you at all times your undisguised opinions." - William
Wilberforce
_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems


gshi at ncsa

Nov 15, 2005, 8:57 PM

Post #3 of 3 (517 views)
Permalink
Re: Re: tengine (was Re: Re: Linux-HA Digest, Vol 24, Issue 45) [In reply to]

This back trace does not help much.
I have created a bug for it
(http://www.osdl.org/developer_bugzilla/show_bug.cgi?id=961). You can
register an account in bugzilla and
add yourself into cc list. Once Andrew is back from vacation, he shall
be able to work on the problem or give a clue what might have been wrong
in message layer.

thanks
-Guochun

Valentin Eberhardt wrote:

>
> >>
> >> Hi!
> >>
> >> I run heartbeat-2.0.1-1 on a 3node linux 2.6 cluster having configured
> >> 2 resource groups (I've attached the cibadmin -Q to this email)
> >>
> >> Two things I experience:
> >>
> >> 1. is that the tengine process goes sometimes probably in to a loop
> >> using as much CPU as it gets.
> >
> snip...
> >
> > Can you use gdb to attach to the process and post a backtrace when this
> > happens?
> >
> > thanks
> > -Guochun
>
>
> Sorry for the subject 8-(
>
> Some additional info: I did a failover node3 -> node2 by stopping the
> heartbeat on node3. After node3:/../heartbeat start the resource moved
> back to node3 but there is no "tengine" process there running!?
>
> Here is the gdb backtrace:
>
> muctam02:~> sudo gdb /usr/lib/heartbeat/tengine 31593
> GNU gdb 6.3
> Copyright 2004 Free Software Foundation, Inc.
> GDB is free software, covered by the GNU General Public License, and
> you are
> welcome to change it and/or distribute copies of it under certain
> conditions.
> Type "show copying" to see the conditions.
> There is absolutely no warranty for GDB. Type "show warranty" for
> details.
> This GDB was configured as "i586-suse-linux"...Using host libthread_db
> library "/lib/tls/libthread_db.so.1".
>
> Attaching to program: /usr/lib/heartbeat/tengine, process 31593
> Reading symbols from /usr/lib/libplumb.so.1...done.
> Loaded symbols for /usr/lib/libplumb.so.1
> Reading symbols from /usr/lib/libpils.so.1...done.
> Loaded symbols for /usr/lib/libpils.so.1
> Reading symbols from /usr/lib/libcrmcommon.so.0...done.
> Loaded symbols for /usr/lib/libcrmcommon.so.0
> Reading symbols from /usr/lib/libcib.so.0...done.
> Loaded symbols for /usr/lib/libcib.so.0
> Reading symbols from /usr/lib/libapphb.so.0...done.
> Loaded symbols for /usr/lib/libapphb.so.0
> Reading symbols from /usr/lib/libhbclient.so.1...done.
> Loaded symbols for /usr/lib/libhbclient.so.1
> Reading symbols from /usr/lib/libstonithd.so.0...done.
> Loaded symbols for /usr/lib/libstonithd.so.0
> Reading symbols from /opt/gnome/lib/libglib-2.0.so.0...done.
> Loaded symbols for /opt/gnome/lib/libglib-2.0.so.0
> Reading symbols from /usr/lib/libbz2.so.1...done.
> Loaded symbols for /usr/lib/libbz2.so.1
> Reading symbols from /lib/libz.so.1...done.
> Loaded symbols for /lib/libz.so.1
> Reading symbols from /lib/tls/libc.so.6...done.
> Loaded symbols for /lib/tls/libc.so.6
> Reading symbols from /lib/libuuid.so.1...done.
> Loaded symbols for /lib/libuuid.so.1
> Reading symbols from /lib/tls/librt.so.1...done.
> Loaded symbols for /lib/tls/librt.so.1
> Reading symbols from /usr/lib/libltdl.so.3...done.
> Loaded symbols for /usr/lib/libltdl.so.3
> Reading symbols from /lib/libdl.so.2...done.
> Loaded symbols for /lib/libdl.so.2
> Reading symbols from /lib/ld-linux.so.2...done.
> Loaded symbols for /lib/ld-linux.so.2
> Reading symbols from /lib/tls/libpthread.so.0...done.
> Loaded symbols for /lib/tls/libpthread.so.0
> 0xffffe410 in ?? ()
> (gdb) bt
> #0 0xffffe410 in ?? ()
> #1 0xbfffef48 in ?? ()
> #2 0x000003e8 in ?? ()
> #3 0x00000004 in ?? ()
> #4 0x401ee91d in poll () from /lib/tls/libc.so.6
> #5 0x400c2c73 in g_main_context_acquire ()
> from /opt/gnome/lib/libglib-2.0.so.0
> #6 0x400c3cf7 in g_main_loop_run () from /opt/gnome/lib/libglib-2.0.so.0
> #7 0x0804f8db in init_start () at main.c:182
> #8 0x0804f690 in main (argc=1, argv=0xbffff0d4) at main.c:103
>
>
> Thanks
> Valentin
>
>------------------------------------------------------------------------
>
>_______________________________________________
>Linux-HA mailing list
>Linux-HA [at] lists
>http://lists.linux-ha.org/mailman/listinfo/linux-ha
>See also: http://linux-ha.org/ReportingProblems
>

_______________________________________________
Linux-HA mailing list
Linux-HA [at] lists
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Linux-HA users RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.