Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Linux: Kernel

BUG: spinlock recursion on CPU, kernel 2.6.16.20 & 2.6.16-1.2122_FC5[smp]

 

 

Linux kernel RSS feed   Index | Next | Previous | View Threaded


konstantin at antselovich

Jun 15, 2006, 10:05 AM

Post #1 of 3 (294 views)
Permalink
BUG: spinlock recursion on CPU, kernel 2.6.16.20 & 2.6.16-1.2122_FC5[smp]

pls CC me, I'm not on the list.

## Added on Jun-15-2006
## I posted this msg to linux-scsi about a week ago, but nobody replied
## or made any comments.


Hi List,

Kernel crashes when I try to host-swap a hdd. When I take a drive out
I see messages on console that drive is taken out and that it's marked
as 'Failed' in md arrays. When I push it back in kernel crashes.
(also https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=194349)

2x2.4 Xeon, Adaptec AIC-7902B U320 (aic79xx), SuperMicro 6023-P8 Chassis
http://www.supermicro.com/products/system/2U/6023/SYS-6023P-8.cfm

Jun 10 00:57:03 192.168.0.6 kernel: netconsole: local IP 192.168.0.201
Jun 10 00:57:35 192.168.0.201 scsi0: Someone reset channel A
Jun 10 00:57:35 192.168.0.201 BUG: spinlock recursion on CPU#2, swapper/0
Jun 10 00:57:35 192.168.0.201 lock: c266f600, .magic: dead4ead, .owner:
swapper/0, .owner_cpu: 2
Jun 10 00:57:35 192.168.0.201 [<c01cc02d>]
Jun 10 00:57:35 192.168.0.201 _raw_spin_lock+0x33/0xd2
Jun 10 00:57:35 192.168.0.201
Jun 10 00:57:35 192.168.0.201 [<c02e5bf7>]
Jun 10 00:57:35 192.168.0.201 _spin_lock_irqsave+0x9/0xd
Jun 10 00:57:35 192.168.0.201
Jun 10 00:57:35 192.168.0.201 [<f88b7768>]
Jun 10 00:57:35 192.168.0.201 ahd_freeze_simq+0x12/0x43 [aic79xx]
Jun 10 00:57:35 192.168.0.201
Jun 10 00:57:35 192.168.0.201 [<f88aed4d>]
Jun 10 00:57:35 192.168.0.201 ahd_reset_channel+0x471/0x4b5 [aic79xx]
Jun 10 00:57:35 192.168.0.201
Jun 10 00:57:35 192.168.0.201 [<c02e0000>]
Jun 10 00:57:35 192.168.0.201 unix_stream_recvmsg+0x331/0x49a
Jun 10 00:57:35 192.168.0.201
Jun 10 00:57:35 192.168.0.201 [<f88af93c>]
Jun 10 00:57:35 192.168.0.201 ahd_handle_scsiint+0x349/0x15c7 [aic79xx]
Jun 10 00:57:35 192.168.0.201
Jun 10 00:57:35 192.168.0.201 [<c02e5bf7>]
Jun 10 00:57:35 192.168.0.201 _spin_lock_irqsave+0x9/0xd
Jun 10 00:57:35 192.168.0.201
Jun 10 00:57:35 192.168.0.201 [<c0209ab3>]
Jun 10 00:57:35 192.168.0.201 __add_entropy_words+0x58/0x184
Jun 10 00:57:35 192.168.0.201
Jun 10 00:57:35 192.168.0.201 [<c011cf16>]
Jun 10 00:57:35 192.168.0.201 __wake_up+0x2a/0x3d
Jun 10 00:57:35 192.168.0.201
Jun 10 00:57:35 192.168.0.201 [<c011f382>]
Jun 10 00:57:35 192.168.0.201 nr_uninterruptible+0x44/0x63
Jun 10 00:57:35 192.168.0.201
Jun 10 00:57:35 192.168.0.201 [<f88b907d>]
Jun 10 00:57:35 192.168.0.201 ahd_linux_isr+0x160/0x17b [aic79xx]
Jun 10 00:57:35 192.168.0.201
Jun 10 00:57:35 192.168.0.201 [<c01421ca>]

dmesg, lspci are attached

Rgds,
Konstantin

--
Konstantin Antselovich
mailto: konstantin [at] antselovich
http://konstantin.antselovich.com
Attachments: dmesg.2006-06-10.txt (37.1 KB)
  lspci.2006-06-10.txt (7.45 KB)
  oops.2006-06-10.txt (2.72 KB)


76306.1226 at compuserve

Jun 19, 2006, 2:57 AM

Post #2 of 3 (269 views)
Permalink
Re: BUG: spinlock recursion on CPU, kernel 2.6.16.20 & 2.6.16-1.2122_FC5[smp] [In reply to]

In-Reply-To: <4491933C.7060100 [at] antselovich>

On Thu, 15 Jun 2006 10:05:00 -0700, Konstantin Antselovich wrote:

> Jun 10 00:57:35 192.168.0.201 BUG: spinlock recursion on CPU#2, swapper/0
> Jun 10 00:57:35 192.168.0.201 lock: c266f600, .magic: dead4ead, .owner:
> swapper/0, .owner_cpu: 2
> Jun 10 00:57:35 192.168.0.201 [<c01cc02d>]
> Jun 10 00:57:35 192.168.0.201 _raw_spin_lock+0x33/0xd2
> Jun 10 00:57:35 192.168.0.201
> Jun 10 00:57:35 192.168.0.201 [<c02e5bf7>]
> Jun 10 00:57:35 192.168.0.201 _spin_lock_irqsave+0x9/0xd
> Jun 10 00:57:35 192.168.0.201
> Jun 10 00:57:35 192.168.0.201 [<f88b7768>]
> Jun 10 00:57:35 192.168.0.201 ahd_freeze_simq+0x12/0x43 [aic79xx]

Please try 2.6.17. The spinlock was removed.

--
Chuck
"You can't read a newspaper if you can't read." --George W. Bush
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo [at] vger
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/


konstantin at antselovich

Jun 19, 2006, 2:46 PM

Post #3 of 3 (260 views)
Permalink
Re: Infinite interrupt loop, INTSTAT = 8 ( WAS: BUG: spinlock recursion on CPU, kernel 2.6.16.20 & 2.6.16-1.2122_FC5[smp]) [In reply to]

on 06/19/2006 02:57 AM Chuck Ebbert said:
> In-Reply-To: <4491933C.7060100 [at] antselovich>
>
> On Thu, 15 Jun 2006 10:05:00 -0700, Konstantin Antselovich wrote:
>
>> Jun 10 00:57:35 192.168.0.201 BUG: spinlock recursion on CPU#2, swapper/0

<skip>

> Please try 2.6.17. The spinlock was removed.
>

Thanks! Yes, spinlock recursion bug has gone away in 2.6.17.

But other some problems evolved: when I take a HDD out then push it
right back, kernel freezes and the following messages are logged:

---(see detailed log in attachment)---
Jun 19 13:52:05 192.168.0.201 Infinite interrupt loop, INTSTAT = 8
Jun 19 13:52:05 192.168.0.201 scsi0: At time of recovery, card was paused
Jun 19 13:52:05 192.168.0.201 >>>>>>>>>>>>>>>>>> Dump Card State Begins
<<<<<<<<<<<<<<<<<
Jun 19 13:52:05 192.168.0.201 scsi0: Dumping Card State at program
address 0x0 Mode 0x33
Jun 19 13:52:05 192.168.0.201 Card was paused
------

because SCSI Card is paused for quite a while, other HDDs are put
offline, md1 on which / partition resides (3 disks raid5) array brakes
as 2nd HDD is lost.

After some more scsi messages are logged, other issue with networking
pops up (see below). At that point top shows 100% CPU is taken by event
kernel thread, machine locks and I had to push restart.


----
Jun 19 13:53:18 192.168.0.6 kernel: BUG: warning at
include/net/dst.h:154/dst_release()
Jun 19 13:53:18 192.168.0.6 kernel: <c0293b7e> __kfree_skb+0x30/0xce
<c0293c52> skb_queue_purge+0xa/0x17
Jun 19 13:53:18 192.168.0.6 kernel: <c02a51bb>
pfifo_fast_reset+0x14/0x2f <c02a495a> qdisc_reset+0x10/0x11
Jun 19 13:53:18 192.168.0.6 kernel: <c02a49f8> dev_deactivate+0x26/0x76
<c029ee51> linkwatch_run_queue+0x121/0x141
Jun 19 13:53:18 192.168.0.6 kernel: <c029ee93>
linkwatch_event+0x22/0x27 <c012ee7a> run_workqueue+0x7f/0xba
Jun 19 13:53:18 192.168.0.6 kernel: <c029ee71> linkwatch_event+0x0/0x27
<c012f682> worker_thread+0x0/0x106
Jun 19 13:53:18 192.168.0.6 kernel: <c012f757> worker_thread+0xd5/0x106
<c011bf79> default_wake_function+0x0/0xc
Jun 19 13:53:18 192.168.0.6 kernel: <c0131bfd> kthread+0x9d/0xc9
<c0131b60> kthread+0x0/0xc9
Jun 19 13:53:18 192.168.0.6 kernel: <c0102005> kernel_thread_helper+0x5/0xb
----


Let me know if more info is needed.

Thanks,
Konstantin


--
Konstantin Antselovich
mailto: konstantin [at] antselovich
http://konstantin.antselovich.com
Attachments: smicro-2.6.17.log (38.0 KB)

Linux kernel RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.