Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Apache: Users

Apache mysteriously stops answering requests

 

 

Apache users RSS feed   Index | Next | Previous | View Threaded


bnelson at rightnow

Dec 11, 2001, 1:01 PM

Post #1 of 3 (658 views)
Permalink
Apache mysteriously stops answering requests

I have several servers that I have installed Apache 1.3.22 on and I'm seeing
some interesting problems. The httpd daemon(s) start up just fine and start
answering requests just fine as well. The problem is that after some amount
of time ( it's not consistent ) the daemons stop answering http requests.
Sometimes it stays up for 16-17 hours, other times only for 3-4 hours.
Sometimes it'll serve up to ~180,00 hits before it dies, other times it'll
serve as few as ~50,000.

The httpd daemons are running, but when I try to connect to the machine
using lynx I get the following errors:

<lynx snip>
Looking up localhost.
Making HTTP connection to localhost.
Sending HTTP request.
HTTP request sent; waiting for response.
Alert!: Unexpected network read error; connection aborted.
Can't Access `http://localhost/'
Alert!: Unable to access document.
</lynx snip>

The machine is listening on port 80, however :

<shell snip>
root [at] websc0:/var/log/httpd# telnet localhost 80
Trying 127.0.0.1...
Connected to localhost.localdomain.
Escape character is '^]'.
GET / HTTP/1.0

Connection closed by foreign host.
root [at] websc0:/var/log/httpd#
</shell snip>

netstat -an also show that the machine is listening on 80.

Nothing unusual shows up in the error_log or the access logs. When I attach
strace to the running httpd process I get the following over and over again:
<strace snip>
select(0, NULL, NULL, NULL, {0, 790000}) = 0 (Timeout)
time(NULL) = 1008090657
wait4(-1, 0xbffffa9c, WNOHANG, NULL) = 0
select(0, NULL, NULL, NULL, {1, 0}) = 0 (Timeout)
time(NULL) = 1008090658
wait4(-1, 0xbffffa9c, WNOHANG, NULL) = 0
select(0, NULL, NULL, NULL, {1, 0}) = 0 (Timeout)
time(NULL) = 1008090659
wait4(-1, 0xbffffa9c, WNOHANG, NULL) = 0
select(0, NULL, NULL, NULL, {1, 0}) = 0 (Timeout)
time(NULL) = 1008090660
wait4(-1, 0xbffffa9c, WNOHANG, NULL) = 0
select(0, NULL, NULL, NULL, {1, 0}) = 0 (Timeout)
time(NULL) = 1008090661
wait4(-1, 0xbffffa9c, WNOHANG, NULL) = 0
select(0, NULL, NULL, NULL, {1, 0}) = 0 (Timeout)
time(NULL) = 1008090662
wait4(-1, 0xbffffa9c, WNOHANG, NULL) = 0
select(0, NULL, NULL, NULL, {1, 0}) = 0 (Timeout)
time(NULL) = 1008090663
select(7, NULL, [4 6], NULL, {0, 0}) = 2 (out [4 6], left {0, 0})
</strace snip>

vmstat also shows that we are not exhausting our resources.
<vmstat snip>
root [at] websc0:~# vmstat 1
procs memory swap io system
cpu
r b w swpd free buff cache si so bi bo in cs us sy
id
2 0 0 0 1801224 175736 51956 0 0 0 2 27 20 2 1
28
0 0 0 0 1801224 175736 51956 0 0 0 0 119 51 0 2
98
0 0 0 0 1801224 175736 51956 0 0 0 23 120 24 0 2
98
0 0 0 0 1801224 175736 51956 0 0 0 0 113 21 0 4
96
0 0 0 0 1801224 175736 51956 0 0 0 4 178 115 5 5
90
0 0 0 0 1801224 175736 51956 0 0 0 0 122 57 0 3
97
0 0 0 0 1801224 175736 51956 0 0 0 5 119 58 0 3
97
</vmstat snip>

I have run Apache 1.3.12 on this same machine with the same httpd.conf and
it stays up with no problems at all. I have many other web servers running
1.3.12 and they all run very reliably.

The machine is a dual PIII 700, 2G RAM, running linux kernel 2.2.14.

Has anyone else seen anything similar or heard of anything similar??? I'd
like to upgrade all of my machines to 1.3.22, but not until I can get it
running reliably.

--Ben

---------------------------------------------------------------------
The official User-To-User support forum of the Apache HTTP Server Project.
See <URL:http://httpd.apache.org/userslist.html> for more info.
To unsubscribe, e-mail: users-unsubscribe [at] httpd
For additional commands, e-mail: users-help [at] httpd


derek at rivertower

Dec 11, 2001, 2:13 PM

Post #2 of 3 (615 views)
Permalink
Re: Apache mysteriously stops answering requests [In reply to]

What O/S are you using?

I've also had (hopefully not have :) ) this problem over the last few months.
I'm using sun os 5.8 and I think the problem is either with SYSV shared
memory or posix threads (I think only LWP threads work correctly under 5.8).

I'm very interested in seeing how you get along...

Derek

On Tuesday 11 December 2001 20:01, you wrote:
> I have several servers that I have installed Apache 1.3.22 on and I'm
> seeing some interesting problems. The httpd daemon(s) start up just fine
> and start answering requests just fine as well. The problem is that after
> some amount of time ( it's not consistent ) the daemons stop answering http
> requests. Sometimes it stays up for 16-17 hours, other times only for 3-4
> hours. Sometimes it'll serve up to ~180,00 hits before it dies, other times
> it'll serve as few as ~50,000.
>
> The httpd daemons are running, but when I try to connect to the machine
> using lynx I get the following errors:
>
> <lynx snip>
> Looking up localhost.
> Making HTTP connection to localhost.
> Sending HTTP request.
> HTTP request sent; waiting for response.
> Alert!: Unexpected network read error; connection aborted.
> Can't Access `http://localhost/'
> Alert!: Unable to access document.
> </lynx snip>
>
> The machine is listening on port 80, however :
>
> <shell snip>
> root [at] websc0:/var/log/httpd# telnet localhost 80
> Trying 127.0.0.1...
> Connected to localhost.localdomain.
> Escape character is '^]'.
> GET / HTTP/1.0
>
> Connection closed by foreign host.
> root [at] websc0:/var/log/httpd#
> </shell snip>
>
> netstat -an also show that the machine is listening on 80.
>
> Nothing unusual shows up in the error_log or the access logs. When I
> attach strace to the running httpd process I get the following over and
> over again: <strace snip>
> select(0, NULL, NULL, NULL, {0, 790000}) = 0 (Timeout)
> time(NULL) = 1008090657
> wait4(-1, 0xbffffa9c, WNOHANG, NULL) = 0
> select(0, NULL, NULL, NULL, {1, 0}) = 0 (Timeout)
> time(NULL) = 1008090658
> wait4(-1, 0xbffffa9c, WNOHANG, NULL) = 0
> select(0, NULL, NULL, NULL, {1, 0}) = 0 (Timeout)
> time(NULL) = 1008090659
> wait4(-1, 0xbffffa9c, WNOHANG, NULL) = 0
> select(0, NULL, NULL, NULL, {1, 0}) = 0 (Timeout)
> time(NULL) = 1008090660
> wait4(-1, 0xbffffa9c, WNOHANG, NULL) = 0
> select(0, NULL, NULL, NULL, {1, 0}) = 0 (Timeout)
> time(NULL) = 1008090661
> wait4(-1, 0xbffffa9c, WNOHANG, NULL) = 0
> select(0, NULL, NULL, NULL, {1, 0}) = 0 (Timeout)
> time(NULL) = 1008090662
> wait4(-1, 0xbffffa9c, WNOHANG, NULL) = 0
> select(0, NULL, NULL, NULL, {1, 0}) = 0 (Timeout)
> time(NULL) = 1008090663
> select(7, NULL, [4 6], NULL, {0, 0}) = 2 (out [4 6], left {0, 0})
> </strace snip>
>
> vmstat also shows that we are not exhausting our resources.
> <vmstat snip>
> root [at] websc0:~# vmstat 1
> procs memory swap io system
> cpu
> r b w swpd free buff cache si so bi bo in cs us sy
> id
> 2 0 0 0 1801224 175736 51956 0 0 0 2 27 20 2
> 1 28
> 0 0 0 0 1801224 175736 51956 0 0 0 0 119 51 0
> 2 98
> 0 0 0 0 1801224 175736 51956 0 0 0 23 120 24 0
> 2 98
> 0 0 0 0 1801224 175736 51956 0 0 0 0 113 21 0
> 4 96
> 0 0 0 0 1801224 175736 51956 0 0 0 4 178 115 5
> 5 90
> 0 0 0 0 1801224 175736 51956 0 0 0 0 122 57 0
> 3 97
> 0 0 0 0 1801224 175736 51956 0 0 0 5 119 58 0
> 3 97
> </vmstat snip>
>
> I have run Apache 1.3.12 on this same machine with the same httpd.conf and
> it stays up with no problems at all. I have many other web servers running
> 1.3.12 and they all run very reliably.
>
> The machine is a dual PIII 700, 2G RAM, running linux kernel 2.2.14.
>
> Has anyone else seen anything similar or heard of anything similar??? I'd
> like to upgrade all of my machines to 1.3.22, but not until I can get it
> running reliably.
>
> --Ben
>
> ---------------------------------------------------------------------
> The official User-To-User support forum of the Apache HTTP Server Project.
> See <URL:http://httpd.apache.org/userslist.html> for more info.
> To unsubscribe, e-mail: users-unsubscribe [at] httpd
> For additional commands, e-mail: users-help [at] httpd

---------------------------------------------------------------------
The official User-To-User support forum of the Apache HTTP Server Project.
See <URL:http://httpd.apache.org/userslist.html> for more info.
To unsubscribe, e-mail: users-unsubscribe [at] httpd
For additional commands, e-mail: users-help [at] httpd


apache at w3suite

Dec 11, 2001, 5:04 PM

Post #3 of 3 (589 views)
Permalink
Re[2]: Apache mysteriously stops answering requests [In reply to]

> On Tuesday 11 December 2001 20:01, you wrote:
>> I have several servers that I have installed Apache 1.3.22 on and I'm
>> seeing some interesting problems. The httpd daemon(s) start up just fine
>> and start answering requests just fine as well. The problem is that after
>> some amount of time ( it's not consistent ) the daemons stop answering http
>> requests. Sometimes it stays up for 16-17 hours, other times only for 3-4
>> hours. Sometimes it'll serve up to ~180,00 hits before it dies, other times
>> it'll serve as few as ~50,000.
+1 but I use apache 1.3.22 with php on Mandrake 8.1 with selfbulid
2.4.13 Kernel.

>> Nothing unusual shows up in the error_log or the access logs. When I
>> attach strace to the running httpd process I get the following over and
>> over again: <strace snip>
There is also nothing of this in my logfiles

>> vmstat also shows that we are not exhausting our resources.
+1


So what next, tell what additional informations do you need

-Manuel


---------------------------------------------------------------------
The official User-To-User support forum of the Apache HTTP Server Project.
See <URL:http://httpd.apache.org/userslist.html> for more info.
To unsubscribe, e-mail: users-unsubscribe [at] httpd
For additional commands, e-mail: users-help [at] httpd

Apache users RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.