[Varnish] #678: varnishd stops accepting requests
Varnish
varnish-bugs at varnish-cache.org
Wed Apr 14 13:56:00 CEST 2010
#678: varnishd stops accepting requests
----------------------+-----------------------------------------------------
Reporter: ahongens | Owner: phk
Type: defect | Status: new
Priority: high | Milestone:
Component: varnishd | Version: trunk
Severity: major | Keywords: hang stop responding
----------------------+-----------------------------------------------------
I have four balancers that ran 2.0.5 fine for months, and now I've
upgraded them to 2.1.0, and sometimes one (at random which one) seems to
hang.
Varnishstat shows no requests coming in, and when I run varnishlog I only
see a lot of lines like this:
8045 SessionClose - dropped
8045 StatSess - (null) (null) 1271244853 0 0 0 0 0 0 0
8045 SessionClose - dropped
8045 StatSess - (null) (null) 1271244853 0 0 0 0 0 0 0
8045 SessionClose - dropped
8045 StatSess - (null) (null) 1271244853 0 0 0 0 0 0 0
8045 SessionClose - dropped
8045 StatSess - (null) (null) 1271244853 0 0 0 0 0 0 0
8045 SessionClose - dropped
8045 StatSess - (null) (null) 1271244853 0 0 0 0 0 0 0
8045 SessionClose - dropped
8045 StatSess - (null) (null) 1271244853 0 0 0 0 0 0 0
8045 SessionClose - dropped
8045 StatSess - (null) (null) 1271244853 0 0 0 0 0 0 0
8045 SessionClose - dropped
8045 StatSess - (null) (null) 1271244853 0 0 0 0 0 0 0
I don't see anything else that is strange.. The only thing I see in my
cacti monitoring is that responses go down, and active tcp connections go
up (probably as a result). Load of the balancers prior to the problem is a
normal 0.2-0.5.
After restaring, in my syslog I see (strange the time is off by 2 hours
though, time was 13:34, all other daemons log ok)
Apr 14 11:34:23 nmt-nlb-04 varnishd[54351]: Manager got SIGINT
Apr 14 11:34:23 nmt-nlb-04 varnishd[54351]: Stopping Child
Apr 14 11:34:37 nmt-nlb-04 varnishd[65642]: child (65643) Started
Apr 14 11:34:37 nmt-nlb-04 varnishd[65642]: Child (65643) said Closed fds:
4 5 6 7 11 12 14 15
Apr 14 11:34:37 nmt-nlb-04 varnishd[65642]: Child (65643) said Child
starts
Apr 14 11:34:37 nmt-nlb-04 varnishd[65642]: Child (65643) said managed to
mmap 8589934592 bytes of 8589934592
I cannot reproduce it.
--
Ticket URL: <http://www.varnish-cache.org/ticket/678>
Varnish <http://varnish-cache.org/>
The Varnish HTTP Accelerator
More information about the varnish-bugs
mailing list