Varnish restarting sporadically... losing entire cache...

Ben Nowacky bnowacky at competitorgroup.com
Thu Jun 24 20:13:38 CEST 2010


Looks like the child processes are dying regularly, causing the restart.. I increased the ping_interval as flavio recommended ,and the cli timeout. Took longer to die, (12 minutes this time), but finally pooped out.. This was in the /var/log/messages...

Jun 24 18:13:23 srv860 varnishd[25646]: Child (26549) died signal=11
Jun 24 18:13:23 srv860 varnishd[25646]: child (27359) Started
Jun 24 18:13:23 srv860 varnishd[25646]: Child (27359) said 
Jun 24 18:13:23 srv860 varnishd[25646]: Child (27359) said Child starts


Any other suggestions? 


On Jun 24, 2010, at 10:58 AM, Caunter, Stefan wrote:

> Check dmesg too, child is probably dying. Problem with persistent I
> found, I had to go back to file.
> 
> Stefan Caunter :: Senior Systems Administrator :: TOPS
> e: scaunter at topscms.com  ::  m: (416) 561-4871
> www.thestar.com www.topscms.com
> 
> 
> -----Original Message-----
> From: varnish-misc-bounces at varnish-cache.org
> [mailto:varnish-misc-bounces at varnish-cache.org] On Behalf Of Ben Nowacky
> Sent: June-24-10 1:51 PM
> To: Flavio Torres
> Cc: varnish-misc at varnish-cache.org
> Subject: Re: Varnish restarting sporadically... losing entire cache...
> 
> Thanks Flavio! Here's the errors that I see in the /var/log/messages...
> Is this what you were seeing? 
> 
> Jun 24 17:38:23 srv860 varnishd[15625]: Child (22165) Panic message:
> Assert error in SMP_FreeObj(), storage_persistent.c line 802:
> Condition(sg->nfixed > 0) not true. thread = (cache-timeout) ident =
> Linux,2.6.18-128.4.1.el5PAE,i686,-spersistent,-hclassic,epoll Backtrace:
> 0x806ca7c: pan_ic+cc   0x808851e: SMP_FreeObj+13e   0x8064b5f:
> HSH_Deref+21f   0x80618d1: exp_timer+321   0x806f1fd: wrk_bgthread+cd
> 0x44249b: /lib/libpthread.so.0 [0x44249b]   0x39942e:
> /lib/libc.so.6(clone+0x5e) [0x39942e]  
> Jun 24 17:38:23 srv860 varnishd[15625]: child (22984) Started
> Jun 24 17:38:23 srv860 varnishd[15625]: Child (22984) said 
> Jun 24 17:38:23 srv860 varnishd[15625]: Child (22984) said Child starts
> Jun 24 17:38:23 srv860 varnishd[15625]: Child (22984) said Dropped 0
> segments to make free_reserve
> Jun 24 17:38:23 srv860 varnishd[15625]: Child (22984) said Silo
> completely loaded
> On Jun 24, 2010, at 10:51 AM, Flavio Torres wrote:
> 





More information about the varnish-misc mailing list