Thanks, I'll do as you suggest and see what happens.

If it's wedged, then I should do what, attach with gdb and get a  


>> Update:  I ran varnishd in foreground with -d and I'm seeing these
>> periodically, which would explain the cache invalidation...
>> Child not responding to ping
>> Cache child died pid=23899 status=0x9
> This is the manager process not getting a reply from the child
> process and restarting it, assuming that it is not serving
> requests either.
> You need to find out why the child process does not reply to pings.
> The first thing to do is to increase the managers timeout by  
> increasing
> the "cli_timeout" parameter to see if the child process is wedged
> or just slow.
