varnish nuking agressively all my objects.

Aurélien Lemaire aurelien.lemaire at smile.fr
Thu Sep 15 11:29:36 CEST 2011


Good day folks,

First of all, varnish is an outstanding piece of software that my company and i 
are addicted to. So thanks to all the coders.

Here is my problem :
I allocated varnish 1G of RAM on a website that can have more than 2 Go of 
possible cacheable objects . Not to worry though as any proxy-cache system 
should smartly nuke old objects to make place to new one to live peacefully 
within its allocated RAM. And that's where Varnish behave unexpectedly : each 
time it need to nuke SOME objects : it nukes ALMOST ALL of them (often ~80% of 
my 35k objects) which is quite aggressive ; thus i lost almost all my cache....IRK !

3 Munin graphs attached to see the problem clearly : big drop each time a nuking 
happens.

To make sure my pbr is about varnish nuking system : i increased from 1G to 
3G(more than the max possible 2G cacheable objects) on another varnish of this 
platefom (this website is delivered by multiple front/varnish server all stricly 
similar and independant) and this issue disappeared (no more nuking : no lost of 
~80%of my objects)

Here is my env :
Debian 5.0.8 64 bits on 2.6.32-5-openvz-amd64 kernel
Varnish 2.1.3 SVN 5049:5055(debian package 2.1.3-8)
200 varnish 's worker threads running constantly (no issue on workers)
30req/s average with 60/s in peak

Daemon run as such :
/usr/sbin/varnishd -P /var/run/varnishd.pid -a :80 -T localhost:6082 -S 
/etc/varnish/secret -f /etc/varnish/serverx.vcl -w 100,1024 -s 
file,/var/lib/varnish/serverx/varnish_storage.bin,3G

Here a quick varnishstat -1 :

    client_conn          17177518         9.26 Client connections accepted
    client_drop                 0         0.00 Connection dropped, no sess/wrk
    client_req           31277524        16.86 Client requests received
    cache_hit            20909485        11.27 Cache hits
    cache_hitpass         2558712         1.38 Cache hits for pass
    cache_miss            5422557         2.92 Cache misses
    backend_conn           175175         0.09 Backend conn. success
    backend_unhealthy           0         0.00 Backend conn. not attempted
    backend_busy                0         0.00 Backend conn. too many
    backend_fail                0         0.00 Backend conn. failures
    backend_reuse        10192538         5.49 Backend conn. reuses
    backend_toolate         98629         0.05 Backend conn. was closed
    backend_recycle      10291261         5.55 Backend conn. recycles
    backend_unused              0         0.00 Backend conn. unused
    fetch_head             459553         0.25 Fetch head
    fetch_length          9907522         5.34 Fetch with Length
    fetch_chunked              33         0.00 Fetch chunked
    fetch_eof                   0         0.00 Fetch EOF
    fetch_bad                   0         0.00 Fetch had bad headers
    fetch_close                 7         0.00 Fetch wanted close
    fetch_oldhttp               0         0.00 Fetch pre HTTP/1.1 closed
    fetch_zero                638         0.00 Fetch zero len
    fetch_failed                0         0.00 Fetch failed
    n_sess_mem               1189          .   N struct sess_mem
    n_sess                    890          .   N struct sess
    n_object                32678          .   N struct object
    n_vampireobject             0          .   N unresurrected objects
    n_objectcore            32794          .   N struct objectcore
    n_objecthead            30698          .   N struct objecthead
    n_smf                   68810          .   N struct smf
    n_smf_frag               3002          .   N small free smf
    n_smf_large               490          .   N large free smf
    n_vbe_conn                  8          .   N struct vbe_conn
    n_wrk                     200          .   N worker threads
    n_wrk_create              200         0.00 N worker threads created
    n_wrk_failed                0         0.00 N worker threads not created
    n_wrk_max                4133         0.00 N worker threads limited
    n_wrk_queue                 0         0.00 N queued work requests
    n_wrk_overflow              5         0.00 N overflowed work requests
    n_wrk_drop                  0         0.00 N dropped work requests
    n_backend                   1          .   N backends
    n_expired             2291302          .   N expired objects
    *n_lru_nuked           3098052          .   N LRU nuked objects*
    n_lru_saved                 0          .   N LRU saved objects
    n_lru_moved          19894617          .   N LRU moved objects
    n_deathrow                  0          .   N objects on deathrow
    losthdr                     0         0.00 HTTP header overflows
    n_objsendfile               0         0.00 Objects sent with sendfile
    n_objwrite           23874235        12.87 Objects sent with write
    n_objoverflow               0         0.00 Objects overflowing workspace
    s_sess               17177512         9.26 Total Sessions
    s_req                31277524        16.86 Total Requests
    s_pipe                      0         0.00 Total pipe
    s_pass                4945241         2.67 Total pass
    s_fetch              10367753         5.59 Total fetch
    s_hdrbytes        10732679592      5784.04 Total header bytes
    s_bodybytes      316828226025    170744.51 Total body bytes
    sess_closed           7071576         3.81 Session Closed
    sess_pipeline           13905         0.01 Session Pipeline
    sess_readahead           8085         0.00 Session Read Ahead
    sess_linger          24485521        13.20 Session Linger
    sess_herd            23385907        12.60 Session herd
    shm_records        1766166234       951.82 SHM records
    shm_writes          128900505        69.47 SHM writes
    shm_flushes             13215         0.01 SHM flushes due to overflow
    shm_cont                78095         0.04 SHM MTX contention
    shm_cycles                769         0.00 SHM cycles through buffer
    sm_nreq              18413160         9.92 allocator requests
    sm_nobj                 65318          .   outstanding allocations
    sm_balloc           824430592          .   bytes allocated
    sm_bfree            249311232          .   bytes free
    sma_nreq                    0         0.00 SMA allocator requests
    sma_nobj                    0          .   SMA outstanding allocations
    sma_nbytes                  0          .   SMA outstanding bytes
    sma_balloc                  0          .   SMA bytes allocated
    sma_bfree                   0          .   SMA bytes free
    sms_nreq                   61         0.00 SMS allocator requests
    sms_nobj                    0          .   SMS outstanding allocations
    sms_nbytes                  0          .   SMS outstanding bytes
    sms_balloc              16409          .   SMS bytes allocated
    sms_bfree               16409          .   SMS bytes freed
    backend_req          10367752         5.59 Backend requests made
    n_vcl                       1         0.00 N vcl total
    n_vcl_avail                 1         0.00 N vcl available
    n_vcl_discard               0         0.00 N vcl discarded
    n_purge                     1          .   N total active purges
    n_purge_add             12314         0.01 N new purges added
    n_purge_retire          12313         0.01 N old purges deleted
    n_purge_obj_test      2448163         1.32 N objects tested
    n_purge_re_test      62064275        33.45 N regexps tested against
    n_purge_dups             9524         0.01 N duplicate purges removed
    hcb_nolock           28886624        15.57 HCB Lookups without lock
    hcb_lock              4243837         2.29 HCB Lookups with lock
    hcb_insert            4243834         2.29 HCB Inserts
    esi_parse                   0         0.00 Objects ESI parsed (unlock)
    esi_errors                  0         0.00 ESI parse errors (unlock)
    accept_fail                 0         0.00 Accept failures
    client_drop_late            0         0.00 Connection dropped late
    uptime                1855569         1.00 Client uptime


Is it normal varnish behaviour ? sounds like a bug to me.
Am i missing some tuning (lru_interval)  to soften the nuking algo ?
Do you need more info ?
helps appreciated here  ;-)

Regards, Aurelien Lemaire
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.varnish-cache.org/lists/pipermail/varnish-misc/attachments/20110915/b944cf73/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: varnish_expunge-week.png
Type: image/png
Size: 21317 bytes
Desc: not available
URL: <https://www.varnish-cache.org/lists/pipermail/varnish-misc/attachments/20110915/b944cf73/attachment-0003.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: varnish_memory_usage-week.png
Type: image/png
Size: 27158 bytes
Desc: not available
URL: <https://www.varnish-cache.org/lists/pipermail/varnish-misc/attachments/20110915/b944cf73/attachment-0004.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: varnish_objects-week.png
Type: image/png
Size: 23910 bytes
Desc: not available
URL: <https://www.varnish-cache.org/lists/pipermail/varnish-misc/attachments/20110915/b944cf73/attachment-0005.png>


More information about the varnish-misc mailing list