Re: Random “http first read error: EOF” errors

Sun Apr 2 19:39:51 CEST 2017

Btw, I need to also note that traffic is routed to Varnish from load
balancer:

LB -> Varnish -> LB -> Backend pool

Time does not matter.. It occurs during both peak and regular hours. Even
during peak hours we do not reach the "local ports" limit. Unfortunately
there is no any clue in the logs. There is no evidence that Apache restarts
on the backend pool during occurence of the issue.

On Sat, Apr 1, 2017 at 9:44 PM, Andrei <lagged at gmail.com> wrote:

> If it's during peak hours are you sure there aren't any rate limits being
> reached? Perhaps net.ipv4.ip_local_port_range might need a bump? Are
> Apache or syslog logging anything around those times? No silly periodic
> (Apache) graceful restarts? Just a few thoughts :)
>
> On Sat, Apr 1, 2017 at 2:14 PM, Hazar Güney <hazarguney at gmail.com> wrote:
>
>> We see this error a few times in a day on a highly busy production
>> environment. Unfortunately there is too much traffic on the server to keep
>> tcpdump/ngrep running and we cannot re-produce it on test environment :(
>>
>> I have started tcpdump on a test environment of another implementation
>> and will let you as soon as the issue gets triggerred again.
>>
>> On Fri, Mar 31, 2017 at 4:17 PM, Andrei <lagged at gmail.com> wrote:
>>
>>> Can you provide a tcpdump/ngrep of the requests between
>>> Client/Varnish/Apache along with the varnishlog entry to see if that
>>> uncovers anything?
>>>
>>> On Fri, Mar 31, 2017 at 7:25 AM, Hazar Güney <hazarguney at gmail.com>
>>> wrote:
>>>
>>>> Any idea?
>>>>
>>>> On Thu, Mar 30, 2017 at 3:41 PM, Hazar Güney <hazarguney at gmail.com>
>>>> wrote:
>>>>
>>>>> It did not work either:
>>>>>
>>>>>     *   << BeReq    >> 127418176
>>>>>     -   Begin          bereq 127418175 fetch
>>>>>     -   Timestamp      Start: 1490877149.450124 0.000000 0.000000
>>>>>     -   BereqMethod    GET
>>>>>     -   BereqURL       XXXX
>>>>>     -   BereqProtocol  HTTP/1.1
>>>>>     -   BereqHeader    Accept: text/css,*/*;q=0.1
>>>>>     -   BereqHeader    User-Agent: Mozilla/5.0 (iPhone; CPU iPhone OS
>>>>> 10_2 like Mac OS X) AppleWebKit/602.3.12 (KHTML, like Gecko) Version/10.0
>>>>> Mobile/14C92 Safari/602.1
>>>>>     -   BereqHeader    Accept-Language: tr-tr
>>>>>     -   BereqHeader    Referer: XXXX
>>>>>     -   BereqHeader    Host: XXXX
>>>>>     -   BereqHeader    RIP: XXXX
>>>>>     -   BereqHeader    X-Forwarded-For: XXXX
>>>>>     -   BereqHeader    Accept-Encoding: gzip
>>>>>     -   BereqHeader    X-Varnish: 127418176
>>>>>     -   VCL_call       BACKEND_FETCH
>>>>>     -   BereqHeader    connection: Close
>>>>>     -   VCL_return     fetch
>>>>>     -   BackendOpen    25 reload_2017-03-30T14:53:46.st2 10.35.78.11
>>>>> 80 172.17.0.2 59152
>>>>>     -   BackendStart   10.35.78.11 80
>>>>>     -   Timestamp      Bereq: 1490877149.450594 0.000470 0.000470
>>>>>     -   FetchError     http first read error: EOF
>>>>>     -   BackendClose   25 reload_2017-03-30T14:53:46.st2
>>>>>     -   Timestamp      Beresp: 1490877149.451184 0.001060 0.000590
>>>>>     -   Timestamp      Error: 1490877149.451189 0.001065 0.000005
>>>>>     -   BerespProtocol HTTP/1.1
>>>>>     -   BerespStatus   503
>>>>>     -   BerespReason   Service Unavailable
>>>>>     -   BerespReason   Backend fetch failed
>>>>>     -   BerespHeader   Date: Thu, 30 Mar 2017 12:32:29 GMT
>>>>>     -   BerespHeader   Server: Varnish
>>>>>     -   VCL_call       BACKEND_ERROR
>>>>>     -   BereqHeader    X-Varnish-Backend-5xx: 1
>>>>>     -   VCL_return     retry
>>>>>     -   Timestamp      Retry: 1490877149.451205 0.001081 0.000016
>>>>>     -   Link           bereq 127298071 retry
>>>>>     -   End
>>>>>
>>>>> On Thu, Mar 30, 2017 at 2:34 PM, Guillaume Quintard <
>>>>> guillaume at varnish-software.com> wrote:
>>>>>
>>>>>> It does, I'm suspecting that the connection reuse is creating some
>>>>>> issues, probably because Apache is doing some non-standard stuff (protip:
>>>>>> always blame Apache).
>>>>>>
>>>>>> --
>>>>>> Guillaume Quintard
>>>>>>
>>>>>> On Thu, Mar 30, 2017 at 1:17 PM, Hazar Güney <hazarguney at gmail.com>
>>>>>> wrote:
>>>>>>
>>>>>>> "Connection: close" supersedes keep-alive behavior, is that correct?
>>>>>>>
>>>>>>> On Thu, Mar 30, 2017 at 2:08 PM, Guillaume Quintard <
>>>>>>> guillaume at varnish-software.com> wrote:
>>>>>>>
>>>>>>>> Can you try something: add 'set bereq.http.connection = "Close"; '
>>>>>>>> at the beginning of vcl_backend_fetch and see if that helps?
>>>>>>>>
>>>>>>>> --
>>>>>>>> Guillaume Quintard
>>>>>>>>
>>>>>>>> On Thu, Mar 30, 2017 at 1:04 PM, Hazar Güney <hazarguney at gmail.com>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>>> MaxKeepAliveRequests 20
>>>>>>>>> KeepAliveTimeout 2
>>>>>>>>>
>>>>>>>>> Version is "4.1.3 revision 5e3b6d2". We have also seen "straight
>>>>>>>>> insufficient bytes" error with POST requests to a specific php script
>>>>>>>>> hosted by another backend and fixed it by using "pipe" instead of "pass"
>>>>>>>>> but this specific backend gives "http first read error: EOF" error. Another
>>>>>>>>> example from today:
>>>>>>>>>
>>>>>>>>> *   << BeReq    >> 126635444
>>>>>>>>> -   Begin          bereq 126635443 fetch
>>>>>>>>> -   Timestamp      Start: 1490870598.921499 0.000000 0.000000
>>>>>>>>> -   BereqMethod    GET
>>>>>>>>> -   BereqURL       XXXX
>>>>>>>>> -   BereqProtocol  HTTP/1.1
>>>>>>>>> -   BereqHeader    Host: XXXX
>>>>>>>>> -   BereqHeader    User-Agent: Mozilla/5.0 (Windows NT 10.0;
>>>>>>>>> Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/56.0.2924.87
>>>>>>>>> Safari/537.36
>>>>>>>>> -   BereqHeader    Accept: image/webp,image/*,*/*;q=0.8
>>>>>>>>> -   BereqHeader    Referer: XXXX
>>>>>>>>> -   BereqHeader    Accept-Language: tr-TR,tr;q=0.8,en-US;q=0.6,en;
>>>>>>>>> q=0.4
>>>>>>>>> -   BereqHeader    RIP: XXXX
>>>>>>>>> -   BereqHeader    X-Forwarded-For: XXXX
>>>>>>>>> -   BereqHeader    Accept-Encoding: gzip
>>>>>>>>> -   BereqHeader    X-Varnish: 126635444
>>>>>>>>> -   VCL_call       BACKEND_FETCH
>>>>>>>>> -   VCL_return     fetch
>>>>>>>>> -   BackendOpen    35 reload_2017-03-20T11:32:44.st2 10.35.78.11
>>>>>>>>> 80 172.17.0.2 48896
>>>>>>>>> -   BackendStart   10.35.78.11 80
>>>>>>>>> -   Timestamp      Bereq: 1490870598.922050 0.000552 0.000552
>>>>>>>>> *-   FetchError     http first read error: EOF*
>>>>>>>>> -   BackendClose   35 reload_2017-03-20T11:32:44.st2
>>>>>>>>> -   Timestamp      Beresp: 1490870598.922622 0.001124 0.000572
>>>>>>>>> -   Timestamp      Error: 1490870598.922627 0.001129 0.000005
>>>>>>>>> -   BerespProtocol HTTP/1.1
>>>>>>>>> -   BerespStatus   503
>>>>>>>>> -   BerespReason   Service Unavailable
>>>>>>>>> -   BerespReason   Backend fetch failed
>>>>>>>>> -   BerespHeader   Date: Thu, 30 Mar 2017 10:43:18 GMT
>>>>>>>>> -   BerespHeader   Server: Varnish
>>>>>>>>> -   VCL_call       BACKEND_ERROR
>>>>>>>>> -   BereqHeader    X-Varnish-Backend-5xx: 1
>>>>>>>>> -   VCL_return     retry
>>>>>>>>> -   Timestamp      Retry: 1490870598.922657 0.001159 0.000030
>>>>>>>>> -   Link           bereq 126832283 retry
>>>>>>>>> -   End
>>>>>>>>>
>>>>>>>>> On Wed, Mar 29, 2017 at 12:03 PM, Mattias Geniar <
>>>>>>>>> mattias at nucleus.be> wrote:
>>>>>>>>>
>>>>>>>>>> > Backend is Apache.
>>>>>>>>>>
>>>>>>>>>> In older Varnish versions, you could sometimes see a similar
>>>>>>>>>> error;
>>>>>>>>>>
>>>>>>>>>> >   11 FetchError   c straight insufficient bytes
>>>>>>>>>>
>>>>>>>>>> The error message you’re seeing might be related, as it mentions
>>>>>>>>>> the EOF.
>>>>>>>>>>
>>>>>>>>>> This happens when the backend sends a Content-Length header that
>>>>>>>>>> doesn’t match the _actual_ content length it’s sending. In Apache, this was
>>>>>>>>>> commonly caused by a mod_deflate misconfiguration.
>>>>>>>>>>
>>>>>>>>>> For testing, could you try disabling Gzip either in your backend
>>>>>>>>>> or strip the Accept-Encoding header in Varnish to force a plain text
>>>>>>>>>> response?
>>>>>>>>>>
>>>>>>>>>> Mattias
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>
>>>>>
>>>>
>>>> _______________________________________________
>>>> varnish-misc mailing list
>>>> varnish-misc at varnish-cache.org
>>>> https://www.varnish-cache.org/lists/mailman/listinfo/varnish-misc
>>>>
>>>
>>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.varnish-cache.org/lists/pipermail/varnish-misc/attachments/20170402/d1f0b33c/attachment.html>