<div dir="ltr">Thank you Sam for your response. I already logged requests to cached Arabic URLs and here is the result of one request:<br>===========================================================================================<br>
Cookie: SESScfc90a62c81b7bfc6f292320b1d0b8ca=t7t650vu5qu02916unbtil9o66; SESS50745c6a3729e7f46278f7d281511580=qjc658f7cthp6dvj65rt6a8c64; SESS8348e9a0e0f6133hash*%ntrol: max-age=0%c9c2n9td5uuvj0hp73; SESSb323fb39997d18c5bde4c32f7bc0ffe1=0r5ve4k3i2ubmqu<br>
▒±␊: 0 ┼: ┐␊␊⎻-▒┌␋┴␊ 806 <br>===========================================================================================<br><br>I tried opening the log file with less, vim, and tail but all what am getting is either binary (less) or stuff like above (tail). <br>
I even tried limiting the accepted charset header sent by the browser to UTF-8 but failed. Here is my config for limiting the charset under sub vcl_rcv { } :<br>======================================<br> if (req.http.Accept-Charset) {<br>
remove req.http.Accept-Charset;<br> set req.http.Accept-Charset = "utf-8";<br> }<br>======================================<br><br>I also tried including C header files as follows:<br>===================================<br>
C{<br>#include <string.h><br>#include <locale.h><br>#include <wctype.h><br>#include <wchar.h><br>#include <curses.h><br>}C<br>===================================<br>but it did not give me any result.<br>
<br>I am thinking of recompiling with ncurses wchar enabled. Any ideas? <br><br><br><div class="gmail_quote">2010/5/24 Sam Crawford <span dir="ltr"><<a href="mailto:samcrawford@gmail.com">samcrawford@gmail.com</a>></span><br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">It's not one that I'm familiar with, but if it were me, I'd try<br>
running varnishlog whilst putting a request for one of these URLs<br>
through. See how varnish prints it out in the RxURL field. This might<br>
give you some clues as how to specify it in the rules.<br>
<br>
Thanks,<br>
<br>
Sam<br>
<br>
<br>
2010/5/23 Angie T. Muhammad <<a href="mailto:angie.tawfik@gmail.com">angie.tawfik@gmail.com</a>>:<br>
<div><div></div><div class="h5">> Hello Varnish team<br>
><br>
> I have varnish v. 2.1.2 on production and test servers . We are running a<br>
> bilingual news website.<br>
> On my test server I am trying to parse non-English URLs like follows:<br>
><br>
> .......................<br>
> else if (req.url == "/تقارير") {<br>
> set beresp.http.X-Cacheable = "Yes";<br>
> set beresp.ttl = 60m;<br>
> return(deliver);<br>
> }<br>
> .......................<br>
><br>
> The word in bold red is in Arabic and it is a right-to-left language. The<br>
> link can not be made in English and has no English equivalent. In case you<br>
> are wondering, the word means "reports". My sole problem now is that varnish<br>
> applies all other if-statements with full English URLs but not this one with<br>
> Arabiv. Even if I try regex say: req.url ~ "^/تقارير" instead of the ==<br>
> sign, it starts with no errors but does not apply the rule.<br>
><br>
> I tried the following:<br>
> 1- Reversing the letters of the arabic word, so تقارير would be ريراقت but<br>
> it did not work<br>
> 2- Copying the link directly into /etc/varnish/default.vcl, it produces<br>
> something like: %D9%88%D8%B3%D9%88%D9%85%D8%A7%D8%AA<br>
> Such html address handling prevents varnish from starting<br>
><br>
> Any ideas? Your help is really appreciated.<br>
><br>
><br>
> --<br>
> All the best,<br>
> Angie<br>
><br>
</div></div>> _______________________________________________<br>
> varnish-misc mailing list<br>
> <a href="mailto:varnish-misc@varnish-cache.org">varnish-misc@varnish-cache.org</a><br>
> <a href="http://lists.varnish-cache.org/mailman/listinfo/varnish-misc" target="_blank">http://lists.varnish-cache.org/mailman/listinfo/varnish-misc</a><br>
><br>
</blockquote></div><br><br clear="all"><br>-- <br>All the best,<br>Angie<br>
</div>