[squid-users] Squid4 has extremely low hit ratio due to lacks of ignore-no-cache
Eliezer Croitoru
eliezer at ngtech.co.il
Sun Oct 25 19:17:42 UTC 2015
Hey Yuri,
I am not sure if you think that Squid version 4 with extreme low hit
ratio is bad or not but I can understand your sight about things.
Usually I am redirecting to this page:
http://wiki.squid-cache.org/Features/StoreID/CollisionRisks#Several_real_world_examples
But this time I can proudly say that the squid project is doing things
the right way while it might not be understood by some.
Before you or anyone declares that there is a low hit ratio due to
something that is missing I will try to put some sense into how things
looks in the real world.
Small thing from a nice day of mine:
I was sitting talking with a friend of mine, a MD to be exact and while
we were talking I was just comforting him about the wonders of Computers.
He was complaining on how the software in the office moves so slow and
he needs to wait for the software to response with results. So I
hesitated a bit but then I asked him "What would have happen if some MD
here in the office will receive the wrong content\results on a patient
from the software? he described it to me terrified from the question 'He
can get the wrong decision!' and then I described to him how he is in
such a good place when he doesn't need to fear from such scenarios.
In this same office Squid is being used for many things and it's crucial
that besides the option to cache content the possibility to validate
cache properly will be set right.
I do understand that there is a need for caches and sometimes it is
crucial in order to give the application more CPU cycles or more RAM but
sometimes the hunger for cache can consume the actual requirement for
the content integrity and it must be re-validated from time to time.
I have seen couple times how a cache in a DB or other levels results
with a very bad and unwanted result while I do understand some of the
complexity and caution that the programmers take when building all sort
of systems with cache in them.
If you do want to understand more about the subject pick your favorite
scripting language and just try to implement a simple object caching.
You would then see how complex the task can be and you can maybe then
understand why caches are not such a simple thing and specially why
ignore-no-cache should not be used in any environment if it is possible.
While I do advise you to not use it I would hint you and others on
another approach to the subject.
If you are greedy and you have hunger for cache for specific
sites\traffic and you would like to be able to benefit from over-caching
there is a solution for that!
- You can alter\hack squid code to meet your needs
- You can write an ICAP service that will be able to alter the response
headers so squid would think it is cachable by default.
- You can write an ECAP module that will be able to alter the response
headers ...
- Write your own cache service with your algorithms in it.
Take in account that the squid project tries to be as fault tolerance as
possible due to it being a very sensitive piece of software in very big
production systems.
Squid doesn't try to meet the requirement of "Maximum Cache" and it is
not squid that as a caching proxy makes a reduction of any cache percentage!
The reason that the content is not cachable is due to all these
application that describe their content as not cachable!
For a second of sanity from the the squid project, try to contact
google\youtube admins\support\operators\forces\what-ever to understand
how would you be able to benefit from a local cache.
If and when you do manage to contact them let them know I was looking
for a contact and I never managed to find one of these available to me
on the phone or email. You cannot say anything like that on the squid
project, the squid project can be contacted using an email and if
required you can get a hold of the man behind the software(while he is a
human).
And I will try to write it in a geeky way:
deny_info 302:https://support.google.com/youtube/
big_system_that_doesnt_want_to_be_cached
Eliezer
* P.S If you do want to write an ICAP service or an ECAP module to
replace the "ignore-no-cache" I can give you some code that will might
help you as a starter.
On 25/10/2015 17:17, Yuri Voinov wrote:
>
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA256
>
> Hi gents,
>
> Pay attention to whether someone from the test SQUID 4 as extremely low
> of cache hits from the new version? Particularly with respect to sites
> HTTPS directive "no cache"? After replacing the Squid 3.4 to 4 squid
> cache hit collapsed from 85 percent or more on the level of 5-15
> percent. I believe this is due to the exclusion of support guidelines
> ignore-no-cache, which eliminates the possibility of aggressive caching
> and reduces the value of caching proxy to almost zero.
>
> This HTTP caches normally. However, due to the widespread use of HTTPS
> trends - caching dramatically decreased to unacceptable levels.
>
> Noticed there anyone else this effect? And what is now with caching?
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v2
>
> iQEcBAEBCAAGBQJWLPKOAAoJENNXIZxhPexGCx4H/j0R2aAxOPp5K1kYwHPgkBF1
> oH/7nqKRWLbRJ32tqkRtQIE4zbyqqNjmGamRoa59UCK/xs6H3Z8t8Y2Bbkx6umDH
> lwUWjlksVxATVAxbjIWowkmjU4FVc20dM0p6quvz1A9LqdcZHu5x4AzLGLs2re4b
> Dy7urAjn8dA5jgvQ05rTBLkqgOeDUlakyBaMlHaK8VUJ829H3YreSWpbobjCKAIz
> /Bu5pLSRXDvdPqEzOa4MRwSirggntKHET1ThxwVN9xDa1wCc3SW4cRoKmqobmSv/
> F7ryEkTFC05AcCiGb7ArEjGQf7R7zi4PXybOoUIypEyhipvd5hv2PKdw3Dha4OY=
> =m/MT
> -----END PGP SIGNATURE-----
>
>
> _______________________________________________
> squid-users mailing list
> squid-users at lists.squid-cache.org
> http://lists.squid-cache.org/listinfo/squid-users
>
More information about the squid-users
mailing list