<div dir="ltr">Hi Amos,<div><br></div><div>Thanks for responding</div><div><br></div><div><i><b>"<span style="font-size:12.8px">You would be better off taking the first use of any domain by a client,</span></b></i></div><i><b><span style="font-size:12.8px">then ignoring other requests for it until there is some long period</span><br style="font-size:12.8px"></b></i><div><i><b><span style="font-size:12.8px">between two of them. The opposite of what session helpers do.</span>"</b></i></div><div class="gmail_extra"><br></div><div class="gmail_extra">Could you please elaborate a little on the above logic.</div><div class="gmail_extra"><br></div><div class="gmail_extra">My understanding, if not wrong, is to take domain/host of first client GET request & don't consider the same if it matches with the subsequent GET requests.</div><div class="gmail_extra"><br></div><div class="gmail_extra">In this case there is possibility of multiple unique domains/hosts for single page (Eg. other domain Ads, analytics etc)?</div><div class="gmail_extra"><br></div><div class="gmail_extra"><div class="gmail_quote"><br></div><div class="gmail_quote">On Sat, Oct 10, 2015 at 10:57 AM, Amos Jeffries <span dir="ltr"><<a href="mailto:squid3@treenet.co.nz" target="_blank">squid3@treenet.co.nz</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">On 10/10/2015 1:47 a.m., Ambadas H wrote:<br>

> Hi,<br>

><br>

> I am using below setup:<br>

> Squid proxy 3.5.4.<br>

> CentOS 7.1<br>

><br>

> I am trying to analyze the most used websites by the users via Squid proxy.<br>

> I just require the first GET request for that particular browsed page page<br>

> & not the proceeding GETs of that same page.<br>

><br>

> Eg:<br>

</span>> 1) user enters *<a href="http://google.com" rel="noreferrer" target="_blank">http://google.com</a> <<a href="http://google.com" rel="noreferrer" target="_blank">http://google.com</a>>* in client (mozilla)<br>

<span class="">> 2) client gets page containing some other urls<br>

> 3) client initiates multiple GETs for same requested page without users<br>

> knowledge<br>

><br>

> I myself tried a logic where I assumed if "Referer" header is present, then<br>

> its not the first GET but a proceeding one for same requested page.<br>

><br>

> I know i cant rely on "Referer" header to be always present as its not<br>

> mandatory. But<br>

> I want to know if my logic is correct? & also if there's any alternative<br>

> solution?<br>

<br>

</span>Your assumption is wrong. Referer header (when it exists) is tracking a<br>

whole browsing session, not a particular website or page.<br>

<br>

You would be better off taking the first use of any domain by a client,<br>

then ignoring other requests for it until there is some long period<br>

between two of them. The opposite of what session helpers do.<br>

<br>

Amos<br>

<br>

_______________________________________________<br>

squid-users mailing list<br>

<a href="mailto:squid-users@lists.squid-cache.org">squid-users@lists.squid-cache.org</a><br>

<a href="http://lists.squid-cache.org/listinfo/squid-users" rel="noreferrer" target="_blank">http://lists.squid-cache.org/listinfo/squid-users</a><br>

</blockquote></div><br></div></div>