<div dir="ltr">Thank you Amos for your detailed explanation. That helped. Appreciate it. </div><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Nov 20, 2014 at 8:28 PM, Amos Jeffries <span dir="ltr"><<a href="mailto:squid3@treenet.co.nz" target="_blank">squid3@treenet.co.nz</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">-----BEGIN PGP SIGNED MESSAGE-----<br>

Hash: SHA1<br>

<br>

</span><span class="">On 21/11/2014 3:31 a.m., Swapneel Patnekar wrote:<br>

> Dear Amos,<br>

><br>

> Thank you for your input.<br>

><br>

> The reason I had this requirement is to build a analytics app on<br>

> top of Squid which will only show typed URL's so that the end user<br>

> has the precise information w.r.t evaluating web access.<br>

><br>

> Currently, since everything gets logged into the access.log, there<br>

> is no way to differentiate between typed/bookmark accessed URL's<br>

> and URL's which are references for the main URL i.e CDN URL's for<br>

> js, css etc.<br>

><br>

> Sorry if I am pressing here, but do you have any<br>

> suggestions/alternate solutions for how I could achieve this ?<br>

<br>

</span>No sorry. What you ae asking for is not possible. The information you<br>

are seeking is private to the browser. It rarely goes anywhere else,<br>

and applications that attempt to make it get a CVE security<br>

vulnerability of "information leakage" classification thrown at them<br>

for doing it.<br>

<br>

You will have to go with imprecise data and heuristic algorithms<br>

estimating actions based on the logged URLs and their relative timing.<br>

<br>

<br>

It may help if you discard the idea that you can point at a single URL<br>

and say that is "the page". There are no such things as "pages" in<br>

HTTP. "Page" is one of several abstract concepts representing groups<br>

of URL fetches. In other words a "page" is a whole collection of<br>

transactions in the HTTP log.<br>

 Each time a user views a "page". Some *part* of the URLs forming that<br>

page are fetched. Some parts of the collection may not be fetched. Or<br>

different parts fetched at different "page" views. It depends on what<br>

the browser needs to fetch at the time of each different viewing<br>

(since it has its own cache).<br>

<span class=""><br>

Amos<br>

-----BEGIN PGP SIGNATURE-----<br>

Version: GnuPG v2.0.22 (MingW32)<br>

<br>

</span>iQEcBAEBAgAGBQJUbgGXAAoJELJo5wb/XPRj24QH+wXf8G02wlscUiMO2s99q4I2<br>

ROM1PA16Ye6EieD4I0p+vydIint4nIh10qiuuV+BnsMPgDK95ADdWgl6p2HFWYyK<br>

ZwxzYZ7k4HW9cvJujkGKyJEgIv9UEQmduzljJu35yig50fNIbdYhbMNch4lmjZHg<br>

CYs3TTglE0VOXh5+vYxKLRBR7CzZcYcHzQ2epj5kuSkCfFQSRTyZTsO2bZsTmv38<br>

CufUrkzH6i0XnhYpp9tTwFdXBNPJs2yJI5I0UcDbPBI44IAH+bMCq0HJ83dV1CJA<br>

c4+4EXE+vnWuc67HzmsSWx9KveQvx/2sddex3HqLtWwSXYp328JqHp22oTErwXE=<br>

=dTiW<br>

-----END PGP SIGNATURE-----<br>

</blockquote></div><br><br clear="all"><div><br></div>-- <br><div class="gmail_signature"><div dir="ltr">Best, <div>Swapneel Patnekar</div><div>+91 98866 37820</div></div></div>

</div>