Regenerated documentation after the ChangeLog import
[privoxy.git] / doc / webserver / user-manual / actions-file.html
1 <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
2 "http://www.w3.org/TR/html4/loose.dtd">
3
4 <html>
5 <head>
6   <title>Actions Files</title>
7   <meta name="GENERATOR" content=
8   "Modular DocBook HTML Stylesheet Version 1.79">
9   <link rel="HOME" title="Privoxy 3.0.20 User Manual" href="index.html">
10   <link rel="PREVIOUS" title="The Main Configuration File" href=
11   "config.html">
12   <link rel="NEXT" title="Filter Files" href="filter-file.html">
13   <link rel="STYLESHEET" type="text/css" href="../p_doc.css">
14   <meta http-equiv="Content-Type" content="text/html; charset=us-ascii">
15   <link rel="STYLESHEET" type="text/css" href="p_doc.css">
16 </head>
17
18 <body class="SECT1" bgcolor="#EEEEEE" text="#000000" link="#0000FF" vlink=
19 "#840084" alink="#0000FF">
20   <div class="NAVHEADER">
21     <table summary="Header navigation table" width="100%" border="0"
22     cellpadding="0" cellspacing="0">
23       <tr>
24         <th colspan="3" align="center">Privoxy 3.0.20 User Manual</th>
25       </tr>
26
27       <tr>
28         <td width="10%" align="left" valign="bottom"><a href="config.html"
29         accesskey="P">Prev</a></td>
30
31         <td width="80%" align="center" valign="bottom"></td>
32
33         <td width="10%" align="right" valign="bottom"><a href=
34         "filter-file.html" accesskey="N">Next</a></td>
35       </tr>
36     </table>
37     <hr align="left" width="100%">
38   </div>
39
40   <div class="SECT1">
41     <h1 class="SECT1"><a name="ACTIONS-FILE" id="ACTIONS-FILE">8. Actions
42     Files</a></h1>
43
44     <p>The actions files are used to define what <span class=
45     "emphasis"><i class="EMPHASIS">actions</i></span> <span class=
46     "APPLICATION">Privoxy</span> takes for which URLs, and thus determines
47     how ad images, cookies and various other aspects of HTTP content and
48     transactions are handled, and on which sites (or even parts thereof).
49     There are a number of such actions, with a wide range of functionality.
50     Each action does something a little different. These actions give us a
51     veritable arsenal of tools with which to exert our control, preferences
52     and independence. Actions can be combined so that their effects are
53     aggregated when applied against a given set of URLs.</p>
54
55     <p>There are three action files included with <span class=
56     "APPLICATION">Privoxy</span> with differing purposes:</p>
57
58     <ul>
59       <li>
60         <p><tt class="FILENAME">match-all.action</tt> - is used to define
61         which <span class="QUOTE">"actions"</span> relating to
62         banner-blocking, images, pop-ups, content modification, cookie
63         handling etc should be applied by default. It should be the first
64         actions file loaded</p>
65       </li>
66
67       <li>
68         <p><tt class="FILENAME">default.action</tt> - defines many exceptions
69         (both positive and negative) from the default set of actions that's
70         configured in <tt class="FILENAME">match-all.action</tt>. It is a set
71         of rules that should work reasonably well as-is for most users. This
72         file is only supposed to be edited by the developers. It should be
73         the second actions file loaded.</p>
74       </li>
75
76       <li>
77         <p><tt class="FILENAME">user.action</tt> - is intended to be for
78         local site preferences and exceptions. As an example, if your ISP or
79         your bank has specific requirements, and need special handling, this
80         kind of thing should go here. This file will not be upgraded.</p>
81       </li>
82
83       <li>
84         <p><span class="GUIBUTTON">Edit</span> <span class="GUIBUTTON">Set to
85         Cautious</span> <span class="GUIBUTTON">Set to Medium</span>
86         <span class="GUIBUTTON">Set to Advanced</span></p>
87
88         <p>These have increasing levels of aggressiveness <span class=
89         "emphasis"><i class="EMPHASIS">and have no influence on your browsing
90         unless you select them explicitly in the editor</i></span>. A default
91         installation should be pre-set to <tt class="LITERAL">Cautious</tt>.
92         New users should try this for a while before adjusting the settings
93         to more aggressive levels. The more aggressive the settings, then the
94         more likelihood there is of problems such as sites not working as
95         they should.</p>
96
97         <p>The <span class="GUIBUTTON">Edit</span> button allows you to turn
98         each action on/off individually for fine-tuning. The <span class=
99         "GUIBUTTON">Cautious</span> button changes the actions list to
100         low/safe settings which will activate ad blocking and a minimal set
101         of <span class="APPLICATION">Privoxy</span>'s features, and
102         subsequently there will be less of a chance for accidental problems.
103         The <span class="GUIBUTTON">Medium</span> button sets the list to a
104         medium level of other features and a low level set of privacy
105         features. The <span class="GUIBUTTON">Advanced</span> button sets the
106         list to a high level of ad blocking and medium level of privacy. See
107         the chart below. The latter three buttons over-ride any changes via
108         with the <span class="GUIBUTTON">Edit</span> button. More fine-tuning
109         can be done in the lower sections of this internal page.</p>
110
111         <p>While the actions file editor allows to enable these settings in
112         all actions files, they are only supposed to be enabled in the first
113         one to make sure you don't unintentionally overrule earlier
114         rules.</p>
115
116         <p>The default profiles, and their associated actions, as pre-defined
117         in <tt class="FILENAME">default.action</tt> are:</p>
118
119         <div class="TABLE">
120           <a name="AEN2823" id="AEN2823"></a>
121
122           <p><b>Table 1. Default Configurations</b></p>
123
124           <table border="1" frame="border" rules="all" class="CALSTABLE">
125             <col width="1*" title="C1">
126             <col width="1*" title="C2">
127             <col width="1*" title="C3">
128             <col width="1*" title="C4">
129
130             <thead>
131               <tr>
132                 <th>Feature</th>
133
134                 <th>Cautious</th>
135
136                 <th>Medium</th>
137
138                 <th>Advanced</th>
139               </tr>
140             </thead>
141
142             <tbody>
143               <tr>
144                 <td>Ad-blocking Aggressiveness</td>
145
146                 <td>medium</td>
147
148                 <td>high</td>
149
150                 <td>high</td>
151               </tr>
152
153               <tr>
154                 <td>Ad-filtering by size</td>
155
156                 <td>no</td>
157
158                 <td>yes</td>
159
160                 <td>yes</td>
161               </tr>
162
163               <tr>
164                 <td>Ad-filtering by link</td>
165
166                 <td>no</td>
167
168                 <td>no</td>
169
170                 <td>yes</td>
171               </tr>
172
173               <tr>
174                 <td>Pop-up killing</td>
175
176                 <td>blocks only</td>
177
178                 <td>blocks only</td>
179
180                 <td>blocks only</td>
181               </tr>
182
183               <tr>
184                 <td>Privacy Features</td>
185
186                 <td>low</td>
187
188                 <td>medium</td>
189
190                 <td>medium/high</td>
191               </tr>
192
193               <tr>
194                 <td>Cookie handling</td>
195
196                 <td>none</td>
197
198                 <td>session-only</td>
199
200                 <td>kill</td>
201               </tr>
202
203               <tr>
204                 <td>Referer forging</td>
205
206                 <td>no</td>
207
208                 <td>yes</td>
209
210                 <td>yes</td>
211               </tr>
212
213               <tr>
214                 <td>GIF de-animation</td>
215
216                 <td>no</td>
217
218                 <td>yes</td>
219
220                 <td>yes</td>
221               </tr>
222
223               <tr>
224                 <td>Fast redirects</td>
225
226                 <td>no</td>
227
228                 <td>no</td>
229
230                 <td>yes</td>
231               </tr>
232
233               <tr>
234                 <td>HTML taming</td>
235
236                 <td>no</td>
237
238                 <td>no</td>
239
240                 <td>yes</td>
241               </tr>
242
243               <tr>
244                 <td>JavaScript taming</td>
245
246                 <td>no</td>
247
248                 <td>no</td>
249
250                 <td>yes</td>
251               </tr>
252
253               <tr>
254                 <td>Web-bug killing</td>
255
256                 <td>no</td>
257
258                 <td>yes</td>
259
260                 <td>yes</td>
261               </tr>
262
263               <tr>
264                 <td>Image tag reordering</td>
265
266                 <td>no</td>
267
268                 <td>yes</td>
269
270                 <td>yes</td>
271               </tr>
272             </tbody>
273           </table>
274         </div>
275       </li>
276     </ul>
277
278     <p>The list of actions files to be used are defined in the main
279     configuration file, and are processed in the order they are defined (e.g.
280     <tt class="FILENAME">default.action</tt> is typically processed before
281     <tt class="FILENAME">user.action</tt>). The content of these can all be
282     viewed and edited from <a href="http://config.privoxy.org/show-status"
283     target="_top">http://config.privoxy.org/show-status</a>. The over-riding
284     principle when applying actions, is that the last action that matches a
285     given URL wins. The broadest, most general rules go first (defined in
286     <tt class="FILENAME">default.action</tt>), followed by any exceptions
287     (typically also in <tt class="FILENAME">default.action</tt>), which are
288     then followed lastly by any local preferences (typically in <span class=
289     "emphasis"><i class="EMPHASIS">user</i></span><tt class=
290     "FILENAME">.action</tt>). Generally, <tt class=
291     "FILENAME">user.action</tt> has the last word.</p>
292
293     <p>An actions file typically has multiple sections. If you want to use
294     <span class="QUOTE">"aliases"</span> in an actions file, you have to
295     place the (optional) <a href="actions-file.html#ALIASES">alias
296     section</a> at the top of that file. Then comes the default set of rules
297     which will apply universally to all sites and pages (be <span class=
298     "emphasis"><i class="EMPHASIS">very careful</i></span> with using such a
299     universal set in <tt class="FILENAME">user.action</tt> or any other
300     actions file after <tt class="FILENAME">default.action</tt>, because it
301     will override the result from consulting any previous file). And then
302     below that, exceptions to the defined universal policies. You can regard
303     <tt class="FILENAME">user.action</tt> as an appendix to <tt class=
304     "FILENAME">default.action</tt>, with the advantage that it is a separate
305     file, which makes preserving your personal settings across <span class=
306     "APPLICATION">Privoxy</span> upgrades easier.</p>
307
308     <p>Actions can be used to block anything you want, including ads,
309     banners, or just some obnoxious URL whose content you would rather not
310     see. Cookies can be accepted or rejected, or accepted only during the
311     current browser session (i.e. not written to disk), content can be
312     modified, some JavaScripts tamed, user-tracking fooled, and much more.
313     See below for a <a href="actions-file.html#ACTIONS">complete list of
314     actions</a>.</p>
315
316     <div class="SECT2">
317       <h2 class="SECT2"><a name="AEN2922" id="AEN2922">8.1. Finding the Right
318       Mix</a></h2>
319
320       <p>Note that some <a href="actions-file.html#ACTIONS">actions</a>, like
321       cookie suppression or script disabling, may render some sites unusable
322       that rely on these techniques to work properly. Finding the right mix
323       of actions is not always easy and certainly a matter of personal taste.
324       And, things can always change, requiring refinements in the
325       configuration. In general, it can be said that the more <span class=
326       "QUOTE">"aggressive"</span> your default settings (in the top section
327       of the actions file) are, the more exceptions for <span class=
328       "QUOTE">"trusted"</span> sites you will have to make later. If, for
329       example, you want to crunch all cookies per default, you'll have to
330       make exceptions from that rule for sites that you regularly use and
331       that require cookies for actually useful purposes, like maybe your
332       bank, favorite shop, or newspaper.</p>
333
334       <p>We have tried to provide you with reasonable rules to start from in
335       the distribution actions files. But there is no general rule of thumb
336       on these things. There just are too many variables, and sites are
337       constantly changing. Sooner or later you will want to change the rules
338       (and read this chapter again :).</p>
339     </div>
340
341     <div class="SECT2">
342       <h2 class="SECT2"><a name="AEN2929" id="AEN2929">8.2. How to
343       Edit</a></h2>
344
345       <p>The easiest way to edit the actions files is with a browser by using
346       our browser-based editor, which can be reached from <a href=
347       "http://config.privoxy.org/show-status" target=
348       "_top">http://config.privoxy.org/show-status</a>. Note: the config file
349       option <a href=
350       "config.html#ENABLE-EDIT-ACTIONS">enable-edit-actions</a> must be
351       enabled for this to work. The editor allows both fine-grained control
352       over every single feature on a per-URL basis, and easy choosing from
353       wholesale sets of defaults like <span class="QUOTE">"Cautious"</span>,
354       <span class="QUOTE">"Medium"</span> or <span class=
355       "QUOTE">"Advanced"</span>. Warning: the <span class=
356       "QUOTE">"Advanced"</span> setting is more aggressive, and will be more
357       likely to cause problems for some sites. Experienced users only!</p>
358
359       <p>If you prefer plain text editing to GUIs, you can of course also
360       directly edit the the actions files with your favorite text editor.
361       Look at <tt class="FILENAME">default.action</tt> which is richly
362       commented with many good examples.</p>
363     </div>
364
365     <div class="SECT2">
366       <h2 class="SECT2"><a name="ACTIONS-APPLY" id="ACTIONS-APPLY">8.3. How
367       Actions are Applied to Requests</a></h2>
368
369       <p>Actions files are divided into sections. There are special sections,
370       like the <span class="QUOTE">"<a href=
371       "actions-file.html#ALIASES">alias</a>"</span> sections which will be
372       discussed later. For now let's concentrate on regular sections: They
373       have a heading line (often split up to multiple lines for readability)
374       which consist of a list of actions, separated by whitespace and
375       enclosed in curly braces. Below that, there is a list of URL and tag
376       patterns, each on a separate line.</p>
377
378       <p>To determine which actions apply to a request, the URL of the
379       request is compared to all URL patterns in each <span class=
380       "QUOTE">"action file"</span>. Every time it matches, the list of
381       applicable actions for the request is incrementally updated, using the
382       heading of the section in which the pattern is located. The same is
383       done again for tags and tag patterns later on.</p>
384
385       <p>If multiple applying sections set the same action differently, the
386       last match wins. If not, the effects are aggregated. E.g. a URL might
387       match a regular section with a heading line of <tt class="LITERAL">{
388       +<a href="actions-file.html#HANDLE-AS-IMAGE">handle-as-image</a>
389       }</tt>, then later another one with just <tt class="LITERAL">{
390       +<a href="actions-file.html#BLOCK">block</a> }</tt>, resulting in
391       <span class="emphasis"><i class="EMPHASIS">both</i></span> actions to
392       apply. And there may well be cases where you will want to combine
393       actions together. Such a section then might look like:</p>
394
395       <table border="0" bgcolor="#E0E0E0" width="100%">
396         <tr>
397           <td>
398             <pre class="SCREEN">
399   { +<tt class="LITERAL">handle-as-image</tt>  +<tt class=
400 "LITERAL">block{Banner ads.}</tt> }
401   # Block these as if they were images. Send no block page.
402    banners.example.com
403    media.example.com/.*banners
404    .example.com/images/ads/
405 </pre>
406           </td>
407         </tr>
408       </table>
409
410       <p>You can trace this process for URL patterns and any given URL by
411       visiting <a href="http://config.privoxy.org/show-url-info" target=
412       "_top">http://config.privoxy.org/show-url-info</a>.</p>
413
414       <p>Examples and more detail on this is provided in the Appendix,
415       <a href="appendix.html#ACTIONSANAT">Troubleshooting: Anatomy of an
416       Action</a> section.</p>
417     </div>
418
419     <div class="SECT2">
420       <h2 class="SECT2"><a name="AF-PATTERNS" id="AF-PATTERNS">8.4.
421       Patterns</a></h2>
422
423       <p>As mentioned, <span class="APPLICATION">Privoxy</span> uses
424       <span class="QUOTE">"patterns"</span> to determine what <span class=
425       "emphasis"><i class="EMPHASIS">actions</i></span> might apply to which
426       sites and pages your browser attempts to access. These <span class=
427       "QUOTE">"patterns"</span> use wild card type <span class=
428       "emphasis"><i class="EMPHASIS">pattern</i></span> matching to achieve a
429       high degree of flexibility. This allows one expression to be expanded
430       and potentially match against many similar patterns.</p>
431
432       <p>Generally, an URL pattern has the form <tt class=
433       "LITERAL">&lt;domain&gt;&lt;port&gt;/&lt;path&gt;</tt>, where the
434       <tt class="LITERAL">&lt;domain&gt;</tt>, the <tt class=
435       "LITERAL">&lt;port&gt;</tt> and the <tt class=
436       "LITERAL">&lt;path&gt;</tt> are optional. (This is why the special
437       <tt class="LITERAL">/</tt> pattern matches all URLs). Note that the
438       protocol portion of the URL pattern (e.g. <tt class=
439       "LITERAL">http://</tt>) should <span class="emphasis"><i class=
440       "EMPHASIS">not</i></span> be included in the pattern. This is assumed
441       already!</p>
442
443       <p>The pattern matching syntax is different for the domain and path
444       parts of the URL. The domain part uses a simple globbing type matching
445       technique, while the path part uses more flexible <a href=
446       "http://en.wikipedia.org/wiki/Regular_expressions" target=
447       "_top"><span class="QUOTE">"Regular Expressions"</span></a> (POSIX
448       1003.2).</p>
449
450       <p>The port part of a pattern is a decimal port number preceded by a
451       colon (<tt class="LITERAL">:</tt>). If the domain part contains a
452       numerical IPv6 address, it has to be put into angle brackets
453       (<tt class="LITERAL">&lt;</tt>, <tt class="LITERAL">&gt;</tt>).</p>
454
455       <div class="VARIABLELIST">
456         <dl>
457           <dt><tt class="LITERAL">www.example.com/</tt></dt>
458
459           <dd>
460             <p>is a domain-only pattern and will match any request to
461             <tt class="LITERAL">www.example.com</tt>, regardless of which
462             document on that server is requested. So ALL pages in this domain
463             would be covered by the scope of this action. Note that a simple
464             <tt class="LITERAL">example.com</tt> is different and would NOT
465             match.</p>
466           </dd>
467
468           <dt><tt class="LITERAL">www.example.com</tt></dt>
469
470           <dd>
471             <p>means exactly the same. For domain-only patterns, the trailing
472             <tt class="LITERAL">/</tt> may be omitted.</p>
473           </dd>
474
475           <dt><tt class="LITERAL">www.example.com/index.html</tt></dt>
476
477           <dd>
478             <p>matches all the documents on <tt class=
479             "LITERAL">www.example.com</tt> whose name starts with <tt class=
480             "LITERAL">/index.html</tt>.</p>
481           </dd>
482
483           <dt><tt class="LITERAL">www.example.com/index.html$</tt></dt>
484
485           <dd>
486             <p>matches only the single document <tt class=
487             "LITERAL">/index.html</tt> on <tt class=
488             "LITERAL">www.example.com</tt>.</p>
489           </dd>
490
491           <dt><tt class="LITERAL">/index.html$</tt></dt>
492
493           <dd>
494             <p>matches the document <tt class="LITERAL">/index.html</tt>,
495             regardless of the domain, i.e. on <span class=
496             "emphasis"><i class="EMPHASIS">any</i></span> web server
497             anywhere.</p>
498           </dd>
499
500           <dt><tt class="LITERAL">/</tt></dt>
501
502           <dd>
503             <p>Matches any URL because there's no requirement for either the
504             domain or the path to match anything.</p>
505           </dd>
506
507           <dt><tt class="LITERAL">:8000/</tt></dt>
508
509           <dd>
510             <p>Matches any URL pointing to TCP port 8000.</p>
511           </dd>
512
513           <dt><tt class="LITERAL">&lt;2001:db8::1&gt;/</tt></dt>
514
515           <dd>
516             <p>Matches any URL with the host address <tt class=
517             "LITERAL">2001:db8::1</tt>. (Note that the real URL uses plain
518             brackets, not angle brackets.)</p>
519           </dd>
520
521           <dt><tt class="LITERAL">index.html</tt></dt>
522
523           <dd>
524             <p>matches nothing, since it would be interpreted as a domain
525             name and there is no top-level domain called <tt class=
526             "LITERAL">.html</tt>. So its a mistake.</p>
527           </dd>
528         </dl>
529       </div>
530
531       <div class="SECT3">
532         <h3 class="SECT3"><a name="AEN3041" id="AEN3041">8.4.1. The Domain
533         Pattern</a></h3>
534
535         <p>The matching of the domain part offers some flexible options: if
536         the domain starts or ends with a dot, it becomes unanchored at that
537         end. For example:</p>
538
539         <div class="VARIABLELIST">
540           <dl>
541             <dt><tt class="LITERAL">.example.com</tt></dt>
542
543             <dd>
544               <p>matches any domain with first-level domain <tt class=
545               "LITERAL">com</tt> and second-level domain <tt class=
546               "LITERAL">example</tt>. For example <tt class=
547               "LITERAL">www.example.com</tt>, <tt class=
548               "LITERAL">example.com</tt> and <tt class=
549               "LITERAL">foo.bar.baz.example.com</tt>. Note that it wouldn't
550               match if the second-level domain was <tt class=
551               "LITERAL">another-example</tt>.</p>
552             </dd>
553
554             <dt><tt class="LITERAL">www.</tt></dt>
555
556             <dd>
557               <p>matches any domain that <span class="emphasis"><i class=
558               "EMPHASIS">STARTS</i></span> with <tt class="LITERAL">www.</tt>
559               (It also matches the domain <tt class="LITERAL">www</tt> but
560               most of the time that doesn't matter.)</p>
561             </dd>
562
563             <dt><tt class="LITERAL">.example.</tt></dt>
564
565             <dd>
566               <p>matches any domain that <span class="emphasis"><i class=
567               "EMPHASIS">CONTAINS</i></span> <tt class=
568               "LITERAL">.example.</tt>. And, by the way, also included would
569               be any files or documents that exist within that domain since
570               no path limitations are specified. (Correctly speaking: It
571               matches any FQDN that contains <tt class="LITERAL">example</tt>
572               as a domain.) This might be <tt class=
573               "LITERAL">www.example.com</tt>, <tt class=
574               "LITERAL">news.example.de</tt>, or <tt class=
575               "LITERAL">www.example.net/cgi/testing.pl</tt> for instance. All
576               these cases are matched.</p>
577             </dd>
578           </dl>
579         </div>
580
581         <p>Additionally, there are wild-cards that you can use in the domain
582         names themselves. These work similarly to shell globbing type
583         wild-cards: <span class="QUOTE">"*"</span> represents zero or more
584         arbitrary characters (this is equivalent to the <a href=
585         "http://en.wikipedia.org/wiki/Regular_expressions" target=
586         "_top"><span class="QUOTE">"Regular Expression"</span></a> based
587         syntax of <span class="QUOTE">".*"</span>), <span class=
588         "QUOTE">"?"</span> represents any single character (this is
589         equivalent to the regular expression syntax of a simple <span class=
590         "QUOTE">"."</span>), and you can define <span class=
591         "QUOTE">"character classes"</span> in square brackets which is
592         similar to the same regular expression technique. All of this can be
593         freely mixed:</p>
594
595         <div class="VARIABLELIST">
596           <dl>
597             <dt><tt class="LITERAL">ad*.example.com</tt></dt>
598
599             <dd>
600               <p>matches <span class="QUOTE">"adserver.example.com"</span>,
601               <span class="QUOTE">"ads.example.com"</span>, etc but not
602               <span class="QUOTE">"sfads.example.com"</span></p>
603             </dd>
604
605             <dt><tt class="LITERAL">*ad*.example.com</tt></dt>
606
607             <dd>
608               <p>matches all of the above, and then some.</p>
609             </dd>
610
611             <dt><tt class="LITERAL">.?pix.com</tt></dt>
612
613             <dd>
614               <p>matches <tt class="LITERAL">www.ipix.com</tt>, <tt class=
615               "LITERAL">pictures.epix.com</tt>, <tt class=
616               "LITERAL">a.b.c.d.e.upix.com</tt> etc.</p>
617             </dd>
618
619             <dt><tt class="LITERAL">www[1-9a-ez].example.c*</tt></dt>
620
621             <dd>
622               <p>matches <tt class="LITERAL">www1.example.com</tt>,
623               <tt class="LITERAL">www4.example.cc</tt>, <tt class=
624               "LITERAL">wwwd.example.cy</tt>, <tt class=
625               "LITERAL">wwwz.example.com</tt> etc., but <span class=
626               "emphasis"><i class="EMPHASIS">not</i></span> <tt class=
627               "LITERAL">wwww.example.com</tt>.</p>
628             </dd>
629           </dl>
630         </div>
631
632         <p>While flexible, this is not the sophistication of full regular
633         expression based syntax.</p>
634       </div>
635
636       <div class="SECT3">
637         <h3 class="SECT3"><a name="AEN3117" id="AEN3117">8.4.2. The Path
638         Pattern</a></h3>
639
640         <p><span class="APPLICATION">Privoxy</span> uses <span class=
641         "QUOTE">"modern"</span> POSIX 1003.2 <a href=
642         "http://en.wikipedia.org/wiki/Regular_expressions" target=
643         "_top"><span class="QUOTE">"Regular Expressions"</span></a> for
644         matching the path portion (after the slash), and is thus more
645         flexible.</p>
646
647         <p>There is an <a href="appendix.html#REGEX">Appendix</a> with a
648         brief quick-start into regular expressions, you also might want to
649         have a look at your operating system's documentation on regular
650         expressions (try <tt class="LITERAL">man re_format</tt>).</p>
651
652         <p>Note that the path pattern is automatically left-anchored at the
653         <span class="QUOTE">"/"</span>, i.e. it matches as if it would start
654         with a <span class="QUOTE">"^"</span> (regular expression speak for
655         the beginning of a line).</p>
656
657         <p>Please also note that matching in the path is <span class=
658         "emphasis"><i class="EMPHASIS">CASE INSENSITIVE</i></span> by
659         default, but you can switch to case sensitive at any point in the
660         pattern by using the <span class="QUOTE">"(?-i)"</span> switch:
661         <tt class="LITERAL">www.example.com/(?-i)PaTtErN.*</tt> will match
662         only documents whose path starts with <tt class=
663         "LITERAL">PaTtErN</tt> in <span class="emphasis"><i class=
664         "EMPHASIS">exactly</i></span> this capitalization.</p>
665
666         <div class="VARIABLELIST">
667           <dl>
668             <dt><tt class="LITERAL">.example.com/.*</tt></dt>
669
670             <dd>
671               <p>Is equivalent to just <span class=
672               "QUOTE">".example.com"</span>, since any documents within that
673               domain are matched with or without the <span class=
674               "QUOTE">".*"</span> regular expression. This is redundant</p>
675             </dd>
676
677             <dt><tt class="LITERAL">.example.com/.*/index.html$</tt></dt>
678
679             <dd>
680               <p>Will match any page in the domain of <span class=
681               "QUOTE">"example.com"</span> that is named <span class=
682               "QUOTE">"index.html"</span>, and that is part of some path. For
683               example, it matches <span class=
684               "QUOTE">"www.example.com/testing/index.html"</span> but NOT
685               <span class="QUOTE">"www.example.com/index.html"</span> because
686               the regular expression called for at least two <span class=
687               "QUOTE">"/'s"</span>, thus the path requirement. It also would
688               match <span class=
689               "QUOTE">"www.example.com/testing/index_html"</span>, because of
690               the special meta-character <span class="QUOTE">"."</span>.</p>
691             </dd>
692
693             <dt><tt class="LITERAL">.example.com/(.*/)?index\.html$</tt></dt>
694
695             <dd>
696               <p>This regular expression is conditional so it will match any
697               page named <span class="QUOTE">"index.html"</span> regardless
698               of path which in this case can have one or more <span class=
699               "QUOTE">"/'s"</span>. And this one must contain exactly
700               <span class="QUOTE">".html"</span> (but does not have to end
701               with that!).</p>
702             </dd>
703
704             <dt><tt class=
705             "LITERAL">.example.com/(.*/)(ads|banners?|junk)</tt></dt>
706
707             <dd>
708               <p>This regular expression will match any path of <span class=
709               "QUOTE">"example.com"</span> that contains any of the words
710               <span class="QUOTE">"ads"</span>, <span class=
711               "QUOTE">"banner"</span>, <span class="QUOTE">"banners"</span>
712               (because of the <span class="QUOTE">"?"</span>) or <span class=
713               "QUOTE">"junk"</span>. The path does not have to end in these
714               words, just contain them.</p>
715             </dd>
716
717             <dt><tt class=
718             "LITERAL">.example.com/(.*/)(ads|banners?|junk)/.*\.(jpe?g|gif|png)$</tt></dt>
719
720             <dd>
721               <p>This is very much the same as above, except now it must end
722               in either <span class="QUOTE">".jpg"</span>, <span class=
723               "QUOTE">".jpeg"</span>, <span class="QUOTE">".gif"</span> or
724               <span class="QUOTE">".png"</span>. So this one is limited to
725               common image formats.</p>
726             </dd>
727           </dl>
728         </div>
729
730         <p>There are many, many good examples to be found in <tt class=
731         "FILENAME">default.action</tt>, and more tutorials below in <a href=
732         "appendix.html#REGEX">Appendix on regular expressions</a>.</p>
733       </div>
734
735       <div class="SECT3">
736         <h3 class="SECT3"><a name="TAG-PATTERN" id="TAG-PATTERN">8.4.3. The
737         Tag Pattern</a></h3>
738
739         <p>Tag patterns are used to change the applying actions based on the
740         request's tags. Tags can be created with either the <a href=
741         "actions-file.html#CLIENT-HEADER-TAGGER">client-header-tagger</a> or
742         the <a href=
743         "actions-file.html#SERVER-HEADER-TAGGER">server-header-tagger</a>
744         action.</p>
745
746         <p>Tag patterns have to start with <span class="QUOTE">"TAG:"</span>,
747         so <span class="APPLICATION">Privoxy</span> can tell them apart from
748         URL patterns. Everything after the colon including white space, is
749         interpreted as a regular expression with path pattern syntax, except
750         that tag patterns aren't left-anchored automatically (<span class=
751         "APPLICATION">Privoxy</span> doesn't silently add a <span class=
752         "QUOTE">"^"</span>, you have to do it yourself if you need it).</p>
753
754         <p>To match all requests that are tagged with <span class=
755         "QUOTE">"foo"</span> your pattern line should be <span class=
756         "QUOTE">"TAG:^foo$"</span>, <span class="QUOTE">"TAG:foo"</span>
757         would work as well, but it would also match requests whose tags
758         contain <span class="QUOTE">"foo"</span> somewhere. <span class=
759         "QUOTE">"TAG: foo"</span> wouldn't work as it requires white
760         space.</p>
761
762         <p>Sections can contain URL and tag patterns at the same time, but
763         tag patterns are checked after the URL patterns and thus always
764         overrule them, even if they are located before the URL patterns.</p>
765
766         <p>Once a new tag is added, Privoxy checks right away if it's matched
767         by one of the tag patterns and updates the action settings
768         accordingly. As a result tags can be used to activate other tagger
769         actions, as long as these other taggers look for headers that haven't
770         already be parsed.</p>
771
772         <p>For example you could tag client requests which use the <tt class=
773         "LITERAL">POST</tt> method, then use this tag to activate another
774         tagger that adds a tag if cookies are sent, and then use a block
775         action based on the cookie tag. This allows the outcome of one
776         action, to be input into a subsequent action. However if you'd
777         reverse the position of the described taggers, and activated the
778         method tagger based on the cookie tagger, no method tags would be
779         created. The method tagger would look for the request line, but at
780         the time the cookie tag is created, the request line has already been
781         parsed.</p>
782
783         <p>While this is a limitation you should be aware of, this kind of
784         indirection is seldom needed anyway and even the example doesn't make
785         too much sense.</p>
786       </div>
787     </div>
788
789     <div class="SECT2">
790       <h2 class="SECT2"><a name="ACTIONS" id="ACTIONS">8.5. Actions</a></h2>
791
792       <p>All actions are disabled by default, until they are explicitly
793       enabled somewhere in an actions file. Actions are turned on if preceded
794       with a <span class="QUOTE">"+"</span>, and turned off if preceded with
795       a <span class="QUOTE">"-"</span>. So a <tt class="LITERAL">+action</tt>
796       means <span class="QUOTE">"do that action"</span>, e.g. <tt class=
797       "LITERAL">+block</tt> means <span class="QUOTE">"please block URLs that
798       match the following patterns"</span>, and <tt class=
799       "LITERAL">-block</tt> means <span class="QUOTE">"don't block URLs that
800       match the following patterns, even if <tt class="LITERAL">+block</tt>
801       previously applied."</span></p>
802
803       <p>Again, actions are invoked by placing them on a line, enclosed in
804       curly braces and separated by whitespace, like in <tt class=
805       "LITERAL">{+some-action -some-other-action{some-parameter}}</tt>,
806       followed by a list of URL patterns, one per line, to which they apply.
807       Together, the actions line and the following pattern lines make up a
808       section of the actions file.</p>
809
810       <p>Actions fall into three categories:</p>
811
812       <ul>
813         <li>
814           <p>Boolean, i.e the action can only be <span class=
815           "QUOTE">"enabled"</span> or <span class="QUOTE">"disabled"</span>.
816           Syntax:</p>
817
818           <table border="0" bgcolor="#E0E0E0" width="90%">
819             <tr>
820               <td>
821                 <pre class="SCREEN">
822   +<tt class="REPLACEABLE"><i>name</i></tt>        # enable action <tt class=
823 "REPLACEABLE"><i>name</i></tt>
824   -<tt class=
825 "REPLACEABLE"><i>name</i></tt>        # disable action <tt class="REPLACEABLE"><i>name</i></tt>
826 </pre>
827               </td>
828             </tr>
829           </table>
830
831           <p>Example: <tt class="LITERAL">+handle-as-image</tt></p>
832         </li>
833
834         <li>
835           <p>Parameterized, where some value is required in order to enable
836           this type of action. Syntax:</p>
837
838           <table border="0" bgcolor="#E0E0E0" width="90%">
839             <tr>
840               <td>
841                 <pre class="SCREEN">
842   +<tt class="REPLACEABLE"><i>name</i></tt>{<tt class=
843 "REPLACEABLE"><i>param</i></tt>}  # enable action and set parameter to <tt class="REPLACEABLE"><i>param</i></tt>,
844                # overwriting parameter from previous match if necessary
845   -<tt class=
846 "REPLACEABLE"><i>name</i></tt>         # disable action. The parameter can be omitted
847 </pre>
848               </td>
849             </tr>
850           </table>
851
852           <p>Note that if the URL matches multiple positive forms of a
853           parameterized action, the last match wins, i.e. the params from
854           earlier matches are simply ignored.</p>
855
856           <p>Example: <tt class="LITERAL">+hide-user-agent{Mozilla/5.0 (X11;
857           U; FreeBSD i386; en-US; rv:1.8.1.4) Gecko/20070602
858           Firefox/2.0.0.4}</tt></p>
859         </li>
860
861         <li>
862           <p>Multi-value. These look exactly like parameterized actions, but
863           they behave differently: If the action applies multiple times to
864           the same URL, but with different parameters, <span class=
865           "emphasis"><i class="EMPHASIS">all</i></span> the parameters from
866           <span class="emphasis"><i class="EMPHASIS">all</i></span> matches
867           are remembered. This is used for actions that can be executed for
868           the same request repeatedly, like adding multiple headers, or
869           filtering through multiple filters. Syntax:</p>
870
871           <table border="0" bgcolor="#E0E0E0" width="90%">
872             <tr>
873               <td>
874                 <pre class="SCREEN">
875   +<tt class="REPLACEABLE"><i>name</i></tt>{<tt class=
876 "REPLACEABLE"><i>param</i></tt>}   # enable action and add <tt class=
877 "REPLACEABLE"><i>param</i></tt> to the list of parameters
878   -<tt class="REPLACEABLE"><i>name</i></tt>{<tt class=
879 "REPLACEABLE"><i>param</i></tt>}   # remove the parameter <tt class=
880 "REPLACEABLE"><i>param</i></tt> from the list of parameters
881                 # If it was the last one left, disable the action.
882   <tt class=
883 "REPLACEABLE"><i>-name</i></tt>          # disable this action completely and remove all parameters from the list
884 </pre>
885               </td>
886             </tr>
887           </table>
888
889           <p>Examples: <tt class="LITERAL">+add-header{X-Fun-Header: Some
890           text}</tt> and <tt class=
891           "LITERAL">+filter{html-annoyances}</tt></p>
892         </li>
893       </ul>
894
895       <p>If nothing is specified in any actions file, no <span class=
896       "QUOTE">"actions"</span> are taken. So in this case <span class=
897       "APPLICATION">Privoxy</span> would just be a normal, non-blocking,
898       non-filtering proxy. You must specifically enable the privacy and
899       blocking features you need (although the provided default actions files
900       will give a good starting point).</p>
901
902       <p>Later defined action sections always over-ride earlier ones of the
903       same type. So exceptions to any rules you make, should come in the
904       latter part of the file (or in a file that is processed later when
905       using multiple actions files such as <tt class=
906       "FILENAME">user.action</tt>). For multi-valued actions, the actions are
907       applied in the order they are specified. Actions files are processed in
908       the order they are defined in <tt class="FILENAME">config</tt> (the
909       default installation has three actions files). It also quite possible
910       for any given URL to match more than one <span class=
911       "QUOTE">"pattern"</span> (because of wildcards and regular
912       expressions), and thus to trigger more than one set of actions! Last
913       match wins.</p>
914
915       <p>The list of valid <span class="APPLICATION">Privoxy</span> actions
916       are:</p>
917
918       <div class="SECT3">
919         <h4 class="SECT3"><a name="ADD-HEADER" id="ADD-HEADER">8.5.1.
920         add-header</a></h4>
921
922         <div class="VARIABLELIST">
923           <dl>
924             <dt>Typical use:</dt>
925
926             <dd>
927               <p>Confuse log analysis, custom applications</p>
928             </dd>
929
930             <dt>Effect:</dt>
931
932             <dd>
933               <p>Sends a user defined HTTP header to the web server.</p>
934             </dd>
935
936             <dt>Type:</dt>
937
938             <dd>
939               <p>Multi-value.</p>
940             </dd>
941
942             <dt>Parameter:</dt>
943
944             <dd>
945               <p>Any string value is possible. Validity of the defined HTTP
946               headers is not checked. It is recommended that you use the
947               <span class="QUOTE">"<tt class="LITERAL">X-</tt>"</span> prefix
948               for custom headers.</p>
949             </dd>
950
951             <dt>Notes:</dt>
952
953             <dd>
954               <p>This action may be specified multiple times, in order to
955               define multiple headers. This is rarely needed for the typical
956               user. If you don't know what <span class="QUOTE">"HTTP
957               headers"</span> are, you definitely don't need to worry about
958               this one.</p>
959
960               <p>Headers added by this action are not modified by other
961               actions.</p>
962             </dd>
963
964             <dt>Example usage:</dt>
965
966             <dd>
967               <table border="0" bgcolor="#E0E0E0" width="90%">
968                 <tr>
969                   <td>
970                     <pre class="SCREEN">
971 +add-header{X-User-Tracking: sucks}
972 </pre>
973                   </td>
974                 </tr>
975               </table>
976             </dd>
977           </dl>
978         </div>
979       </div>
980
981       <div class="SECT3">
982         <h4 class="SECT3"><a name="BLOCK" id="BLOCK">8.5.2. block</a></h4>
983
984         <div class="VARIABLELIST">
985           <dl>
986             <dt>Typical use:</dt>
987
988             <dd>
989               <p>Block ads or other unwanted content</p>
990             </dd>
991
992             <dt>Effect:</dt>
993
994             <dd>
995               <p>Requests for URLs to which this action applies are blocked,
996               i.e. the requests are trapped by <span class=
997               "APPLICATION">Privoxy</span> and the requested URL is never
998               retrieved, but is answered locally with a substitute page or
999               image, as determined by the <tt class="LITERAL"><a href=
1000               "actions-file.html#HANDLE-AS-IMAGE">handle-as-image</a></tt>,
1001               <tt class="LITERAL"><a href=
1002               "actions-file.html#SET-IMAGE-BLOCKER">set-image-blocker</a></tt>,
1003               and <tt class="LITERAL"><a href=
1004               "actions-file.html#HANDLE-AS-EMPTY-DOCUMENT">handle-as-empty-document</a></tt>
1005               actions.</p>
1006             </dd>
1007
1008             <dt>Type:</dt>
1009
1010             <dd>
1011               <p>Parameterized.</p>
1012             </dd>
1013
1014             <dt>Parameter:</dt>
1015
1016             <dd>
1017               <p>A block reason that should be given to the user.</p>
1018             </dd>
1019
1020             <dt>Notes:</dt>
1021
1022             <dd>
1023               <p><span class="APPLICATION">Privoxy</span> sends a special
1024               <span class="QUOTE">"BLOCKED"</span> page for requests to
1025               blocked pages. This page contains the block reason given as
1026               parameter, a link to find out why the block action applies, and
1027               a click-through to the blocked content (the latter only if the
1028               force feature is available and enabled).</p>
1029
1030               <p>A very important exception occurs if <span class=
1031               "emphasis"><i class="EMPHASIS">both</i></span> <tt class=
1032               "LITERAL">block</tt> and <tt class="LITERAL"><a href=
1033               "actions-file.html#HANDLE-AS-IMAGE">handle-as-image</a></tt>,
1034               apply to the same request: it will then be replaced by an
1035               image. If <tt class="LITERAL"><a href=
1036               "actions-file.html#SET-IMAGE-BLOCKER">set-image-blocker</a></tt>
1037               (see below) also applies, the type of image will be determined
1038               by its parameter, if not, the standard checkerboard pattern is
1039               sent.</p>
1040
1041               <p>It is important to understand this process, in order to
1042               understand how <span class="APPLICATION">Privoxy</span> deals
1043               with ads and other unwanted content. Blocking is a core
1044               feature, and one upon which various other features depend.</p>
1045
1046               <p>The <tt class="LITERAL"><a href=
1047               "actions-file.html#FILTER">filter</a></tt> action can perform a
1048               very similar task, by <span class="QUOTE">"blocking"</span>
1049               banner images and other content through rewriting the relevant
1050               URLs in the document's HTML source, so they don't get requested
1051               in the first place. Note that this is a totally different
1052               technique, and it's easy to confuse the two.</p>
1053             </dd>
1054
1055             <dt>Example usage (section):</dt>
1056
1057             <dd>
1058               <table border="0" bgcolor="#E0E0E0" width="90%">
1059                 <tr>
1060                   <td>
1061                     <pre class="SCREEN">
1062 {+block{No nasty stuff for you.}}
1063 # Block and replace with "blocked" page
1064  .nasty-stuff.example.com
1065
1066 {+block{Doubleclick banners.} +handle-as-image}
1067 # Block and replace with image
1068  .ad.doubleclick.net
1069  .ads.r.us/banners/
1070
1071 {+block{Layered ads.} +handle-as-empty-document}
1072 # Block and then ignore
1073  adserver.example.net/.*\.js$
1074 </pre>
1075                   </td>
1076                 </tr>
1077               </table>
1078             </dd>
1079           </dl>
1080         </div>
1081       </div>
1082
1083       <div class="SECT3">
1084         <h4 class="SECT3"><a name="CHANGE-X-FORWARDED-FOR" id=
1085         "CHANGE-X-FORWARDED-FOR">8.5.3. change-x-forwarded-for</a></h4>
1086
1087         <div class="VARIABLELIST">
1088           <dl>
1089             <dt>Typical use:</dt>
1090
1091             <dd>
1092               <p>Improve privacy by not forwarding the source of the request
1093               in the HTTP headers.</p>
1094             </dd>
1095
1096             <dt>Effect:</dt>
1097
1098             <dd>
1099               <p>Deletes the <span class="QUOTE">"X-Forwarded-For:"</span>
1100               HTTP header from the client request, or adds a new one.</p>
1101             </dd>
1102
1103             <dt>Type:</dt>
1104
1105             <dd>
1106               <p>Parameterized.</p>
1107             </dd>
1108
1109             <dt>Parameter:</dt>
1110
1111             <dd>
1112               <ul>
1113                 <li>
1114                   <p><span class="QUOTE">"block"</span> to delete the
1115                   header.</p>
1116                 </li>
1117
1118                 <li>
1119                   <p><span class="QUOTE">"add"</span> to create the header
1120                   (or append the client's IP address to an already existing
1121                   one).</p>
1122                 </li>
1123               </ul>
1124             </dd>
1125
1126             <dt>Notes:</dt>
1127
1128             <dd>
1129               <p>It is safe and recommended to use <tt class=
1130               "LITERAL">block</tt>.</p>
1131
1132               <p>Forwarding the source address of the request may make sense
1133               in some multi-user setups but is also a privacy risk.</p>
1134             </dd>
1135
1136             <dt>Example usage:</dt>
1137
1138             <dd>
1139               <table border="0" bgcolor="#E0E0E0" width="90%">
1140                 <tr>
1141                   <td>
1142                     <pre class="SCREEN">
1143 +change-x-forwarded-for{block}
1144 </pre>
1145                   </td>
1146                 </tr>
1147               </table>
1148             </dd>
1149           </dl>
1150         </div>
1151       </div>
1152
1153       <div class="SECT3">
1154         <h4 class="SECT3"><a name="CLIENT-HEADER-FILTER" id=
1155         "CLIENT-HEADER-FILTER">8.5.4. client-header-filter</a></h4>
1156
1157         <div class="VARIABLELIST">
1158           <dl>
1159             <dt>Typical use:</dt>
1160
1161             <dd>
1162               <p>Rewrite or remove single client headers.</p>
1163             </dd>
1164
1165             <dt>Effect:</dt>
1166
1167             <dd>
1168               <p>All client headers to which this action applies are filtered
1169               on-the-fly through the specified regular expression based
1170               substitutions.</p>
1171             </dd>
1172
1173             <dt>Type:</dt>
1174
1175             <dd>
1176               <p>Parameterized.</p>
1177             </dd>
1178
1179             <dt>Parameter:</dt>
1180
1181             <dd>
1182               <p>The name of a client-header filter, as defined in one of the
1183               <a href="filter-file.html">filter files</a>.</p>
1184             </dd>
1185
1186             <dt>Notes:</dt>
1187
1188             <dd>
1189               <p>Client-header filters are applied to each header on its own,
1190               not to all at once. This makes it easier to diagnose problems,
1191               but on the downside you can't write filters that only change
1192               header x if header y's value is z. You can do that by using
1193               tags though.</p>
1194
1195               <p>Client-header filters are executed after the other header
1196               actions have finished and use their output as input.</p>
1197
1198               <p>If the request URI gets changed, <span class=
1199               "APPLICATION">Privoxy</span> will detect that and use the new
1200               one. This can be used to rewrite the request destination behind
1201               the client's back, for example to specify a Tor exit relay for
1202               certain requests.</p>
1203
1204               <p>Please refer to the <a href="filter-file.html">filter file
1205               chapter</a> to learn which client-header filters are available
1206               by default, and how to create your own.</p>
1207             </dd>
1208
1209             <dt>Example usage (section):</dt>
1210
1211             <dd>
1212               <table border="0" bgcolor="#E0E0E0" width="90%">
1213                 <tr>
1214                   <td>
1215                     <pre class="SCREEN">
1216 # Hide Tor exit notation in Host and Referer Headers
1217 {+client-header-filter{hide-tor-exit-notation}}
1218 /
1219
1220 </pre>
1221                   </td>
1222                 </tr>
1223               </table>
1224             </dd>
1225           </dl>
1226         </div>
1227       </div>
1228
1229       <div class="SECT3">
1230         <h4 class="SECT3"><a name="CLIENT-HEADER-TAGGER" id=
1231         "CLIENT-HEADER-TAGGER">8.5.5. client-header-tagger</a></h4>
1232
1233         <div class="VARIABLELIST">
1234           <dl>
1235             <dt>Typical use:</dt>
1236
1237             <dd>
1238               <p>Block requests based on their headers.</p>
1239             </dd>
1240
1241             <dt>Effect:</dt>
1242
1243             <dd>
1244               <p>Client headers to which this action applies are filtered
1245               on-the-fly through the specified regular expression based
1246               substitutions, the result is used as tag.</p>
1247             </dd>
1248
1249             <dt>Type:</dt>
1250
1251             <dd>
1252               <p>Parameterized.</p>
1253             </dd>
1254
1255             <dt>Parameter:</dt>
1256
1257             <dd>
1258               <p>The name of a client-header tagger, as defined in one of the
1259               <a href="filter-file.html">filter files</a>.</p>
1260             </dd>
1261
1262             <dt>Notes:</dt>
1263
1264             <dd>
1265               <p>Client-header taggers are applied to each header on its own,
1266               and as the header isn't modified, each tagger <span class=
1267               "QUOTE">"sees"</span> the original.</p>
1268
1269               <p>Client-header taggers are the first actions that are
1270               executed and their tags can be used to control every other
1271               action.</p>
1272             </dd>
1273
1274             <dt>Example usage (section):</dt>
1275
1276             <dd>
1277               <table border="0" bgcolor="#E0E0E0" width="90%">
1278                 <tr>
1279                   <td>
1280                     <pre class="SCREEN">
1281 # Tag every request with the User-Agent header
1282 {+client-header-tagger{user-agent}}
1283 /
1284
1285 # Tagging itself doesn't change the action
1286 # settings, sections with TAG patterns do:
1287 #
1288 # If it's a download agent, use a different forwarding proxy,
1289 # show the real User-Agent and make sure resume works.
1290 {+forward-override{forward-socks5 10.0.0.2:2222 .} \
1291  -hide-if-modified-since      \
1292  -overwrite-last-modified     \
1293  -hide-user-agent             \
1294  -filter                      \
1295  -deanimate-gifs              \
1296 }
1297 TAG:^User-Agent: NetBSD-ftp/
1298 TAG:^User-Agent: Novell ZYPP Installer
1299 TAG:^User-Agent: RPM APT-HTTP/
1300 TAG:^User-Agent: fetch libfetch/
1301 TAG:^User-Agent: Ubuntu APT-HTTP/
1302 TAG:^User-Agent: MPlayer/
1303
1304 </pre>
1305                   </td>
1306                 </tr>
1307               </table>
1308
1309               <table border="0" bgcolor="#E0E0E0" width="90%">
1310                 <tr>
1311                   <td>
1312                     <pre class="SCREEN">
1313 # Tag all requests with the Range header set
1314 {+client-header-tagger{range-requests}}
1315 /
1316
1317 # Disable filtering for the tagged requests.
1318 #
1319 # With filtering enabled Privoxy would remove the Range headers
1320 # to be able to filter the whole response. The downside is that
1321 # it prevents clients from resuming downloads or skipping over
1322 # parts of multimedia files.
1323 {-filter -deanimate-gifs}
1324 TAG:^RANGE-REQUEST$
1325
1326 </pre>
1327                   </td>
1328                 </tr>
1329               </table>
1330             </dd>
1331           </dl>
1332         </div>
1333       </div>
1334
1335       <div class="SECT3">
1336         <h4 class="SECT3"><a name="CONTENT-TYPE-OVERWRITE" id=
1337         "CONTENT-TYPE-OVERWRITE">8.5.6. content-type-overwrite</a></h4>
1338
1339         <div class="VARIABLELIST">
1340           <dl>
1341             <dt>Typical use:</dt>
1342
1343             <dd>
1344               <p>Stop useless download menus from popping up, or change the
1345               browser's rendering mode</p>
1346             </dd>
1347
1348             <dt>Effect:</dt>
1349
1350             <dd>
1351               <p>Replaces the <span class="QUOTE">"Content-Type:"</span> HTTP
1352               server header.</p>
1353             </dd>
1354
1355             <dt>Type:</dt>
1356
1357             <dd>
1358               <p>Parameterized.</p>
1359             </dd>
1360
1361             <dt>Parameter:</dt>
1362
1363             <dd>
1364               <p>Any string.</p>
1365             </dd>
1366
1367             <dt>Notes:</dt>
1368
1369             <dd>
1370               <p>The <span class="QUOTE">"Content-Type:"</span> HTTP server
1371               header is used by the browser to decide what to do with the
1372               document. The value of this header can cause the browser to
1373               open a download menu instead of displaying the document by
1374               itself, even if the document's format is supported by the
1375               browser.</p>
1376
1377               <p>The declared content type can also affect which rendering
1378               mode the browser chooses. If XHTML is delivered as <span class=
1379               "QUOTE">"text/html"</span>, many browsers treat it as yet
1380               another broken HTML document. If it is send as <span class=
1381               "QUOTE">"application/xml"</span>, browsers with XHTML support
1382               will only display it, if the syntax is correct.</p>
1383
1384               <p>If you see a web site that proudly uses XHTML buttons, but
1385               sets <span class="QUOTE">"Content-Type: text/html"</span>, you
1386               can use <span class="APPLICATION">Privoxy</span> to overwrite
1387               it with <span class="QUOTE">"application/xml"</span> and
1388               validate the web master's claim inside your XHTML-supporting
1389               browser. If the syntax is incorrect, the browser will complain
1390               loudly.</p>
1391
1392               <p>You can also go the opposite direction: if your browser
1393               prints error messages instead of rendering a document falsely
1394               declared as XHTML, you can overwrite the content type with
1395               <span class="QUOTE">"text/html"</span> and have it rendered as
1396               broken HTML document.</p>
1397
1398               <p>By default <tt class="LITERAL">content-type-overwrite</tt>
1399               only replaces <span class="QUOTE">"Content-Type:"</span>
1400               headers that look like some kind of text. If you want to
1401               overwrite it unconditionally, you have to combine it with
1402               <tt class="LITERAL"><a href=
1403               "actions-file.html#FORCE-TEXT-MODE">force-text-mode</a></tt>.
1404               This limitation exists for a reason, think twice before
1405               circumventing it.</p>
1406
1407               <p>Most of the time it's easier to replace this action with a
1408               custom <tt class="LITERAL"><a href=
1409               "actions-file.html#SERVER-HEADER-FILTER">server-header
1410               filter</a></tt>. It allows you to activate it for every
1411               document of a certain site and it will still only replace the
1412               content types you aimed at.</p>
1413
1414               <p>Of course you can apply <tt class=
1415               "LITERAL">content-type-overwrite</tt> to a whole site and then
1416               make URL based exceptions, but it's a lot more work to get the
1417               same precision.</p>
1418             </dd>
1419
1420             <dt>Example usage (sections):</dt>
1421
1422             <dd>
1423               <table border="0" bgcolor="#E0E0E0" width="90%">
1424                 <tr>
1425                   <td>
1426                     <pre class="SCREEN">
1427 # Check if www.example.net/ really uses valid XHTML
1428 { +content-type-overwrite{application/xml} }
1429 www.example.net/
1430
1431 # but leave the content type unmodified if the URL looks like a style sheet
1432 {-content-type-overwrite}
1433 www.example.net/.*\.css$
1434 www.example.net/.*style
1435 </pre>
1436                   </td>
1437                 </tr>
1438               </table>
1439             </dd>
1440           </dl>
1441         </div>
1442       </div>
1443
1444       <div class="SECT3">
1445         <h4 class="SECT3"><a name="CRUNCH-CLIENT-HEADER" id=
1446         "CRUNCH-CLIENT-HEADER">8.5.7. crunch-client-header</a></h4>
1447
1448         <div class="VARIABLELIST">
1449           <dl>
1450             <dt>Typical use:</dt>
1451
1452             <dd>
1453               <p>Remove a client header <span class=
1454               "APPLICATION">Privoxy</span> has no dedicated action for.</p>
1455             </dd>
1456
1457             <dt>Effect:</dt>
1458
1459             <dd>
1460               <p>Deletes every header sent by the client that contains the
1461               string the user supplied as parameter.</p>
1462             </dd>
1463
1464             <dt>Type:</dt>
1465
1466             <dd>
1467               <p>Parameterized.</p>
1468             </dd>
1469
1470             <dt>Parameter:</dt>
1471
1472             <dd>
1473               <p>Any string.</p>
1474             </dd>
1475
1476             <dt>Notes:</dt>
1477
1478             <dd>
1479               <p>This action allows you to block client headers for which no
1480               dedicated <span class="APPLICATION">Privoxy</span> action
1481               exists. <span class="APPLICATION">Privoxy</span> will remove
1482               every client header that contains the string you supplied as
1483               parameter.</p>
1484
1485               <p>Regular expressions are <span class="emphasis"><i class=
1486               "EMPHASIS">not supported</i></span> and you can't use this
1487               action to block different headers in the same request, unless
1488               they contain the same string.</p>
1489
1490               <p><tt class="LITERAL">crunch-client-header</tt> is only meant
1491               for quick tests. If you have to block several different
1492               headers, or only want to modify parts of them, you should use a
1493               <tt class="LITERAL"><a href=
1494               "actions-file.html#CLIENT-HEADER-FILTER">client-header
1495               filter</a></tt>.</p>
1496
1497               <div class="WARNING">
1498                 <table class="WARNING" border="1" width="90%">
1499                   <tr>
1500                     <td align="center"><b>Warning</b></td>
1501                   </tr>
1502
1503                   <tr>
1504                     <td align="left">
1505                       <p>Don't block any header without understanding the
1506                       consequences.</p>
1507                     </td>
1508                   </tr>
1509                 </table>
1510               </div>
1511             </dd>
1512
1513             <dt>Example usage (section):</dt>
1514
1515             <dd>
1516               <table border="0" bgcolor="#E0E0E0" width="90%">
1517                 <tr>
1518                   <td>
1519                     <pre class="SCREEN">
1520 # Block the non-existent "Privacy-Violation:" client header
1521 { +crunch-client-header{Privacy-Violation:} }
1522 /
1523
1524 </pre>
1525                   </td>
1526                 </tr>
1527               </table>
1528             </dd>
1529           </dl>
1530         </div>
1531       </div>
1532
1533       <div class="SECT3">
1534         <h4 class="SECT3"><a name="CRUNCH-IF-NONE-MATCH" id=
1535         "CRUNCH-IF-NONE-MATCH">8.5.8. crunch-if-none-match</a></h4>
1536
1537         <div class="VARIABLELIST">
1538           <dl>
1539             <dt>Typical use:</dt>
1540
1541             <dd>
1542               <p>Prevent yet another way to track the user's steps between
1543               sessions.</p>
1544             </dd>
1545
1546             <dt>Effect:</dt>
1547
1548             <dd>
1549               <p>Deletes the <span class="QUOTE">"If-None-Match:"</span> HTTP
1550               client header.</p>
1551             </dd>
1552
1553             <dt>Type:</dt>
1554
1555             <dd>
1556               <p>Boolean.</p>
1557             </dd>
1558
1559             <dt>Parameter:</dt>
1560
1561             <dd>
1562               <p>N/A</p>
1563             </dd>
1564
1565             <dt>Notes:</dt>
1566
1567             <dd>
1568               <p>Removing the <span class="QUOTE">"If-None-Match:"</span>
1569               HTTP client header is useful for filter testing, where you want
1570               to force a real reload instead of getting status code
1571               <span class="QUOTE">"304"</span> which would cause the browser
1572               to use a cached copy of the page.</p>
1573
1574               <p>It is also useful to make sure the header isn't used as a
1575               cookie replacement (unlikely but possible).</p>
1576
1577               <p>Blocking the <span class="QUOTE">"If-None-Match:"</span>
1578               header shouldn't cause any caching problems, as long as the
1579               <span class="QUOTE">"If-Modified-Since:"</span> header isn't
1580               blocked or missing as well.</p>
1581
1582               <p>It is recommended to use this action together with
1583               <tt class="LITERAL"><a href=
1584               "actions-file.html#HIDE-IF-MODIFIED-SINCE">hide-if-modified-since</a></tt>
1585               and <tt class="LITERAL"><a href=
1586               "actions-file.html#OVERWRITE-LAST-MODIFIED">overwrite-last-modified</a></tt>.</p>
1587             </dd>
1588
1589             <dt>Example usage (section):</dt>
1590
1591             <dd>
1592               <table border="0" bgcolor="#E0E0E0" width="90%">
1593                 <tr>
1594                   <td>
1595                     <pre class="SCREEN">
1596 # Let the browser revalidate cached documents but don't
1597 # allow the server to use the revalidation headers for user tracking.
1598 {+hide-if-modified-since{-60} \
1599  +overwrite-last-modified{randomize} \
1600  +crunch-if-none-match}
1601 /
1602 </pre>
1603                   </td>
1604                 </tr>
1605               </table>
1606             </dd>
1607           </dl>
1608         </div>
1609       </div>
1610
1611       <div class="SECT3">
1612         <h4 class="SECT3"><a name="CRUNCH-INCOMING-COOKIES" id=
1613         "CRUNCH-INCOMING-COOKIES">8.5.9. crunch-incoming-cookies</a></h4>
1614
1615         <div class="VARIABLELIST">
1616           <dl>
1617             <dt>Typical use:</dt>
1618
1619             <dd>
1620               <p>Prevent the web server from setting HTTP cookies on your
1621               system</p>
1622             </dd>
1623
1624             <dt>Effect:</dt>
1625
1626             <dd>
1627               <p>Deletes any <span class="QUOTE">"Set-Cookie:"</span> HTTP
1628               headers from server replies.</p>
1629             </dd>
1630
1631             <dt>Type:</dt>
1632
1633             <dd>
1634               <p>Boolean.</p>
1635             </dd>
1636
1637             <dt>Parameter:</dt>
1638
1639             <dd>
1640               <p>N/A</p>
1641             </dd>
1642
1643             <dt>Notes:</dt>
1644
1645             <dd>
1646               <p>This action is only concerned with <span class=
1647               "emphasis"><i class="EMPHASIS">incoming</i></span> HTTP
1648               cookies. For <span class="emphasis"><i class=
1649               "EMPHASIS">outgoing</i></span> HTTP cookies, use <tt class=
1650               "LITERAL"><a href=
1651               "actions-file.html#CRUNCH-OUTGOING-COOKIES">crunch-outgoing-cookies</a></tt>.
1652               Use <span class="emphasis"><i class="EMPHASIS">both</i></span>
1653               to disable HTTP cookies completely.</p>
1654
1655               <p>It makes <span class="emphasis"><i class="EMPHASIS">no sense
1656               at all</i></span> to use this action in conjunction with the
1657               <tt class="LITERAL"><a href=
1658               "actions-file.html#SESSION-COOKIES-ONLY">session-cookies-only</a></tt>
1659               action, since it would prevent the session cookies from being
1660               set. See also <tt class="LITERAL"><a href=
1661               "actions-file.html#FILTER-CONTENT-COOKIES">filter-content-cookies</a></tt>.</p>
1662             </dd>
1663
1664             <dt>Example usage:</dt>
1665
1666             <dd>
1667               <table border="0" bgcolor="#E0E0E0" width="90%">
1668                 <tr>
1669                   <td>
1670                     <pre class="SCREEN">
1671 +crunch-incoming-cookies
1672 </pre>
1673                   </td>
1674                 </tr>
1675               </table>
1676             </dd>
1677           </dl>
1678         </div>
1679       </div>
1680
1681       <div class="SECT3">
1682         <h4 class="SECT3"><a name="CRUNCH-SERVER-HEADER" id=
1683         "CRUNCH-SERVER-HEADER">8.5.10. crunch-server-header</a></h4>
1684
1685         <div class="VARIABLELIST">
1686           <dl>
1687             <dt>Typical use:</dt>
1688
1689             <dd>
1690               <p>Remove a server header <span class=
1691               "APPLICATION">Privoxy</span> has no dedicated action for.</p>
1692             </dd>
1693
1694             <dt>Effect:</dt>
1695
1696             <dd>
1697               <p>Deletes every header sent by the server that contains the
1698               string the user supplied as parameter.</p>
1699             </dd>
1700
1701             <dt>Type:</dt>
1702
1703             <dd>
1704               <p>Parameterized.</p>
1705             </dd>
1706
1707             <dt>Parameter:</dt>
1708
1709             <dd>
1710               <p>Any string.</p>
1711             </dd>
1712
1713             <dt>Notes:</dt>
1714
1715             <dd>
1716               <p>This action allows you to block server headers for which no
1717               dedicated <span class="APPLICATION">Privoxy</span> action
1718               exists. <span class="APPLICATION">Privoxy</span> will remove
1719               every server header that contains the string you supplied as
1720               parameter.</p>
1721
1722               <p>Regular expressions are <span class="emphasis"><i class=
1723               "EMPHASIS">not supported</i></span> and you can't use this
1724               action to block different headers in the same request, unless
1725               they contain the same string.</p>
1726
1727               <p><tt class="LITERAL">crunch-server-header</tt> is only meant
1728               for quick tests. If you have to block several different
1729               headers, or only want to modify parts of them, you should use a
1730               custom <tt class="LITERAL"><a href=
1731               "actions-file.html#SERVER-HEADER-FILTER">server-header
1732               filter</a></tt>.</p>
1733
1734               <div class="WARNING">
1735                 <table class="WARNING" border="1" width="90%">
1736                   <tr>
1737                     <td align="center"><b>Warning</b></td>
1738                   </tr>
1739
1740                   <tr>
1741                     <td align="left">
1742                       <p>Don't block any header without understanding the
1743                       consequences.</p>
1744                     </td>
1745                   </tr>
1746                 </table>
1747               </div>
1748             </dd>
1749
1750             <dt>Example usage (section):</dt>
1751
1752             <dd>
1753               <table border="0" bgcolor="#E0E0E0" width="90%">
1754                 <tr>
1755                   <td>
1756                     <pre class="SCREEN">
1757 # Crunch server headers that try to prevent caching
1758 { +crunch-server-header{no-cache} }
1759 /
1760 </pre>
1761                   </td>
1762                 </tr>
1763               </table>
1764             </dd>
1765           </dl>
1766         </div>
1767       </div>
1768
1769       <div class="SECT3">
1770         <h4 class="SECT3"><a name="CRUNCH-OUTGOING-COOKIES" id=
1771         "CRUNCH-OUTGOING-COOKIES">8.5.11. crunch-outgoing-cookies</a></h4>
1772
1773         <div class="VARIABLELIST">
1774           <dl>
1775             <dt>Typical use:</dt>
1776
1777             <dd>
1778               <p>Prevent the web server from reading any HTTP cookies from
1779               your system</p>
1780             </dd>
1781
1782             <dt>Effect:</dt>
1783
1784             <dd>
1785               <p>Deletes any <span class="QUOTE">"Cookie:"</span> HTTP
1786               headers from client requests.</p>
1787             </dd>
1788
1789             <dt>Type:</dt>
1790
1791             <dd>
1792               <p>Boolean.</p>
1793             </dd>
1794
1795             <dt>Parameter:</dt>
1796
1797             <dd>
1798               <p>N/A</p>
1799             </dd>
1800
1801             <dt>Notes:</dt>
1802
1803             <dd>
1804               <p>This action is only concerned with <span class=
1805               "emphasis"><i class="EMPHASIS">outgoing</i></span> HTTP
1806               cookies. For <span class="emphasis"><i class=
1807               "EMPHASIS">incoming</i></span> HTTP cookies, use <tt class=
1808               "LITERAL"><a href=
1809               "actions-file.html#CRUNCH-INCOMING-COOKIES">crunch-incoming-cookies</a></tt>.
1810               Use <span class="emphasis"><i class="EMPHASIS">both</i></span>
1811               to disable HTTP cookies completely.</p>
1812
1813               <p>It makes <span class="emphasis"><i class="EMPHASIS">no sense
1814               at all</i></span> to use this action in conjunction with the
1815               <tt class="LITERAL"><a href=
1816               "actions-file.html#SESSION-COOKIES-ONLY">session-cookies-only</a></tt>
1817               action, since it would prevent the session cookies from being
1818               read.</p>
1819             </dd>
1820
1821             <dt>Example usage:</dt>
1822
1823             <dd>
1824               <table border="0" bgcolor="#E0E0E0" width="90%">
1825                 <tr>
1826                   <td>
1827                     <pre class="SCREEN">
1828 +crunch-outgoing-cookies
1829 </pre>
1830                   </td>
1831                 </tr>
1832               </table>
1833             </dd>
1834           </dl>
1835         </div>
1836       </div>
1837
1838       <div class="SECT3">
1839         <h4 class="SECT3"><a name="DEANIMATE-GIFS" id=
1840         "DEANIMATE-GIFS">8.5.12. deanimate-gifs</a></h4>
1841
1842         <div class="VARIABLELIST">
1843           <dl>
1844             <dt>Typical use:</dt>
1845
1846             <dd>
1847               <p>Stop those annoying, distracting animated GIF images.</p>
1848             </dd>
1849
1850             <dt>Effect:</dt>
1851
1852             <dd>
1853               <p>De-animate GIF animations, i.e. reduce them to their first
1854               or last image.</p>
1855             </dd>
1856
1857             <dt>Type:</dt>
1858
1859             <dd>
1860               <p>Parameterized.</p>
1861             </dd>
1862
1863             <dt>Parameter:</dt>
1864
1865             <dd>
1866               <p><span class="QUOTE">"last"</span> or <span class=
1867               "QUOTE">"first"</span></p>
1868             </dd>
1869
1870             <dt>Notes:</dt>
1871
1872             <dd>
1873               <p>This will also shrink the images considerably (in bytes, not
1874               pixels!). If the option <span class="QUOTE">"first"</span> is
1875               given, the first frame of the animation is used as the
1876               replacement. If <span class="QUOTE">"last"</span> is given, the
1877               last frame of the animation is used instead, which probably
1878               makes more sense for most banner animations, but also has the
1879               risk of not showing the entire last frame (if it is only a
1880               delta to an earlier frame).</p>
1881
1882               <p>You can safely use this action with patterns that will also
1883               match non-GIF objects, because no attempt will be made at
1884               anything that doesn't look like a GIF.</p>
1885             </dd>
1886
1887             <dt>Example usage:</dt>
1888
1889             <dd>
1890               <table border="0" bgcolor="#E0E0E0" width="90%">
1891                 <tr>
1892                   <td>
1893                     <pre class="SCREEN">
1894 +deanimate-gifs{last}
1895 </pre>
1896                   </td>
1897                 </tr>
1898               </table>
1899             </dd>
1900           </dl>
1901         </div>
1902       </div>
1903
1904       <div class="SECT3">
1905         <h4 class="SECT3"><a name="DOWNGRADE-HTTP-VERSION" id=
1906         "DOWNGRADE-HTTP-VERSION">8.5.13. downgrade-http-version</a></h4>
1907
1908         <div class="VARIABLELIST">
1909           <dl>
1910             <dt>Typical use:</dt>
1911
1912             <dd>
1913               <p>Work around (very rare) problems with HTTP/1.1</p>
1914             </dd>
1915
1916             <dt>Effect:</dt>
1917
1918             <dd>
1919               <p>Downgrades HTTP/1.1 client requests and server replies to
1920               HTTP/1.0.</p>
1921             </dd>
1922
1923             <dt>Type:</dt>
1924
1925             <dd>
1926               <p>Boolean.</p>
1927             </dd>
1928
1929             <dt>Parameter:</dt>
1930
1931             <dd>
1932               <p>N/A</p>
1933             </dd>
1934
1935             <dt>Notes:</dt>
1936
1937             <dd>
1938               <p>This is a left-over from the time when <span class=
1939               "APPLICATION">Privoxy</span> didn't support important HTTP/1.1
1940               features well. It is left here for the unlikely case that you
1941               experience HTTP/1.1-related problems with some server out
1942               there.</p>
1943
1944               <p>Note that enabling this action is only a workaround. It
1945               should not be enabled for sites that work without it. While it
1946               shouldn't break any pages, it has an (usually negative)
1947               performance impact.</p>
1948
1949               <p>If you come across a site where enabling this action helps,
1950               please report it, so the cause of the problem can be analyzed.
1951               If the problem turns out to be caused by a bug in <span class=
1952               "APPLICATION">Privoxy</span> it should be fixed so the
1953               following release works without the work around.</p>
1954             </dd>
1955
1956             <dt>Example usage (section):</dt>
1957
1958             <dd>
1959               <table border="0" bgcolor="#E0E0E0" width="90%">
1960                 <tr>
1961                   <td>
1962                     <pre class="SCREEN">
1963 {+downgrade-http-version}
1964 problem-host.example.com
1965 </pre>
1966                   </td>
1967                 </tr>
1968               </table>
1969             </dd>
1970           </dl>
1971         </div>
1972       </div>
1973
1974       <div class="SECT3">
1975         <h4 class="SECT3"><a name="FAST-REDIRECTS" id=
1976         "FAST-REDIRECTS">8.5.14. fast-redirects</a></h4>
1977
1978         <div class="VARIABLELIST">
1979           <dl>
1980             <dt>Typical use:</dt>
1981
1982             <dd>
1983               <p>Fool some click-tracking scripts and speed up indirect
1984               links.</p>
1985             </dd>
1986
1987             <dt>Effect:</dt>
1988
1989             <dd>
1990               <p>Detects redirection URLs and redirects the browser without
1991               contacting the redirection server first.</p>
1992             </dd>
1993
1994             <dt>Type:</dt>
1995
1996             <dd>
1997               <p>Parameterized.</p>
1998             </dd>
1999
2000             <dt>Parameter:</dt>
2001
2002             <dd>
2003               <ul>
2004                 <li>
2005                   <p><span class="QUOTE">"simple-check"</span> to just search
2006                   for the string <span class="QUOTE">"http://"</span> to
2007                   detect redirection URLs.</p>
2008                 </li>
2009
2010                 <li>
2011                   <p><span class="QUOTE">"check-decoded-url"</span> to decode
2012                   URLs (if necessary) before searching for redirection
2013                   URLs.</p>
2014                 </li>
2015               </ul>
2016             </dd>
2017
2018             <dt>Notes:</dt>
2019
2020             <dd>
2021               <p>Many sites, like yahoo.com, don't just link to other sites.
2022               Instead, they will link to some script on their own servers,
2023               giving the destination as a parameter, which will then redirect
2024               you to the final target. URLs resulting from this scheme
2025               typically look like: <span class=
2026               "QUOTE">"http://www.example.org/click-tracker.cgi?target=http%3a//www.example.net/"</span>.</p>
2027
2028               <p>Sometimes, there are even multiple consecutive redirects
2029               encoded in the URL. These redirections via scripts make your
2030               web browsing more traceable, since the server from which you
2031               follow such a link can see where you go to. Apart from that,
2032               valuable bandwidth and time is wasted, while your browser asks
2033               the server for one redirect after the other. Plus, it feeds the
2034               advertisers.</p>
2035
2036               <p>This feature is currently not very smart and is scheduled
2037               for improvement. If it is enabled by default, you will have to
2038               create some exceptions to this action. It can lead to failures
2039               in several ways:</p>
2040
2041               <p>Not every URLs with other URLs as parameters is evil. Some
2042               sites offer a real service that requires this information to
2043               work. For example a validation service needs to know, which
2044               document to validate. <tt class="LITERAL">fast-redirects</tt>
2045               assumes that every URL parameter that looks like another URL is
2046               a redirection target, and will always redirect to the last one.
2047               Most of the time the assumption is correct, but if it isn't,
2048               the user gets redirected anyway.</p>
2049
2050               <p>Another failure occurs if the URL contains other parameters
2051               after the URL parameter. The URL: <span class=
2052               "QUOTE">"http://www.example.org/?redirect=http%3a//www.example.net/&amp;foo=bar"</span>.
2053               contains the redirection URL <span class=
2054               "QUOTE">"http://www.example.net/"</span>, followed by another
2055               parameter. <tt class="LITERAL">fast-redirects</tt> doesn't know
2056               that and will cause a redirect to <span class=
2057               "QUOTE">"http://www.example.net/&amp;foo=bar"</span>. Depending
2058               on the target server configuration, the parameter will be
2059               silently ignored or lead to a <span class="QUOTE">"page not
2060               found"</span> error. You can prevent this problem by first
2061               using the <tt class="LITERAL"><a href=
2062               "actions-file.html#REDIRECT">redirect</a></tt> action to remove
2063               the last part of the URL, but it requires a little effort.</p>
2064
2065               <p>To detect a redirection URL, <tt class=
2066               "LITERAL">fast-redirects</tt> only looks for the string
2067               <span class="QUOTE">"http://"</span>, either in plain text
2068               (invalid but often used) or encoded as <span class=
2069               "QUOTE">"http%3a//"</span>. Some sites use their own URL
2070               encoding scheme, encrypt the address of the target server or
2071               replace it with a database id. In theses cases <tt class=
2072               "LITERAL">fast-redirects</tt> is fooled and the request reaches
2073               the redirection server where it probably gets logged.</p>
2074             </dd>
2075
2076             <dt>Example usage:</dt>
2077
2078             <dd>
2079               <table border="0" bgcolor="#E0E0E0" width="90%">
2080                 <tr>
2081                   <td>
2082                     <pre class="SCREEN">
2083  { +fast-redirects{simple-check} }
2084    one.example.com
2085
2086  { +fast-redirects{check-decoded-url} }
2087    another.example.com/testing
2088 </pre>
2089                   </td>
2090                 </tr>
2091               </table>
2092             </dd>
2093           </dl>
2094         </div>
2095       </div>
2096
2097       <div class="SECT3">
2098         <h4 class="SECT3"><a name="FILTER" id="FILTER">8.5.15.
2099         filter</a></h4>
2100
2101         <div class="VARIABLELIST">
2102           <dl>
2103             <dt>Typical use:</dt>
2104
2105             <dd>
2106               <p>Get rid of HTML and JavaScript annoyances, banner
2107               advertisements (by size), do fun text replacements, add
2108               personalized effects, etc.</p>
2109             </dd>
2110
2111             <dt>Effect:</dt>
2112
2113             <dd>
2114               <p>All instances of text-based type, most notably HTML and
2115               JavaScript, to which this action applies, can be filtered
2116               on-the-fly through the specified regular expression based
2117               substitutions. (Note: as of version 3.0.3 plain text documents
2118               are exempted from filtering, because web servers often use the
2119               <tt class="LITERAL">text/plain</tt> MIME type for all files
2120               whose type they don't know.)</p>
2121             </dd>
2122
2123             <dt>Type:</dt>
2124
2125             <dd>
2126               <p>Parameterized.</p>
2127             </dd>
2128
2129             <dt>Parameter:</dt>
2130
2131             <dd>
2132               <p>The name of a content filter, as defined in the <a href=
2133               "filter-file.html">filter file</a>. Filters can be defined in
2134               one or more files as defined by the <tt class=
2135               "LITERAL"><a href="config.html#FILTERFILE">filterfile</a></tt>
2136               option in the <a href="config.html">config file</a>. <tt class=
2137               "FILENAME">default.filter</tt> is the collection of filters
2138               supplied by the developers. Locally defined filters should go
2139               in their own file, such as <tt class=
2140               "FILENAME">user.filter</tt>.</p>
2141
2142               <p>When used in its negative form, and without parameters,
2143               <span class="emphasis"><i class="EMPHASIS">all</i></span>
2144               filtering is completely disabled.</p>
2145             </dd>
2146
2147             <dt>Notes:</dt>
2148
2149             <dd>
2150               <p>For your convenience, there are a number of pre-defined
2151               filters available in the distribution filter file that you can
2152               use. See the examples below for a list.</p>
2153
2154               <p>Filtering requires buffering the page content, which may
2155               appear to slow down page rendering since nothing is displayed
2156               until all content has passed the filters. (The total time until
2157               the page is completely rendered doesn't change much, but it may
2158               be perceived as slower since the page is not incrementally
2159               displayed.) This effect will be more noticeable on slower
2160               connections.</p>
2161
2162               <p><span class="QUOTE">"Rolling your own"</span> filters
2163               requires a knowledge of <a href=
2164               "http://en.wikipedia.org/wiki/Regular_expressions" target=
2165               "_top"><span class="QUOTE">"Regular Expressions"</span></a> and
2166               <a href="http://en.wikipedia.org/wiki/Html" target=
2167               "_top"><span class="QUOTE">"HTML"</span></a>. This is very
2168               powerful feature, and potentially very intrusive. Filters
2169               should be used with caution, and where an equivalent
2170               <span class="QUOTE">"action"</span> is not available.</p>
2171
2172               <p>The amount of data that can be filtered is limited to the
2173               <tt class="LITERAL"><a href=
2174               "config.html#BUFFER-LIMIT">buffer-limit</a></tt> option in the
2175               main <a href="config.html">config file</a>. The default is 4096
2176               KB (4 Megs). Once this limit is exceeded, the buffered data,
2177               and all pending data, is passed through unfiltered.</p>
2178
2179               <p>Inappropriate MIME types, such as zipped files, are not
2180               filtered at all. (Again, only text-based types except plain
2181               text). Encrypted SSL data (from HTTPS servers) cannot be
2182               filtered either, since this would violate the integrity of the
2183               secure transaction. In some situations it might be necessary to
2184               protect certain text, like source code, from filtering by
2185               defining appropriate <tt class="LITERAL">-filter</tt>
2186               exceptions.</p>
2187
2188               <p>Compressed content can't be filtered either, but if
2189               <span class="APPLICATION">Privoxy</span> is compiled with zlib
2190               support and a supported compression algorithm is used (gzip or
2191               deflate), <span class="APPLICATION">Privoxy</span> can first
2192               decompress the content and then filter it.</p>
2193
2194               <p>If you use a <span class="APPLICATION">Privoxy</span>
2195               version without zlib support, but want filtering to work on as
2196               much documents as possible, even those that would normally be
2197               sent compressed, you must use the <tt class="LITERAL"><a href=
2198               "actions-file.html#PREVENT-COMPRESSION">prevent-compression</a></tt>
2199               action in conjunction with <tt class="LITERAL">filter</tt>.</p>
2200
2201               <p>Content filtering can achieve some of the same effects as
2202               the <tt class="LITERAL"><a href=
2203               "actions-file.html#BLOCK">block</a></tt> action, i.e. it can be
2204               used to block ads and banners. But the mechanism works quite
2205               differently. One effective use, is to block ad banners based on
2206               their size (see below), since many of these seem to be somewhat
2207               standardized.</p>
2208
2209               <p><a href="contact.html">Feedback</a> with suggestions for new
2210               or improved filters is particularly welcome!</p>
2211
2212               <p>The below list has only the names and a one-line description
2213               of each predefined filter. There are <a href=
2214               "filter-file.html#PREDEFINED-FILTERS">more verbose
2215               explanations</a> of what these filters do in the <a href=
2216               "filter-file.html">filter file chapter</a>.</p>
2217             </dd>
2218
2219             <dt>Example usage (with filters from the distribution <tt class=
2220             "FILENAME">default.filter</tt> file). See <a href=
2221             "filter-file.html#PREDEFINED-FILTERS">the Predefined Filters
2222             section</a> for more explanation on each:</dt>
2223
2224             <dd>
2225               <p><a name="FILTER-JS-ANNOYANCES" id=
2226               "FILTER-JS-ANNOYANCES"></a></p>
2227
2228               <table border="0" bgcolor="#E0E0E0" width="90%">
2229                 <tr>
2230                   <td>
2231                     <pre class="SCREEN">
2232 +filter{js-annoyances}       # Get rid of particularly annoying JavaScript abuse.
2233 </pre>
2234                   </td>
2235                 </tr>
2236               </table>
2237
2238               <p><a name="FILTER-JS-EVENTS" id="FILTER-JS-EVENTS"></a></p>
2239
2240               <table border="0" bgcolor="#E0E0E0" width="90%">
2241                 <tr>
2242                   <td>
2243                     <pre class="SCREEN">
2244 +filter{js-events}           # Kill all JS event bindings and timers (Radically destructive! Only for extra nasty sites).
2245 </pre>
2246                   </td>
2247                 </tr>
2248               </table>
2249
2250               <p><a name="FILTER-HTML-ANNOYANCES" id=
2251               "FILTER-HTML-ANNOYANCES"></a></p>
2252
2253               <table border="0" bgcolor="#E0E0E0" width="90%">
2254                 <tr>
2255                   <td>
2256                     <pre class="SCREEN">
2257 +filter{html-annoyances}     # Get rid of particularly annoying HTML abuse.
2258 </pre>
2259                   </td>
2260                 </tr>
2261               </table>
2262
2263               <p><a name="FILTER-CONTENT-COOKIES" id=
2264               "FILTER-CONTENT-COOKIES"></a></p>
2265
2266               <table border="0" bgcolor="#E0E0E0" width="90%">
2267                 <tr>
2268                   <td>
2269                     <pre class="SCREEN">
2270 +filter{content-cookies}     # Kill cookies that come in the HTML or JS content.
2271 </pre>
2272                   </td>
2273                 </tr>
2274               </table>
2275
2276               <p><a name="FILTER-REFRESH-TAGS" id=
2277               "FILTER-REFRESH-TAGS"></a></p>
2278
2279               <table border="0" bgcolor="#E0E0E0" width="90%">
2280                 <tr>
2281                   <td>
2282                     <pre class="SCREEN">
2283 +filter{refresh-tags}        # Kill automatic refresh tags (for dial-on-demand setups).
2284 </pre>
2285                   </td>
2286                 </tr>
2287               </table>
2288
2289               <p><a name="FILTER-UNSOLICITED-POPUPS" id=
2290               "FILTER-UNSOLICITED-POPUPS"></a></p>
2291
2292               <table border="0" bgcolor="#E0E0E0" width="90%">
2293                 <tr>
2294                   <td>
2295                     <pre class="SCREEN">
2296 +filter{unsolicited-popups}  # Disable only unsolicited pop-up windows. Useful if your browser lacks this ability.
2297 </pre>
2298                   </td>
2299                 </tr>
2300               </table>
2301
2302               <p><a name="FILTER-ALL-POPUPS" id="FILTER-ALL-POPUPS"></a></p>
2303
2304               <table border="0" bgcolor="#E0E0E0" width="90%">
2305                 <tr>
2306                   <td>
2307                     <pre class="SCREEN">
2308 +filter{all-popups}          # Kill all popups in JavaScript and HTML. Useful if your browser lacks this ability.
2309 </pre>
2310                   </td>
2311                 </tr>
2312               </table>
2313
2314               <p><a name="FILTER-IMG-REORDER" id=
2315               "FILTER-IMG-REORDER"></a></p>
2316
2317               <table border="0" bgcolor="#E0E0E0" width="90%">
2318                 <tr>
2319                   <td>
2320                     <pre class="SCREEN">
2321 +filter{img-reorder}         # Reorder attributes in &lt;img&gt; tags to make the banners-by-* filters more effective.
2322 </pre>
2323                   </td>
2324                 </tr>
2325               </table>
2326
2327               <p><a name="FILTER-BANNERS-BY-SIZE" id=
2328               "FILTER-BANNERS-BY-SIZE"></a></p>
2329
2330               <table border="0" bgcolor="#E0E0E0" width="90%">
2331                 <tr>
2332                   <td>
2333                     <pre class="SCREEN">
2334 +filter{banners-by-size}     # Kill banners by size.
2335 </pre>
2336                   </td>
2337                 </tr>
2338               </table>
2339
2340               <p><a name="FILTER-BANNERS-BY-LINK" id=
2341               "FILTER-BANNERS-BY-LINK"></a></p>
2342
2343               <table border="0" bgcolor="#E0E0E0" width="90%">
2344                 <tr>
2345                   <td>
2346                     <pre class="SCREEN">
2347 +filter{banners-by-link}     # Kill banners by their links to known clicktrackers.
2348 </pre>
2349                   </td>
2350                 </tr>
2351               </table>
2352
2353               <p><a name="FILTER-WEBBUGS" id="FILTER-WEBBUGS"></a></p>
2354
2355               <table border="0" bgcolor="#E0E0E0" width="90%">
2356                 <tr>
2357                   <td>
2358                     <pre class="SCREEN">
2359 +filter{webbugs}             # Squish WebBugs (1x1 invisible GIFs used for user tracking).
2360 </pre>
2361                   </td>
2362                 </tr>
2363               </table>
2364
2365               <p><a name="FILTER-TINY-TEXTFORMS" id=
2366               "FILTER-TINY-TEXTFORMS"></a></p>
2367
2368               <table border="0" bgcolor="#E0E0E0" width="90%">
2369                 <tr>
2370                   <td>
2371                     <pre class="SCREEN">
2372 +filter{tiny-textforms}      # Extend those tiny textareas up to 40x80 and kill the hard wrap.
2373 </pre>
2374                   </td>
2375                 </tr>
2376               </table>
2377
2378               <p><a name="FILTER-JUMPING-WINDOWS" id=
2379               "FILTER-JUMPING-WINDOWS"></a></p>
2380
2381               <table border="0" bgcolor="#E0E0E0" width="90%">
2382                 <tr>
2383                   <td>
2384                     <pre class="SCREEN">
2385 +filter{jumping-windows}     # Prevent windows from resizing and moving themselves.
2386 </pre>
2387                   </td>
2388                 </tr>
2389               </table>
2390
2391               <p><a name="FILTER-FRAMESET-BORDERS" id=
2392               "FILTER-FRAMESET-BORDERS"></a></p>
2393
2394               <table border="0" bgcolor="#E0E0E0" width="90%">
2395                 <tr>
2396                   <td>
2397                     <pre class="SCREEN">
2398 +filter{frameset-borders}    # Give frames a border and make them resizable.
2399 </pre>
2400                   </td>
2401                 </tr>
2402               </table>
2403
2404               <p><a name="FILTER-DEMORONIZER" id=
2405               "FILTER-DEMORONIZER"></a></p>
2406
2407               <table border="0" bgcolor="#E0E0E0" width="90%">
2408                 <tr>
2409                   <td>
2410                     <pre class="SCREEN">
2411 +filter{demoronizer}         # Fix MS's non-standard use of standard charsets.
2412 </pre>
2413                   </td>
2414                 </tr>
2415               </table>
2416
2417               <p><a name="FILTER-SHOCKWAVE-FLASH" id=
2418               "FILTER-SHOCKWAVE-FLASH"></a></p>
2419
2420               <table border="0" bgcolor="#E0E0E0" width="90%">
2421                 <tr>
2422                   <td>
2423                     <pre class="SCREEN">
2424 +filter{shockwave-flash}     # Kill embedded Shockwave Flash objects.
2425 </pre>
2426                   </td>
2427                 </tr>
2428               </table>
2429
2430               <p><a name="FILTER-QUICKTIME-KIOSKMODE" id=
2431               "FILTER-QUICKTIME-KIOSKMODE"></a></p>
2432
2433               <table border="0" bgcolor="#E0E0E0" width="90%">
2434                 <tr>
2435                   <td>
2436                     <pre class="SCREEN">
2437 +filter{quicktime-kioskmode} # Make Quicktime movies saveable.
2438 </pre>
2439                   </td>
2440                 </tr>
2441               </table>
2442
2443               <p><a name="FILTER-FUN" id="FILTER-FUN"></a></p>
2444
2445               <table border="0" bgcolor="#E0E0E0" width="90%">
2446                 <tr>
2447                   <td>
2448                     <pre class="SCREEN">
2449 +filter{fun}                 # Text replacements for subversive browsing fun!
2450 </pre>
2451                   </td>
2452                 </tr>
2453               </table>
2454
2455               <p><a name="FILTER-CRUDE-PARENTAL" id=
2456               "FILTER-CRUDE-PARENTAL"></a></p>
2457
2458               <table border="0" bgcolor="#E0E0E0" width="90%">
2459                 <tr>
2460                   <td>
2461                     <pre class="SCREEN">
2462 +filter{crude-parental}      # Crude parental filtering. Note that this filter doesn't work reliably.
2463 </pre>
2464                   </td>
2465                 </tr>
2466               </table>
2467
2468               <p><a name="FILTER-IE-EXPLOITS" id=
2469               "FILTER-IE-EXPLOITS"></a></p>
2470
2471               <table border="0" bgcolor="#E0E0E0" width="90%">
2472                 <tr>
2473                   <td>
2474                     <pre class="SCREEN">
2475 +filter{ie-exploits}         # Disable some known Internet Explorer bug exploits.
2476 </pre>
2477                   </td>
2478                 </tr>
2479               </table>
2480
2481               <p><a name="FILTER-SITE-SPECIFICS" id=
2482               "FILTER-SITE-SPECIFICS"></a></p>
2483
2484               <table border="0" bgcolor="#E0E0E0" width="90%">
2485                 <tr>
2486                   <td>
2487                     <pre class="SCREEN">
2488 +filter{site-specifics}      # Cure for site-specific problems. Don't apply generally!
2489 </pre>
2490                   </td>
2491                 </tr>
2492               </table>
2493
2494               <p><a name="FILTER-NO-PING" id="FILTER-NO-PING"></a></p>
2495
2496               <table border="0" bgcolor="#E0E0E0" width="90%">
2497                 <tr>
2498                   <td>
2499                     <pre class="SCREEN">
2500 +filter{no-ping}             # Removes non-standard ping attributes in &lt;a&gt; and &lt;area&gt; tags.
2501 </pre>
2502                   </td>
2503                 </tr>
2504               </table>
2505
2506               <p><a name="FILTER-GOOGLE" id="FILTER-GOOGLE"></a></p>
2507
2508               <table border="0" bgcolor="#E0E0E0" width="90%">
2509                 <tr>
2510                   <td>
2511                     <pre class="SCREEN">
2512 +filter{google}              # CSS-based block for Google text ads. Also removes a width limitation and the toolbar advertisement.
2513 </pre>
2514                   </td>
2515                 </tr>
2516               </table>
2517
2518               <p><a name="FILTER-YAHOO" id="FILTER-YAHOO"></a></p>
2519
2520               <table border="0" bgcolor="#E0E0E0" width="90%">
2521                 <tr>
2522                   <td>
2523                     <pre class="SCREEN">
2524 +filter{yahoo}               # CSS-based block for Yahoo text ads. Also removes a width limitation.
2525 </pre>
2526                   </td>
2527                 </tr>
2528               </table>
2529
2530               <p><a name="FILTER-MSN" id="FILTER-MSN"></a></p>
2531
2532               <table border="0" bgcolor="#E0E0E0" width="90%">
2533                 <tr>
2534                   <td>
2535                     <pre class="SCREEN">
2536 +filter{msn}                 # CSS-based block for MSN text ads. Also removes tracking URLs and a width limitation.
2537 </pre>
2538                   </td>
2539                 </tr>
2540               </table>
2541
2542               <p><a name="FILTER-BLOGSPOT" id="FILTER-BLOGSPOT"></a></p>
2543
2544               <table border="0" bgcolor="#E0E0E0" width="90%">
2545                 <tr>
2546                   <td>
2547                     <pre class="SCREEN">
2548 +filter{blogspot}            # Cleans up some Blogspot blogs. Read the fine print before using this.
2549 </pre>
2550                   </td>
2551                 </tr>
2552               </table>
2553             </dd>
2554           </dl>
2555         </div>
2556       </div>
2557
2558       <div class="SECT3">
2559         <h4 class="SECT3"><a name="FORCE-TEXT-MODE" id=
2560         "FORCE-TEXT-MODE">8.5.16. force-text-mode</a></h4>
2561
2562         <div class="VARIABLELIST">
2563           <dl>
2564             <dt>Typical use:</dt>
2565
2566             <dd>
2567               <p>Force <span class="APPLICATION">Privoxy</span> to treat a
2568               document as if it was in some kind of <span class=
2569               "emphasis"><i class="EMPHASIS">text</i></span> format.</p>
2570             </dd>
2571
2572             <dt>Effect:</dt>
2573
2574             <dd>
2575               <p>Declares a document as text, even if the <span class=
2576               "QUOTE">"Content-Type:"</span> isn't detected as such.</p>
2577             </dd>
2578
2579             <dt>Type:</dt>
2580
2581             <dd>
2582               <p>Boolean.</p>
2583             </dd>
2584
2585             <dt>Parameter:</dt>
2586
2587             <dd>
2588               <p>N/A</p>
2589             </dd>
2590
2591             <dt>Notes:</dt>
2592
2593             <dd>
2594               <p>As explained <tt class="LITERAL"><a href=
2595               "actions-file.html#FILTER">above</a></tt>, <span class=
2596               "APPLICATION">Privoxy</span> tries to only filter files that
2597               are in some kind of text format. The same restrictions apply to
2598               <tt class="LITERAL"><a href=
2599               "actions-file.html#CONTENT-TYPE-OVERWRITE">content-type-overwrite</a></tt>.
2600               <tt class="LITERAL">force-text-mode</tt> declares a document as
2601               text, without looking at the <span class=
2602               "QUOTE">"Content-Type:"</span> first.</p>
2603
2604               <div class="WARNING">
2605                 <table class="WARNING" border="1" width="90%">
2606                   <tr>
2607                     <td align="center"><b>Warning</b></td>
2608                   </tr>
2609
2610                   <tr>
2611                     <td align="left">
2612                       <p>Think twice before activating this action. Filtering
2613                       binary data with regular expressions can cause file
2614                       damage.</p>
2615                     </td>
2616                   </tr>
2617                 </table>
2618               </div>
2619             </dd>
2620
2621             <dt>Example usage:</dt>
2622
2623             <dd>
2624               <table border="0" bgcolor="#E0E0E0" width="90%">
2625                 <tr>
2626                   <td>
2627                     <pre class="SCREEN">
2628 +force-text-mode
2629
2630 </pre>
2631                   </td>
2632                 </tr>
2633               </table>
2634             </dd>
2635           </dl>
2636         </div>
2637       </div>
2638
2639       <div class="SECT3">
2640         <h4 class="SECT3"><a name="FORWARD-OVERRIDE" id=
2641         "FORWARD-OVERRIDE">8.5.17. forward-override</a></h4>
2642
2643         <div class="VARIABLELIST">
2644           <dl>
2645             <dt>Typical use:</dt>
2646
2647             <dd>
2648               <p>Change the forwarding settings based on User-Agent or
2649               request origin</p>
2650             </dd>
2651
2652             <dt>Effect:</dt>
2653
2654             <dd>
2655               <p>Overrules the forward directives in the configuration
2656               file.</p>
2657             </dd>
2658
2659             <dt>Type:</dt>
2660
2661             <dd>
2662               <p>Multi-value.</p>
2663             </dd>
2664
2665             <dt>Parameter:</dt>
2666
2667             <dd>
2668               <ul>
2669                 <li>
2670                   <p><span class="QUOTE">"forward ."</span> to use a direct
2671                   connection without any additional proxies.</p>
2672                 </li>
2673
2674                 <li>
2675                   <p><span class="QUOTE">"forward 127.0.0.1:8123"</span> to
2676                   use the HTTP proxy listening at 127.0.0.1 port 8123.</p>
2677                 </li>
2678
2679                 <li>
2680                   <p><span class="QUOTE">"forward-socks4a 127.0.0.1:9050
2681                   ."</span> to use the socks4a proxy listening at 127.0.0.1
2682                   port 9050. Replace <span class=
2683                   "QUOTE">"forward-socks4a"</span> with <span class=
2684                   "QUOTE">"forward-socks4"</span> to use a socks4 connection
2685                   (with local DNS resolution) instead, use <span class=
2686                   "QUOTE">"forward-socks5"</span> for socks5 connections
2687                   (with remote DNS resolution).</p>
2688                 </li>
2689
2690                 <li>
2691                   <p><span class="QUOTE">"forward-socks4a 127.0.0.1:9050
2692                   proxy.example.org:8000"</span> to use the socks4a proxy
2693                   listening at 127.0.0.1 port 9050 to reach the HTTP proxy
2694                   listening at proxy.example.org port 8000. Replace
2695                   <span class="QUOTE">"forward-socks4a"</span> with
2696                   <span class="QUOTE">"forward-socks4"</span> to use a socks4
2697                   connection (with local DNS resolution) instead, use
2698                   <span class="QUOTE">"forward-socks5"</span> for socks5
2699                   connections (with remote DNS resolution).</p>
2700                 </li>
2701               </ul>
2702             </dd>
2703
2704             <dt>Notes:</dt>
2705
2706             <dd>
2707               <p>This action takes parameters similar to the <a href=
2708               "config.html#FORWARDING">forward</a> directives in the
2709               configuration file, but without the URL pattern. It can be used
2710               as replacement, but normally it's only used in cases where
2711               matching based on the request URL isn't sufficient.</p>
2712
2713               <div class="WARNING">
2714                 <table class="WARNING" border="1" width="90%">
2715                   <tr>
2716                     <td align="center"><b>Warning</b></td>
2717                   </tr>
2718
2719                   <tr>
2720                     <td align="left">
2721                       <p>Please read the description for the <a href=
2722                       "config.html#FORWARDING">forward</a> directives before
2723                       using this action. Forwarding to the wrong people will
2724                       reduce your privacy and increase the chances of
2725                       man-in-the-middle attacks.</p>
2726
2727                       <p>If the ports are missing or invalid, default values
2728                       will be used. This might change in the future and you
2729                       shouldn't rely on it. Otherwise incorrect syntax causes
2730                       Privoxy to exit.</p>
2731
2732                       <p>Use the <a href=
2733                       "http://config.privoxy.org/show-url-info" target=
2734                       "_top">show-url-info CGI page</a> to verify that your
2735                       forward settings do what you thought the do.</p>
2736                     </td>
2737                   </tr>
2738                 </table>
2739               </div>
2740             </dd>
2741
2742             <dt>Example usage:</dt>
2743
2744             <dd>
2745               <table border="0" bgcolor="#E0E0E0" width="90%">
2746                 <tr>
2747                   <td>
2748                     <pre class="SCREEN">
2749 # Always use direct connections for requests previously tagged as
2750 # <span class="QUOTE">"User-Agent: fetch libfetch/2.0"</span> and make sure
2751 # resuming downloads continues to work.
2752 # This way you can continue to use Tor for your normal browsing,
2753 # without overloading the Tor network with your FreeBSD ports updates
2754 # or downloads of bigger files like ISOs.
2755 # Note that HTTP headers are easy to fake and therefore their
2756 # values are as (un)trustworthy as your clients and users.
2757 {+forward-override{forward .} \
2758  -hide-if-modified-since      \
2759  -overwrite-last-modified     \
2760 }
2761 TAG:^User-Agent: fetch libfetch/2\.0$
2762
2763 </pre>
2764                   </td>
2765                 </tr>
2766               </table>
2767             </dd>
2768           </dl>
2769         </div>
2770       </div>
2771
2772       <div class="SECT3">
2773         <h4 class="SECT3"><a name="HANDLE-AS-EMPTY-DOCUMENT" id=
2774         "HANDLE-AS-EMPTY-DOCUMENT">8.5.18. handle-as-empty-document</a></h4>
2775
2776         <div class="VARIABLELIST">
2777           <dl>
2778             <dt>Typical use:</dt>
2779
2780             <dd>
2781               <p>Mark URLs that should be replaced by empty documents
2782               <span class="emphasis"><i class="EMPHASIS">if they get
2783               blocked</i></span></p>
2784             </dd>
2785
2786             <dt>Effect:</dt>
2787
2788             <dd>
2789               <p>This action alone doesn't do anything noticeable. It just
2790               marks URLs. If the <tt class="LITERAL"><a href=
2791               "actions-file.html#BLOCK">block</a></tt> action <span class=
2792               "emphasis"><i class="EMPHASIS">also applies</i></span>, the
2793               presence or absence of this mark decides whether an HTML
2794               <span class="QUOTE">"BLOCKED"</span> page, or an empty document
2795               will be sent to the client as a substitute for the blocked
2796               content. The <span class="emphasis"><i class=
2797               "EMPHASIS">empty</i></span> document isn't literally empty, but
2798               actually contains a single space.</p>
2799             </dd>
2800
2801             <dt>Type:</dt>
2802
2803             <dd>
2804               <p>Boolean.</p>
2805             </dd>
2806
2807             <dt>Parameter:</dt>
2808
2809             <dd>
2810               <p>N/A</p>
2811             </dd>
2812
2813             <dt>Notes:</dt>
2814
2815             <dd>
2816               <p>Some browsers complain about syntax errors if JavaScript
2817               documents are blocked with <span class=
2818               "APPLICATION">Privoxy's</span> default HTML page; this option
2819               can be used to silence them. And of course this action can also
2820               be used to eliminate the <span class=
2821               "APPLICATION">Privoxy</span> BLOCKED message in frames.</p>
2822
2823               <p>The content type for the empty document can be specified
2824               with <tt class="LITERAL"><a href=
2825               "actions-file.html#CONTENT-TYPE-OVERWRITE">content-type-overwrite{}</a></tt>,
2826               but usually this isn't necessary.</p>
2827             </dd>
2828
2829             <dt>Example usage:</dt>
2830
2831             <dd>
2832               <table border="0" bgcolor="#E0E0E0" width="90%">
2833                 <tr>
2834                   <td>
2835                     <pre class="SCREEN">
2836 # Block all documents on example.org that end with ".js",
2837 # but send an empty document instead of the usual HTML message.
2838 {+block{Blocked JavaScript} +handle-as-empty-document}
2839 example.org/.*\.js$
2840
2841 </pre>
2842                   </td>
2843                 </tr>
2844               </table>
2845             </dd>
2846           </dl>
2847         </div>
2848       </div>
2849
2850       <div class="SECT3">
2851         <h4 class="SECT3"><a name="HANDLE-AS-IMAGE" id=
2852         "HANDLE-AS-IMAGE">8.5.19. handle-as-image</a></h4>
2853
2854         <div class="VARIABLELIST">
2855           <dl>
2856             <dt>Typical use:</dt>
2857
2858             <dd>
2859               <p>Mark URLs as belonging to images (so they'll be replaced by
2860               images <span class="emphasis"><i class="EMPHASIS">if they do
2861               get blocked</i></span>, rather than HTML pages)</p>
2862             </dd>
2863
2864             <dt>Effect:</dt>
2865
2866             <dd>
2867               <p>This action alone doesn't do anything noticeable. It just
2868               marks URLs as images. If the <tt class="LITERAL"><a href=
2869               "actions-file.html#BLOCK">block</a></tt> action <span class=
2870               "emphasis"><i class="EMPHASIS">also applies</i></span>, the
2871               presence or absence of this mark decides whether an HTML
2872               <span class="QUOTE">"blocked"</span> page, or a replacement
2873               image (as determined by the <tt class="LITERAL"><a href=
2874               "actions-file.html#SET-IMAGE-BLOCKER">set-image-blocker</a></tt>
2875               action) will be sent to the client as a substitute for the
2876               blocked content.</p>
2877             </dd>
2878
2879             <dt>Type:</dt>
2880
2881             <dd>
2882               <p>Boolean.</p>
2883             </dd>
2884
2885             <dt>Parameter:</dt>
2886
2887             <dd>
2888               <p>N/A</p>
2889             </dd>
2890
2891             <dt>Notes:</dt>
2892
2893             <dd>
2894               <p>The below generic example section is actually part of
2895               <tt class="FILENAME">default.action</tt>. It marks all URLs
2896               with well-known image file name extensions as images and should
2897               be left intact.</p>
2898
2899               <p>Users will probably only want to use the handle-as-image
2900               action in conjunction with <tt class="LITERAL"><a href=
2901               "actions-file.html#BLOCK">block</a></tt>, to block sources of
2902               banners, whose URLs don't reflect the file type, like in the
2903               second example section.</p>
2904
2905               <p>Note that you cannot treat HTML pages as images in most
2906               cases. For instance, (in-line) ad frames require an HTML page
2907               to be sent, or they won't display properly. Forcing <tt class=
2908               "LITERAL">handle-as-image</tt> in this situation will not
2909               replace the ad frame with an image, but lead to error
2910               messages.</p>
2911             </dd>
2912
2913             <dt>Example usage (sections):</dt>
2914
2915             <dd>
2916               <table border="0" bgcolor="#E0E0E0" width="90%">
2917                 <tr>
2918                   <td>
2919                     <pre class="SCREEN">
2920 # Generic image extensions:
2921 #
2922 {+handle-as-image}
2923 /.*\.(gif|jpg|jpeg|png|bmp|ico)$
2924
2925 # These don't look like images, but they're banners and should be
2926 # blocked as images:
2927 #
2928 {+block{Nasty banners.} +handle-as-image}
2929 nasty-banner-server.example.com/junk.cgi\?output=trash
2930 </pre>
2931                   </td>
2932                 </tr>
2933               </table>
2934             </dd>
2935           </dl>
2936         </div>
2937       </div>
2938
2939       <div class="SECT3">
2940         <h4 class="SECT3"><a name="HIDE-ACCEPT-LANGUAGE" id=
2941         "HIDE-ACCEPT-LANGUAGE">8.5.20. hide-accept-language</a></h4>
2942
2943         <div class="VARIABLELIST">
2944           <dl>
2945             <dt>Typical use:</dt>
2946
2947             <dd>
2948               <p>Pretend to use different language settings.</p>
2949             </dd>
2950
2951             <dt>Effect:</dt>
2952
2953             <dd>
2954               <p>Deletes or replaces the <span class=
2955               "QUOTE">"Accept-Language:"</span> HTTP header in client
2956               requests.</p>
2957             </dd>
2958
2959             <dt>Type:</dt>
2960
2961             <dd>
2962               <p>Parameterized.</p>
2963             </dd>
2964
2965             <dt>Parameter:</dt>
2966
2967             <dd>
2968               <p>Keyword: <span class="QUOTE">"block"</span>, or any user
2969               defined value.</p>
2970             </dd>
2971
2972             <dt>Notes:</dt>
2973
2974             <dd>
2975               <p>Faking the browser's language settings can be useful to make
2976               a foreign User-Agent set with <tt class="LITERAL"><a href=
2977               "actions-file.html#HIDE-USER-AGENT">hide-user-agent</a></tt>
2978               more believable.</p>
2979
2980               <p>However some sites with content in different languages check
2981               the <span class="QUOTE">"Accept-Language:"</span> to decide
2982               which one to take by default. Sometimes it isn't possible to
2983               later switch to another language without changing the
2984               <span class="QUOTE">"Accept-Language:"</span> header first.</p>
2985
2986               <p>Therefore it's a good idea to either only change the
2987               <span class="QUOTE">"Accept-Language:"</span> header to
2988               languages you understand, or to languages that aren't wide
2989               spread.</p>
2990
2991               <p>Before setting the <span class=
2992               "QUOTE">"Accept-Language:"</span> header to a rare language,
2993               you should consider that it helps to make your requests unique
2994               and thus easier to trace. If you don't plan to change this
2995               header frequently, you should stick to a common language.</p>
2996             </dd>
2997
2998             <dt>Example usage (section):</dt>
2999
3000             <dd>
3001               <table border="0" bgcolor="#E0E0E0" width="90%">
3002                 <tr>
3003                   <td>
3004                     <pre class="SCREEN">
3005 # Pretend to use Canadian language settings.
3006 {+hide-accept-language{en-ca} \
3007 +hide-user-agent{Mozilla/5.0 (X11; U; OpenBSD i386; en-CA; rv:1.8.0.4) Gecko/20060628 Firefox/1.5.0.4} \
3008 }
3009 /
3010 </pre>
3011                   </td>
3012                 </tr>
3013               </table>
3014             </dd>
3015           </dl>
3016         </div>
3017       </div>
3018
3019       <div class="SECT3">
3020         <h4 class="SECT3"><a name="HIDE-CONTENT-DISPOSITION" id=
3021         "HIDE-CONTENT-DISPOSITION">8.5.21. hide-content-disposition</a></h4>
3022
3023         <div class="VARIABLELIST">
3024           <dl>
3025             <dt>Typical use:</dt>
3026
3027             <dd>
3028               <p>Prevent download menus for content you prefer to view inside
3029               the browser.</p>
3030             </dd>
3031
3032             <dt>Effect:</dt>
3033
3034             <dd>
3035               <p>Deletes or replaces the <span class=
3036               "QUOTE">"Content-Disposition:"</span> HTTP header set by some
3037               servers.</p>
3038             </dd>
3039
3040             <dt>Type:</dt>
3041
3042             <dd>
3043               <p>Parameterized.</p>
3044             </dd>
3045
3046             <dt>Parameter:</dt>
3047
3048             <dd>
3049               <p>Keyword: <span class="QUOTE">"block"</span>, or any user
3050               defined value.</p>
3051             </dd>
3052
3053             <dt>Notes:</dt>
3054
3055             <dd>
3056               <p>Some servers set the <span class=
3057               "QUOTE">"Content-Disposition:"</span> HTTP header for documents
3058               they assume you want to save locally before viewing them. The
3059               <span class="QUOTE">"Content-Disposition:"</span> header
3060               contains the file name the browser is supposed to use by
3061               default.</p>
3062
3063               <p>In most browsers that understand this header, it makes it
3064               impossible to <span class="emphasis"><i class="EMPHASIS">just
3065               view</i></span> the document, without downloading it first,
3066               even if it's just a simple text file or an image.</p>
3067
3068               <p>Removing the <span class=
3069               "QUOTE">"Content-Disposition:"</span> header helps to prevent
3070               this annoyance, but some browsers additionally check the
3071               <span class="QUOTE">"Content-Type:"</span> header, before they
3072               decide if they can display a document without saving it first.
3073               In these cases, you have to change this header as well, before
3074               the browser stops displaying download menus.</p>
3075
3076               <p>It is also possible to change the server's file name
3077               suggestion to another one, but in most cases it isn't worth the
3078               time to set it up.</p>
3079
3080               <p>This action will probably be removed in the future, use
3081               server-header filters instead.</p>
3082             </dd>
3083
3084             <dt>Example usage:</dt>
3085
3086             <dd>
3087               <table border="0" bgcolor="#E0E0E0" width="90%">
3088                 <tr>
3089                   <td>
3090                     <pre class="SCREEN">
3091 # Disarm the download link in Sourceforge's patch tracker
3092 { -filter \
3093  +content-type-overwrite{text/plain}\
3094  +hide-content-disposition{block} }
3095  .sourceforge.net/tracker/download\.php
3096 </pre>
3097                   </td>
3098                 </tr>
3099               </table>
3100             </dd>
3101           </dl>
3102         </div>
3103       </div>
3104
3105       <div class="SECT3">
3106         <h4 class="SECT3"><a name="HIDE-IF-MODIFIED-SINCE" id=
3107         "HIDE-IF-MODIFIED-SINCE">8.5.22. hide-if-modified-since</a></h4>
3108
3109         <div class="VARIABLELIST">
3110           <dl>
3111             <dt>Typical use:</dt>
3112
3113             <dd>
3114               <p>Prevent yet another way to track the user's steps between
3115               sessions.</p>
3116             </dd>
3117
3118             <dt>Effect:</dt>
3119
3120             <dd>
3121               <p>Deletes the <span class="QUOTE">"If-Modified-Since:"</span>
3122               HTTP client header or modifies its value.</p>
3123             </dd>
3124
3125             <dt>Type:</dt>
3126
3127             <dd>
3128               <p>Parameterized.</p>
3129             </dd>
3130
3131             <dt>Parameter:</dt>
3132
3133             <dd>
3134               <p>Keyword: <span class="QUOTE">"block"</span>, or a user
3135               defined value that specifies a range of hours.</p>
3136             </dd>
3137
3138             <dt>Notes:</dt>
3139
3140             <dd>
3141               <p>Removing this header is useful for filter testing, where you
3142               want to force a real reload instead of getting status code
3143               <span class="QUOTE">"304"</span>, which would cause the browser
3144               to use a cached copy of the page.</p>
3145
3146               <p>Instead of removing the header, <tt class=
3147               "LITERAL">hide-if-modified-since</tt> can also add or subtract
3148               a random amount of time to/from the header's value. You specify
3149               a range of minutes where the random factor should be chosen
3150               from and <span class="APPLICATION">Privoxy</span> does the
3151               rest. A negative value means subtracting, a positive value
3152               adding.</p>
3153
3154               <p>Randomizing the value of the <span class=
3155               "QUOTE">"If-Modified-Since:"</span> makes it less likely that
3156               the server can use the time as a cookie replacement, but you
3157               will run into caching problems if the random range is too
3158               high.</p>
3159
3160               <p>It is a good idea to only use a small negative value and let
3161               <tt class="LITERAL"><a href=
3162               "actions-file.html#OVERWRITE-LAST-MODIFIED">overwrite-last-modified</a></tt>
3163               handle the greater changes.</p>
3164
3165               <p>It is also recommended to use this action together with
3166               <tt class="LITERAL"><a href=
3167               "actions-file.html#CRUNCH-IF-NONE-MATCH">crunch-if-none-match</a></tt>,
3168               otherwise it's more or less pointless.</p>
3169             </dd>
3170
3171             <dt>Example usage (section):</dt>
3172
3173             <dd>
3174               <table border="0" bgcolor="#E0E0E0" width="90%">
3175                 <tr>
3176                   <td>
3177                     <pre class="SCREEN">
3178 # Let the browser revalidate but make tracking based on the time less likely.
3179 {+hide-if-modified-since{-60} \
3180  +overwrite-last-modified{randomize} \
3181  +crunch-if-none-match}
3182 /
3183 </pre>
3184                   </td>
3185                 </tr>
3186               </table>
3187             </dd>
3188           </dl>
3189         </div>
3190       </div>
3191
3192       <div class="SECT3">
3193         <h4 class="SECT3"><a name="HIDE-FROM-HEADER" id=
3194         "HIDE-FROM-HEADER">8.5.23. hide-from-header</a></h4>
3195
3196         <div class="VARIABLELIST">
3197           <dl>
3198             <dt>Typical use:</dt>
3199
3200             <dd>
3201               <p>Keep your (old and ill) browser from telling web servers
3202               your email address</p>
3203             </dd>
3204
3205             <dt>Effect:</dt>
3206
3207             <dd>
3208               <p>Deletes any existing <span class="QUOTE">"From:"</span> HTTP
3209               header, or replaces it with the specified string.</p>
3210             </dd>
3211
3212             <dt>Type:</dt>
3213
3214             <dd>
3215               <p>Parameterized.</p>
3216             </dd>
3217
3218             <dt>Parameter:</dt>
3219
3220             <dd>
3221               <p>Keyword: <span class="QUOTE">"block"</span>, or any user
3222               defined value.</p>
3223             </dd>
3224
3225             <dt>Notes:</dt>
3226
3227             <dd>
3228               <p>The keyword <span class="QUOTE">"block"</span> will
3229               completely remove the header (not to be confused with the
3230               <tt class="LITERAL"><a href=
3231               "actions-file.html#BLOCK">block</a></tt> action).</p>
3232
3233               <p>Alternately, you can specify any value you prefer to be sent
3234               to the web server. If you do, it is a matter of fairness not to
3235               use any address that is actually used by a real person.</p>
3236
3237               <p>This action is rarely needed, as modern web browsers don't
3238               send <span class="QUOTE">"From:"</span> headers anymore.</p>
3239             </dd>
3240
3241             <dt>Example usage:</dt>
3242
3243             <dd>
3244               <table border="0" bgcolor="#E0E0E0" width="90%">
3245                 <tr>
3246                   <td>
3247                     <pre class="SCREEN">
3248 +hide-from-header{block}
3249 </pre>
3250                   </td>
3251                 </tr>
3252               </table>or
3253
3254               <table border="0" bgcolor="#E0E0E0" width="90%">
3255                 <tr>
3256                   <td>
3257                     <pre class="SCREEN">
3258 +hide-from-header{spam-me-senseless@sittingduck.example.com}
3259 </pre>
3260                   </td>
3261                 </tr>
3262               </table>
3263             </dd>
3264           </dl>
3265         </div>
3266       </div>
3267
3268       <div class="SECT3">
3269         <h4 class="SECT3"><a name="HIDE-REFERRER" id="HIDE-REFERRER">8.5.24.
3270         hide-referrer</a></h4><a name="HIDE-REFERER" id="HIDE-REFERER"></a>
3271
3272         <div class="VARIABLELIST">
3273           <dl>
3274             <dt>Typical use:</dt>
3275
3276             <dd>
3277               <p>Conceal which link you followed to get to a particular
3278               site</p>
3279             </dd>
3280
3281             <dt>Effect:</dt>
3282
3283             <dd>
3284               <p>Deletes the <span class="QUOTE">"Referer:"</span> (sic) HTTP
3285               header from the client request, or replaces it with a forged
3286               one.</p>
3287             </dd>
3288
3289             <dt>Type:</dt>
3290
3291             <dd>
3292               <p>Parameterized.</p>
3293             </dd>
3294
3295             <dt>Parameter:</dt>
3296
3297             <dd>
3298               <ul>
3299                 <li>
3300                   <p><span class="QUOTE">"conditional-block"</span> to delete
3301                   the header completely if the host has changed.</p>
3302                 </li>
3303
3304                 <li>
3305                   <p><span class="QUOTE">"conditional-forge"</span> to forge
3306                   the header if the host has changed.</p>
3307                 </li>
3308
3309                 <li>
3310                   <p><span class="QUOTE">"block"</span> to delete the header
3311                   unconditionally.</p>
3312                 </li>
3313
3314                 <li>
3315                   <p><span class="QUOTE">"forge"</span> to pretend to be
3316                   coming from the homepage of the server we are talking
3317                   to.</p>
3318                 </li>
3319
3320                 <li>
3321                   <p>Any other string to set a user defined referrer.</p>
3322                 </li>
3323               </ul>
3324             </dd>
3325
3326             <dt>Notes:</dt>
3327
3328             <dd>
3329               <p><tt class="LITERAL">conditional-block</tt> is the only
3330               parameter, that isn't easily detected in the server's log file.
3331               If it blocks the referrer, the request will look like the
3332               visitor used a bookmark or typed in the address directly.</p>
3333
3334               <p>Leaving the referrer unmodified for requests on the same
3335               host allows the server owner to see the visitor's <span class=
3336               "QUOTE">"click path"</span>, but in most cases she could also
3337               get that information by comparing other parts of the log file:
3338               for example the User-Agent if it isn't a very common one, or
3339               the user's IP address if it doesn't change between different
3340               requests.</p>
3341
3342               <p>Always blocking the referrer, or using a custom one, can
3343               lead to failures on servers that check the referrer before they
3344               answer any requests, in an attempt to prevent their content
3345               from being embedded or linked to elsewhere.</p>
3346
3347               <p>Both <tt class="LITERAL">conditional-block</tt> and
3348               <tt class="LITERAL">forge</tt> will work with referrer checks,
3349               as long as content and valid referring page are on the same
3350               host. Most of the time that's the case.</p>
3351
3352               <p><tt class="LITERAL">hide-referer</tt> is an alternate
3353               spelling of <tt class="LITERAL">hide-referrer</tt> and the two
3354               can be can be freely substituted with each other. (<span class=
3355               "QUOTE">"referrer"</span> is the correct English spelling,
3356               however the HTTP specification has a bug - it requires it to be
3357               spelled as <span class="QUOTE">"referer"</span>.)</p>
3358             </dd>
3359
3360             <dt>Example usage:</dt>
3361
3362             <dd>
3363               <table border="0" bgcolor="#E0E0E0" width="90%">
3364                 <tr>
3365                   <td>
3366                     <pre class="SCREEN">
3367 +hide-referrer{forge}
3368 </pre>
3369                   </td>
3370                 </tr>
3371               </table>or
3372
3373               <table border="0" bgcolor="#E0E0E0" width="90%">
3374                 <tr>
3375                   <td>
3376                     <pre class="SCREEN">
3377 +hide-referrer{http://www.yahoo.com/}
3378 </pre>
3379                   </td>
3380                 </tr>
3381               </table>
3382             </dd>
3383           </dl>
3384         </div>
3385       </div>
3386
3387       <div class="SECT3">
3388         <h4 class="SECT3"><a name="HIDE-USER-AGENT" id=
3389         "HIDE-USER-AGENT">8.5.25. hide-user-agent</a></h4>
3390
3391         <div class="VARIABLELIST">
3392           <dl>
3393             <dt>Typical use:</dt>
3394
3395             <dd>
3396               <p>Try to conceal your type of browser and client operating
3397               system</p>
3398             </dd>
3399
3400             <dt>Effect:</dt>
3401
3402             <dd>
3403               <p>Replaces the value of the <span class=
3404               "QUOTE">"User-Agent:"</span> HTTP header in client requests
3405               with the specified value.</p>
3406             </dd>
3407
3408             <dt>Type:</dt>
3409
3410             <dd>
3411               <p>Parameterized.</p>
3412             </dd>
3413
3414             <dt>Parameter:</dt>
3415
3416             <dd>
3417               <p>Any user-defined string.</p>
3418             </dd>
3419
3420             <dt>Notes:</dt>
3421
3422             <dd>
3423               <div class="WARNING">
3424                 <table class="WARNING" border="1" width="90%">
3425                   <tr>
3426                     <td align="center"><b>Warning</b></td>
3427                   </tr>
3428
3429                   <tr>
3430                     <td align="left">
3431                       <p>This can lead to problems on web sites that depend
3432                       on looking at this header in order to customize their
3433                       content for different browsers (which, by the way, is
3434                       <span class="emphasis"><i class=
3435                       "EMPHASIS">NOT</i></span> the right thing to do: good
3436                       web sites work browser-independently).</p>
3437                     </td>
3438                   </tr>
3439                 </table>
3440               </div>
3441
3442               <p>Using this action in multi-user setups or wherever different
3443               types of browsers will access the same <span class=
3444               "APPLICATION">Privoxy</span> is <span class=
3445               "emphasis"><i class="EMPHASIS">not recommended</i></span>. In
3446               single-user, single-browser setups, you might use it to delete
3447               your OS version information from the headers, because it is an
3448               invitation to exploit known bugs for your OS. It is also
3449               occasionally useful to forge this in order to access sites that
3450               won't let you in otherwise (though there may be a good reason
3451               in some cases).</p>
3452
3453               <p>More information on known user-agent strings can be found at
3454               <a href="http://www.user-agents.org/" target=
3455               "_top">http://www.user-agents.org/</a> and <a href=
3456               "http://en.wikipedia.org/wiki/User_agent" target=
3457               "_top">http://en.wikipedia.org/wiki/User_agent</a>.</p>
3458             </dd>
3459
3460             <dt>Example usage:</dt>
3461
3462             <dd>
3463               <table border="0" bgcolor="#E0E0E0" width="90%">
3464                 <tr>
3465                   <td>
3466                     <pre class="SCREEN">
3467 +hide-user-agent{Netscape 6.1 (X11; I; Linux 2.4.18 i686)}
3468 </pre>
3469                   </td>
3470                 </tr>
3471               </table>
3472             </dd>
3473           </dl>
3474         </div>
3475       </div>
3476
3477       <div class="SECT3">
3478         <h4 class="SECT3"><a name="LIMIT-CONNECT" id="LIMIT-CONNECT">8.5.26.
3479         limit-connect</a></h4>
3480
3481         <div class="VARIABLELIST">
3482           <dl>
3483             <dt>Typical use:</dt>
3484
3485             <dd>
3486               <p>Prevent abuse of <span class="APPLICATION">Privoxy</span> as
3487               a TCP proxy relay or disable SSL for untrusted sites</p>
3488             </dd>
3489
3490             <dt>Effect:</dt>
3491
3492             <dd>
3493               <p>Specifies to which ports HTTP CONNECT requests are
3494               allowable.</p>
3495             </dd>
3496
3497             <dt>Type:</dt>
3498
3499             <dd>
3500               <p>Parameterized.</p>
3501             </dd>
3502
3503             <dt>Parameter:</dt>
3504
3505             <dd>
3506               <p>A comma-separated list of ports or port ranges (the latter
3507               using dashes, with the minimum defaulting to 0 and the maximum
3508               to 65K).</p>
3509             </dd>
3510
3511             <dt>Notes:</dt>
3512
3513             <dd>
3514               <p>By default, i.e. if no <tt class=
3515               "LITERAL">limit-connect</tt> action applies, <span class=
3516               "APPLICATION">Privoxy</span> allows HTTP CONNECT requests to
3517               all ports. Use <tt class="LITERAL">limit-connect</tt> if
3518               fine-grained control is desired for some or all
3519               destinations.</p>
3520
3521               <p>The CONNECT methods exists in HTTP to allow access to secure
3522               websites (<span class="QUOTE">"https://"</span> URLs) through
3523               proxies. It works very simply: the proxy connects to the server
3524               on the specified port, and then short-circuits its connections
3525               to the client and to the remote server. This means
3526               CONNECT-enabled proxies can be used as TCP relays very
3527               easily.</p>
3528
3529               <p><span class="APPLICATION">Privoxy</span> relays HTTPS
3530               traffic without seeing the decoded content. Websites can
3531               leverage this limitation to circumvent <span class=
3532               "APPLICATION">Privoxy</span>'s filters. By specifying an
3533               invalid port range you can disable HTTPS entirely.</p>
3534             </dd>
3535
3536             <dt>Example usages:</dt>
3537
3538             <dd>
3539               <table border="0" bgcolor="#E0E0E0" width="90%">
3540                 <tr>
3541                   <td>
3542                     <pre class="SCREEN">
3543 +limit-connect{443}                   # Port 443 is OK.
3544 +limit-connect{80,443}                # Ports 80 and 443 are OK.
3545 +limit-connect{-3, 7, 20-100, 500-}   # Ports less than 3, 7, 20 to 100 and above 500 are OK.
3546 +limit-connect{-}                     # All ports are OK
3547 +limit-connect{,}                     # No HTTPS/SSL traffic is allowed
3548 </pre>
3549                   </td>
3550                 </tr>
3551               </table>
3552             </dd>
3553           </dl>
3554         </div>
3555       </div>
3556
3557       <div class="SECT3">
3558         <h4 class="SECT3"><a name="LIMIT-COOKIE-LIFETIME" id=
3559         "LIMIT-COOKIE-LIFETIME">8.5.27. limit-cookie-lifetime</a></h4>
3560
3561         <div class="VARIABLELIST">
3562           <dl>
3563             <dt>Typical use:</dt>
3564
3565             <dd>
3566               <p>Limit the lifetime of HTTP cookies to a couple of minutes or
3567               hours.</p>
3568             </dd>
3569
3570             <dt>Effect:</dt>
3571
3572             <dd>
3573               <p>Overwrites the expires field in Set-Cookie server headers if
3574               it's above the specified limit.</p>
3575             </dd>
3576
3577             <dt>Type:</dt>
3578
3579             <dd>
3580               <p>Parameterized.</p>
3581             </dd>
3582
3583             <dt>Parameter:</dt>
3584
3585             <dd>
3586               <p>The lifetime limit in minutes, or 0.</p>
3587             </dd>
3588
3589             <dt>Notes:</dt>
3590
3591             <dd>
3592               <p>This action reduces the lifetime of HTTP cookies coming from
3593               the server to the specified number of minutes, starting from
3594               the time the cookie passes Privoxy.</p>
3595
3596               <p>Cookies with a lifetime below the limit are not modified.
3597               The lifetime of session cookies is set to the specified
3598               limit.</p>
3599
3600               <p>The effect of this action depends on the server.</p>
3601
3602               <p>In case of servers which refresh their cookies with each
3603               response (or at least frequently), the lifetime limit set by
3604               this action is updated as well. Thus, a session associated with
3605               the cookie continues to work with this action enabled, as long
3606               as a new request is made before the last limit set is
3607               reached.</p>
3608
3609               <p>However, some servers send their cookies once, with a
3610               lifetime of several years (the year 2037 is a popular choice),
3611               and do not refresh them until a certain event in the future,
3612               for example the user logging out. In this case this action may
3613               limit the absolute lifetime of the session, even if requests
3614               are made frequently.</p>
3615
3616               <p>If the parameter is <span class="QUOTE">"0"</span>, this
3617               action behaves like <tt class="LITERAL"><a href=
3618               "actions-file.html#SESSION-COOKIES-ONLY">session-cookies-only</a></tt>.</p>
3619             </dd>
3620
3621             <dt>Example usages:</dt>
3622
3623             <dd>
3624               <table border="0" bgcolor="#E0E0E0" width="90%">
3625                 <tr>
3626                   <td>
3627                     <pre class="SCREEN">
3628 +limit-cookie-lifetime{60}
3629
3630 </pre>
3631                   </td>
3632                 </tr>
3633               </table>
3634             </dd>
3635           </dl>
3636         </div>
3637       </div>
3638
3639       <div class="SECT3">
3640         <h4 class="SECT3"><a name="PREVENT-COMPRESSION" id=
3641         "PREVENT-COMPRESSION">8.5.28. prevent-compression</a></h4>
3642
3643         <div class="VARIABLELIST">
3644           <dl>
3645             <dt>Typical use:</dt>
3646
3647             <dd>
3648               <p>Ensure that servers send the content uncompressed, so it can
3649               be passed through <tt class="LITERAL"><a href=
3650               "actions-file.html#FILTER">filter</a></tt>s.</p>
3651             </dd>
3652
3653             <dt>Effect:</dt>
3654
3655             <dd>
3656               <p>Removes the Accept-Encoding header which can be used to ask
3657               for compressed transfer.</p>
3658             </dd>
3659
3660             <dt>Type:</dt>
3661
3662             <dd>
3663               <p>Boolean.</p>
3664             </dd>
3665
3666             <dt>Parameter:</dt>
3667
3668             <dd>
3669               <p>N/A</p>
3670             </dd>
3671
3672             <dt>Notes:</dt>
3673
3674             <dd>
3675               <p>More and more websites send their content compressed by
3676               default, which is generally a good idea and saves bandwidth.
3677               But the <tt class="LITERAL"><a href=
3678               "actions-file.html#FILTER">filter</a></tt> and <tt class=
3679               "LITERAL"><a href=
3680               "actions-file.html#DEANIMATE-GIFS">deanimate-gifs</a></tt>
3681               actions need access to the uncompressed data.</p>
3682
3683               <p>When compiled with zlib support (available since
3684               <span class="APPLICATION">Privoxy</span> 3.0.7), content that
3685               should be filtered is decompressed on-the-fly and you don't
3686               have to worry about this action. If you are using an older
3687               <span class="APPLICATION">Privoxy</span> version, or one that
3688               hasn't been compiled with zlib support, this action can be used
3689               to convince the server to send the content uncompressed.</p>
3690
3691               <p>Most text-based instances compress very well, the size is
3692               seldom decreased by less than 50%, for markup-heavy instances
3693               like news feeds saving more than 90% of the original size isn't
3694               unusual.</p>
3695
3696               <p>Not using compression will therefore slow down the transfer,
3697               and you should only enable this action if you really need it.
3698               As of <span class="APPLICATION">Privoxy</span> 3.0.7 it's
3699               disabled in all predefined action settings.</p>
3700
3701               <p>Note that some (rare) ill-configured sites don't handle
3702               requests for uncompressed documents correctly. Broken PHP
3703               applications tend to send an empty document body, some IIS
3704               versions only send the beginning of the content. If you enable
3705               <tt class="LITERAL">prevent-compression</tt> per default, you
3706               might want to add exceptions for those sites. See the example
3707               for how to do that.</p>
3708             </dd>
3709
3710             <dt>Example usage (sections):</dt>
3711
3712             <dd>
3713               <table border="0" bgcolor="#E0E0E0" width="90%">
3714                 <tr>
3715                   <td>
3716                     <pre class="SCREEN">
3717 # Selectively turn off compression, and enable a filter
3718 #
3719 { +filter{tiny-textforms} +prevent-compression }
3720 # Match only these sites
3721  .google.
3722  sourceforge.net
3723  sf.net
3724
3725 # Or instead, we could set a universal default:
3726 #
3727 { +prevent-compression }
3728  / # Match all sites
3729
3730 # Then maybe make exceptions for broken sites:
3731 #
3732 { -prevent-compression }
3733 .compusa.com/
3734 </pre>
3735                   </td>
3736                 </tr>
3737               </table>
3738             </dd>
3739           </dl>
3740         </div>
3741       </div>
3742
3743       <div class="SECT3">
3744         <h4 class="SECT3"><a name="OVERWRITE-LAST-MODIFIED" id=
3745         "OVERWRITE-LAST-MODIFIED">8.5.29. overwrite-last-modified</a></h4>
3746
3747         <div class="VARIABLELIST">
3748           <dl>
3749             <dt>Typical use:</dt>
3750
3751             <dd>
3752               <p>Prevent yet another way to track the user's steps between
3753               sessions.</p>
3754             </dd>
3755
3756             <dt>Effect:</dt>
3757
3758             <dd>
3759               <p>Deletes the <span class="QUOTE">"Last-Modified:"</span> HTTP
3760               server header or modifies its value.</p>
3761             </dd>
3762
3763             <dt>Type:</dt>
3764
3765             <dd>
3766               <p>Parameterized.</p>
3767             </dd>
3768
3769             <dt>Parameter:</dt>
3770
3771             <dd>
3772               <p>One of the keywords: <span class="QUOTE">"block"</span>,
3773               <span class="QUOTE">"reset-to-request-time"</span> and
3774               <span class="QUOTE">"randomize"</span></p>
3775             </dd>
3776
3777             <dt>Notes:</dt>
3778
3779             <dd>
3780               <p>Removing the <span class="QUOTE">"Last-Modified:"</span>
3781               header is useful for filter testing, where you want to force a
3782               real reload instead of getting status code <span class=
3783               "QUOTE">"304"</span>, which would cause the browser to reuse
3784               the old version of the page.</p>
3785
3786               <p>The <span class="QUOTE">"randomize"</span> option overwrites
3787               the value of the <span class="QUOTE">"Last-Modified:"</span>
3788               header with a randomly chosen time between the original value
3789               and the current time. In theory the server could send each
3790               document with a different <span class=
3791               "QUOTE">"Last-Modified:"</span> header to track visits without
3792               using cookies. <span class="QUOTE">"Randomize"</span> makes it
3793               impossible and the browser can still revalidate cached
3794               documents.</p>
3795
3796               <p><span class="QUOTE">"reset-to-request-time"</span>
3797               overwrites the value of the <span class=
3798               "QUOTE">"Last-Modified:"</span> header with the current time.
3799               You could use this option together with <tt class=
3800               "LITERAL"><a href=
3801               "actions-file.html#HIDE-IF-MODIFIED-SINCE">hide-if-modified-since</a></tt>
3802               to further customize your random range.</p>
3803
3804               <p>The preferred parameter here is <span class=
3805               "QUOTE">"randomize"</span>. It is safe to use, as long as the
3806               time settings are more or less correct. If the server sets the
3807               <span class="QUOTE">"Last-Modified:"</span> header to the time
3808               of the request, the random range becomes zero and the value
3809               stays the same. Therefore you should later randomize it a
3810               second time with <tt class="LITERAL"><a href=
3811               "actions-file.html#HIDE-IF-MODIFIED-SINCE">hided-if-modified-since</a></tt>,
3812               just to be sure.</p>
3813
3814               <p>It is also recommended to use this action together with
3815               <tt class="LITERAL"><a href=
3816               "actions-file.html#CRUNCH-IF-NONE-MATCH">crunch-if-none-match</a></tt>.</p>
3817             </dd>
3818
3819             <dt>Example usage:</dt>
3820
3821             <dd>
3822               <table border="0" bgcolor="#E0E0E0" width="90%">
3823                 <tr>
3824                   <td>
3825                     <pre class="SCREEN">
3826 # Let the browser revalidate without being tracked across sessions
3827 { +hide-if-modified-since{-60} \
3828  +overwrite-last-modified{randomize} \
3829  +crunch-if-none-match}
3830 /
3831 </pre>
3832                   </td>
3833                 </tr>
3834               </table>
3835             </dd>
3836           </dl>
3837         </div>
3838       </div>
3839
3840       <div class="SECT3">
3841         <h4 class="SECT3"><a name="REDIRECT" id="REDIRECT">8.5.30.
3842         redirect</a></h4>
3843
3844         <div class="VARIABLELIST">
3845           <dl>
3846             <dt>Typical use:</dt>
3847
3848             <dd>
3849               <p>Redirect requests to other sites.</p>
3850             </dd>
3851
3852             <dt>Effect:</dt>
3853
3854             <dd>
3855               <p>Convinces the browser that the requested document has been
3856               moved to another location and the browser should get it from
3857               there.</p>
3858             </dd>
3859
3860             <dt>Type:</dt>
3861
3862             <dd>
3863               <p>Parameterized</p>
3864             </dd>
3865
3866             <dt>Parameter:</dt>
3867
3868             <dd>
3869               <p>An absolute URL or a single pcrs command.</p>
3870             </dd>
3871
3872             <dt>Notes:</dt>
3873
3874             <dd>
3875               <p>Requests to which this action applies are answered with a
3876               HTTP redirect to URLs of your choosing. The new URL is either
3877               provided as parameter, or derived by applying a single pcrs
3878               command to the original URL.</p>
3879
3880               <p>The syntax for pcrs commands is documented in the <a href=
3881               "filter-file.html">filter file</a> section.</p>
3882
3883               <p>This action will be ignored if you use it together with
3884               <tt class="LITERAL"><a href=
3885               "actions-file.html#BLOCK">block</a></tt>. It can be combined
3886               with <tt class="LITERAL"><a href=
3887               "actions-file.html#FAST-REDIRECTS">fast-redirects{check-decoded-url}</a></tt>
3888               to redirect to a decoded version of a rewritten URL.</p>
3889
3890               <p>Use this action carefully, make sure not to create
3891               redirection loops and be aware that using your own redirects
3892               might make it possible to fingerprint your requests.</p>
3893
3894               <p>In case of problems with your redirects, or simply to watch
3895               them working, enable <a href="config.html#DEBUG">debug
3896               128</a>.</p>
3897             </dd>
3898
3899             <dt>Example usages:</dt>
3900
3901             <dd>
3902               <table border="0" bgcolor="#E0E0E0" width="90%">
3903                 <tr>
3904                   <td>
3905                     <pre class="SCREEN">
3906 # Replace example.com's style sheet with another one
3907 { +redirect{http://localhost/css-replacements/example.com.css} }
3908  example.com/stylesheet\.css
3909
3910 # Create a short, easy to remember nickname for a favorite site
3911 # (relies on the browser accept and forward invalid URLs to <span class=
3912 "APPLICATION">Privoxy</span>)
3913 { +redirect{http://www.privoxy.org/user-manual/actions-file.html} }
3914  a
3915
3916 # Always use the expanded view for Undeadly.org articles
3917 # (Note the $ at the end of the URL pattern to make sure
3918 # the request for the rewritten URL isn't redirected as well)
3919 {+redirect{s@$@&amp;mode=expanded@}}
3920 undeadly.org/cgi\?action=article&amp;sid=\d*$
3921
3922 # Redirect Google search requests to MSN
3923 {+redirect{s@^http://[^/]*/search\?q=([^&amp;]*).*@http://search.msn.com/results.aspx?q=$1@}}
3924 .google.com/search
3925
3926 # Redirect MSN search requests to Yahoo
3927 {+redirect{s@^http://[^/]*/results\.aspx\?q=([^&amp;]*).*@http://search.yahoo.com/search?p=$1@}}
3928 search.msn.com//results\.aspx\?q=
3929
3930 # Redirect remote requests for this manual
3931 # to the local version delivered by Privoxy
3932 {+redirect{s@^http://www@http://config@}}
3933 www.privoxy.org/user-manual/
3934 </pre>
3935                   </td>
3936                 </tr>
3937               </table>
3938             </dd>
3939           </dl>
3940         </div>
3941       </div>
3942
3943       <div class="SECT3">
3944         <h4 class="SECT3"><a name="SERVER-HEADER-FILTER" id=
3945         "SERVER-HEADER-FILTER">8.5.31. server-header-filter</a></h4>
3946
3947         <div class="VARIABLELIST">
3948           <dl>
3949             <dt>Typical use:</dt>
3950
3951             <dd>
3952               <p>Rewrite or remove single server headers.</p>
3953             </dd>
3954
3955             <dt>Effect:</dt>
3956
3957             <dd>
3958               <p>All server headers to which this action applies are filtered
3959               on-the-fly through the specified regular expression based
3960               substitutions.</p>
3961             </dd>
3962
3963             <dt>Type:</dt>
3964
3965             <dd>
3966               <p>Parameterized.</p>
3967             </dd>
3968
3969             <dt>Parameter:</dt>
3970
3971             <dd>
3972               <p>The name of a server-header filter, as defined in one of the
3973               <a href="filter-file.html">filter files</a>.</p>
3974             </dd>
3975
3976             <dt>Notes:</dt>
3977
3978             <dd>
3979               <p>Server-header filters are applied to each header on its own,
3980               not to all at once. This makes it easier to diagnose problems,
3981               but on the downside you can't write filters that only change
3982               header x if header y's value is z. You can do that by using
3983               tags though.</p>
3984
3985               <p>Server-header filters are executed after the other header
3986               actions have finished and use their output as input.</p>
3987
3988               <p>Please refer to the <a href="filter-file.html">filter file
3989               chapter</a> to learn which server-header filters are available
3990               by default, and how to create your own.</p>
3991             </dd>
3992
3993             <dt>Example usage (section):</dt>
3994
3995             <dd>
3996               <table border="0" bgcolor="#E0E0E0" width="90%">
3997                 <tr>
3998                   <td>
3999                     <pre class="SCREEN">
4000 {+server-header-filter{html-to-xml}}
4001 example.org/xml-instance-that-is-delivered-as-html
4002
4003 {+server-header-filter{xml-to-html}}
4004 example.org/instance-that-is-delivered-as-xml-but-is-not
4005
4006 </pre>
4007                   </td>
4008                 </tr>
4009               </table>
4010             </dd>
4011           </dl>
4012         </div>
4013       </div>
4014
4015       <div class="SECT3">
4016         <h4 class="SECT3"><a name="SERVER-HEADER-TAGGER" id=
4017         "SERVER-HEADER-TAGGER">8.5.32. server-header-tagger</a></h4>
4018
4019         <div class="VARIABLELIST">
4020           <dl>
4021             <dt>Typical use:</dt>
4022
4023             <dd>
4024               <p>Enable or disable filters based on the Content-Type
4025               header.</p>
4026             </dd>
4027
4028             <dt>Effect:</dt>
4029
4030             <dd>
4031               <p>Server headers to which this action applies are filtered
4032               on-the-fly through the specified regular expression based
4033               substitutions, the result is used as tag.</p>
4034             </dd>
4035
4036             <dt>Type:</dt>
4037
4038             <dd>
4039               <p>Parameterized.</p>
4040             </dd>
4041
4042             <dt>Parameter:</dt>
4043
4044             <dd>
4045               <p>The name of a server-header tagger, as defined in one of the
4046               <a href="filter-file.html">filter files</a>.</p>
4047             </dd>
4048
4049             <dt>Notes:</dt>
4050
4051             <dd>
4052               <p>Server-header taggers are applied to each header on its own,
4053               and as the header isn't modified, each tagger <span class=
4054               "QUOTE">"sees"</span> the original.</p>
4055
4056               <p>Server-header taggers are executed before all other header
4057               actions that modify server headers. Their tags can be used to
4058               control all of the other server-header actions, the content
4059               filters and the crunch actions (<a href=
4060               "actions-file.html#REDIRECT">redirect</a> and <a href=
4061               "actions-file.html#BLOCK">block</a>).</p>
4062
4063               <p>Obviously crunching based on tags created by server-header
4064               taggers doesn't prevent the request from showing up in the
4065               server's log file.</p>
4066             </dd>
4067
4068             <dt>Example usage (section):</dt>
4069
4070             <dd>
4071               <table border="0" bgcolor="#E0E0E0" width="90%">
4072                 <tr>
4073                   <td>
4074                     <pre class="SCREEN">
4075 # Tag every request with the content type declared by the server
4076 {+server-header-tagger{content-type}}
4077 /
4078
4079 </pre>
4080                   </td>
4081                 </tr>
4082               </table>
4083             </dd>
4084           </dl>
4085         </div>
4086       </div>
4087
4088       <div class="SECT3">
4089         <h4 class="SECT3"><a name="SESSION-COOKIES-ONLY" id=
4090         "SESSION-COOKIES-ONLY">8.5.33. session-cookies-only</a></h4>
4091
4092         <div class="VARIABLELIST">
4093           <dl>
4094             <dt>Typical use:</dt>
4095
4096             <dd>
4097               <p>Allow only temporary <span class="QUOTE">"session"</span>
4098               cookies (for the current browser session <span class=
4099               "emphasis"><i class="EMPHASIS">only</i></span>).</p>
4100             </dd>
4101
4102             <dt>Effect:</dt>
4103
4104             <dd>
4105               <p>Deletes the <span class="QUOTE">"expires"</span> field from
4106               <span class="QUOTE">"Set-Cookie:"</span> server headers. Most
4107               browsers will not store such cookies permanently and forget
4108               them in between sessions.</p>
4109             </dd>
4110
4111             <dt>Type:</dt>
4112
4113             <dd>
4114               <p>Boolean.</p>
4115             </dd>
4116
4117             <dt>Parameter:</dt>
4118
4119             <dd>
4120               <p>N/A</p>
4121             </dd>
4122
4123             <dt>Notes:</dt>
4124
4125             <dd>
4126               <p>This is less strict than <tt class="LITERAL"><a href=
4127               "actions-file.html#CRUNCH-INCOMING-COOKIES">crunch-incoming-cookies</a></tt>
4128               / <tt class="LITERAL"><a href=
4129               "actions-file.html#CRUNCH-OUTGOING-COOKIES">crunch-outgoing-cookies</a></tt>
4130               and allows you to browse websites that insist or rely on
4131               setting cookies, without compromising your privacy too
4132               badly.</p>
4133
4134               <p>Most browsers will not permanently store cookies that have
4135               been processed by <tt class="LITERAL">session-cookies-only</tt>
4136               and will forget about them between sessions. This makes
4137               profiling cookies useless, but won't break sites which require
4138               cookies so that you can log in for transactions. This is
4139               generally turned on for all sites, and is the recommended
4140               setting.</p>
4141
4142               <p>It makes <span class="emphasis"><i class="EMPHASIS">no sense
4143               at all</i></span> to use <tt class=
4144               "LITERAL">session-cookies-only</tt> together with <tt class=
4145               "LITERAL"><a href=
4146               "actions-file.html#CRUNCH-INCOMING-COOKIES">crunch-incoming-cookies</a></tt>
4147               or <tt class="LITERAL"><a href=
4148               "actions-file.html#CRUNCH-OUTGOING-COOKIES">crunch-outgoing-cookies</a></tt>.
4149               If you do, cookies will be plainly killed.</p>
4150
4151               <p>Note that it is up to the browser how it handles such
4152               cookies without an <span class="QUOTE">"expires"</span> field.
4153               If you use an exotic browser, you might want to try it out to
4154               be sure.</p>
4155
4156               <p>This setting also has no effect on cookies that may have
4157               been stored previously by the browser before starting
4158               <span class="APPLICATION">Privoxy</span>. These would have to
4159               be removed manually.</p>
4160
4161               <p><span class="APPLICATION">Privoxy</span> also uses the
4162               <a href=
4163               "actions-file.html#FILTER-CONTENT-COOKIES">content-cookies
4164               filter</a> to block some types of cookies. Content cookies are
4165               not effected by <tt class=
4166               "LITERAL">session-cookies-only</tt>.</p>
4167             </dd>
4168
4169             <dt>Example usage:</dt>
4170
4171             <dd>
4172               <table border="0" bgcolor="#E0E0E0" width="90%">
4173                 <tr>
4174                   <td>
4175                     <pre class="SCREEN">
4176 +session-cookies-only
4177 </pre>
4178                   </td>
4179                 </tr>
4180               </table>
4181             </dd>
4182           </dl>
4183         </div>
4184       </div>
4185
4186       <div class="SECT3">
4187         <h4 class="SECT3"><a name="SET-IMAGE-BLOCKER" id=
4188         "SET-IMAGE-BLOCKER">8.5.34. set-image-blocker</a></h4>
4189
4190         <div class="VARIABLELIST">
4191           <dl>
4192             <dt>Typical use:</dt>
4193
4194             <dd>
4195               <p>Choose the replacement for blocked images</p>
4196             </dd>
4197
4198             <dt>Effect:</dt>
4199
4200             <dd>
4201               <p>This action alone doesn't do anything noticeable. If
4202               <span class="emphasis"><i class="EMPHASIS">both</i></span>
4203               <tt class="LITERAL"><a href=
4204               "actions-file.html#BLOCK">block</a></tt> <span class=
4205               "emphasis"><i class="EMPHASIS">and</i></span> <tt class=
4206               "LITERAL"><a href=
4207               "actions-file.html#HANDLE-AS-IMAGE">handle-as-image</a></tt>
4208               <span class="emphasis"><i class="EMPHASIS">also</i></span>
4209               apply, i.e. if the request is to be blocked as an image,
4210               <span class="emphasis"><i class="EMPHASIS">then</i></span> the
4211               parameter of this action decides what will be sent as a
4212               replacement.</p>
4213             </dd>
4214
4215             <dt>Type:</dt>
4216
4217             <dd>
4218               <p>Parameterized.</p>
4219             </dd>
4220
4221             <dt>Parameter:</dt>
4222
4223             <dd>
4224               <ul>
4225                 <li>
4226                   <p><span class="QUOTE">"pattern"</span> to send a built-in
4227                   checkerboard pattern image. The image is visually decent,
4228                   scales very well, and makes it obvious where banners were
4229                   busted.</p>
4230                 </li>
4231
4232                 <li>
4233                   <p><span class="QUOTE">"blank"</span> to send a built-in
4234                   transparent image. This makes banners disappear completely,
4235                   but makes it hard to detect where <span class=
4236                   "APPLICATION">Privoxy</span> has blocked images on a given
4237                   page and complicates troubleshooting if <span class=
4238                   "APPLICATION">Privoxy</span> has blocked innocent images,
4239                   like navigation icons.</p>
4240                 </li>
4241
4242                 <li>
4243                   <p><span class="QUOTE">"<tt class=
4244                   "REPLACEABLE"><i>target-url</i></tt>"</span> to send a
4245                   redirect to <tt class="REPLACEABLE"><i>target-url</i></tt>.
4246                   You can redirect to any image anywhere, even in your local
4247                   filesystem via <span class="QUOTE">"file:///"</span> URL.
4248                   (But note that not all browsers support redirecting to a
4249                   local file system).</p>
4250
4251                   <p>A good application of redirects is to use special
4252                   <span class="APPLICATION">Privoxy</span>-built-in URLs,
4253                   which send the built-in images, as <tt class=
4254                   "REPLACEABLE"><i>target-url</i></tt>. This has the same
4255                   visual effect as specifying <span class=
4256                   "QUOTE">"blank"</span> or <span class=
4257                   "QUOTE">"pattern"</span> in the first place, but enables
4258                   your browser to cache the replacement image, instead of
4259                   requesting it over and over again.</p>
4260                 </li>
4261               </ul>
4262             </dd>
4263
4264             <dt>Notes:</dt>
4265
4266             <dd>
4267               <p>The URLs for the built-in images are <span class=
4268               "QUOTE">"http://config.privoxy.org/send-banner?type=<tt class=
4269               "REPLACEABLE"><i>type</i></tt>"</span>, where <tt class=
4270               "REPLACEABLE"><i>type</i></tt> is either <span class=
4271               "QUOTE">"blank"</span> or <span class=
4272               "QUOTE">"pattern"</span>.</p>
4273
4274               <p>There is a third (advanced) type, called <span class=
4275               "QUOTE">"auto"</span>. It is <span class="emphasis"><i class=
4276               "EMPHASIS">NOT</i></span> to be used in <tt class=
4277               "LITERAL">set-image-blocker</tt>, but meant for use from
4278               <a href="filter-file.html">filters</a>. Auto will select the
4279               type of image that would have applied to the referring page,
4280               had it been an image.</p>
4281             </dd>
4282
4283             <dt>Example usage:</dt>
4284
4285             <dd>
4286               <p>Built-in pattern:</p>
4287
4288               <table border="0" bgcolor="#E0E0E0" width="90%">
4289                 <tr>
4290                   <td>
4291                     <pre class="SCREEN">
4292 +set-image-blocker{pattern}
4293 </pre>
4294                   </td>
4295                 </tr>
4296               </table>
4297
4298               <p>Redirect to the BSD daemon:</p>
4299
4300               <table border="0" bgcolor="#E0E0E0" width="90%">
4301                 <tr>
4302                   <td>
4303                     <pre class="SCREEN">
4304 +set-image-blocker{http://www.freebsd.org/gifs/dae_up3.gif}
4305 </pre>
4306                   </td>
4307                 </tr>
4308               </table>
4309
4310               <p>Redirect to the built-in pattern for better caching:</p>
4311
4312               <table border="0" bgcolor="#E0E0E0" width="90%">
4313                 <tr>
4314                   <td>
4315                     <pre class="SCREEN">
4316 +set-image-blocker{http://config.privoxy.org/send-banner?type=pattern}
4317 </pre>
4318                   </td>
4319                 </tr>
4320               </table>
4321             </dd>
4322           </dl>
4323         </div>
4324       </div>
4325
4326       <div class="SECT3">
4327         <h3 class="SECT3"><a name="AEN4789" id="AEN4789">8.5.35.
4328         Summary</a></h3>
4329
4330         <p>Note that many of these actions have the potential to cause a page
4331         to misbehave, possibly even not to display at all. There are many
4332         ways a site designer may choose to design his site, and what HTTP
4333         header content, and other criteria, he may depend on. There is no way
4334         to have hard and fast rules for all sites. See the <a href=
4335         "appendix.html#ACTIONSANAT">Appendix</a> for a brief example on
4336         troubleshooting actions.</p>
4337       </div>
4338     </div>
4339
4340     <div class="SECT2">
4341       <h2 class="SECT2"><a name="ALIASES" id="ALIASES">8.6. Aliases</a></h2>
4342
4343       <p>Custom <span class="QUOTE">"actions"</span>, known to <span class=
4344       "APPLICATION">Privoxy</span> as <span class="QUOTE">"aliases"</span>,
4345       can be defined by combining other actions. These can in turn be invoked
4346       just like the built-in actions. Currently, an alias name can contain
4347       any character except space, tab, <span class="QUOTE">"="</span>,
4348       <span class="QUOTE">"{"</span> and <span class="QUOTE">"}"</span>, but
4349       we <span class="emphasis"><i class="EMPHASIS">strongly
4350       recommend</i></span> that you only use <span class="QUOTE">"a"</span>
4351       to <span class="QUOTE">"z"</span>, <span class="QUOTE">"0"</span> to
4352       <span class="QUOTE">"9"</span>, <span class="QUOTE">"+"</span>, and
4353       <span class="QUOTE">"-"</span>. Alias names are not case sensitive, and
4354       are not required to start with a <span class="QUOTE">"+"</span> or
4355       <span class="QUOTE">"-"</span> sign, since they are merely textually
4356       expanded.</p>
4357
4358       <p>Aliases can be used throughout the actions file, but they
4359       <span class="emphasis"><i class="EMPHASIS">must be defined in a special
4360       section at the top of the file!</i></span> And there can only be one
4361       such section per actions file. Each actions file may have its own alias
4362       section, and the aliases defined in it are only visible within that
4363       file.</p>
4364
4365       <p>There are two main reasons to use aliases: One is to save typing for
4366       frequently used combinations of actions, the other one is a gain in
4367       flexibility: If you decide once how you want to handle shops by
4368       defining an alias called <span class="QUOTE">"shop"</span>, you can
4369       later change your policy on shops in <span class="emphasis"><i class=
4370       "EMPHASIS">one</i></span> place, and your changes will take effect
4371       everywhere in the actions file where the <span class=
4372       "QUOTE">"shop"</span> alias is used. Calling aliases by their purpose
4373       also makes your actions files more readable.</p>
4374
4375       <p>Currently, there is one big drawback to using aliases, though:
4376       <span class="APPLICATION">Privoxy</span>'s built-in web-based action
4377       file editor honors aliases when reading the actions files, but it
4378       expands them before writing. So the effects of your aliases are of
4379       course preserved, but the aliases themselves are lost when you edit
4380       sections that use aliases with it.</p>
4381
4382       <p>Now let's define some aliases...</p>
4383
4384       <table border="0" bgcolor="#E0E0E0" width="100%">
4385         <tr>
4386           <td>
4387             <pre class="SCREEN">
4388  # Useful custom aliases we can use later.
4389  #
4390  # Note the (required!) section header line and that this section
4391  # must be at the top of the actions file!
4392  #
4393  {{alias}}
4394
4395  # These aliases just save typing later:
4396  # (Note that some already use other aliases!)
4397  #
4398  +crunch-all-cookies = +<a href=
4399 "actions-file.html#CRUNCH-INCOMING-COOKIES">crunch-incoming-cookies</a> +<a href="actions-file.html#CRUNCH-OUTGOING-COOKIES">crunch-outgoing-cookies</a>
4400  -crunch-all-cookies = -<a href=
4401 "actions-file.html#CRUNCH-INCOMING-COOKIES">crunch-incoming-cookies</a> -<a href="actions-file.html#CRUNCH-OUTGOING-COOKIES">crunch-outgoing-cookies</a>
4402  +block-as-image      = +block{Blocked image.} +handle-as-image
4403  allow-all-cookies   = -crunch-all-cookies -<a href=
4404 "actions-file.html#SESSION-COOKIES-ONLY">session-cookies-only</a> -<a href=
4405 "actions-file.html#FILTER-CONTENT-COOKIES">filter{content-cookies}</a>
4406
4407  # These aliases define combinations of actions
4408  # that are useful for certain types of sites:
4409  #
4410  fragile     = -<a href="actions-file.html#BLOCK">block</a> -<a href=
4411 "actions-file.html#FILTER">filter</a> -crunch-all-cookies -<a href=
4412 "actions-file.html#FAST-REDIRECTS">fast-redirects</a> -<a href=
4413 "actions-file.html#HIDE-REFERER">hide-referrer</a> -<a href=
4414 "actions-file.html#PREVENT-COMPRESSION">prevent-compression</a>
4415
4416  shop        = -crunch-all-cookies -<a href=
4417 "actions-file.html#FILTER-ALL-POPUPS">filter{all-popups}</a>
4418
4419  # Short names for other aliases, for really lazy people ;-)
4420  #
4421  c0 = +crunch-all-cookies
4422  c1 = -crunch-all-cookies
4423 </pre>
4424           </td>
4425         </tr>
4426       </table>
4427
4428       <p>...and put them to use. These sections would appear in the lower
4429       part of an actions file and define exceptions to the default actions
4430       (as specified further up for the <span class="QUOTE">"/"</span>
4431       pattern):</p>
4432
4433       <table border="0" bgcolor="#E0E0E0" width="100%">
4434         <tr>
4435           <td>
4436             <pre class="SCREEN">
4437  # These sites are either very complex or very keen on
4438  # user data and require minimal interference to work:
4439  #
4440  {fragile}
4441  .office.microsoft.com
4442  .windowsupdate.microsoft.com
4443  # Gmail is really mail.google.com, not gmail.com
4444  mail.google.com
4445
4446  # Shopping sites:
4447  # Allow cookies (for setting and retrieving your customer data)
4448  #
4449  {shop}
4450  .quietpc.com
4451  .worldpay.com   # for quietpc.com
4452  mybank.example.com
4453
4454  # These shops require pop-ups:
4455  #
4456  {-filter{all-popups} -filter{unsolicited-popups}}
4457   .dabs.com
4458   .overclockers.co.uk
4459 </pre>
4460           </td>
4461         </tr>
4462       </table>
4463
4464       <p>Aliases like <span class="QUOTE">"shop"</span> and <span class=
4465       "QUOTE">"fragile"</span> are typically used for <span class=
4466       "QUOTE">"problem"</span> sites that require more than one action to be
4467       disabled in order to function properly.</p>
4468     </div>
4469
4470     <div class="SECT2">
4471       <h2 class="SECT2"><a name="ACT-EXAMPLES" id="ACT-EXAMPLES">8.7. Actions
4472       Files Tutorial</a></h2>
4473
4474       <p>The above chapters have shown <a href="actions-file.html">which
4475       actions files there are and how they are organized</a>, how actions are
4476       <a href="actions-file.html#ACTIONS">specified</a> and <a href=
4477       "actions-file.html#ACTIONS-APPLY">applied to URLs</a>, how <a href=
4478       "actions-file.html#AF-PATTERNS">patterns</a> work, and how to define
4479       and use <a href="actions-file.html#ALIASES">aliases</a>. Now, let's
4480       look at an example <tt class="FILENAME">match-all.action</tt>,
4481       <tt class="FILENAME">default.action</tt> and <tt class=
4482       "FILENAME">user.action</tt> file and see how all these pieces come
4483       together:</p>
4484
4485       <div class="SECT3">
4486         <h3 class="SECT3"><a name="AEN4853" id="AEN4853">8.7.1.
4487         match-all.action</a></h3>
4488
4489         <p>Remember <span class="emphasis"><i class="EMPHASIS">all actions
4490         are disabled when matching starts</i></span>, so we have to
4491         explicitly enable the ones we want.</p>
4492
4493         <p>While the <tt class="FILENAME">match-all.action</tt> file only
4494         contains a single section, it is probably the most important one. It
4495         has only one pattern, <span class="QUOTE">"<tt class=
4496         "LITERAL">/</tt>"</span>, but this pattern <a href=
4497         "actions-file.html#AF-PATTERNS">matches all URLs</a>. Therefore, the
4498         set of actions used in this <span class="QUOTE">"default"</span>
4499         section <span class="emphasis"><i class="EMPHASIS">will be applied to
4500         all requests as a start</i></span>. It can be partly or wholly
4501         overridden by other actions files like <tt class=
4502         "FILENAME">default.action</tt> and <tt class=
4503         "FILENAME">user.action</tt>, but it will still be largely responsible
4504         for your overall browsing experience.</p>
4505
4506         <p>Again, at the start of matching, all actions are disabled, so
4507         there is no need to disable any actions here. (Remember: a
4508         <span class="QUOTE">"+"</span> preceding the action name enables the
4509         action, a <span class="QUOTE">"-"</span> disables!). Also note how
4510         this long line has been made more readable by splitting it into
4511         multiple lines with line continuation.</p>
4512
4513         <table border="0" bgcolor="#E0E0E0" width="100%">
4514           <tr>
4515             <td>
4516               <pre class="SCREEN">
4517 { \
4518  +<a href=
4519 "actions-file.html#CHANGE-X-FORWARDED-FOR">change-x-forwarded-for{block}</a> \
4520  +<a href="actions-file.html#HIDE-FROM-HEADER">hide-from-header{block}</a> \
4521  +<a href=
4522 "actions-file.html#SET-IMAGE-BLOCKER">set-image-blocker{pattern}</a> \
4523 }
4524 / # Match all URLs
4525
4526 </pre>
4527             </td>
4528           </tr>
4529         </table>
4530
4531         <p>The default behavior is now set.</p>
4532       </div>
4533
4534       <div class="SECT3">
4535         <h3 class="SECT3"><a name="AEN4875" id="AEN4875">8.7.2.
4536         default.action</a></h3>
4537
4538         <p>If you aren't a developer, there's no need for you to edit the
4539         <tt class="FILENAME">default.action</tt> file. It is maintained by
4540         the <span class="APPLICATION">Privoxy</span> developers and if you
4541         disagree with some of the sections, you should overrule them in your
4542         <tt class="FILENAME">user.action</tt>.</p>
4543
4544         <p>Understanding the <tt class="FILENAME">default.action</tt> file
4545         can help you with your <tt class="FILENAME">user.action</tt>,
4546         though.</p>
4547
4548         <p>The first section in this file is a special section for internal
4549         use that prevents older <span class="APPLICATION">Privoxy</span>
4550         versions from reading the file:</p>
4551
4552         <table border="0" bgcolor="#E0E0E0" width="100%">
4553           <tr>
4554             <td>
4555               <pre class="SCREEN">
4556 ##########################################################################
4557 # Settings -- Don't change! For internal Privoxy use ONLY.
4558 ##########################################################################
4559 {{settings}}
4560 for-privoxy-version=3.0.11
4561 </pre>
4562             </td>
4563           </tr>
4564         </table>
4565
4566         <p>After that comes the (optional) alias section. We'll use the
4567         example section from the above <a href=
4568         "actions-file.html#ALIASES">chapter on aliases</a>, that also
4569         explains why and how aliases are used:</p>
4570
4571         <table border="0" bgcolor="#E0E0E0" width="100%">
4572           <tr>
4573             <td>
4574               <pre class="SCREEN">
4575 ##########################################################################
4576 # Aliases
4577 ##########################################################################
4578 {{alias}}
4579
4580  # These aliases just save typing later:
4581  # (Note that some already use other aliases!)
4582  #
4583  +crunch-all-cookies = +<a href=
4584 "actions-file.html#CRUNCH-INCOMING-COOKIES">crunch-incoming-cookies</a> +<a href="actions-file.html#CRUNCH-OUTGOING-COOKIES">crunch-outgoing-cookies</a>
4585  -crunch-all-cookies = -<a href=
4586 "actions-file.html#CRUNCH-INCOMING-COOKIES">crunch-incoming-cookies</a> -<a href="actions-file.html#CRUNCH-OUTGOING-COOKIES">crunch-outgoing-cookies</a>
4587  +block-as-image      = +block{Blocked image.} +handle-as-image
4588  mercy-for-cookies   = -crunch-all-cookies -<a href=
4589 "actions-file.html#SESSION-COOKIES-ONLY">session-cookies-only</a> -<a href=
4590 "actions-file.html#FILTER-CONTENT-COOKIES">filter{content-cookies}</a>
4591
4592  # These aliases define combinations of actions
4593  # that are useful for certain types of sites:
4594  #
4595  fragile     = -<a href="actions-file.html#BLOCK">block</a> -<a href=
4596 "actions-file.html#FILTER">filter</a> -crunch-all-cookies -<a href=
4597 "actions-file.html#FAST-REDIRECTS">fast-redirects</a> -<a href=
4598 "actions-file.html#HIDE-REFERER">hide-referrer</a>
4599  shop        = -crunch-all-cookies -<a href=
4600 "actions-file.html#FILTER-ALL-POPUPS">filter{all-popups}</a>
4601 </pre>
4602             </td>
4603           </tr>
4604         </table>
4605
4606         <p>The first of our specialized sections is concerned with
4607         <span class="QUOTE">"fragile"</span> sites, i.e. sites that require
4608         minimum interference, because they are either very complex or very
4609         keen on tracking you (and have mechanisms in place that make them
4610         unusable for people who avoid being tracked). We will simply use our
4611         pre-defined <tt class="LITERAL">fragile</tt> alias instead of stating
4612         the list of actions explicitly:</p>
4613
4614         <table border="0" bgcolor="#E0E0E0" width="100%">
4615           <tr>
4616             <td>
4617               <pre class="SCREEN">
4618 ##########################################################################
4619 # Exceptions for sites that'll break under the default action set:
4620 ##########################################################################
4621
4622 # "Fragile" Use a minimum set of actions for these sites (see alias above):
4623 #
4624 { fragile }
4625 .office.microsoft.com           # surprise, surprise!
4626 .windowsupdate.microsoft.com
4627 mail.google.com
4628 </pre>
4629             </td>
4630           </tr>
4631         </table>
4632
4633         <p>Shopping sites are not as fragile, but they typically require
4634         cookies to log in, and pop-up windows for shopping carts or item
4635         details. Again, we'll use a pre-defined alias:</p>
4636
4637         <table border="0" bgcolor="#E0E0E0" width="100%">
4638           <tr>
4639             <td>
4640               <pre class="SCREEN">
4641 # Shopping sites:
4642 #
4643 { shop }
4644 .quietpc.com
4645 .worldpay.com   # for quietpc.com
4646 .jungle.com
4647 .scan.co.uk
4648 </pre>
4649             </td>
4650           </tr>
4651         </table>
4652
4653         <p>The <tt class="LITERAL"><a href=
4654         "actions-file.html#FAST-REDIRECTS">fast-redirects</a></tt> action,
4655         which may have been enabled in <tt class=
4656         "FILENAME">match-all.action</tt>, breaks some sites. So disable it
4657         for popular sites where we know it misbehaves:</p>
4658
4659         <table border="0" bgcolor="#E0E0E0" width="100%">
4660           <tr>
4661             <td>
4662               <pre class="SCREEN">
4663 { -<a href="actions-file.html#FAST-REDIRECTS">fast-redirects</a> }
4664 login.yahoo.com
4665 edit.*.yahoo.com
4666 .google.com
4667 .altavista.com/.*(like|url|link):http
4668 .altavista.com/trans.*urltext=http
4669 .nytimes.com
4670 </pre>
4671             </td>
4672           </tr>
4673         </table>
4674
4675         <p>It is important that <span class="APPLICATION">Privoxy</span>
4676         knows which URLs belong to images, so that <span class=
4677         "emphasis"><i class="EMPHASIS">if</i></span> they are to be blocked,
4678         a substitute image can be sent, rather than an HTML page. Contacting
4679         the remote site to find out is not an option, since it would destroy
4680         the loading time advantage of banner blocking, and it would feed the
4681         advertisers information about you. We can mark any URL as an image
4682         with the <tt class="LITERAL"><a href=
4683         "actions-file.html#HANDLE-AS-IMAGE">handle-as-image</a></tt> action,
4684         and marking all URLs that end in a known image file extension is a
4685         good start:</p>
4686
4687         <table border="0" bgcolor="#E0E0E0" width="100%">
4688           <tr>
4689             <td>
4690               <pre class="SCREEN">
4691 ##########################################################################
4692 # Images:
4693 ##########################################################################
4694
4695 # Define which file types will be treated as images, in case they get
4696 # blocked further down this file:
4697 #
4698 { +<a href="actions-file.html#HANDLE-AS-IMAGE">handle-as-image</a> }
4699 /.*\.(gif|jpe?g|png|bmp|ico)$
4700 </pre>
4701             </td>
4702           </tr>
4703         </table>
4704
4705         <p>And then there are known banner sources. They often use scripts to
4706         generate the banners, so it won't be visible from the URL that the
4707         request is for an image. Hence we block them <span class=
4708         "emphasis"><i class="EMPHASIS">and</i></span> mark them as images in
4709         one go, with the help of our <tt class="LITERAL">+block-as-image</tt>
4710         alias defined above. (We could of course just as well use <tt class=
4711         "LITERAL">+<a href="actions-file.html#BLOCK">block</a> +<a href=
4712         "actions-file.html#HANDLE-AS-IMAGE">handle-as-image</a></tt> here.)
4713         Remember that the type of the replacement image is chosen by the
4714         <tt class="LITERAL"><a href=
4715         "actions-file.html#SET-IMAGE-BLOCKER">set-image-blocker</a></tt>
4716         action. Since all URLs have matched the default section with its
4717         <tt class="LITERAL">+<a href=
4718         "actions-file.html#SET-IMAGE-BLOCKER">set-image-blocker</a>{pattern}</tt>
4719         action before, it still applies and needn't be repeated:</p>
4720
4721         <table border="0" bgcolor="#E0E0E0" width="100%">
4722           <tr>
4723             <td>
4724               <pre class="SCREEN">
4725 # Known ad generators:
4726 #
4727 { +block-as-image }
4728 ar.atwola.com
4729 .ad.doubleclick.net
4730 .ad.*.doubleclick.net
4731 .a.yimg.com/(?:(?!/i/).)*$
4732 .a[0-9].yimg.com/(?:(?!/i/).)*$
4733 bs*.gsanet.com
4734 .qkimg.net
4735 </pre>
4736             </td>
4737           </tr>
4738         </table>
4739
4740         <p>One of the most important jobs of <span class=
4741         "APPLICATION">Privoxy</span> is to block banners. Many of these can
4742         be <span class="QUOTE">"blocked"</span> by the <tt class=
4743         "LITERAL"><a href=
4744         "actions-file.html#FILTER">filter</a>{banners-by-size}</tt> action,
4745         which we enabled above, and which deletes the references to banner
4746         images from the pages while they are loaded, so the browser doesn't
4747         request them anymore, and hence they don't need to be blocked here.
4748         But this naturally doesn't catch all banners, and some people choose
4749         not to use filters, so we need a comprehensive list of patterns for
4750         banner URLs here, and apply the <tt class="LITERAL"><a href=
4751         "actions-file.html#BLOCK">block</a></tt> action to them.</p>
4752
4753         <p>First comes many generic patterns, which do most of the work, by
4754         matching typical domain and path name components of banners. Then
4755         comes a list of individual patterns for specific sites, which is
4756         omitted here to keep the example short:</p>
4757
4758         <table border="0" bgcolor="#E0E0E0" width="100%">
4759           <tr>
4760             <td>
4761               <pre class="SCREEN">
4762 ##########################################################################
4763 # Block these fine banners:
4764 ##########################################################################
4765 { <a href="actions-file.html#BLOCK">+block{Banner ads.}</a> }
4766
4767 # Generic patterns:
4768 #
4769 ad*.
4770 .*ads.
4771 banner?.
4772 count*.
4773 /.*count(er)?\.(pl|cgi|exe|dll|asp|php[34]?)
4774 /(?:.*/)?(publicite|werbung|rekla(ma|me|am)|annonse|maino(kset|nta|s)?)/
4775
4776 # Site-specific patterns (abbreviated):
4777 #
4778 .hitbox.com
4779 </pre>
4780             </td>
4781           </tr>
4782         </table>
4783
4784         <p>It's quite remarkable how many advertisers actually call their
4785         banner servers ads.<tt class="REPLACEABLE"><i>company</i></tt>.com,
4786         or call the directory in which the banners are stored simply
4787         <span class="QUOTE">"banners"</span>. So the above generic patterns
4788         are surprisingly effective.</p>
4789
4790         <p>But being very generic, they necessarily also catch URLs that we
4791         don't want to block. The pattern <tt class="LITERAL">.*ads.</tt> e.g.
4792         catches <span class="QUOTE">"nasty-<span class="emphasis"><i class=
4793         "EMPHASIS">ads</i></span>.nasty-corp.com"</span> as intended, but
4794         also <span class="QUOTE">"downlo<span class="emphasis"><i class=
4795         "EMPHASIS">ads</i></span>.sourcefroge.net"</span> or <span class=
4796         "QUOTE">"<span class="emphasis"><i class=
4797         "EMPHASIS">ads</i></span>l.some-provider.net."</span> So here come
4798         some well-known exceptions to the <tt class="LITERAL">+<a href=
4799         "actions-file.html#BLOCK">block</a></tt> section above.</p>
4800
4801         <p>Note that these are exceptions to exceptions from the default!
4802         Consider the URL <span class=
4803         "QUOTE">"downloads.sourcefroge.net"</span>: Initially, all actions
4804         are deactivated, so it wouldn't get blocked. Then comes the defaults
4805         section, which matches the URL, but just deactivates the <tt class=
4806         "LITERAL"><a href="actions-file.html#BLOCK">block</a></tt> action
4807         once again. Then it matches <tt class="LITERAL">.*ads.</tt>, an
4808         exception to the general non-blocking policy, and suddenly <tt class=
4809         "LITERAL"><a href="actions-file.html#BLOCK">+block</a></tt> applies.
4810         And now, it'll match <tt class="LITERAL">.*loads.</tt>, where
4811         <tt class="LITERAL"><a href="actions-file.html#BLOCK">-block</a></tt>
4812         applies, so (unless it matches <span class="emphasis"><i class=
4813         "EMPHASIS">again</i></span> further down) it ends up with no
4814         <tt class="LITERAL"><a href="actions-file.html#BLOCK">block</a></tt>
4815         action applying.</p>
4816
4817         <table border="0" bgcolor="#E0E0E0" width="100%">
4818           <tr>
4819             <td>
4820               <pre class="SCREEN">
4821 ##########################################################################
4822 # Save some innocent victims of the above generic block patterns:
4823 ##########################################################################
4824
4825 # By domain:
4826 #
4827 { -<a href="actions-file.html#BLOCK">block</a> }
4828 adv[io]*.  # (for advogato.org and advice.*)
4829 adsl.      # (has nothing to do with ads)
4830 adobe.     # (has nothing to do with ads either)
4831 ad[ud]*.   # (adult.* and add.*)
4832 .edu       # (universities don't host banners (yet!))
4833 .*loads.   # (downloads, uploads etc)
4834
4835 # By path:
4836 #
4837 /.*loads/
4838
4839 # Site-specific:
4840 #
4841 www.globalintersec.com/adv # (adv = advanced)
4842 www.ugu.com/sui/ugu/adv
4843 </pre>
4844             </td>
4845           </tr>
4846         </table>
4847
4848         <p>Filtering source code can have nasty side effects, so make an
4849         exception for our friends at sourceforge.net, and all paths with
4850         <span class="QUOTE">"cvs"</span> in them. Note that <tt class=
4851         "LITERAL">-<a href="actions-file.html#FILTER">filter</a></tt>
4852         disables <span class="emphasis"><i class="EMPHASIS">all</i></span>
4853         filters in one fell swoop!</p>
4854
4855         <table border="0" bgcolor="#E0E0E0" width="100%">
4856           <tr>
4857             <td>
4858               <pre class="SCREEN">
4859 # Don't filter code!
4860 #
4861 { -<a href="actions-file.html#FILTER">filter</a> }
4862 /(.*/)?cvs
4863 bugzilla.
4864 developer.
4865 wiki.
4866 .sourceforge.net
4867 </pre>
4868             </td>
4869           </tr>
4870         </table>
4871
4872         <p>The actual <tt class="FILENAME">default.action</tt> is of course
4873         much more comprehensive, but we hope this example made clear how it
4874         works.</p>
4875       </div>
4876
4877       <div class="SECT3">
4878         <h3 class="SECT3"><a name="AEN4988" id="AEN4988">8.7.3.
4879         user.action</a></h3>
4880
4881         <p>So far we are painting with a broad brush by setting general
4882         policies, which would be a reasonable starting point for many people.
4883         Now, you might want to be more specific and have customized rules
4884         that are more suitable to your personal habits and preferences. These
4885         would be for narrowly defined situations like your ISP or your bank,
4886         and should be placed in <tt class="FILENAME">user.action</tt>, which
4887         is parsed after all other actions files and hence has the last word,
4888         over-riding any previously defined actions. <tt class=
4889         "FILENAME">user.action</tt> is also a <span class=
4890         "emphasis"><i class="EMPHASIS">safe</i></span> place for your
4891         personal settings, since <tt class="FILENAME">default.action</tt> is
4892         actively maintained by the <span class="APPLICATION">Privoxy</span>
4893         developers and you'll probably want to install updated versions from
4894         time to time.</p>
4895
4896         <p>So let's look at a few examples of things that one might typically
4897         do in <tt class="FILENAME">user.action</tt>:</p>
4898
4899         <table border="0" bgcolor="#E0E0E0" width="100%">
4900           <tr>
4901             <td>
4902               <pre class="SCREEN">
4903 # My user.action file. &lt;fred@example.com&gt;
4904 </pre>
4905             </td>
4906           </tr>
4907         </table>
4908
4909         <p>As <a href="actions-file.html#ALIASES">aliases</a> are local to
4910         the actions file that they are defined in, you can't use the ones
4911         from <tt class="FILENAME">default.action</tt>, unless you repeat them
4912         here:</p>
4913
4914         <table border="0" bgcolor="#E0E0E0" width="100%">
4915           <tr>
4916             <td>
4917               <pre class="SCREEN">
4918 # Aliases are local to the file they are defined in.
4919 # (Re-)define aliases for this file:
4920 #
4921 {{alias}}
4922 #
4923 # These aliases just save typing later, and the alias names should
4924 # be self explanatory.
4925 #
4926 +crunch-all-cookies = +crunch-incoming-cookies +crunch-outgoing-cookies
4927 -crunch-all-cookies = -crunch-incoming-cookies -crunch-outgoing-cookies
4928  allow-all-cookies  = -crunch-all-cookies -session-cookies-only
4929  allow-popups       = -filter{all-popups}
4930 +block-as-image     = +block{Blocked as image.} +handle-as-image
4931 -block-as-image     = -block
4932
4933 # These aliases define combinations of actions that are useful for
4934 # certain types of sites:
4935 #
4936 fragile     = -block -crunch-all-cookies -filter -fast-redirects -hide-referrer
4937 shop        = -crunch-all-cookies allow-popups
4938
4939 # Allow ads for selected useful free sites:
4940 #
4941 allow-ads   = -block -filter{banners-by-size} -filter{banners-by-link}
4942
4943 # Alias for specific file types that are text, but might have conflicting
4944 # MIME types. We want the browser to force these to be text documents.
4945 handle-as-text = -<a href="actions-file.html#FILTER">filter</a> +-<a href=
4946 "actions-file.html#CONTENT-TYPE-OVERWRITE">content-type-overwrite{text/plain}</a> +-<a href="actions-file.html#FORCE-TEXT-MODE">force-text-mode</a> -<a href="actions-file.html#HIDE-CONTENT-DISPOSITION">hide-content-disposition</a>
4947 </pre>
4948             </td>
4949           </tr>
4950         </table>
4951
4952         <p>Say you have accounts on some sites that you visit regularly, and
4953         you don't want to have to log in manually each time. So you'd like to
4954         allow persistent cookies for these sites. The <tt class=
4955         "LITERAL">allow-all-cookies</tt> alias defined above does exactly
4956         that, i.e. it disables crunching of cookies in any direction, and the
4957         processing of cookies to make them only temporary.</p>
4958
4959         <table border="0" bgcolor="#E0E0E0" width="100%">
4960           <tr>
4961             <td>
4962               <pre class="SCREEN">
4963 { allow-all-cookies }
4964  sourceforge.net
4965  .yahoo.com
4966  .msdn.microsoft.com
4967  .redhat.com
4968 </pre>
4969             </td>
4970           </tr>
4971         </table>
4972
4973         <p>Your bank is allergic to some filter, but you don't know which, so
4974         you disable them all:</p>
4975
4976         <table border="0" bgcolor="#E0E0E0" width="100%">
4977           <tr>
4978             <td>
4979               <pre class="SCREEN">
4980 { -<a href="actions-file.html#FILTER">filter</a> }
4981  .your-home-banking-site.com
4982 </pre>
4983             </td>
4984           </tr>
4985         </table>
4986
4987         <p>Some file types you may not want to filter for various
4988         reasons:</p>
4989
4990         <table border="0" bgcolor="#E0E0E0" width="100%">
4991           <tr>
4992             <td>
4993               <pre class="SCREEN">
4994 # Technical documentation is likely to contain strings that might
4995 # erroneously get altered by the JavaScript-oriented filters:
4996 #
4997 .tldp.org
4998 /(.*/)?selfhtml/
4999
5000 # And this stupid host sends streaming video with a wrong MIME type,
5001 # so that Privoxy thinks it is getting HTML and starts filtering:
5002 #
5003 stupid-server.example.com/
5004 </pre>
5005             </td>
5006           </tr>
5007         </table>
5008
5009         <p>Example of a simple <a href="actions-file.html#BLOCK">block</a>
5010         action. Say you've seen an ad on your favourite page on example.com
5011         that you want to get rid of. You have right-clicked the image,
5012         selected <span class="QUOTE">"copy image location"</span> and pasted
5013         the URL below while removing the leading http://, into a <tt class=
5014         "LITERAL">{ +block{} }</tt> section. Note that <tt class="LITERAL">{
5015         +handle-as-image }</tt> need not be specified, since all URLs ending
5016         in <tt class="LITERAL">.gif</tt> will be tagged as images by the
5017         general rules as set in default.action anyway:</p>
5018
5019         <table border="0" bgcolor="#E0E0E0" width="100%">
5020           <tr>
5021             <td>
5022               <pre class="SCREEN">
5023 { +<a href="actions-file.html#BLOCK">block</a>{Nasty ads.} }
5024  www.example.com/nasty-ads/sponsor\.gif
5025  another.example.net/more/junk/here/
5026 </pre>
5027             </td>
5028           </tr>
5029         </table>
5030
5031         <p>The URLs of dynamically generated banners, especially from large
5032         banner farms, often don't use the well-known image file name
5033         extensions, which makes it impossible for <span class=
5034         "APPLICATION">Privoxy</span> to guess the file type just by looking
5035         at the URL. You can use the <tt class="LITERAL">+block-as-image</tt>
5036         alias defined above for these cases. Note that objects which match
5037         this rule but then turn out NOT to be an image are typically rendered
5038         as a <span class="QUOTE">"broken image"</span> icon by the browser.
5039         Use cautiously.</p>
5040
5041         <table border="0" bgcolor="#E0E0E0" width="100%">
5042           <tr>
5043             <td>
5044               <pre class="SCREEN">
5045 { +block-as-image }
5046  .doubleclick.net
5047  .fastclick.net
5048  /Realmedia/ads/
5049  ar.atwola.com/
5050 </pre>
5051             </td>
5052           </tr>
5053         </table>
5054
5055         <p>Now you noticed that the default configuration breaks Forbes
5056         Magazine, but you were too lazy to find out which action is the
5057         culprit, and you were again too lazy to give <a href=
5058         "contact.html">feedback</a>, so you just used the <tt class=
5059         "LITERAL">fragile</tt> alias on the site, and -- <span class=
5060         "emphasis"><i class="EMPHASIS">whoa!</i></span> -- it worked. The
5061         <tt class="LITERAL">fragile</tt> aliases disables those actions that
5062         are most likely to break a site. Also, good for testing purposes to
5063         see if it is <span class="APPLICATION">Privoxy</span> that is causing
5064         the problem or not. We later find other regular sites that misbehave,
5065         and add those to our personalized list of troublemakers:</p>
5066
5067         <table border="0" bgcolor="#E0E0E0" width="100%">
5068           <tr>
5069             <td>
5070               <pre class="SCREEN">
5071 { fragile }
5072  .forbes.com
5073  webmail.example.com
5074  .mybank.com
5075 </pre>
5076             </td>
5077           </tr>
5078         </table>
5079
5080         <p>You like the <span class="QUOTE">"fun"</span> text replacements in
5081         <tt class="FILENAME">default.filter</tt>, but it is disabled in the
5082         distributed actions file. So you'd like to turn it on in your
5083         private, update-safe config, once and for all:</p>
5084
5085         <table border="0" bgcolor="#E0E0E0" width="100%">
5086           <tr>
5087             <td>
5088               <pre class="SCREEN">
5089 { +<a href="actions-file.html#FILTER-FUN">filter{fun}</a> }
5090  / # For ALL sites!
5091 </pre>
5092             </td>
5093           </tr>
5094         </table>
5095
5096         <p>Note that the above is not really a good idea: There are
5097         exceptions to the filters in <tt class="FILENAME">default.action</tt>
5098         for things that really shouldn't be filtered, like code on
5099         CVS-&gt;Web interfaces. Since <tt class="FILENAME">user.action</tt>
5100         has the last word, these exceptions won't be valid for the
5101         <span class="QUOTE">"fun"</span> filtering specified here.</p>
5102
5103         <p>You might also worry about how your favourite free websites are
5104         funded, and find that they rely on displaying banner advertisements
5105         to survive. So you might want to specifically allow banners for those
5106         sites that you feel provide value to you:</p>
5107
5108         <table border="0" bgcolor="#E0E0E0" width="100%">
5109           <tr>
5110             <td>
5111               <pre class="SCREEN">
5112 { allow-ads }
5113  .sourceforge.net
5114  .slashdot.org
5115  .osdn.net
5116 </pre>
5117             </td>
5118           </tr>
5119         </table>
5120
5121         <p>Note that <tt class="LITERAL">allow-ads</tt> has been aliased to
5122         <tt class="LITERAL">-<a href=
5123         "actions-file.html#BLOCK">block</a></tt>, <tt class=
5124         "LITERAL">-<a href=
5125         "actions-file.html#FILTER-BANNERS-BY-SIZE">filter{banners-by-size}</a></tt>,
5126         and <tt class="LITERAL">-<a href=
5127         "actions-file.html#FILTER-BANNERS-BY-LINK">filter{banners-by-link}</a></tt>
5128         above.</p>
5129
5130         <p>Invoke another alias here to force an over-ride of the MIME type
5131         <tt class="LITERAL">application/x-sh</tt> which typically would open
5132         a download type dialog. In my case, I want to look at the shell
5133         script, and then I can save it should I choose to.</p>
5134
5135         <table border="0" bgcolor="#E0E0E0" width="100%">
5136           <tr>
5137             <td>
5138               <pre class="SCREEN">
5139 { handle-as-text }
5140  /.*\.sh$
5141 </pre>
5142             </td>
5143           </tr>
5144         </table>
5145
5146         <p><tt class="FILENAME">user.action</tt> is generally the best place
5147         to define exceptions and additions to the default policies of
5148         <tt class="FILENAME">default.action</tt>. Some actions are safe to
5149         have their default policies set here though. So let's set a default
5150         policy to have a <span class="QUOTE">"blank"</span> image as opposed
5151         to the checkerboard pattern for <span class="emphasis"><i class=
5152         "EMPHASIS">ALL</i></span> sites. <span class="QUOTE">"/"</span> of
5153         course matches all URL paths and patterns:</p>
5154
5155         <table border="0" bgcolor="#E0E0E0" width="100%">
5156           <tr>
5157             <td>
5158               <pre class="SCREEN">
5159 { +<a href=
5160 "actions-file.html#SET-IMAGE-BLOCKER">set-image-blocker{blank}</a> }
5161 / # ALL sites
5162 </pre>
5163             </td>
5164           </tr>
5165         </table>
5166       </div>
5167     </div>
5168   </div>
5169
5170   <div class="NAVFOOTER">
5171     <hr align="left" width="100%">
5172
5173     <table summary="Footer navigation table" width="100%" border="0"
5174     cellpadding="0" cellspacing="0">
5175       <tr>
5176         <td width="33%" align="left" valign="top"><a href="config.html"
5177         accesskey="P">Prev</a></td>
5178
5179         <td width="34%" align="center" valign="top"><a href="index.html"
5180         accesskey="H">Home</a></td>
5181
5182         <td width="33%" align="right" valign="top"><a href="filter-file.html"
5183         accesskey="N">Next</a></td>
5184       </tr>
5185
5186       <tr>
5187         <td width="33%" align="left" valign="top">The Main Configuration
5188         File</td>
5189
5190         <td width="34%" align="center" valign="top">&nbsp;</td>
5191
5192         <td width="33%" align="right" valign="top">Filter Files</td>
5193       </tr>
5194     </table>
5195   </div>
5196 </body>
5197 </html>