X-Git-Url: http://www.privoxy.org/gitweb/?a=blobdiff_plain;f=doc%2Fsource%2Fuser-manual.sgml;h=28b581fe78700ddde5fc53bcaabd793dfede1daa;hb=6aa60a252059f209c5f3fb14502ad1deff7aba61;hp=c9f6a94d9804f85351e52dd18b96bd85814e57a3;hpb=00ff6723cacb0c08cbf3f1044e8639a89ebc23d7;p=privoxy.git
diff --git a/doc/source/user-manual.sgml b/doc/source/user-manual.sgml
index c9f6a94d..28b581fe 100644
--- a/doc/source/user-manual.sgml
+++ b/doc/source/user-manual.sgml
@@ -11,11 +11,11 @@
-
-
+
+
-
-
+
+
@@ -24,6 +24,7 @@
+Privoxy">
]>
- Copyright &my-copy; 2001 - 2006 by
+ Copyright &my-copy; 2001 - 2007 by
Privoxy Developers
-$Id: user-manual.sgml,v 2.17 2006/09/05 13:25:12 david__schmidt Exp $
+$Id: user-manual.sgml,v 2.30 2007/04/25 15:10:36 fabiankeil Exp $
@@ -96,7 +97,7 @@ Hal.
- You can find the latest version of the User Manual at Privoxy User Manual at http://www.privoxy.org/user-manual/.
Please see the Contact section on how to
contact the developers.
@@ -135,7 +136,8 @@ Hal.
Features
In addition to the core
- features of ad blocking and cookie management,
+ features of ad blocking and
+ cookie management,
Privoxy provides many supplemental
features,
that give the end-user more control, more privacy and more freedom:
@@ -175,8 +177,10 @@ Hal.
How to install the binary packages depends on your operating system:
+
+
-Red Hat, SuSE and Conectiva RPMs
+Red Hat and Fedora RPMs
RPMs can be installed with rpm -Uvh privoxy-&p-version;-1.rpm,
@@ -188,8 +192,7 @@ How to install the binary packages depends on your operating system:
Note that on Red Hat, Privoxy will
not be automatically started on system boot. You will
need to enable that using chkconfig,
- ntsysv, or similar methods. Note that SuSE will
-automatically start Privoxy in the boot process.
+ ntsysv, or similar methods.
@@ -224,7 +227,7 @@ automatically start Privoxy in the boot process.
in the same directory as you installed Privoxy in.
- Version 3.0.4 introduces full Windows service
+ Version 3.0.4 introduced full Windows service
functionality. On Windows only, the Privoxy
program has two new command line arguments to install and uninstall
Privoxy as a service.
@@ -249,7 +252,7 @@ automatically start Privoxy in the boot process.
want Privoxy to run under, and whether or not you
want it to run whenever the system starts. You can start the
Windows services console with the following
- command: services.msc If you do not take the manual step
+ command: services.msc. If you do not take the manual step
of modifying Privoxy's service settings, it will
not start. Note too that you will need to give Privoxy a user account that
actually exists, or it will not be permitted to
@@ -259,7 +262,7 @@ automatically start Privoxy in the boot process.
-Solaris, NetBSD, FreeBSD, HP-UX
+Solaris, NetBSD, HP-UX
Create a new directory, cd to it, then unzip and
@@ -334,6 +337,25 @@ automatically start Privoxy in the boot process.
+
+FreeBSD
+
+
+ Privoxy is part of FreeBSD's Ports Collection, you can build and install
+ it with cd /usr/ports/www/privoxy; make install clean.
+
+
+ If you don't use the ports, you can fetch and install
+ the package with pkg_add -r privoxy.
+
+
+ The port skeleton and the package can also be downloaded from the
+ File Release
+ Page, but if you're interested in stable releases only you don't
+ gain anything by using them.
+
+
+
Gentoo
@@ -362,7 +384,8 @@ automatically start Privoxy in the boot process.
The most convenient way to obtain the Privoxy sources
- is to download the source tarball from our project
+ is to download the source tarball from our
+ project download
page.
@@ -421,20 +444,21 @@ automatically start Privoxy in the boot process.
What's New in this Release
- There are many improvements and new features in Privoxy &p-version;
- :
+ There are many improvements and new features since Privoxy 3.0.6, the last stable release:
- Mulitiple filter files can now be specifed in config. This allows for
- locally defined filters that can be maintained separately from the filters as
- supplied by the developers.
+ Header filtering can be done with dedicated header filters now. As a result
+ the actions filter-client-headers
and filter-server-headers
+ that were introduced with Privoxy 3.0.5 to apply
+ the content filters to the headers as, well have been removed again.
-
+
+
@@ -598,7 +630,7 @@ automatically start Privoxy in the boot process.
- On the other hand, some installers may not overwrite any existing configuration
+ On the other hand, other installers may not overwrite any existing configuration
files, thinking you will want to do that. You may want to manually check
your saved files against the newer versions to see if the improvements have
merit, or whether there are new options that you may want to consider.
@@ -610,12 +642,23 @@ automatically start Privoxy in the boot process.
See the full documentation on
fast-redirects
- which has changed syntax, and may require adjustments to local configs.
+ which has changed syntax, and will require adjustments to local configs,
+ such as user.action. You must reference the new
+ syntax:
+
+
+ { +fast-redirects{check-decoded-url} }
+ .example.com
+ mybank.com
+ .google.
+
+
- The jarfile, cookie logger, is off by default now.
+ The jarfile,
+ cookie logger, is off by default now.
@@ -625,9 +668,54 @@ automatically start Privoxy in the boot process.
and you may want to review which actions are on
by
default. This is primarily a matter of emphasis, but some features
you may have been used to, may now be off
by default.
+ There are also a number of new actions and filters you may want to
+ consider, most of which are not fully incorporated into the default
+ settings as yet (see above).
-
+
+
+
+ The default actions setting is now Cautious. Previous
+ releases had a default setting of Medium. Experienced
+ users may want to adjust this, as it is fairly conservative by &my-app;
+ standards and past practices. See
+ http://config.privoxy.org/edit-actions-list?f=default. New users
+ should try the default settings for a while before turning up the volume.
+
+
+
+
+
+ The default setting has filtering turned off, which
+ subsequently means that compression is on. Remember
+ that filtering does not work on compressed pages, so if you use, or want to
+ use, filtering, you will need to force compression off. Example:
+
+
+
+ { +filter{google} +prevent-compression }
+ .google.
+
+
+ Or if you use a number of filters, or filter many sites, you may just want
+ to turn off compression for all sites in
+ default.action (or
+ user.action).
+
+
+
+
+
+
+ Also, session-cookies-only is
+ off by default now. If you've liked this feature in the past, you may want
+ to turn it back on in user.action now.
+
+
+
+
@@ -637,13 +725,14 @@ automatically start Privoxy in the boot process.
+
-Quickstart to Using Privoxy
+Quickstart to Using Privoxy
@@ -676,7 +765,8 @@ automatically start Privoxy in the boot process.
Set your browser to use Privoxy as HTTP and
- HTTPS (SSL) proxy by setting the proxy configuration for address of
+ HTTPS (SSL) proxy
+ by setting the proxy configuration for address of
127.0.0.1 and port 8118.
DO NOT activate proxying for FTP or
any protocols besides HTTP and HTTPS (SSL)! It won't work!
@@ -686,8 +776,9 @@ automatically start Privoxy in the boot process.
Flush your browser's disk and memory caches, to remove any cached ad images.
- If using Privoxy to manage cookies, you should
- remove any currently stored cookies too.
+ If using Privoxy to manage
+ cookies,
+ you should remove any currently stored cookies too.
@@ -701,31 +792,31 @@ automatically start Privoxy in the boot process.
See the Configuration section for more
configuration options, and how to customize your installation.
- next section for a quick
introduction to how Privoxy blocks ads and
- banners.]]>
+ banners.
- If you experience ads that slipped through, innocent images that are
+ If you experience ads that slip through, innocent images that are
blocked, or otherwise feel the need to fine-tune
- Privoxy's behaviour, take a look at the Privoxy's behavior, take a look at the actions files. As a quick start, you might
find the richly commented examples
helpful. You can also view and edit the actions files through the web-based user interface. The
- Appendix Anatomy of an
- Action
has hints how to debug actions that
+ Appendix Troubleshooting: Anatomy of an
+ Action
has hints on how to understand and debug actions that
misbehave
.
- For easy access to Privoxy's most important controls, drag the provided
+ For easy access to &my-app;'s most important controls, drag the provided
Bookmarklets into your browser's
personal toolbar.
@@ -770,7 +861,8 @@ automatically start Privoxy in the boot process.
First a bit of a warning ... blocking ads is much like blocking SPAM: the
more aggressive you are about it, the more likely you are to block
- things that were not intended. So there is a trade off here. If you want
+ things that were not intended. And the more likely that some things
+ may not work as intended. So there is a trade off here. If you want
extreme ad free browsing, be prepared to deal with more
problem
sites, and to spend more time adjusting the
configuration to solve these unintended consequences. In short, there is
@@ -783,7 +875,7 @@ automatically start Privoxy in the boot process.
Secondly, a brief explanation of Privoxy's
actions
. Actions
in this context, are
the directives we use to tell Privoxy to perform
- some task relating to HTTP transactions (i.e. web browsing). We tell
+ some task relating to WWW transactions (i.e. web browsing). We tell
Privoxy to take some action
. Each
action has a unique name and function. While there are many potential
actions in Privoxy's
@@ -808,13 +900,17 @@ automatically start Privoxy in the boot process.
original page's HTML content. An ad image for instance, is just an URL
embedded in the page somewhere. The image itself may be on the same server,
or a server somewhere else on the Internet. Complex web pages will have many
- such embedded URLs.
+ such embedded URLs. &my-app; can deal with each URL individually, so, for
+ instance, the main page text is not touched, but images from such-and-such
+ server are blocked.
- The actions we need to know about for ad blocking are: block, handle-as-image, and
+ linkend="handle-as-image">handle-as-image,
+ handle-as-empty-document,and
set-image-blocker:
@@ -823,12 +919,14 @@ automatically start Privoxy in the boot process.
- block - this action stops
- any contact between your browser and any URL patterns that match this
- action's configuration. It can be used for blocking ads, but also anything
- that is determined to be unwanted. By itself, it simply stops any
- communication with the remote server and sends Privoxy's
- own built-in BLOCKED page instead to let you now what has happened.
+ block - this is perhaps
+ the single most used action, and is particularly important for ad blocking.
+ This action stops any contact between your browser and any URL patterns
+ that match this action's configuration. It can be used for blocking ads,
+ but also anything that is determined to be unwanted. By itself, it simply
+ stops any communication with the remote server and sends
+ Privoxy's own built-in BLOCKED page instead to
+ let you now what has happened (with some exceptions, see below).
@@ -848,6 +946,15 @@ automatically start Privoxy in the boot process.
+
+
+ handle-as-empty-document -
+ sends an empty document instead of Privoxy's
+ normal BLOCKED HTML page. This is useful for file types that are neither
+ HTML nor images, such as blocking JavaScript files.
+
+
+
Actions Files in Use
-
+
[ Screenshot of Actions Files in Use ]
@@ -988,6 +1095,13 @@ automatically start Privoxy in the boot process.
to now go to the Actions Files Tutorial.
The ideas explained therein also apply to the web-based editor.
+
+ There are also various
+ filters that can be used for ad blocking
+ (filters are a special subset of actions). These
+ fall into the advanced
usage category, and are explained in
+ depth in later sections.
+
@@ -998,13 +1112,15 @@ automatically start Privoxy in the boot process.
-Starting Privoxy
+Starting Privoxy
Before launching Privoxy for the first time, you
will want to configure your browser(s) to use
- Privoxy as a HTTP and HTTPS proxy. The default is
+ Privoxy as a HTTP and HTTPS (SSL)
+ proxy. The default is
127.0.0.1 (or localhost) for the proxy address, and port 8118 (earlier versions
- used port 8000). This is the one configuration step that must be done!
+ used port 8000). This is the one configuration step that must be done
+!
Please note that Privoxy can only proxy HTTP and
@@ -1013,10 +1129,11 @@ automatically start Privoxy in the boot process.
- Proxy Configuration (Mozilla)
+ Proxy Configuration Showing
+ Mozilla/Netscape HTTP and HTTPS (SSL) Settings
-
+
[ Screenshot of Mozilla Proxy Configuration ]
@@ -1027,21 +1144,21 @@ automatically start Privoxy in the boot process.
- With Firefox, this can be set under:
+ With Firefox, this is typically set under:
-
-
- Tools
- |_
- Options
- |_
- General
- |_
- Connection Settings
- |_
- Manual Proxy Configuration
+ Tools -> Options -> General -> Connection Settings -> Manual Proxy Configuration
+
+
+
+
+ Or optionally on some platforms:
+
+
+
+ Edit -> Preferences -> General -> Connection Settings -> Manual Proxy Configuration
+
@@ -1054,43 +1171,48 @@ automatically start Privoxy in the boot process.
- Edit
- |_
- Preferences
- |_
- Advanced
- |_
- Proxies
- |_
- HTTP Proxy
+ Edit -> Preferences -> Advanced -> Proxies -> HTTP Proxy
+
- For Internet Explorer:
+ For Internet Explorer v.5-6:
-
-
- Tools
- |_
- Internet Properties
- |_
- Connections
- |_
- LAN Settings
+ Tools -> Internet Options -> Connections -> LAN Settings
Then, check Use Proxy
and fill in the appropriate info
(Address: 127.0.0.1, Port: 8118). Include HTTPS (SSL), if you want HTTPS
- proxy support too.
+ proxy support too (sometimes labeled Secure
). Make sure any
+ checkboxes like Use the same proxy server for all protocols
is
+ UNCHECKED. You want only HTTP and HTTPS (SSL)!
+
+
+ Proxy Configuration Showing
+ Internet Explorer HTTP and HTTPS (Secure) Settings
+
+
+
+
+
+ [ Screenshot of IE Proxy Configuration ]
+
+
+
+
+
+
After doing this, flush your browser's disk and memory caches to force a
- re-reading of all pages and to get rid of any ads that may be cached. You
- are now ready to start enjoying the benefits of using
+ re-reading of all pages and to get rid of any ads that may be cached. Remove
+ any cookies,
+ if you want Privoxy to manage that. You are now
+ ready to start enjoying the benefits of using
Privoxy!
@@ -1103,23 +1225,31 @@ automatically start Privoxy in the boot process.
-Red Hat and Conectiva
+Red Hat and Fedora
- We use a script. Note that Red Hat does not start Privoxy upon booting per
- default. It will use the file /etc/privoxy/config as
- its main configuration file.
+ A default Red Hat installation may not start &my-app; upon boot. It will use
+ the file /etc/privoxy/config as its main configuration
+ file.
# /etc/rc.d/init.d/privoxy start
+
+ Or ...
+
+
+
+ # service privoxy start
+
+
Debian
- We use a script. Note that Debian starts Privoxy upon booting per
+ We use a script. Note that Debian typically starts &my-app; upon booting per
default. It will use the file
/etc/privoxy/config as its main configuration
file.
@@ -1131,6 +1261,9 @@ automatically start Privoxy in the boot process.
+
Windows
-Click on the Privoxy Icon to start Privoxy. If no configuration file is
+Click on the &my-app; Icon to start Privoxy. If no configuration file is
specified on the command line, Privoxy will look
for a file named config.txt. Note that Windows will
- automatically start Privoxy when the system starts if you chose that option
+ automatically start &my-app; when the system starts if you chose that option
when installing.
Privoxy can run with full Windows service functionality.
- On Windows only, the Privoxy program has two new command line arguments
- to install and uninstall Privoxy as a service. See the
+ On Windows only, the &my-app; program has two new command line arguments
+ to install and uninstall &my-app; as a service. See the
Windows Installation
instructions for details.
@@ -1189,7 +1322,7 @@ Example Unix startup command:
Mac OSX
During installation, Privoxy is configured to
- start automatically when the system restarts. To start Privoxy by hand,
+ start automatically when the system restarts. To start &my-app; manually,
double-click on the StartPrivoxy.command icon in the
/Library/Privoxy folder. Or, type this command
in the Terminal:
@@ -1407,9 +1540,9 @@ must find a better place for this paragraph
Before changing to the user ID given in the --user option,
- chroot to that user's home directory, i.e. make the kernel pretend to the Privoxy
+ chroot to that user's home directory, i.e. make the kernel pretend to the &my-app;
process that the directory tree starts there. If set up carefully, this can limit
- the impact of possible vulnerabilities in Privoxy to the files contained in that hierarchy.
+ the impact of possible vulnerabilities in &my-app; to the files contained in that hierarchy.
Unix only.
@@ -1431,8 +1564,8 @@ must find a better place for this paragraph
- On MS Windows only there are two addition
- options to allow Privoxy to install and
+ On MS Windows only there are two additional
+ command-line options to allow Privoxy to install and
run as a service. See the
Window Installation section
for details.
@@ -1446,7 +1579,7 @@ for details.
-Privoxy Configuration
+Privoxy Configuration
All Privoxy configuration is stored
in text files. These files can be edited with a text editor.
@@ -1458,7 +1591,7 @@ for details.
-Controlling Privoxy with Your Web Browser
+Controlling Privoxy with Your Web Browser
Privoxy's user interface can be reached through the special
URL http://config.privoxy.org/
@@ -1572,7 +1705,7 @@ for details.
default.action (which you will most probably want
to define sooner or later) are probably best applied in
user.action, where you can preserve them across
- upgrades. standard.action is for
+ upgrades. standard.action is only for
Privoxy's internal use.
@@ -1607,7 +1740,10 @@ for details.
The syntax of all configuration files has remained the same throughout the
3.x series. There have been enhancements, but no changes that would preclude
- the use of any configuration file from one version to the next.
+ the use of any configuration file from one version to the next. (There is
+ one exception: +fast-redirects which
+ has enhanced syntax and will require updating any local configs from earlier
+ versions.)
@@ -1617,7 +1753,7 @@ for details.
in a line. If the # is preceded by a backslash, it looses
its special function. Placing a # in front of an otherwise
valid configuration line to prevent it from being interpreted is called "commenting
- out" that line.
+ out" that line. Blank lines are ignored.
@@ -1675,7 +1811,8 @@ for details.
There are a number of such actions, with a wide range of functionality.
Each action does something a little different.
These actions give us a veritable arsenal of tools with which to exert
- our control, preferences and independence.
+ our control, preferences and independence. Actions can be combined so that
+ their effects are aggregated when applied against a given set of URLs.
There
@@ -1691,9 +1828,13 @@ for details.
that sets the initial values for all actions. It is intended to
provide a base level of functionality for
Privoxy's array of features. So it is
- a set of broad rules that should work reasonably well for users everywhere.
+ a set of broad rules that should work reasonably well as-is for most users.
This is the file that the developers are keeping updated, and making available to users.
+ The user's preferences as set in standard.action,
+ e.g. either Cautious (the default),
+ Medium, or Advanced (see
+ below).
@@ -1706,12 +1847,42 @@ for details.
- standard.action - is used by the web based editor,
+ standard.action - is used only by the web based editor
+ at
+ http://config.privoxy.org/edit-actions-list?f=default,
to set various pre-defined sets of rules for the default actions section
- in default.action. These have increasing levels of
- aggressiveness and have no influence on your browsing unless
- you select them explicitly in the editor. It is not recommend
- to edit this file.
+ in default.action.
+
+
+ Edit Set to Cautious Set to Medium Set to Advanced
+
+
+ These have increasing levels of aggressiveness and have no
+ influence on your browsing unless you select them explicitly in the
+ editor. A default installation should be pre-set to
+ Cautious (versions prior to 3.0.5 were set to
+ Medium). New users should try this for a while before
+ adjusting the settings to more aggressive levels. The more aggressive
+ the settings, then the more likelihood there is of problems such as sites
+ not working as they should.
+
+
+ The Edit button allows you to turn each
+ action on/off individually for fine-tuning. The Cautious
+ button changes the actions list to low/safe settings which will activate
+ ad blocking and a minimal set of &my-app;'s features, and subsequently
+ there will be less of a chance for accidental problems. The
+ Medium button sets the list to a medium level of
+ other features and a low level set of privacy features. The
+ Advanced button sets the list to a high level of
+ ad blocking and medium level of privacy. See the chart below. The latter
+ three buttons over-ride any changes via with the
+ Edit button. More fine-tuning can be done in the
+ lower sections of this internal page.
+
+
+ It is not recommend to edit the standard.action file
+ itself.
The default profiles, and their associated actions, as pre-defined in
@@ -1729,7 +1900,7 @@ for details.
Feature
Cautious
Medium
- Adventuresome
+ Advanced
@@ -1743,31 +1914,37 @@ for details.
- Ad-blocking by URL
- yes
- yes
- yes
+ Ad-blocking Aggressiveness
+ medium
+ high
+ high
Ad-filtering by size
- yes
+ no
yes
yes
- GIF de-animation
+ Ad-filtering by link
+ no
no
- yes
yes
-
- Referer forging
- no
- yes
- yes
+ Pop-up killing
+ blocks only
+ blocks only
+ blocks only
+
+
+
+ Privacy Features
+ low
+ medium
+ medium/high
@@ -1778,69 +1955,56 @@ for details.
- Pop-up killing
- unsolicited
- unsolicited
- all
-
-
-
- Fast redirects
- no
+ Referer forging
no
yes
-
-
-
- HTML taming
- yes
- yes
yes
+
- JavaScript taming
- yes
+ GIF de-animation
+ no
yes
yes
+
- Web-bug killing
- yes
- yes
+ Fast redirects
+ no
+ no
yes
- Fun text replacements
+ HTML taming
no
no
yes
- Image tag reordering
+ JavaScript taming
no
no
yes
- Ad-filtering by link
- no
+ Web-bug killing
no
yes
+ yes
- Demoronizer
+ Image tag reordering
no
no
yes
-
@@ -1853,11 +2017,18 @@ for details.
The list of actions files to be used are defined in the main configuration
file, and are processed in the order they are defined (e.g.
- default.action is typically process before
+ default.action is typically processed before
user.action). The content of these can all be viewed and
edited from http://config.privoxy.org/show-status.
-
+ The over-riding principle when applying actions, is that the last action that
+ matches a given URL, wins. The broadest, most general rules go first
+ (defined in default.action),
+ followed by any exceptions (typically also in
+ default.action), which are then followed lastly by any
+ local preferences (typically in user.action).
+ Generally, user.action has the last word.
+
An actions file typically has multiple sections. If you want to use
@@ -1890,12 +2061,13 @@ for details.
Note that some actions, like cookie suppression
or script disabling, may render some sites unusable that rely on these
techniques to work properly. Finding the right mix of actions is not always easy and
- certainly a matter of personal taste. In general, it can be said that the more
+ certainly a matter of personal taste. And, things can always change, requiring
+ refinements in the configuration. In general, it can be said that the more
aggressive
your default settings (in the top section of the
actions file) are, the more exceptions for trusted
sites you
will have to make later. If, for example, you want to crunch all cookies per
default, you'll have to make exceptions from that rule for sites that you
- regularly use and that require cookies for actually useful puposes, like maybe
+ regularly use and that require cookies for actually useful purposes, like maybe
your bank, favorite shop, or newspaper.
@@ -1916,53 +2088,68 @@ for details.
url="http://config.privoxy.org/show-status">http://config.privoxy.org/show-status.
The editor allows both fine-grained control over every single feature on a
per-URL basis, and easy choosing from wholesale sets of defaults like
- Cautious
, Medium
or Adventuresome
.
- Warning: the Adventuresome
setting is not only more aggressive,
- but includes settings that are fun and subversive, and which some may find of
- dubious merit!
+ Cautious
, Medium
or Advanced
.
+ Warning: the Advanced
setting is more aggressive, and
+ will be more likely to cause problems for some sites. Experienced users only!
If you prefer plain text editing to GUIs, you can of course also directly edit the
- the actions files. Look at default.action which is richly
- commented.
+ the actions files with your favorite text editor. Look at
+ default.action which is richly commented with many
+ good examples.
-How Actions are Applied to URLs
+How Actions are Applied to Requests
Actions files are divided into sections. There are special sections,
like the alias
sections which will
be discussed later. For now let's concentrate on regular sections: They have a
heading line (often split up to multiple lines for readability) which consist
of a list of actions, separated by whitespace and enclosed in curly braces.
- Below that, there is a list of URL patterns, each on a separate line.
+ Below that, there is a list of URL and tag patterns, each on a separate line.
To determine which actions apply to a request, the URL of the request is
- compared to all patterns in each action file
file. Every time it matches, the list of
- applicable actions for the URL is incrementally updated, using the heading
- of the section in which the pattern is located. If multiple matches for
- the same URL set the same action differently, the last match wins. If not,
- the effects are aggregated. E.g. a URL might match a regular section with
- a heading line of {
+ compared to all URL patterns in each action file
.
+ Every time it matches, the list of applicable actions for the request is
+ incrementally updated, using the heading of the section in which the
+ pattern is located. The same is done again for tags and tag patterns later on.
+
+
+
+ If multiple applying sections set the same action differently,
+ the last match wins. If not, the effects are aggregated.
+ E.g. a URL might match a regular section with a heading line of {
+handle-as-image },
then later another one with just {
+block }, resulting
- in both actions to apply.
+ in both actions to apply. And there may well be
+ cases where you will want to combine actions together. Such a section then
+ might look like:
+
+
+ { +handle-as-image +block }
+ # Block these as if they were images. Send no block page.
+ banners.example.com
+ media.example.com/.*banners
+ .example.com/images/ads/
+
+
- You can trace this process for any given URL by visiting http://config.privoxy.org/show-url-info.
- More detail on this is provided in the Appendix,
- Anatomy of an Action.
+ Examples and more detail on this is provided in the Appendix,
+ Troubleshooting: Anatomy of an Action section.
@@ -1971,15 +2158,15 @@ for details.
Patterns
As mentioned, Privoxy uses patterns
- to determine what actions might apply to which sites and pages your browser
- attempts to access. These patterns
use wild card type
- pattern matching to achieve a high degree of
+ to determine what actions might apply to which sites and
+ pages your browser attempts to access. These patterns
use wild
+ card type pattern matching to achieve a high degree of
flexibility. This allows one expression to be expanded and potentially match
against many similar patterns.
- Generally, a Privoxy pattern has the form
+ Generally, a URL pattern has the form
<domain>/<path>, where both the
<domain> and <path> are
optional. (This is why the special / pattern matches all
@@ -1987,6 +2174,13 @@ for details.
http://) should not be included in
the pattern. This is assumed already!
+
+ The pattern matching syntax is different for the domain and path parts of
+ the URL. The domain part uses a simple globbing type matching technique,
+ while the path part uses a more flexible
+ Regular
+ Expressions (PCRE)
based syntax.
+
@@ -1994,7 +2188,9 @@ for details.
is a domain-only pattern and will match any request to www.example.com,
- regardless of which document on that server is requested.
+ regardless of which document on that server is requested. So ALL pages in
+ this domain would be covered by the scope of this action. Note that a
+ simple example.com is different and would NOT match.
@@ -2021,7 +2217,7 @@ for details.
matches the document /index.html, regardless of the domain,
- i.e. on any web server.
+ i.e. on any web server anywhere.
@@ -2030,7 +2226,8 @@ for details.
matches nothing, since it would be interpreted as a domain name and
- there is no top-level domain called .html.
+ there is no top-level domain called .html. So its
+ a mistake.
@@ -2069,8 +2266,14 @@ for details.
.example.
- matches any domain that CONTAINS .example.
- (Correctly speaking: It matches any FQDN that contains example as a domain.)
+ matches any domain that CONTAINS .example..
+ And, by the way, also included would be any files or documents that exist
+ within that domain since no path limitations are specified. (Correctly
+ speaking: It matches any FQDN that contains example as
+ a domain.) This might be www.example.com,
+ news.example.de, or
+ www.example.net/cgi/testing.pl for instance. All these
+ cases are matched.
@@ -2078,10 +2281,15 @@ for details.
Additionally, there are wild-cards that you can use in the domain names
- themselves. They work pretty similar to shell wild-cards: *
- stands for zero or more arbitrary characters, ?
stands for
- any single character, you can define character classes in square
- brackets and all of that can be freely mixed:
+ themselves. These work similarly to shell globbing type wild-cards:
+ *
represents zero or more arbitrary characters (this is
+ equivalent to the
+ Regular
+ Expression
based syntax of .*
),
+ ?
represents any single character (this is equivalent to the
+ regular expression syntax of a simple .
), and you can define
+ character classes
in square brackets which is similar to
+ the same regular expression technique. All of this can be freely mixed:
@@ -2124,6 +2332,10 @@ for details.
+
+ While flexible, this is not the sophistication of full regular expression based syntax.
+
+
@@ -2133,9 +2345,11 @@ for details.
The Path Pattern
- Privoxy uses Perl compatible regular expressions
+ Privoxy uses Perl compatible (PCRE)
+ Regular
+ Expression
based syntax
(through the PCRE library) for
- matching the path.
+ matching the path portion (after the slash), and is thus more flexible.
@@ -2160,6 +2374,134 @@ for details.
only documents whose path starts with PaTtErN in
exactly this capitalization.
+
+
+
+ .example.com/.*
+
+
+ Is equivalent to just .example.com
, since any documents
+ within that domain are matched with or without the .*
+ regular expression. This is redundant
+
+
+
+
+ .example.com/.*/index.html
+
+
+ Will match any page in the domain of example.com
that is
+ named index.html
, and that is part of some path. For
+ example, it matches www.example.com/testing/index.html
but
+ NOT www.example.com/index.html
because the regular
+ expression called for at least two /'s
, thus the path
+ requirement. It also would match
+ www.example.com/testing/index_html
, because of the
+ special meta-character .
.
+
+
+
+
+ .example.com/(.*/)?index\.html
+
+
+ This regular expression is conditional so it will match any page
+ named index.html
regardless of path which in this case can
+ have one or more /'s
. And this one must contain exactly
+ .html
(but does not have to end with that!).
+
+
+
+
+ .example.com/(.*/)(ads|banners?|junk)
+
+
+ This regular expression will match any path of example.com
+ that contains any of the words ads
, banner
,
+ banners
(because of the ?
) or junk
.
+ The path does not have to end in these words, just contain them.
+
+
+
+
+ .example.com/(.*/)(ads|banners?|junk)/.*\.(jpe?g|gif|png)$
+
+
+ This is very much the same as above, except now it must end in either
+ .jpg
, .jpeg
, .gif
or .png
. So this
+ one is limited to common image formats.
+
+
+
+
+
+
+ There are many, many good examples to be found in default.action,
+ and more tutorials below in Appendix on regular expressions.
+
+
+
+
+
+
+
+
+
+
+The Tag Pattern
+
+
+ Tag patterns are used to change the applying actions based on the
+ request's tags. Tags can be created with either the
+ client-header-tagger
+ or the server-header-tagger action.
+
+
+
+ Tag patterns have to start with TAG:
, so &my-app;
+ can tell them apart from URL patterns. Everything after the colon
+ including white space, is interpreted as a regular expression with
+ path patterns syntax, except that tag patterns aren't left-anchored
+ automatically (Privoxy doesn't silently add a ^
,
+ you have to do it yourself if you need it).
+
+
+
+ To match all requests that are tagged with foo
+ your pattern line should be TAG:^foo$
,
+ TAG:foo
would work as well, but it would also
+ match requests whose tags contain foo
somewhere.
+
+
+
+ Sections can contain URL and tag patterns at the same time,
+ but tag patterns are checked after the URL patterns and thus
+ always overrule them, even if they are located before the URL patterns.
+
+
+
+ Once a new tag is added, Privoxy checks right away if it's matched by one
+ of the tag patterns and updates the action settings accordingly. As a result
+ tags can be used to activate other tagger actions, as long as these other
+ taggers look for headers that haven't already be parsed.
+
+
+
+ For example you could tag client requests which use the POST method,
+ use this tag to activate another tagger that adds a tag if cookies
+ are send, and then block based on the cookie tag. However if you'd
+ reverse the position of the described taggers, and activated the method
+ tagger based on the cookie tagger, no method tags would be created.
+ The method tagger would look for the request line, but at the time
+ the cookie tag is created the request line has already been parsed.
+
+
+
+ While this is a limitation you should be aware of, this kind of
+ indirection is seldom needed anyway and even the example doesn't
+ make too much sense.
+
+
@@ -2193,7 +2535,7 @@ for details.
- There are three classes of actions:
+ Actions fall into three categories:
@@ -2270,12 +2612,14 @@ for details.
Later defined actions always over-ride earlier ones. So exceptions
to any rules you make, should come in the latter part of the file (or
- in a file that is processed later when using multiple actions files). For
- multi-valued actions, the actions are applied in the order they are specified.
- Actions files are processed in the order they are defined in
- config (the default installation has three actions
- files). It also quite possible for any given URL pattern to match more than
- one pattern and thus more than one set of actions!
+ in a file that is processed later when using multiple actions files such
+ as user.action). For multi-valued actions, the actions
+ are applied in the order they are specified. Actions files are processed in
+ the order they are defined in config (the default
+ installation has three actions files). It also quite possible for any given
+ URL to match more than one pattern
(because of wildcards and
+ regular expressions), and thus to trigger more than one set of actions! Last
+ match wins.
@@ -2364,7 +2708,7 @@ for details.
Typical use:
- Block ads or other obnoxious content
+ Block ads or other unwanted content
@@ -2372,10 +2716,16 @@ for details.
Effect:
- Requests for URLs to which this action applies are blocked, i.e. the requests are not
- forwarded to the remote server, but answered locally with a substitute page or image,
- as determined by the handle-as-image
- and set-image-blocker actions.
+ Requests for URLs to which this action applies are blocked, i.e. the
+ requests are trapped by &my-app; and the requested URL is never retrieved,
+ but is answered locally with a substitute page or image, as determined by
+ the handle-as-image,
+ set-image-blocker, and
+ handle-as-empty-document actions.
+
@@ -2420,7 +2770,8 @@ for details.
It is important to understand this process, in order
to understand how Privoxy deals with
- ads and other unwanted content.
+ ads and other unwanted content. Blocking is a core feature, and one
+ upon which various other features depend.
The filter
@@ -2436,12 +2787,18 @@ for details.
Example usage (section):
- {+block} # Block and replace with "blocked" page
-.nasty-stuff.example.com
-
-{+block +handle-as-image} # Block and replace with image
-.ad.doubleclick.net
-.ads.r.us
+ {+block}
+# Block and replace with "blocked" page
+ .nasty-stuff.example.com
+
+{+block +handle-as-image}
+# Block and replace with image
+ .ad.doubleclick.net
+ .ads.r.us/banners/
+
+{+block +handle-as-empty-document}
+# Block and then ignore
+ adserver.exampleclick.net/.*\.js$
@@ -2452,17 +2809,16 @@ for details.
-
-
-content-type-overwrite
+
@@ -2496,14 +2854,167 @@ new action
Notes:
- The Content-Type:
HTTP server header is used by the
- browser to decide what to do with the document. The value of this
- header can cause the browser to open a download menu instead of
- displaying the document by itself, even if the document's format is
- supported by the browser.
+ Client-header filters are applied to each header on its own, not to
+ all at once. This makes it easier to diagnose problems, but on the downside
+ you can't write filters that only change header x if header y's value is z.
+ You can do that by using tags though.
- The declared content type can also affect which rendering mode
+ Client-header filters are executed after the other header actions have finished
+ and use their output as input.
+
+
+ Please refer to the filter file chapter
+ to learn which client-header filters are available by default, and how to
+ create your own.
+
+
+
+
+
+ Example usage (section):
+
+
+
+{+client-header-filter{hide-tor-exit-notation}}
+.exit/
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+content-type-overwrite
+
+
+
+ Typical use:
+
+ Stop useless download menus from popping up, or change the browser's rendering mode
+
+
+
+
+ Effect:
+
+
+ Replaces the Content-Type:
HTTP server header.
+
+
+
+
+
+ Type:
+
+
+ Parameterized.
+
+
+
+
+ Parameter:
+
+
+ Any string.
+
+
+
+
+
+ Notes:
+
+
+ The Content-Type:
HTTP server header is used by the
+ browser to decide what to do with the document. The value of this
+ header can cause the browser to open a download menu instead of
+ displaying the document by itself, even if the document's format is
+ supported by the browser.
+
+
+ The declared content type can also affect which rendering mode
the browser chooses. If XHTML is delivered as text/html
,
many browsers treat it as yet another broken HTML document.
If it is send as application/xml
, browsers with
@@ -2511,7 +3022,7 @@ new action
If you see a web site that proudly uses XHTML buttons, but sets
- Content-Type: text/html
, you can use Privoxy
+ Content-Type: text/html
, you can use &my-app;
to overwrite it with application/xml
and validate
the web master's claim inside your XHTML-supporting browser.
If the syntax is incorrect, the browser will complain loudly.
@@ -2530,10 +3041,9 @@ new action
This limitation exists for a reason, think twice before circumventing it.
- Most of the time it's easier to enable
- filter-server-headers
- and replace this action with a custom regular expression. It allows you
- to activate it for every document of a certain site and it will still
+ Most of the time it's easier to replace this action with a custom
+ server-header filter.
+ It allows you to activate it for every document of a certain site and it will still
only replace the content types you aimed at.
@@ -2549,12 +3059,13 @@ new action
# Check if www.example.net/ really uses valid XHTML
-{+content-type-overwrite {application/xml}}
+{ +content-type-overwrite{application/xml} }
www.example.net/
+
# but leave the content type unmodified if the URL looks like a style sheet
{-content-type-overwrite}
-www.example.net/*.\.css$
-www.example.net/*.style
+www.example.net/.*\.css$
+www.example.net/.*style
@@ -2621,9 +3132,8 @@ new action
crunch-client-header is only meant for quick tests.
If you have to block several different headers, or only want to modify
- parts of them, you should enable
- filter-client-headers
- and create your own filter.
+ parts of them, you should use a
+ client-header filter.
@@ -2638,7 +3148,7 @@ new action
# Block the non-existent "Privacy-Violation:" client header
-{+crunch-client-header {Privacy-Violation:}}
+{ +crunch-client-header{Privacy-Violation:} }
/
@@ -2720,9 +3230,9 @@ new action
# Let the browser revalidate cached documents without being tracked across sessions
-{+hide-if-modified-since {-1} \
-+overwrite-last-modified {randomize} \
-+crunch-if-none-match}
+{ +hide-if-modified-since{-60} \
+ +overwrite-last-modified{randomize} \
+ +crunch-if-none-match}
/
@@ -2857,9 +3367,8 @@ new action
crunch-server-header is only meant for quick tests.
If you have to block several different headers, or only want to modify
- parts of them, you should enable
- filter-server-headers
- and create your own filter.
+ parts of them, you should use a custom
+ server-header filter.
@@ -2874,7 +3383,7 @@ new action
# Crunch server headers that try to prevent caching
-{+crunch-server-header {no-cache}}
+{ +crunch-server-header{no-cache} }
/
@@ -3175,9 +3684,9 @@ problem-host.example.com
followed by another parameter. fast-redirects doesn't know that
and will cause a redirect to http://www.example.net/&foo=bar
.
Depending on the target server configuration, the parameter will be silently ignored
- or lead to a page not found
error. It is possible to fix these redirected
- requests with filter-client-headers
- but it requires a little effort.
+ or lead to a page not found
error. You can prevent this problem by
+ first using the redirect action
+ to remove the last part of the URL, but it requires a little effort.
To detect a redirection URL, fast-redirects only
@@ -3195,10 +3704,12 @@ problem-host.example.com
Example usage:
- +fast-redirects{simple-check}
-
-
- +fast-redirects{check-decoded-url}
+
+ { +fast-redirects{simple-check} }
+ .example.com
+
+ { +fast-redirects{check-decoded-url} }
+ another.example.com/testing
@@ -3215,7 +3726,8 @@ problem-host.example.com
Typical use:
- Get rid of HTML and JavaScript annoyances, banner advertisements (by size), do fun text replacements, etc.
+ Get rid of HTML and JavaScript annoyances, banner advertisements (by size),
+ do fun text replacements, add personalized effects, etc.
@@ -3223,13 +3735,11 @@ problem-host.example.com
Effect:
- All files of text-based type, most notably HTML and JavaScript, to which this
- action applies, are filtered on-the-fly through the specified regular expression
- based substitutions. (Note: as of version 3.0.3 plain text documents
+ All instances of text-based type, most notably HTML and JavaScript, to which
+ this action applies, can be filtered on-the-fly through the specified regular
+ expression based substitutions. (Note: as of version 3.0.3 plain text documents
are exempted from filtering, because web servers often use the
- text/plain MIME type for all files whose type they
- don't know.) By default, filtering works only on the document content
- itself, not the headers.
+ text/plain MIME type for all files whose type they don't know.)
@@ -3246,7 +3756,7 @@ problem-host.example.com
Parameter:
- The name of a filter, as defined in the filter file.
+ The name of a content filter, as defined in the filter file.
Filters can be defined in one or more files as defined by the
filterfile
option in the config file.
@@ -3256,7 +3766,7 @@ problem-host.example.com
When used in its negative form,
- and without parameters, filtering is completely disabled.
+ and without parameters, all filtering is completely disabled.
@@ -3277,8 +3787,14 @@ problem-host.example.com
noticeable on slower connections.
- This is very powerful feature, and rolling your own
- filters requires a knowledge of regular expressions and HTML.
+ Rolling your own
+ filters requires a knowledge of
+ Regular
+ Expressions
and
+ HTML
.
+ This is very powerful feature, and potentially very intrusive.
+ Filters should be used with caution, and where an equivalent
+ action
is not available.
The amount of data that can be filtered is limited to the
@@ -3288,22 +3804,27 @@ problem-host.example.com
data, and all pending data, is passed through unfiltered.
- Inadequate MIME types, such as zipped files, are not filtered at all.
+ Inappropriate MIME types, such as zipped files, are not filtered at all.
(Again, only text-based types except plain text). Encrypted SSL data
(from HTTPS servers) cannot be filtered either, since this would violate
the integrity of the secure transaction. In some situations it might
be necessary to protect certain text, like source code, from filtering
- by defining appropriate -filter sections.
+ by defining appropriate -filter exceptions.
- At this time, Privoxy cannot (yet!) uncompress compressed
- documents. If you want filtering to work on all documents, even those that
- would normally be sent compressed, use the
- prevent-compression
+ Compressed content can't be filtered either, unless &my-app;
+ is compiled with zlib support (requires at least &my-app; 3.0.7),
+ in which case &my-app; will decompress the content before filtering
+ it.
+
+
+ If you use a &my-app; version without zlib support, but want filtering to work on
+ as much documents as possible, even those that would normally be sent compressed,
+ you must use the prevent-compression
action in conjunction with filter.
- Filtering can achieve some of the same effects as the
+ Content filtering can achieve some of the same effects as the
block
action, i.e. it can be used to block ads and banners. But the mechanism
works quite differently. One effective use, is to block ad banners
@@ -3350,11 +3871,11 @@ problem-host.example.com
@@ -3382,7 +3903,7 @@ problem-host.example.com
- +filter{frameset-borders} # Give frames a border and make them resizable
+ +filter{frameset-borders} # Give frames a border and make them resizeable
@@ -3394,7 +3915,7 @@ problem-host.example.com
- +filter{quicktime-kioskmode} # Make Quicktime movies saveable
+ +filter{quicktime-kioskmode} # Make Quicktime movies savable
@@ -3408,123 +3929,47 @@ problem-host.example.com
+filter{ie-exploits} # Disable some known Internet Explorer bug exploits
-
-
-
-
-
-
-
-
-
-