X-Git-Url: http://www.privoxy.org/gitweb/?p=privoxy.git;a=blobdiff_plain;f=doc%2Fsource%2Fuser-manual.sgml;h=a80299768104afd46946416e1463996f1385a33b;hp=009eac67fb5f186025419107ad6614bb10aebbcf;hb=86062c9ff866c6dbcd8e2826093719ce1a1e9809;hpb=9103f695944532335e14becb1842167473deb4e0 diff --git a/doc/source/user-manual.sgml b/doc/source/user-manual.sgml index 009eac67..a8029976 100644 --- a/doc/source/user-manual.sgml +++ b/doc/source/user-manual.sgml @@ -1,16 +1,15 @@ - +
-Junkbuster User Manual +Privoxy User Manual -$Id: user-manual.sgml,v 1.12 2001/09/28 02:57:04 hal9 Exp $ +$Id: user-manual.sgml,v 1.51 2002/03/23 15:13:11 swa Exp $ - By: Junkbuster Developers + By: Privoxy Developers - The user manual gives the users information on how to install and configure - Internet Junkbuster. Internet - Junkbuster is an application that provides privacy and - security to users of the World Wide Web. + The user manual gives users information on how to install, configure and use + Privoxy. Privoxy is a + web proxy with advanced filtering capabilities for protecting privacy, + filtering web page content, managing cookies, controlling access, and + removing ads, banners, pop-ups and other obnoxious Internet + Junk. Privoxy has a very flexible configuration + and can be customized to suit individual needs and + tastes. Privoxy has application for both + stand-alone systems and multi-user networks. -You can find the latest version of the user manual at http://ijbswa.sourceforge.net/doc/user-manual/. +You can find the latest version of the user manual at http://ijbswa.sourceforge.net/user-manual/. - - Feel free to send a note to the developers at ijbswa-developers@lists.sourceforge.net. - + + + @@ -63,28 +66,30 @@ You can find the latest version of the user manual at Introduction - Internet Junkbuster is a web proxy with advanced - filtering capabilities for protecting privacy, filtering web page content, - managing cookies and removing ads, banners, pop-ups and other obnoxious - Internet Junk. Junkbuster has a very flexible - configuration and can be customized to suit individual needs and tastes. - Internet Junkbuster has application for both - stand-alone systems and multi-user networks. + Privoxy is a web proxy with advanced + filtering capabilities for protecting privacy, filtering and modifying web + page content, managing cookies, controlling access, and removing ads, + banners, pop-ups and other obnoxious Internet Junk. + Privoxy has a very flexible configuration and + can be customized to suit individual needs and tastes. Privoxy has application for both stand-alone systems and + multi-user networks. - This documentation is included with the current development version of - Internet Junkbuster and is incomplete at this + This documentation is included with the current BETA version of + Privoxy and is mostly complete at this point. The most up to date reference for the time being is still the comments in the source files and in the individual configuration files. Development - of version 3.0 is currently underway, and includes significant changes and - enhancements over earlier verions. The target release date for stable v3.0 is - December 2001. + of version 3.0 is currently nearing completion, and includes many significant + changes and enhancements over earlier versions. The target release date for + stable v3.0 is soon ;-) - Since this is a development version, some features are in the process of - being implemented. And there are bugs! + Since this is a BETA version, not all new features are well tested. This + documentation may be slightly out of sync as a result (especially with + CVS sources). And there may be bugs, though hopefully + not many! @@ -92,53 +97,112 @@ You can find the latest version of the user manual at http://i.j.b). Browser-based tracing of rule + and filter effects. + + + + + + Blocking of annoying pop-up browser windows. - A browser based GUI configuration utility. + HTTP/1.1 compliant (most, but not all 1.1 features are supported). - Blocking of annoying pop-up browser windows (previously available as a - patch). + Support for Perl Compatible Regular Expressions in the configuration files, and + generally a more sophisticated and flexible configuration syntax over + previous versions. - Fledgling support for HTTP 1.1. + GIF de-animation. + + + + + + Web page content filtering (removes banners based on size, + invisible web-bugs, JavaScript, pop-ups, status bar abuse, + etc.) + + + + + + Bypass many click-tracking scripts (avoids script redirection). + + + + + + + Multi-threaded (POSIX and native threads). - Support for Perl Compatible Regular Expressions in the configuration files, and - generally a more sophisticated configuration syntax. + Auto-detection and re-reading of config file changes. - Web page content filtering. + User-customizable HTML templates (e.g. 404 error page). + + + + Improved cookie management features (e.g. session based cookies). + + + + + + Builds from source on most UNIX-like systems. Packages available for: Linux + (RedHat, SuSE, or Debian), Windows, Sun Solaris, Mac OSX, OS/2, HP-UX 11 and AmigaOS. + + + + + + In addition, the configuration is much more powerful and versatile over-all. + + + @@ -152,11 +216,11 @@ You can find the latest version of the user manual at Installation - Junkbuster is available as raw source code, or + Privoxy is available as raw source code, or pre-compiled binaries. See the Junkbuster Home Page - for current releases. Junkbuster is also available - via Privoxy Home Page + for binaries and current release info. Privoxy + is also available via CVS. This is the recommended approach at this time. But please be aware that CVS is constantly changing, and it may break in mysterious ways. @@ -170,8 +234,8 @@ You can find the latest version of the user manual at - Then, in either case, to build from source: + Then, in either case, to build from tarball/CVS source: - ./configure - make - su - make install + ./configure (--help to see options) + make (the make from gnu, gmake for *BSD) + su + make -n install (to see where all the files will go) + make install (to really install) @@ -221,6 +286,8 @@ You can find the latest version of the user manual at -    /usr/src/redhat/RPMS/i686/junkbuster-2.9.8-1.i686.rpm +    /usr/src/redhat/RPMS/i686/privoxy-2.9.11-1.i686.rpm -    /usr/src/redhat/SRPMS/junkbuster-2.9.8-1.src.rpm +    /usr/src/redhat/SRPMS/privoxy-2.9.11-1.src.rpm @@ -243,14 +310,14 @@ You can find the latest version of the user manual at + autoheader [suggested for CVS source] + autoconf [suggested for CVS source] ./configure make suse-dist @@ -273,10 +342,10 @@ You can find the latest version of the user manual at - rpm -Uvv /usr/src/suse/RPMS/i686/junkbuster-2.9.8-1.i686.rpm + rpm -Uvv /usr/src/packages/RPMS/i686/privoxy-2.9.11-1.i686.rpm - This will place the Junkbuster configuration - files in /etc/junkbuster/, and log files in - /var/log/junkbuster/. + This will place the Privoxy configuration + files in /etc/privoxy/, and log files in + /var/log/privoxy/. @@ -306,46 +375,53 @@ Thanx David Schmidt! --> - The OS/2 version of Junkbuster requires the EMX - runtime library to be installed. The EMX runtime library is available on - the hobbes OS/2 archive, among many other locations: - http://hobbes.nmsu.edu/cgi-bin/h-search?sh=1&button=Search&key=emxrt.zip&stype=all&sort=type&dir=%2Fpub%2Fos2%2Fdev%2Femx%2Fv0.9d - - - - Junkbuster is packaged in a WarpIN self- + Privoxy is packaged in a WarpIN self- installing archive. The self-installing program will be named depending on the release version, something like: - ijbos123.exe. In order to install it, simply run - this executable or double-click on its icon and follow the WarpIN - installation panels. A shadow of the Junkbuster + ijbos2_setup_1.2.3.exe. In order to install it, simply + run this executable or double-click on its icon and follow the WarpIN + installation panels. A shadow of the Privoxy executable will be placed in your startup folder so it will start automatically whenever OS/2 starts. - The directory you choose to install Junkbuster + The directory you choose to install Privoxy into will contain all of the configuration files. If you would like to build binary images on OS/2 yourself, you will need - a working EMX/GCC environment, plus several Unix-like tools. The Hobbes - OS/2 archive is a good place to start when building such an environment. - A set of Unix-like tools named gnupack is located here: - http://hobbes.nmsu.edu/cgi-bin/h-search?sh=1&key=gnupack&stype=all&sort=type&dir=%2Fpub%2Fos2%2Fapps - - - Once you have the source code unpacked as above, you can build the binaries - from the current/ directory: + a few Unix-like tools: autoconf, autoheader and sh. These tools will be + used to create the required config.h file, which is not part of the + source distribution because it differs based on platform. You will also + need a compiler. + The distribution has been created using IBM VisualAge compilers, but you + can use any compiler you like. GCC/EMX has the disadvantage of needing + to be single-threaded due to a limitation of EMX's implementation of the + select() socket call. + In addition to needing the source code distribution as outlined earlier, + you will want to extract the os2seutp directory from CVS: + cvs -d:pserver:anonymous@cvs.ijbswa.sourceforge.net:/cvsroot/ijbswa login + cvs -z3 -d:pserver:anonymous@cvs.ijbswa.sourceforge.net:/cvsroot/ijbswa co os2setup + + This will create a directory named os2setup/, which will contain the + Makefile.vac makefile and os2build.cmd + which is used to completely create the binary distribution. The sequence + of events for building the executable for yourself goes something like this: + + cd current + autoheader autoconf sh configure - make + cd ..\os2setup + nmake -f Makefile.vac + You will see this sequence laid out in os2build.cmd. @@ -353,8 +429,8 @@ Thanx David Schmidt! Windows -I need help on this. Not a clue here. Also for -configuration section below. +Click-click. (I need help on this. Not a clue here. Also for +configuration section below. HB.) @@ -365,7 +441,7 @@ configuration section below. - For FreeBSD (and other *BSDs?), the build will need gmake + For FreeBSD (and other *BSDs?), the build will require gmake instead of the included make. gmake is available from http://www.gnu.org. The rest should be the same as above for Linux/Unix. @@ -379,20 +455,87 @@ configuration section below. -Junkbuster Configuration +<application>Privoxy</application> Configuration + + All Privoxy configuration is kept + in text files. These files can be edited with a text editor. + Many important aspects of Privoxy can + also be controlled easily with a web browser. + + + + + + + +Controlling <application>Privoxy</application> with Your Web Browser + + Privoxy can be reached by the special + URL http://i.j.b/ (or alternately + http://ijbswa.sourceforge.net/config/), + which is an internal page. You will see the following section: + + + + + + +Please choose from the following options: + + * Show information about the current configuration + * Show the source code version numbers + * Show the client's request headers. + * Show which actions apply to a URL and why + * Toggle Privoxy on or off + * Edit the actions list + + + + + + This should be self-explanatory. Note the last item is an editor for the + actions list, which is where much of the ad, banner, cookie, + and URL blocking magic is configured as well as other advanced features of + Privoxy. This is an easy way to adjust various + aspects of Privoxy configuration. The actions + file, and other configuration files, are explained in detail below. + Privoxy will automatically detect any changes + to these files. + + + + Toggle Privoxy On or Off is handy for sites that might + have problems with your current actions and filters, or just to test if + a site misbehaves, whether it is Privoxy + causing the problem or not. Privoxy continues + to run as a proxy in this case, but all filtering is disabled. + + + + + + + + + + + + + +Configuration Files Overview - For Unix, *BSD and Linux, all configuraton files are located in - /etc/junkbuster/ by default. For MS Windows and OS/2, - these are all in the same directory as the - Junkbuster executable. The name and number of + For Unix, *BSD and Linux, all configuration files are located in + /etc/privoxy/ by default. For MS Windows, OS/2, and + AmigaOS these are all in the same directory as the + Privoxy executable. The name and number of configuration files has changed from previous versions, and is subject to change as development progresses. - The installed defaults provide a reasonable starting point. For the - time being, there are only three default configuration files (this will - change in time): + The installed defaults provide a reasonable starting point, though possibly + aggressive by some standards. For the time being, there are only three + default configuration files (this will change in time): @@ -401,22 +544,28 @@ configuration section below. The main configuration file is named config - on Linux, Unix, BSD, and OS/2, and junkbustr.txt on - Windows. + on Linux, Unix, BSD, OS/2, and AmigaOS and config.txt + on Windows. - The actionsfile file is used to define various - actions relating to images, banners, pop-ups, banners and cookies. + The ijb.action file is used to define various + actions relating to images, banners, pop-ups, access + restrictions, banners and cookies. There is a CGI based editor for this + file that can be accessed via http://i.j.b. (Other actions + files are included as well with differing levels of filtering + and blocking, e.g. ijb-basic.action.) - The re_filterfile file can be used to rewrite the raw - page content, including text as well as embedded HTML and JavaScript. + The re_filterfile file can be used to re-write the raw + page content, including viewable text as well as embedded HTML and JavaScript, + and whatever else lurks on any given web page. @@ -424,13 +573,23 @@ configuration section below. - actionsfile and re_filterfile + ijb.action and re_filterfile can use Perl style regular expressions for maximum flexibility. All files use the # character to denote a comment. Such - lines are not processed by Junkbuster. After - making any changes, restart Junkbuster in order - for the changes to take effect. + lines are not processed by Privoxy. After + making any changes, there is no need to restart + Privoxy in order for the changes to take + effect. Privoxy should detect such changes + automatically. + + + + While under development, the configuration content is subject to change. + The below documentation may not be accurate by the time you read this. + Also, what constitutes a default setting, may change, so + please check all your configuration files on important issues. + @@ -439,7 +598,7 @@ configuration section below. The Main Configuration File Again, the main configuration file is named config on - Linux/Unix/BSD and OS/2, and junkbustr.txt on Windows. + Linux/Unix/BSD and OS/2, and config.txt on Windows. Configuration lines consist of an initial keyword followed by a list of values, all separated by whitespace (any number of spaces or tabs). For example: @@ -447,20 +606,21 @@ configuration section below. - + blockfile blocklist.ini - + - Indicates that the blockfile is named blocklist.ini. + Indicates that the blockfile is named blocklist.ini. (A + default installation does not use this.) - The # indicates a comment. Any part of a + A # indicates a comment. Any part of a line following a # is ignored, except if the # is preceded by a \. @@ -471,7 +631,7 @@ configuration section below. existing configuration line, you can make it a comment and it will be treated as if it weren't there. This is called commenting out an option and can be useful to turn off features: If you comment out the - logfile line, junkbuster will not + logfile line, Privoxy will not log to a file at all. Watch for the default: section in each explanation to see what happens if the option is left unset (or commented out). @@ -483,8 +643,8 @@ configuration section below. - There are various aspects of Junkbuster behavior - that can be adjusted. + There are various aspects of Privoxy behavior + that can be tuned. @@ -494,22 +654,23 @@ configuration section below. Defining Other Configuration Files - Junkbuster can use a number of other files to tell it + Privoxy can use a number of other files to tell it what ads to block, what cookies to accept, etc. This section of the - configuration file tells Junkbuster where to find + configuration file tells Privoxy where to find all those other files. - On Windows, Junkbuster - looks for these files in the same directory as the executable. On Unix and - OS/2, Junkbuster looks for these files in the current - working directory. In either case, an absolute path name can be used to + On Windows and AmigaOS, + Privoxy looks for these files in the same + directory as the executable. On Unix and OS/2, + Privoxy looks for these files in the current + working directory. In either case, an absolute path name can be used to avoid problems. - When development goes modular and multiuser, the blocker, filter, and + When development goes modular and multi-user, the blocker, filter, and per-user config will be stored in subdirectories of confdir. For now, only confdir/templates is used for storing HTML templates for CGI results. @@ -521,11 +682,11 @@ configuration section below. - + - confdir /etc/junkbuster # No trailing /, please. + confdir /etc/privoxy # No trailing /, please. - + @@ -537,11 +698,11 @@ configuration section below. - + - logdir /var/log/junkbuster + logdir /var/log/privoxy - + @@ -551,48 +712,61 @@ configuration section below. - The actionsfile contains patterns to specify the actions to - apply to requests for each site. Default: Cookies to and from all - destinations are filtered. Popups are disabled for all sites. All sites are - filtered if re_filterfile specified. No sites are blocked. An empty image is - displayed for filtered ads and other images (formerly - tinygif). The syntax of this file is explained in detail - below. + The ijb.action file contains patterns to specify the actions + to apply to requests for each site. Default: Cookies to and from all + destinations are kept only during the current browser session (i.e. they are + not saved to disk). Pop-ups are disabled for all sites. All sites are + filtered through selected sections of re_filterfile. No sites + are blocked. The Privoxy logo is displayed for + filtered ads and other images. The syntax of this file is explained in detail + below. Other actions files + are included, and you are free to use any of them. They have varying degrees + of aggressiveness. - + - actionsfile actionsfile + actionsfile ijb.action - + - The re_filterfile file contains content modification rules. - These rules permit powerful changes on the content of Web pages, e.g., you - could disable your favourite JavaScript annoyances, rewrite the actual - content, or just have some fun replacing Microsoft with - MicroSuck wherever it appears on a Web page. Default: No - content modification, or whatever the developers are playing with :-/ + The re_filterfile file contains content modification rules + that use regular expressions. These rules permit powerful + changes on the content of Web pages, e.g., you could disable your favorite + JavaScript annoyances, re-write the actual displayed text, or just have some + fun replacing Microsoft with MicroSuck wherever + it appears on a Web page. Default: whatever the developers are playing with + :-/ + + + + Filtering requires buffering the page content, which may appear to slow down + page rendering since nothing is displayed until all content has passed + the filters. (It does not really take longer, but seems that way since + the page is not incrementally displayed.) This effect will be more noticeable + on slower connections. + - + re_filterfile re_filterfile - + The logfile is where all logging and error messages are written. The logfile can be useful for tracking down a problem with - Junkbuster (e.g., it's not blocking an ad you + Privoxy (e.g., it's not blocking an ad you think it should block) but in most cases you probably will never look at it. @@ -604,7 +778,7 @@ configuration section below. - On SuSE Linux systems, you can place a line like /var/log/junkbuster.* + On SuSE Linux systems, you can place a line like /var/log/privoxy.* +1024k 644 nobody.nogroup in /etc/logfiles, with the effect that cron.daily will automatically archive, gzip, and empty the log, when it exceeds 1M size. @@ -617,54 +791,54 @@ configuration section below. - + logfile logfile - + The jarfile defines where - Junkbuster stores the cookies it intercepts. Note + Privoxy stores the cookies it intercepts. Note that if you use a jarfile, it may grow quite large. Default: Don't store intercepted cookies. - + #jarfile jarfile - + If you specify a trustfile, - Junkbuster will only allow access to sites that + Privoxy will only allow access to sites that are named in the trustfile. You can also mark sites as trusted referrers, with the effect that access to untrusted sites will be granted, if a link from a trusted referrer was used. The link target will then be added to the trustfile. This is a very restrictive feature that typical - users most propably want to leave disabled. Default: Disabled, don't use the + users most probably want to leave disabled. Default: Disabled, don't use the trust mechanism. - + #trustfile trust - + - If you use the trust mechanism, it is a good idea to write up some online + If you use the trust mechanism, it is a good idea to write up some on-line documentation about your blocking policy and to specify the URL(s) here. They will appear on the page that your users receive when they try to access untrusted content. Use multiple times for multiple URLs. Default: Don't @@ -673,12 +847,12 @@ configuration section below. - + trust-info-url http://www.your-site.com/why_we_block.html trust-info-url http://www.your-site.com/what_we_allow.html - + @@ -695,7 +869,7 @@ configuration section below. This part of the configuration file contains options that control how - Junkbuster operates. + Privoxy operates. @@ -706,55 +880,55 @@ configuration section below. - + #admin-address fill@me.in.please - + Proxy-info-url can be set to a URL that contains more info - about this Junkbuster installation, it's + about this Privoxy installation, it's configuration and policies. It is used in many of the proxy-generated pages and its use is highly recommended in multi-user installations, since your users will want to know why certain content is blocked or modified. Default: - Don't show a link to online documentation. + Don't show a link to on-line documentation. - + proxy-info-url http://www.your-site.com/proxy.html - + Listen-address specifies the address and port where - Junkbuster will listen for connections from your - Web browser. The default is to listen on the localhost port 8000, and + Privoxy will listen for connections from your + Web browser. The default is to listen on the localhost port 8118, and this is suitable for most users. (In your web browser, under proxy configuration, list the proxy server as localhost and the - port as 8000). + port as 8118). - If you already have another service running on port 8000, or if you want to + If you already have another service running on port 8118, or if you want to serve requests from other machines (e.g. on your local network) as well, you will need to override the default. The syntax is listen-address [<ip-address>]:<port>. If you leave - out the IP adress, junkbuster will bind to all + out the IP address, Privoxy will bind to all interfaces (addresses) on your machine and may become reachable from the - internet. In that case, consider using access control lists (acl's) (see - aclfile above). + Internet. In that case, consider using access control lists (acl's) (see + aclfile above), or a firewall. - For example, suppose you are running Junkbuster on + For example, suppose you are running Privoxy on a machine which has the address 192.168.0.1 on your local private network (192.168.0.0) and has another outside connection with a different address. You want it to serve requests from inside only: @@ -762,11 +936,11 @@ configuration section below. - + - listen-address 192.168.0.1:8000 + listen-address 192.168.0.1:8118 - + @@ -777,18 +951,18 @@ configuration section below. - + - listen-address :8000 + listen-address :8118 - + If you do this, consider using ACLs (see aclfile above). Note: you will need to point your browser(s) to the address and port that you have - configured here. Default: localhost:8000 (127.0.0.1:8000). + configured here. Default: localhost:8118 (127.0.0.1:8118). @@ -798,10 +972,10 @@ configuration section below. levels of debug are probably only of interest to developers. - - - - + + + + debug 1 # GPC = show each GET/POST/CONNECT request debug 2 # CONN = show each connection status debug 4 # IO = show I/O status @@ -810,24 +984,24 @@ configuration section below. debug 32 # FRC = debug force feature debug 64 # REF = debug regular expression filter debug 128 # = debug fast redirects - debug 256 # = debug GIF deanimation + debug 256 # = debug GIF de-animation debug 512 # CLF = Common Log Format - debug 1024 # = debug kill popups + debug 1024 # = debug kill pop-ups debug 4096 # INFO = Startup banner and warnings. debug 8192 # ERROR = Non-fatal errors - - - - + + + + It is highly recommended that you enable ERROR - reporting (debug 8192), at least until the next stable release. + reporting (debug 8192), at least until v3.0 is released. The reporting of FATAL errors (i.e. ones which crash - JunkBuster) is always on and cannot be disabled. + Privoxy) is always on and cannot be disabled. @@ -842,11 +1016,11 @@ configuration section below. - + debug 15 # same as setting the first 4 listed above - + @@ -856,67 +1030,147 @@ configuration section below. - + debug 1 # URLs debug 4096 # Info debug 8192 # Errors - *we highly recommended enabling this* - + - Junkbuster normally uses + Privoxy normally uses multi-threading, a software technique that permits it to handle many different requests simultaneously. In some cases you may wish to disable this -- particularly if you're trying to debug a problem. The single-threaded option forces - Junkbuster to handle requests sequentially. + Privoxy to handle requests sequentially. Default: Multi-threaded mode. - + #single-threaded - + toggle allows you to temporarily disable all - Junkbuster's filtering. Just set toggle + Privoxy's filtering. Just set toggle 0. - The Windows version of Junkbuster puts an icon in - the system tray, which allows you to change this option without having to - edit this file. If you right-click on that icon (or select the - Options menu), one choice is Enable. Clicking - on enable toggles Junkbuster on and off. This is - useful if you want to temporarily disable - Junkbuster, e.g., to access a site that requires - cookies which you normally have blocked. + The Windows version of Privoxy puts an icon in + the system tray, which also allows you to change this option. If you + right-click on that icon (or select the Options menu), one + choice is Enable. Clicking on enable toggles + Privoxy on and off. This is useful if you want + to temporarily disable Privoxy, e.g., to access + a site that requires cookies which you would otherwise have blocked. This can also + be toggled via a web browser at the Privoxy + internal address of http://i.j.b on + any platform. - toggle 1 means Junkbuster runs + toggle 1 means Privoxy runs normally, toggle 0 means that - Junkbuster becomes a non-anonymizing non-blocking - proxy. Default: 1. + Privoxy becomes a non-anonymizing non-blocking + proxy. Default: 1 (on). - + toggle 1 - + + + + + + For content filtering, i.e. the +filter and + +deanimate-gif actions, it is necessary that + Privoxy buffers the entire document body. + This can be potentially dangerous, since a server could just keep sending + data indefinitely and wait for your RAM to exhaust. With nasty consequences. + + + + The buffer-limit option lets you set the maximum + size in Kbytes that each buffer may use. When the documents buffer exceeds + this size, it is flushed to the client unfiltered and no further attempt to + filter the rest of it is made. Remember that there may multiple threads + running, which might require increasing the buffer-limit + Kbytes each, unless you have enabled + single-threaded above. + + + + + + + buffer-limit 4069 + + + + + + + To enable the web-based ijb.action file editor set + enable-edit-actions to 1, or 0 to disable. Note + that you must have compiled Privoxy with + support for this feature, otherwise this option has no effect. This + internal page can be reached at http://i.j.b. + + + + Security note: If this is enabled, anyone who can use the proxy + can edit the actions file, and their changes will affect all users. + For shared proxies, you probably want to disable this. Default: enabled. + + + + + + + enable-edit-actions 1 + + + + + + + Allow Privoxy to be toggled on and off + remotely, using your web browser. Set enable-remote-toggleto + 1 to enable, and 0 to disable. Note that you must have compiled + Privoxy with support for this feature, + otherwise this option has no effect. + + + + Security note: If this is enabled, anyone who can use the proxy can toggle + it on or off (see http://i.j.b), and + their changes will affect all users. For shared proxies, you probably want to + disable this. Default: enabled. + + + + + + + enable-remote-toggle 1 + + @@ -970,11 +1224,11 @@ configuration section below. - + ACTION SRC_ADDR[/SRC_MASKLEN] [ DST_ADDR[/DST_MASKLEN] ] - + @@ -984,7 +1238,7 @@ configuration section below. - + ACTION = permit-access or deny-access @@ -994,7 +1248,7 @@ configuration section below. DST_ADDR = server or forwarder hostname or dotted IP address DST_MASKLEN = number of bits in the subnet mask for the target - + @@ -1004,12 +1258,12 @@ configuration section below. - IMPORTANT NOTE: If the junkbuster is using a + IMPORTANT NOTE: If Privoxy is using a forwarder (see below) or a gateway for a particular destination URL, the DST_ADDR that is examined is the address of the forwarder or the gateway and NOT the address of the ultimate target. This is necessary because it may be impossible for the local - Junkbuster to determine the address of the + Privoxy to determine the address of the ultimate target (that's often what gateways are used for). @@ -1024,26 +1278,26 @@ configuration section below. - + permit-access localhost - + A silly example to illustrate permitting any host on the class-C subnet with - Junkbuster to go anywhere: + Privoxy to go anywhere: - + - permit-access www.junkbusters.com/24 + permit-access www.privoxy.com/24 - + @@ -1053,11 +1307,11 @@ configuration section below. - + - deny-access ident.junkbusters.com + deny-access ident.privoxy.com - + @@ -1068,11 +1322,11 @@ configuration section below. - + permit-access 207.153.200.0/24 - + @@ -1082,11 +1336,11 @@ configuration section below. - + permit-access 0.0.0.0/0 - + @@ -1096,11 +1350,11 @@ configuration section below. - + permit-access .org - + @@ -1109,7 +1363,7 @@ configuration section below. - An ISP may want to provide a Junkbuster that is + An ISP may want to provide a Privoxy that is accessible by the world and yet restrict use of some of their private content to hosts on its internal network (i.e. its own subscribers). Say, for instance the ISP owns the Class-B IP address block 123.124.0.0 (a 16 @@ -1118,7 +1372,7 @@ configuration section below. - + permit-access 0.0.0.0/0 0.0.0.0/0 # other clients can go anywhere # with the following exceptions: @@ -1132,7 +1386,7 @@ configuration section below. permit 123.124.0.0/16 0.0.0.0/0 # the ISP's clients can go # anywhere - + @@ -1153,10 +1407,11 @@ configuration section below. Forwarding - This feature allows routing of HTTP requests via multiple proxies. + This feature allows chaining of HTTP requests via multiple proxies. It can be used to better protect privacy and confidentiality when accessing specific domains by routing requests to those domains - to a special purpose filtering proxy such as lpwa.com. + to a special purpose filtering proxy such as lpwa.com. Or to use + a caching proxy to speed up browsing. @@ -1166,7 +1421,7 @@ configuration section below. - Also specified here are SOCKS proxies. Junkbuster + Also specified here are SOCKS proxies. Privoxy SOCKS 4 and SOCKS 4A. The difference is that SOCKS 4A will resolve the target hostname using DNS on the SOCKS server, not our local DNS client. @@ -1177,13 +1432,13 @@ configuration section below. - + forward target_domain[:port] http_proxy_host[:port] forward-socks4 target_domain[:port] socks_proxy_host[:port] http_proxy_host[:port] forward-socks4a target_domain[:port] socks_proxy_host[:port] http_proxy_host[:port] - + @@ -1204,11 +1459,11 @@ configuration section below. - + forward .* . # implicit - + @@ -1219,17 +1474,19 @@ configuration section below. - + forward .* lpwa.com:8000 forward :443 . - + + Some users have reported difficulties related to LPWA's use of . as the last element of the domain, and have said that this can be fixed with this: @@ -1237,16 +1494,16 @@ configuration section below. - + forward lpwa. lpwa.com:8000 - + - (NOTE: the syntax for specifiying target_domain has changed since the + (NOTE: the syntax for specifying target_domain has changed since the previous paragraph was written -- it will not work now. More information is welcome.) @@ -1258,12 +1515,12 @@ configuration section below. - + forward .* caching.myisp.net:8000 forward myisp.net . - + @@ -1274,17 +1531,17 @@ configuration section below. - + forward .* proxy:8080 - + - Also, we're told they insist on getting cookies and JavaScript, so you need - to add home.com to the cookie file. We consider JavaScript a security risk. + Also, we're told they insist on getting cookies and JavaScript, so you should + allow cookies from home.com. We consider JavaScript a potential security risk. Java need not be enabled. @@ -1296,12 +1553,12 @@ configuration section below. - + - forward_socks4 .* lpwa.com:8000 firewall.my_company.com:1080 + forward-socks4 .* lpwa.com:8000 firewall.my_company.com:1080 forward my_company.com . - + @@ -1311,11 +1568,11 @@ configuration section below. - + - forward_socks4a .* . firewall.my_company.com:1080 + forward-socks4a .* . firewall.my_company.com:1080 - + @@ -1337,34 +1594,34 @@ configuration section below. host-a has a PPP connection to isp-a.com. And host-b has a PPP connection to - isp-b.com. host-a can run a Junkbuster proxy with + isp-b.com. host-a can run a Privoxy proxy with forwarding like this: - + forward .* . - forward isp-b.com host-b:8000 + forward isp-b.com host-b:8118 - + - host-b can run a Junkbuster proxy with forwarding + host-b can run a Privoxy proxy with forwarding like this: - + forward .* . - forward isp-a.com host-a:8000 + forward isp-a.com host-a:8118 - + @@ -1382,7 +1639,7 @@ configuration section below. - + forward *. ssbcache.ukc.ac.uk:3128 # Use the proxy, except for: forward .ukc.ac.uk . # Anything on the same domain as us @@ -1392,14 +1649,14 @@ configuration section below. forward localhost.localdomain . # Loopback address forward www.ukc.mirror.ac.uk . # Specific host - + - If you intend to chain Junkbuster and + If you intend to chain Privoxy and squid locally, then chain as - browser -> squid -> junkbuster is the recommended way. + browser -> squid -> privoxy is the recommended way. @@ -1408,24 +1665,27 @@ configuration section below. - + - # Define junkbuster as parent cache - cache_peer 127.0.0.1 8000 parent 0 no-query - + # Define Privoxy as parent cache + + cache_peer 127.0.0.1 parent 8118 0 no-query + # Define ACL for protocol FTP acl FTP proto FTP - # Do not forward ACL FTP to junkbuster + # Do not forward ACL FTP to privoxy always_direct allow FTP - # Do not forward ACL CONNECT (https) to junkbuster + # Do not forward ACL CONNECT (https) to privoxy always_direct allow CONNECT - # Forward the rest to junkbuster + # Forward the rest to privoxy never_direct allow all - + @@ -1442,39 +1702,39 @@ configuration section below. Removed references to Win32. HB 09/23/01 --> - Junkbuster has a number of options specific to the + Privoxy has a number of options specific to the Windows GUI interface: If activity-animation is set to 1, the - Junkbuster icon will animate when - Junkbuster is active. To turn off, set to 0. + Privoxy icon will animate when + Privoxy is active. To turn off, set to 0. - + activity-animation 1 - + If log-messages is set to 1, - Junkbuster will log messages to the console + Privoxy will log messages to the console window: - + log-messages 1 - + @@ -1491,11 +1751,11 @@ Removed references to Win32. HB 09/23/01 - + log-buffer-size 1 - + @@ -1506,27 +1766,27 @@ Removed references to Win32. HB 09/23/01 - + log-max-lines 200 - + If log-highlight-messages is set to 1, - Junkbuster will highlight portions of the log + Privoxy will highlight portions of the log messages with a bold-faced font: - + log-highlight-messages 1 - + @@ -1536,11 +1796,11 @@ Removed references to Win32. HB 09/23/01 - + log-font-name Comic Sans MS - + @@ -1550,60 +1810,60 @@ Removed references to Win32. HB 09/23/01 - + log-font-size 8 - + show-on-task-bar controls whether or not - Junkbuster will appear as a button on the Task bar + Privoxy will appear as a button on the Task bar when minimized: - + show-on-task-bar 0 - + If close-button-minimizes is set to 1, the Windows close - button will minimize Junkbuster instead of closing + button will minimize Privoxy instead of closing the program (close with the exit option on the File menu). - + close-button-minimizes 1 - + The hide-console option is specific to the MS-Win console - version of JunkBuster. If this option is used, - Junkbuster will disconnect from and hide the + version of Privoxy. If this option is used, + Privoxy will disconnect from and hide the command console. - + #hide-console - + @@ -1618,12 +1878,22 @@ Removed references to Win32. HB 09/23/01 The Actions File - The actionsfile is used to define what actions - Junkbuster takes, and thus determines how images, + The ijb.action file (formerly + actionsfile) is used to define what actions + Privoxy takes, and thus determines how images, cookies and various other aspects of HTTP content and transactions are handled. Images can be anything you want, including ads, banners, or just - some obnoxious image that you would rather not see. Cookies can be accepted - or rejected. The default file is in fact named actionsfile. + some obnoxious URL that you would rather not see. Cookies can be accepted + or rejected, or accepted only during the current browser session (i.e. + not written to disk). Changes to ijb.action should + be immediately visible to Privoxy without + the need to restart. + + + + The easiest way to edit actions file is with a browser by + loading http://i.j.b/, and then select + Edit Actions List. A text editor can also be used. @@ -1634,10 +1904,13 @@ Removed references to Win32. HB 09/23/01 url="http://i.j.b/show-url-info">http://i.j.b/show-url-info. + There are four types of lines in this file: comments (begin with a # character), actions, aliases and patterns, all of which are - explained below. + explained below, as well as the configuration file syntax that + Privoxy understands. + @@ -1692,10 +1965,10 @@ Removed references to Win32. HB 09/23/01 - Additionally, there are wildcards that you can use in the domain names - themselves. They work pretty similar to shell wildcards: * + Additionally, there are wild-cards that you can use in the domain names + themselves. They work pretty similar to shell wild-cards: * stands for zero or more arbitrary characters, ? stands for - any single character. And you can define charachter classes in square + any single character. And you can define character classes in square brackets and they can be freely mixed: @@ -1721,9 +1994,9 @@ Removed references to Win32. HB 09/23/01 - If Junkbuster was compiled with + If Privoxy was compiled with pcre support (default), Perl compatible regular expressions - can be used. See the pcre/docs/ direcory or man + can be used. See the pcre/docs/ directory or man perlre (also available on http://www.perldoc.com/perl5.6/pod/perlre.html) for details. A brief discussion of regular expressions is in the @@ -1780,12 +2053,12 @@ Removed references to Win32. HB 09/23/01 - + {+name} # enable this action {-name} # disable this action - + @@ -1793,16 +2066,16 @@ Removed references to Win32. HB 09/23/01 - Parameterized (e.g. +/-hide-user-agent): + parameterized (e.g. +/-hide-user-agent): - + {+name{param}} # enable action and set parameter to param {-name} # disable action - + @@ -1813,13 +2086,13 @@ Removed references to Win32. HB 09/23/01 - + {+name{param}} # enable action and add parameter param {-name{param}} # remove the parameter param {-name} # disable this action totally - + @@ -1829,10 +2102,10 @@ Removed references to Win32. HB 09/23/01 If nothing is specified in this file, no actions are taken. - So in this case JunkBuster would just be a + So in this case Privoxy would just be a normal, non-blocking, non-anonymizing proxy. You must specifically enable the privacy and blocking features you need (although the - provided default actionsfile file will + provided default ijb.action file will give a good starting point). @@ -1842,7 +2115,7 @@ Removed references to Win32. HB 09/23/01 - The list of valid Junkbuster actions are: + The list of valid Privoxy actions are: @@ -1855,11 +2128,11 @@ Removed references to Win32. HB 09/23/01 - + +add-header{Name: value} - + @@ -1867,15 +2140,17 @@ Removed references to Win32. HB 09/23/01 - Block this URL totally. + Block this URL totally. In a default installation, a blocked + URL will result in bright red banner that says BLOCKED, + with a reason why it is being blocked. - + +block - + @@ -1887,22 +2162,41 @@ Removed references to Win32. HB 09/23/01 This will also shrink the images considerably (in bytes, not pixels!). If the option first is given, the first frame of the animation is used as the replacement. If last is given, the last frame - of the animation is used instead, which propably makes more sense for most + of the animation is used instead, which probably makes more sense for most banner animations, but also has the risk of not showing the entire last frame (if it is only a delta to an earlier frame). - + +deanimate-gifs{last} +deanimate-gifs{first} - + - + + + + +downgrade will downgrade HTTP/1.1 client requests to + HTTP/1.0 and downgrade the responses as well. Use this action for servers + that use HTTP/1.1 protocol features that + Privoxy doesn't handle well yet. HTTP/1.1 + is only partially implemented. Default is not to downgrade requests. + + + + + + +downgrade + + + + + + Many sites, like yahoo.com, don't just link to other sites. Instead, they @@ -1913,7 +2207,7 @@ Removed references to Win32. HB 09/23/01 Sometimes, there are even multiple consecutive redirects encoded in the - URL. These redirections via scripts make your web browing more traceable, + URL. These redirections via scripts make your web browsing more traceable, since the server from which you follow such a link can see where you go to. Apart from that, valuable bandwidth and time is wasted, while your browser ask the server for one redirect after the other. Plus, it feeds the @@ -1921,34 +2215,97 @@ Removed references to Win32. HB 09/23/01 The +fast-redirects option enables interception of these - requests by Junkbuster, who will cut off all but + requests by Privoxy, who will cut off all but the last valid URL in the request and send a local redirect back to your browser without contacting the remote site. - + +fast-redirects - + - Filter the website through the re_filterfile: - + Apply the filters in the section_header + section of the re_filterfile file to the site(s). + Re_filterfile sections are grouped according to like + functionality. + + - + - +filter{filename} + +filter{section_header} - + + + + Filter sections that are pre-defined in the supplied + re_filterfile include: + + +
+ + + html-annoyances: Get rid of particularly annoying HTML abuse. + + + + + js-annoyances: Get rid of particularly annoying JavaScript abuse + + + + + no-poups: Kill all popups in JS and HTML + + + + + frameset-borders: Give frames a border + + + + + webbugs: Squish WebBugs (1x1 invisible GIFs used for user tracking) + + + + + no-refresh: Automatic refresh sucks on auto-dialup lines + + + + + fun: Text replacements for subversive browsing fun! + + + + + nimda: Remove (virus) Nimda code. + + + + + banners-by-size: Kill banners by size + + + + + crude-parental: Kill all web pages that contain the words "sex" or "warez" + + +
+
@@ -1957,11 +2314,11 @@ Removed references to Win32. HB 09/23/01 - + +hide-forwarded - + @@ -1974,12 +2331,12 @@ Removed references to Win32. HB 09/23/01 - + +hide-from{block} +hide-from{spam@sittingduck.xqq} - + @@ -1993,13 +2350,13 @@ Removed references to Win32. HB 09/23/01 - + +hide-referer{block} +hide-referer{forge} +hide-referer{http://nowhere.com} - + @@ -2013,11 +2370,11 @@ Removed references to Win32. HB 09/23/01 - + +hide-referrer{...} - + @@ -2031,24 +2388,24 @@ Removed references to Win32. HB 09/23/01 - + +hide-user-agent{Mozilla (X11; I; Linux 2.0.32 i586)} - + @@ -2069,41 +2426,126 @@ Removed references to Win32. HB 09/23/01 Treat this URL as an image. This only matters if it's also +blocked, - in which case a blocked image can be sent rather than a HTML page. - See +image-blocker{} below for the control over what is actually sent. + in which case a blocked image can be sent rather than a HTML page. + See +image-blocker{} below for the control over what is actually sent. + If you want invisible ads, they should be defined as + images and blocked. And also, + image-blocker should be set to blank. - + +image - + - - Decides what to do with URLs that end up tagged with {+block - +image}. There are 4 options. -image-blocker will - send a HTML blocked page, usually resulting in a - broken image icon. +image-blocker{logo} will - send a JunkBuster image. - +image-blocker{blank} will send a 1x1 transparent GIF image. - And finally, +image-blocker{http://xyz.com} will send a HTTP - temporary redirect to the specified image. This has the advantage of the - icon being being cached by the browser, which will speed up the display. + Decides what to do with URLs that end up tagged with {+block + +image}, e.g an advertizement. There are five options. + -image-blocker will send a HTML blocked page, + usually resulting in a broken image icon. + +image-blocker{logo} will send a + Privoxy logo + image. +image-blocker{blank} will send a 1x1 transparent GIF + image. And finally, +image-blocker{http://xyz.com} will send + a HTTP temporary redirect to the specified image. This has the advantage of + the icon being being cached by the browser, which will speed up the display. + +image-blocker{pattern} will send a checkboard type pattern, + which scales better than the logo (which can get blocky if the browser + enlarges it too much). - + +image-blocker{logo} +image-blocker{blank} + +image-blocker{pattern} +image-blocker{http://i.j.b/send-banner} - + + + + + + + + By default (i.e. in the absence of a +limit-connect + action), Privoxy will only allow CONNECT + requests to port 443, which is the standard port for https as a + precaution. + + + + The CONNECT methods exists in HTTP to allow access to secure websites + (https:// URLs) through proxies. It works very simply: the proxy + connects to the server on the specified port, and then short-circuits + its connections to the client and to the remote proxy. + This can be a big security hole, since CONNECT-enabled proxies can + be abused as TCP relays very easily. + + + + If you want to allow CONNECT for more ports than this, or want to forbid + CONNECT altogether, you can specify a comma separated list of ports and + port ranges (the latter using dashes, with the minimum defaulting to 0 and + max to 65K): + + + + + + + +limit-connect{443} # This is the default and need no be specified. + +limit-connect{80,443} # Ports 80 and 443 are OK. + +limit-connect{-3, 7, 20-100, 500-} # Port less than 3, 7, 20 to 100 + #and above 500 are OK. + + + + + + + + + + +no-compression prevents the website from compressing the + data. Some websites do this, which can be a problem for + Privoxy, since +filter, + +no-popup and +gif-deanimate will not work on + compressed data. This will slow down connections to those websites, + though. Default is nocompression is turned on. + + + + + + + +nocompression + + + + + + + + + If the website sets cookies, no-cookies-keep will make sure + they are erased when you exit and restart your web browser. This makes + profiling cookies useless, but won't break sites which require cookies so + that you can log in for transactions. Default: on. + + + + + + +no-cookies-keep + + @@ -2114,11 +2556,11 @@ Removed references to Win32. HB 09/23/01 - + +no-cookies-read - + @@ -2129,11 +2571,11 @@ Removed references to Win32. HB 09/23/01 - + +no-cookies-set - + @@ -2146,12 +2588,12 @@ Removed references to Win32. HB 09/23/01 - + +no-popup +no-popups - + @@ -2166,11 +2608,11 @@ Removed references to Win32. HB 09/23/01 - + +vanilla-wafer - + @@ -2182,11 +2624,11 @@ Removed references to Win32. HB 09/23/01 - + +wafer{name=value} - + @@ -2209,15 +2651,18 @@ Removed references to Win32. HB 09/23/01 - + - # Turn off all cookies + # Turn off all persistent cookies { +no-cookies-read } { +no-cookies-set } + # Allow cookies for this browser session ONLY + { +no-cookies-keep } - # Execeptions to the above, sites that need cookies + # Exceptions to the above, sites that benefit from persistent cookies { -no-cookies-read } { -no-cookies-set } + { -no-cookies-keep } .javasoft.com .sun.com .yahoo.com @@ -2225,11 +2670,11 @@ Removed references to Win32. HB 09/23/01 .redhat.com # Alternative way of saying the same thing - {-no-cookies-set -no-cookies-read} + {-no-cookies-set -no-cookies-read -no-cookies-keep} .sourceforge.net .sf.net - + @@ -2239,7 +2684,7 @@ Removed references to Win32. HB 09/23/01 - + # Turn them off! {+fast-redirects} @@ -2249,26 +2694,30 @@ Removed references to Win32. HB 09/23/01 www.ukc.ac.uk/cgi-bin/wac\.cgi\? login.yahoo.com - + - Turn on page filtering, with one exception for sourceforge: - + Turn on page filtering according to rules in the defined sections + of refilterfile, and make one exception for + sourceforge: + - + - # Run everything through the default filter file (re_filterfile): - {+filter} - - # But please don't re_filter code from sourceforge! + # Run everything through the filter file, using only the + # specified sections: + +filter{html-annoyances} +filter{js-annoyances} +filter{no-popups}\ + +filter{webbugs} +filter{nimda} +filter{banners-by-size} + + # Then disable filtering of code from sourceforge! {-filter} .cvs.sourceforge.net - + @@ -2280,7 +2729,7 @@ Removed references to Win32. HB 09/23/01 - + # Blocklist: {+block} @@ -2328,10 +2777,20 @@ Removed references to Win32. HB 09/23/01 /.*/adlib/server\.cgi /autoads/ - + + + Note that many of these actions have the potential to cause a page to + misbehave, possibly even not to display at all. There are many ways + a site designer may choose to design his site, and what HTTP header + content he may depend on. There is no way to have hard and fast rules + for all sites. See the Appendix + for a brief example on troubleshooting actions. + + + @@ -2341,14 +2800,16 @@ Removed references to Win32. HB 09/23/01 Aliases - Custom actions, known to Junkbuster - as aliases, can be defined by combing other actions. + Custom actions, known to Privoxy + as aliases, can be defined by combining other actions. These can in turn be invoked just like the built-in actions. Currently, an alias can contain any character except space, tab, =, { or }. But please use only a- z, 0-9, +, and - -. Alias names are not case sensitive, and must be defined - before they are used. + -. Alias names are not case sensitive, and + must be defined before anything else in the + ijb.actionfile ! And there can only be one set of + aliases defined. @@ -2357,12 +2818,10 @@ Removed references to Win32. HB 09/23/01 - + - # Aliases + # Useful customer aliases we can use later. These must come first! {{alias}} - - # Useful aliases +no-cookies = +no-cookies-set +no-cookies-read -no-cookies = -no-cookies-set -no-cookies-read fragile = -block -no-cookies -filter -fast-redirects -hide-referer -no-popups @@ -2376,7 +2835,7 @@ Removed references to Win32. HB 09/23/01 c3 = +no-cookies-set -no-cookies-read #... etc. Customize to your heart's content. - + @@ -2387,13 +2846,14 @@ Removed references to Win32. HB 09/23/01 - + # These sites are very complex and require # minimal interference. {fragile} .office.microsoft.com .windowsupdate.microsoft.com + .nytimes.com # Shopping sites - still want to block ads. {shop} @@ -2407,7 +2867,7 @@ Removed references to Win32. HB 09/23/01 .dabs.com .overclockers.co.uk - + @@ -2421,16 +2881,24 @@ Removed references to Win32. HB 09/23/01 The Filter File - The filter file defines what filtering of web pages - Junkbuster does. The default filter file is - re_filterfile, located in the config directory. In this - file, any document content, whether viewable text or - embedded non-visible content, can be changed. + Any web page can be dynamically modified with the filter file. This + modification can be removal, or re-writing, of any web page content, + including tags and non-visible content. The default filter file is + re_filterfile, located in the config directory. + + + + The included example file is divided into sections. Each section begins + with the FILTER keyword, followed by the identifier + for that section, e.g. FILTER: webbugs. Each section performs + a similar type of filtering, such as html-annoyances. + This file uses regular expressions to alter or remove any string in the - target page. Some examples from the included default re_filterfile: + target page. The expressions can only operate on one line at a time. Some + examples from the included default re_filterfile: @@ -2440,107 +2908,326 @@ Removed references to Win32. HB 09/23/01 - + - # The status bar is for displaying link targets, not pointless buzzwords. - # Again, check it out on http://www.airport-cgn.de/. - s/status='.*?';*//ig + FILTER: html-annoyances + + # New browser windows should be resizeable and have a location and status + # bar. Make it so. + # + s/resizable="?(no|0)"?/resizable=1/ig s/noresize/yesresize/ig + s/location="?(no|0)"?/location=1/ig s/status="?(no|0)"?/status=1/ig + s/scrolling="?(no|0|Auto)"?/scrolling=1/ig + s/menubar="?(no|0)"?/menubar=1/ig + + # The <BLINK> tag was a crime! + # + s*<blink>|</blink>**ig + + # Is this evil? + # + #s/framespacing="?(no|0)"?//ig + #s/margin(height|width)=[0-9]*//gi - + Just for kicks, replace any occurrence of Microsoft with - MicroSuck: + MicroSuck, and have a little fun with topical buzzwords: - + + FILTER: fun + s/microsoft(?!.com)/MicroSuck/ig + + # Buzzword Bingo: + # + s/industry-leading|cutting-edge|award-winning/<font color=red><b>BINGO!</b></font>/ig - + - Kill those auto-refresh tags: + Kill those pesky little web-bugs: - + - # Kill refresh tags. I like to refresh myself. Manually. - # check it out on http://www.airport-cgn.de/ and go to the arrivals page. - # - s/<meta[^>]*http-equiv[^>]*refresh.*URL=([^>]*?)"?>/<link rev="x-refresh" href=$1>/i - s/<meta[^>]*http-equiv="?page-enter"?[^>]*content=[^>]*>/<!--no page enter for me-->/i + # webbugs: Squish WebBugs (1x1 invisible GIFs used for user tracking) + FILTER: webbugs + + s/<img\s+[^>]*?(width|height)\s*=\s*['"]?1\D[^>]*?(width|height)\s*=\s*['"]?1(\D[^>]*?)?>/<!-- Squished WebBug -->/sig - + + + + + + + + +Templates + + When Privoxy displays one of its internal + pages, such as a 404 Not Found error page, it uses the appropriate template. + On Linux, BSD, and Unix, these are located in + /etc/privoxy/templates by default. These may be + customized, if desired. + + + + + + + + -Quickstart to Using Junkbuster +Quickstart to Using <application>Privoxy</application> - Install package, then run and enjoy! Be sure your browser is set to use - the proxy which is by default at localhost, port 8000. With - Netscape (and Mozilla), - this can be set under Edit -> Preferences -> Advanced -> - Proxies -> HTTP Proxy. For Internet - Explorer: Internet Properties -> Connections -> - LAN Setting. Then, check Use Proxy and fill in the - appropriate info (Address: localhost, Port: 8000). + Install package, then run and enjoy! Privoxy + is typically started by specifying the main configuration file to be + used on the command line. Example Unix startup command: - The included default configuration files should give a reasonable starting - point, though may be somewhat aggressive in blocking junk. You will probably - want to keep an eye out for sites that require cookies, and add these to - actionsfile as needed. By default, most of these will - be blocked until you add them to the configuration. If you want the browser - to handle this, you will need to edit actionsfile and - disable this feature. If you use more than one browser, it would make more - sense to let Junkbuster handle this. In which - case, the browser(s) should be set to accept all cookies. + + + # /usr/sbin/privoxy /etc/privoxy/config + + - Browsers, like Mozilla, that support HTTP/1.1, might do better if forced - back to HTTP/1.0. For Mozilla, look under - Edit -> Preferences -> Debug -> Networking. + An init script is provided for SuSE and Redhat. + + + +For for SuSE: /etc/rc.d/privoxy start + + + +For RedHat: /etc/rc.d/init.d/privoxy start + + + + + If no configuration file is specified on the command line, + Privoxy will look for a file named + config in the current directory. Except on Win32 where + it will try config.txt. If no file is specified on the + command line and no default configuration file can be found, + Privoxy will fail to start. + + + + Be sure your browser is set to use the proxy which is by default at + localhost, port 8118. With Netscape (and + Mozilla), this can be set under Edit + -> Preferences -> Advanced -> Proxies -> HTTP Proxy. + For Internet Explorer: Tools > + Internet Properties -> Connections -> LAN Setting. Then, + check Use Proxy and fill in the appropriate info (Address: + localhost, Port: 8118). Include if HTTPS proxy support too. + + + + The included default configuration files should give a reasonable starting + point, though may be somewhat aggressive in blocking junk. You will probably + want to keep an eye out for sites that require persistent cookies, and add these to + ijb.action as needed. By default, most of these will + be accepted only during the current browser session, until you add them to + the configuration. If you want the browser to handle this instead, you will + need to edit ijb.action and disable this feature. If you + use more than one browser, it would make more sense to let + Privoxy handle this. In which case, the + browser(s) should be set to accept all cookies. + + + + If a particular site shows problems loading properly, try adding it + to the {fragile} section of + ijb.action. This will turn off most actions for + this site. + + + + Privoxy is HTTP/1.1 compliant, but not all 1.1 + features are as yet implemented. If browsers that support HTTP/1.1 (like + Mozilla or recent versions of I.E.) experience + problems, you might try to force HTTP/1.0 compatibility. For Mozilla, look + under Edit -> Preferences -> Debug -> Networking. + Or set the +downgrade config option in + ijb.action. + + + + After running Privoxy for a while, you can + start to fine tune the configuration to suit your personal, or site, + preferences and requirements. There are many, many aspects that can + be customized. Actions (as specified in ijb.action) + can be adjusted by pointing your browser to + http://i.j.b/, + and then follow the link to edit the actions list. + (This is an internal page and does not require Internet access.) + + + + In fact, various aspects of Privoxy + configuration can be viewed from this page, including + current configuration parameters, source code version numbers, + the browser's request headers, and actions that apply + to a given URL. In addition to the ijb.action file + editor mentioned above, Privoxy can also + be turned on and off from this page. If you encounter problems, please verify it is a - Junkbuster bug, by disabling - Junkbuster, and then trying the same page. - Before reporting it as a bug, see if there is not a configuration + Privoxy bug, by disabling + Privoxy, and then trying the same page. + Also, try another browser if possible to eliminate browser or site + problems. Before reporting it as a bug, see if there is not a configuration option that is enabled that is causing the page not to load. You can then add an exception for that page or site. If a bug, please report it to the developers (see below). + + + + + +Command Line Options + + Privoxy may be invoked with the following + command-line options: + + + + + + + + --version + + + Print version info and exit, Unix only. + + + + + --help + + + Print a short usage info and exit, Unix only. + + + + + --no-daemon + + + Don't become a daemon, i.e. don't fork and become process group + leader, don't detach from controlling tty. Unix only. + + + + + --pidfile FILE + + + + On startup, write the process ID to FILE. Delete the + FILE on exit. Failiure to create or delete the + FILE is non-fatal. If no FILE + option is given, no PID file will be used. Unix only. + + + + + --user USER[.GROUP] + + + + After (optionally) writing the PID file, assume the user ID of + USER, and if included the GID of GROUP. Exit if the + privileges are not sufficient to do so. Unix only. + + + + + configfile + + + If no configfile is included on the command line, + Privoxy will look for a file named + config in the current directory (except on Win32 + where it will look for config.txt instead). Specify + full path to avoid confusion. + + + + + + + + + + + -Contact the Developers + +Contacting the Developers, Bug Reporting and Feature +Requests - - Feature requests and other questions should be posted to the Feature - request page at SourceForge. There is also an archive there. +We value your feedback. However, to provide you with the best support, +please note: + + + + Use the Sourceforge support forum to get + help. + + Submit bugs only thru our Sourceforge bug + forum. +Make sure that the bug has not already been submitted. Please try to +verify that it is a Privoxy bug, and not +a browser or site bug first. If you are using your own custom configuration, +please try the stock configs to see if the problem is a configuration +related bug. And if not using the latest development snapshot, please +try the latest one. Or even better, CVS sources. + + + + Submit feature requests only thru our Sourceforge feature request forum. + + + + + + + +For any other issues, feel free to use the mailing lists. @@ -2550,14 +3237,6 @@ communication (bugs, feature requests, etc.) Archives are available here too. - - Please report bugs, using the form at - Sourceforge. - Please try to verify that it is a Junkbuster bug, - and not a browser or site bug first. Also, check to make sure this is not - already a known bug. - - @@ -2567,7 +3246,7 @@ communication (bugs, feature requests, etc.) License - Internet Junkbuster is free software; you can + Privoxy is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. @@ -2592,14 +3271,15 @@ communication (bugs, feature requests, etc.) History - Junkbuster was originally written by Anonymous + Junkbuster was originally written by Anonymous Coders and JunkBusters + url="http://www.junkbusters.com/ht/en/ijbfaq.html">Junkbuster's Corporation, and was released as free open-source software under the GNU GPL. Stefan Waldherr made many improvements, and started the SourceForge project to - rekindle development. The last stable release was v2.0.2, which has now + url="http://sourceforge.net/projects/ijbswa/">SourceForge project + Privoxy
to rekindle development. There are now several active + developers contributing. The last stable release was v2.0.2, which has now grown whiskers ;-). @@ -2609,7 +3289,44 @@ communication (bugs, feature requests, etc.) See also -To be filled. What should go here :/ + + + + +   http://sourceforge.net/projects/ijbswa + + + + +   http://ijbswa.sourceforge.net/ + + + + +   http://i.j.b/ + + + + +   http://www.junkbusters.com/ht/en/cookies.html + + + + +   http://www.waldherr.org/junkbuster/ + + + + +   http://privacy.net/analyze/ + + + + +  http://www.squid-cache.org/ + + + @@ -2623,11 +3340,11 @@ communication (bugs, feature requests, etc.) Regular Expressions - Junkbuster can use regular expressions + Privoxy can use regular expressions in various config files. Assuming support for pcre (Perl Compatible Regular Expressions) is compiled in, which is the default. Such configuration directives do not require regular expressions, but they can be - used to increase flexibility by matching a pattern with wildcards against + used to increase flexibility by matching a pattern with wild-cards against URLs. @@ -2642,18 +3359,18 @@ communication (bugs, feature requests, etc.) expression against another to see if it matches or not. One of the expressions is a literal string of readable characters (letter, numbers, etc), and the other is a complex string of literal - characters combined with wildcards, and other special characters, called - metacharacters. The metacharacters have special meanings and + characters combined with wild-cards, and other special characters, called + meta-characters. The meta-characters have special meanings and are used to build the complex pattern to be matched against. Perl Compatible Regular Expressions is an enhanced form of the regular expression language with backward compatibility. - To make a simple analogy, we do something similar when we use wildcard + To make a simple analogy, we do something similar when we use wild-card characters when listing files with the dir command in DOS. *.* matches all filenames. The special - character here is the asterik which matches any and all characters. We can be + character here is the asterisk which matches any and all characters. We can be more specific and use ? to match just individual characters. So dir file?.text would match file1.txt, file2.txt, etc. We are pattern @@ -2700,7 +3417,7 @@ communication (bugs, feature requests, etc.) \ - The escape character denotes that the following character should be taken literally. This is used where one of the special characters (e.g. .) needs to be taken literally and - not as a special metacharacter. + not as a special meta-character. @@ -2713,7 +3430,7 @@ communication (bugs, feature requests, etc.) - () - Pararentheses are used to group a sub-expression, + () - parentheses are used to group a sub-expression, or multiple sub-expressions. @@ -2736,7 +3453,7 @@ communication (bugs, feature requests, etc.) These are just some of the ones you are likely to use when matching URLs with - Junkbuster, and is a long way from a definitive + Privoxy, and is a long way from a definitive list. This is enough to get us started with a few simple examples which may be more illuminating: @@ -2825,7 +3542,7 @@ communication (bugs, feature requests, etc.) s/microsoft(?!.com)/MicroSuck/i - This is - a substitution. MicroSuck will replace any occurence of + a substitution. MicroSuck will replace any occurrence of microsoft. The i at the end of the expression means ignore case. The (?!.com) means the match should fail if microsoft is followed by @@ -2835,7 +3552,7 @@ communication (bugs, feature requests, etc.) We are barely scratching the surface of regular expressions here so that you - can understand the default Junkbuster + can understand the default Privoxy configuration files, and maybe use this knowledge to customize your own installation. There is much, much more that can be done with regular expressions. Now that you know enough to get started, you can learn more on @@ -2849,6 +3566,360 @@ communication (bugs, feature requests, etc.) + + + + + +<application>Privoxy</application>'s Internal Pages + + + Since Privoxy proxies each requested + web page, it is easy for Privoxy to + trap certain URLs. In this way, we can talk directly to + Privoxy, and see how it is + configured, see how our rules are being applied, change these + rules and other configuration options, and even turn + Privoxy's filtering off, all with + a web browser. + + + + + The URLs listed below are the special ones that allow direct access + to Privoxy. Of course, + Privoxy must be running to access these. If + not, you will get a friendly error message. Internet access is not + necessary either. + + + + + + + + Privoxy main page: + +
+ + http://ijbswa.sourceforge.net/config/ + +
+ + Alternately, this may be reached at http://i.j.b/, + but this variation may not work as reliably as the above in some + configurations. + +
+ + + + Show information about the current configuration: + +
+ + http://ijbswa.sourceforge.net/config/show-status + +
+
+ + + + Show the source code version numbers: + +
+ + http://ijbswa.sourceforge.net/config/show-version + +
+
+ + + + Show the client's request headers: + +
+ + http://ijbswa.sourceforge.net/config/show-request + +
+
+ + + + Show which actions apply to a URL and why: + +
+ + http://ijbswa.sourceforge.net/config/show-url-info + +
+
+ + + + Toggle Privoxy on or off: + +
+ + http://ijbswa.sourceforge.net/config/toggle + +
+ + Short cuts. Turn off, then on: + +
+ + http://ijbswa.sourceforge.net/config/toggle?set=disable + +
+
+ + http://ijbswa.sourceforge.net/config/toggle?set=enable + +
+
+ + + + Edit the actions list file: + +
+ + http://ijbswa.sourceforge.net/config/edit-actions + +
+
+ +
+
+ + + These may be bookmarked for quick reference. + + + +
+ + + + +Anatomy of an Action + + + The way Privoxy applies actions + to any given URL can be complex, and not always so easy to understand what + is happening. And sometimes we need to be able to see + just what Privoxy is doing. Especially, + if something Privoxy is doing is causing + us a problem inadvertantly. It can be a little daunting to look at + the actions files themselves, since they tend to be filled with + regular expressions whose consequences are not always + so obvious. Privoxy provides the + http://ijbswa.sourceforge.net/config/show-url-info + page that can show us very specifically how actions + are being applied to any given URL. This is a big help for troubleshooting. + + + + First, enter one URL (or partial URL) at the prompt, and then + Privoxy will tell us + how current configuration will handle it. This will not + help with filtering effects from the re_filterfile! It + also will not tell you about any other URLs that may be embedded within the + URL you are testing. For instance, images such as ads are expressed as URLs + within the raw page source of HTML pages. So you will only get info for the + actual URL that is pasted into the prompt area -- not any sub-URLs. If you + want to know about embedded URLs like ads, you will have to dig those out of + the HTML source. Use your browser's View Page Source option + for this. + + + + Let's look at an example, google.com, + one section at a time: + + + + + System default actions: + + { -add-header -block -deanimate-gifs -downgrade -fast-redirects -filter + -hide-forwarded -hide-from -hide-referer -hide-user-agent -image + -image-blocker -limit-connect -no-compression -no-cookies-keep + -no-cookies-read -no-cookies-set -no-popups -vanilla-wafer -wafer } + + + + + + This is the top section, and only tells us of the compiled in defaults. This + is basically what Privoxy would do if there + were not any actions defined, i.e. it does nothing. Every action + is disabled. This is not particularly informative for our purposes here. OK, + next section: + + + + + + Matches for http://google.com: + + { -add-header -block +deanimate-gifs -downgrade +fast-redirects + +filter{html-annoyances} +filter{js-annoyances} +filter{no-popups} + +filter{webbugs} +filter{nimda} +filter{banners-by-size} +filter{hal} + +filter{fun} +hide-forwarded +hide-from{block} +hide-referer{forge} + -hide-user-agent -image +image-blocker{blank} +no-compression + +no-cookies-keep -no-cookies-read -no-cookies-set +no-popups + -vanilla-wafer -wafer } + / + + { -no-cookies-keep -no-cookies-read -no-cookies-set } + .google.com + + { -fast-redirects } + .google.com + + + + + + This is much more informative, and tells us how we have defined our + actions, and which ones match for our example, + google.com. The first grouping shows our default + settings, which would apply to all URLs. If you look at your actions + file, this would be the section just below the aliases section + near the top. This applies to all URLs as signified by the single forward + slash -- /. + + + + + These are the default actions we have enabled. But we can define additional + actions that would be exceptions to these general rules, and then list + specific URLs that these exceptions would apply to. Last match wins. + Just below this then are two explict matches for .google.com. + The first is negating our various cookie blocking actions (i.e. we will allow + cookies here). The second is allowing fast-redirects. Note + that there is a leading dot here -- .google.com. This will + match any hosts and sub-domains, in the google.com domain also, such as + www.google.com. So, apparently, we have these actions defined + somewhere in the lower part of our actions file, and + google.com is referenced in these sections. + + + + + And now we pull it altogether in the bottom section and summarize how + Privoxy is appying all its actions + to google.com: + + + + + + + Final results: + + -add-header -block -deanimate-gifs -downgrade -fast-redirects + +filter{html-annoyances} +filter{js-annoyances} +filter{no-popups} + +filter{webbugs} +filter{nimda} +filter{banners-by-size} +filter{hal} + +filter{fun} +hide-forwarded +hide-from{block} +hide-referer{forge} + -hide-user-agent -image +image-blocker{blank} -limit-connect +no-compression + -no-cookies-keep -no-cookies-read -no-cookies-set +no-popups -vanilla-wafer + -wafer + + + + + + Now another example, ad.doubleclick.net: + + + + + + { +block +image } + .ad.doubleclick.net + + { +block +image } + ad*. + + { +block +image } + .doubleclick.net + + + + + + We'll just show the interesting part here, the explicit matches. It is + matched three different times. Each as an +block +image, + which is the expanded form of one of our aliases that had been defined as: + +imageblock. (Aliases are defined in the + first section of the actions file and typically used to combine more + than one action.) + + + + Any one of these would have done the trick and blocked this as an unwanted + image. This is unnecessarily redundant since the last case effectively + would also cover the first. No point in taking chances with these guys + though ;-) Note that if you want an ad or obnoxious + URL to be invisible, it should be defined as ad.doubleclick.net + is done here -- as both a +block and an + +image. The custom alias +imageblock does this + for us. + + + + One last example. Let's try http://www.rhapsodyk.net/adsl/HOWTO/. + This one is giving us problems. We are getting a blank page. Hmmm... + + + + + + Matches for http://www.rhapsodyk.net/adsl/HOWTO/: + + { -add-header -block +deanimate-gifs -downgrade +fast-redirects + +filter{html-annoyances} +filter{js-annoyances} +filter{no-popups} + +filter{webbugs} +filter{nimda} +filter{banners-by-size} +filter{hal} + +filter{fun} +hide-forwarded +hide-from{block} +hide-referer{forge} + -hide-user-agent -image +image-blocker{blank} +no-compression + +no-cookies-keep -no-cookies-read -no-cookies-set +no-popups + -vanilla-wafer -wafer } + / + + { +block +image } + /ads + + + + + + Ooops, the /adsl/ is matching /ads! But + we did not want this at all! Now we see why we get the blank page. We could + now add a new action below this that explictly does not + block (-block) pages with adsl. There are various ways to + handle such exceptions. Example: + + + + + + { -block } + /adsl + + + + + + Now the page displays ;-) + + + + +