From: hal9 Date: Sun, 13 Oct 2002 21:56:52 +0000 (+0000) Subject: Adding demoronizer filter. This should include all the common abuses. I have X-Git-Tag: v_3_0_1~104 X-Git-Url: http://www.privoxy.org/gitweb/?a=commitdiff_plain;h=e26ca2c9d29a0bd202edb29c19a8e5adbeda89b6;p=privoxy.git Adding demoronizer filter. This should include all the common abuses. I have left a few of the rare cases commented out (never found these in the wild). --- diff --git a/default.filter b/default.filter index 6795017e..58e10f3a 100644 --- a/default.filter +++ b/default.filter @@ -1,8 +1,8 @@ # ******************************************************************** # -# File : $Source: /cvsroot/ijbswa//current/default.filter,v $ +# File : $Source: /cvsroot/ijbswa/current/default.filter,v $ # -# $Id: default.filter,v 1.11.2.6 2002/08/23 14:12:26 oes Exp $ +# $Id: default.filter,v 1.11.2.7 2002/09/25 15:09:39 oes Exp $ # # Purpose : Rules to process the content of web pages # @@ -356,10 +356,50 @@ s%^.*(?Blocked< s+^.*warez.*$+No Warez

You're not searching for illegal stuff, are you?

+is +################################################################################# +# +# demoronizer: Correct Microsoft's abuse of standardized character sets, which +# leave the browser to (mis)-interpret unknown characters, with +# sometimes bizarre results on non-MS platforms. +# +# credit: ripped from the demoroniser.pl script by: +# John Walker -- January 1998, http://www.fourmilab.ch/webtools/demoroniser +# +################################################################################# +FILTER: demoronizer fixing MS's non-standard use of std charsets. + +s/(&\#[0-2]\d\d)\s/$1; /g +# per Robert Lynch: http://slate.msn.com//?id=2067547, just a guess. +# Must come before x94 below. +s/\xE2\x80\x94/ -- /g +s/\x82/,/g +#s-\x83-f-g +s/\x84/,,/g +s/\x85/.../g +#s/\x88/^/g +#s-\x89- °/°°-g +s/\x8B/~-g +#s-\x99-TM-g +# per Robert Lynch. +s/\x9B/>/g # 155 + + ############################################################################## # # Revisions : # $Log: default.filter,v $ +# Revision 1.11.2.7 2002/09/25 15:09:39 oes +# Preserve original quoting style in tags wherever possible. Fixes Bug #605956 +# # Revision 1.11.2.6 2002/08/23 14:12:26 oes # Proofed frameset-borders against "fremaborder=0 border=0" #