matching devious non ascii spam strings

D

Dan Jacobson

To match the name of a famous spam drug for spamassassin, I use
/v[il1\xA0-\xFF]agra/i
as they now are using all the accented versions of "i".
I suppose I will have to do the same for the a's etc. too.
I have just used the entire range I see on "man iso_8859_1" and more,
rather than whittle it down. I don't suppose there are much better ways.
 
C

Chris Mattern

Dan said:
To match the name of a famous spam drug for spamassassin, I use
/v[il1\xA0-\xFF]agra/i
as they now are using all the accented versions of "i".
I suppose I will have to do the same for the a's etc. too.
I have just used the entire range I see on "man iso_8859_1" and more,
rather than whittle it down. I don't suppose there are much better ways.

The Bayesian filters worry about that kind of crud automatically.
Let them do the work.

--
Christopher Mattern

"Which one you figure tracked us?"
"The ugly one, sir."
"...Could you be more specific?"
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,754
Messages
2,569,528
Members
45,000
Latest member
MurrayKeync

Latest Threads

Top