Tobias DiPasquale on 22 Oct 2004 12:28:02 -0000


[Date Prev] [Date Next] [Thread Prev] [Thread Next] [Date Index] [Thread Index]

Re: [PLUG] Spam programs


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On Oct 22, 2004, at 7:53 AM, Jeff Abrahamson wrote:
I am running bogofilter with a database of 23,272 spams and 43034
non-spam messages.

I would recommend using a database composed of an order of magnitude less spam and ham (on the order of 2000 apiece). This has proven to be the most accurate in terms of individual precision. Pick the 2000 spammiest spams and 2000 hammiest hams and create a database using those and see if your false negative rate doesn't go down.


- --
Tobias DiPasquale
202A 04C4 2CE6 B985 8520  88D6 CD25 1A6C B9B5 1595
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.5 (Darwin)

iD8DBQFBePylzSUabLm1FZURAul6AJ9W+Ps4v+JR77iGN7guRZep3ZrUXgCgptgA
w5WTID09j5n5v51HHdxyb8k=
=4Exl
-----END PGP SIGNATURE-----

___________________________________________________________________________
Philadelphia Linux Users Group         --        http://www.phillylinux.org
Announcements - http://lists.phillylinux.org/mailman/listinfo/plug-announce
General Discussion  --   http://lists.phillylinux.org/mailman/listinfo/plug