A disclaimer, thanks, and links
Disclaimer
I hate to say it, but I have to say it: Use clean_corpus at your
own risk. If this script damages your corpus, your harddisk, your computer,
or your sanity, hides your keys, or runs away with your spouse, send us a note,
but don't sue us. Use it at your own risk or don't use it at all.
It is the user's (your) reponsibility to backup the corpus. Nothing should go wrong or damage your corpus since POPFile makes the actual changes, but if for some reason accuracy really drops, the old corpus can be restored from the backup. Simply copy the popfile.db inside your POPFile installation to a save place.
If you have any problems running clean_corpus, if something goes
wrong, or doesn't work the way it's supposed to work, drop us a line. Write an
email to cleancorpus@myriad-online.com
If you want to discuss clean_corpus with other users or if you want to
publicly state how amazingly good (or bad) this script is, the
POPFile Extensions forum
would be a good place.
Thank you!
The creation of this script was only possible because:
- John Graham-Cumming created POPFile. Thank you!
- Olivier Guillion came up with a rule set. Thank you!
- Joseph Connors was kind enough to do extensive beta testing and come up with some really good ideas. Thank you!
-
Scott W. Leighton wrote his skeleton.pl script on which
clean_corpuswas based in former versions. Thank you! - Brian Ellis had the idea to incorporate a probability rule (and was patient enough to draw pictures and give me candy until I finally grasped it).
- Well, I'm not going to thank myself (as some people have actually suggested), but I guess that I must mention that I am the one who wrote the clean_corpus script. Here is a link to my homepage.
Links
The first link, of course, is the one to POPFile. I'm sure you already have this among your bookmarks, but a little promotion won't hurt.
The second link is to Scott W. Leighton's page of POPFile utilities. This is a truly great collection of little tools to tweak POPFile and to get information about your corpus.
This next link features the best skin for POPFile there is: windows.css by Joseph Connors.
Last but not least, check out Olivier's company. It has nothing to do with POPFile, but you may be into music.
Next: Your turn.