Been thinking about where I am in the process of creating this service and I thought I’d put it down for posterity
I now have 8 Erlang nodes working together to create the core of the system, in reality 6 of them are the core and the other two are additional processing power. I had hoped to get 6 nodes before the first of the year and I had all 8 of them up and running on the 19th of December. All of the nodes are using Erlang’s distributed processing to communicate with each other.
Three of the nodes are exposed to the Internet and accepting SMTP email all of the other nodes are helpers that process the mail and tag them for spam traits.
Last week I got the Bayesian filters working and they are working amazingly well. I was catching about 80% of the email messages as spam, now I’m up to about 93%. I’m still using two of my older domains where the real spam percentage should be 99%. The filters are still learning, and I expect them to get more accurate as the days and weeks go on.
The SMTP client portion of the software is also complete, at least it is ready for BETA testing. I still need to implement retries, but I have them figured out.
Speaking of that I consider the software to be in a pre-BETA state right now. I’m hoping to start by BETA testing at the first of the year when I intend to put my personal email onto the system.
Before I get that far I need a rudimentary user interface, which is what I am working on right now. As soon as I can reclassify email messages that are bad to good and vice versa and I can forward email that has been quarantined I will move to BETA testing.
I still have some rather large components to write to make the SMTP server completely RFC compliant, but everything that has been written so far is compatible with the RFCs.
That is pretty much so it, hopefully my BETA will go off without any hitches and I’ll possibly open the BETA testing up to more domains then just the ones that I own.