While checking the Akismet spam filter to respond to one of the commenters on another thread, PG discovered a statistical summary of the last twelve months of TPV from a ham/spam perspective. He thought some visitors might find it interesting. (Click on the graph for a larger version)
PG really enjoys reading 99% of the comments that appear on TPV, but he had no idea there were almost 11 thousand comments he didn’t see because Akismet zapped them. That’s 36% of the total comments that were submitted to the blog.
Since PG opened up Excel to calculate that 36% spam percentage, he played with the numbers a little bit more.
Absent the spam filter, PG isn’t certain how long it would have taken him to clean up the spam by manually deleting spam comments.
However, Excel at hand, if each spam comment took him an average of 15 seconds to identify and delete, that would total almost 45 hours in addition to the time he already spends on TPV that he would have to devote to keeping the conversational space tidied up.
If it took 30 seconds per spam post, that’s almost 90 hours. 60 seconds per spam would total about one month of 8-hour Monday-Friday work days.
PG’s calculations only assume time spent on the 10,744 spam comments that Akismet caught during the last twelve months. However, in order to identify the chaff or mostly-chaff comments, PG would also have to at least briefly examine the wheat comments before determining he wouldn’t need to delete them.
The total number of wheat and chaff comments would have been almost 30,000. Presumably, without Akismet cleaning the chaff, spammers might well have been incented to drop more comments into TPV, thereby consuming more human filtering time.
PG needs to figure out a way to make a donation to Akismet.