KMcGrail at PCCC
Mar 21, 2012, 7:11 AM
Post #4 of 4
On 3/21/2012 6:17 AM, Axb wrote:
> the MSGID_MULTIPLE_AT hits mostly ham using newish MS MUAs and seems
> like pointless bloat.
> Hits every other message and give a recipient a fuzzy feeling when
> rcpt see this hit so often
> (sic: all my messages are spam because of MSGID_MULTIPLE_AT !!)
> The few spam hits from the axb corpus don't warrant this to be kept.
> Many moons ago it was a usefull spam trait, but that was way before
> Microsoft started using it in their MUAs
> Could we remove this rule?
> +1 for removal?
> -1 for removal?
First, for the sake of debate, I think using any of the current QA in a
debate with such a low corpus is a bit unscientific.
Second, checking my corpus, I only have the following emails with
grep -E ^Message-?id * | wc -l
Personal: 1933 (much larger corpus)
All inboxes on a server with 400+ users: 2929
This is a VERY small percentage of total emails.
Checking the above for those that hit the rule and we get:
grep -E ^Message-?id * | grep -E "<[^>]*\@[^>]*\@"
Corporate: 4 all HAM (4 from the same Verizon user)
Personal: 21 all HAM (17 from Verizon and 4 from non verizon with a lot
of overlapping users and all but 4 from August of last year.)
All Inboxes: Only 15 (14 from Verizon)
Overall, I'm wondering what is adding the Message-ID for you? I am not
sure it's an MUA. I think it's an MTA in the chain. I rarely see a
Message-ID header and use Sendmail.
However, the rule is scored as a test rule: score MSGID_MULTIPLE_AT
0.001 so this is an efficiency issue not a scoring/FP issue.
We have enough votes to kill it so I've moved it to a demoted sandbox.
svn commit rulesrc/sandbox/kmcgrail/20_demoted_tests.cf
rules/20_head_tests.cf rules/30_text_pt_br.cf rules/50_scores.cf -m
'Demoted MSGID_MULTIPLE_AT from default rules to sandbox'
Transmitting file data ....
Committed revision 1303397.