Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: SpamAssassin: devel

Fwd: My corpus in ruleqa?

 

 

SpamAssassin devel RSS feed   Index | Next | Previous | View Threaded


jarif at iki

Jul 23, 2012, 3:01 PM

Post #1 of 6 (243 views)
Permalink
Fwd: My corpus in ruleqa?

And now my maildir for this list just disappeared to some bit heaven..

Have to repost, as I may have lost everything here...


-------- Original Message --------
Subject: My corpus in ruleqa?
Date: Mon, 23 Jul 2012 20:32:54 +0300
From: Jari Fredriksson <jarif [at] iki>
To: SpamAssassin Developers <dev [at] spamassassin>


I have the corpus on a new machine, lots faster than the old. The old
masscheck had only 2 cores in use, current has 8 cores and 12 gigs of
RAM. That results in the logs sent to ruleqa in ½ hours from starting
the masscheck, while the old lasted nearly 2 hours.

I do not see myself in ruleqa for today, while the job seems to have
been run fine. What happened to my logs? Were they sent too early to get
to the ruleqa sets?

--jarif
Attachments: signature.asc (0.26 KB)


jarif at iki

Aug 10, 2012, 6:43 AM

Post #2 of 6 (182 views)
Permalink
Re: Fwd: My corpus in ruleqa? [In reply to]

24.07.2012 01:01, Jari Fredriksson kirjoitti:
> And now my maildir for this list just disappeared to some bit heaven..
>
> Have to repost, as I may have lost everything here...
>
>
> -------- Original Message --------
> Subject: My corpus in ruleqa?
> Date: Mon, 23 Jul 2012 20:32:54 +0300
> From: Jari Fredriksson <jarif [at] iki>
> To: SpamAssassin Developers <dev [at] spamassassin>
>
>
> I have the corpus on a new machine, lots faster than the old. The old
> masscheck had only 2 cores in use, current has 8 cores and 12 gigs of
> RAM. That results in the logs sent to ruleqa in ½ hours from starting
> the masscheck, while the old lasted nearly 2 hours.
>
> I do not see myself in ruleqa for today, while the job seems to have
> been run fine. What happened to my logs? Were they sent too early to get
> to the ruleqa sets?
>
> --jarif
>

20120808-r1370707-n shows some strange entry from, but then again later
sendings have not registered. So I'm not good with this yet.

Hopefully this gets solved, I see people are really busy with the issue
right now.

--

Noise proves nothing. Often a hen who has merely laid an egg cackles
as if she laid an asteroid.
-- Mark Twain
Attachments: signature.asc (0.26 KB)


KMcGrail at PCCC

Aug 10, 2012, 6:45 AM

Post #3 of 6 (179 views)
Permalink
Re: Fwd: My corpus in ruleqa? [In reply to]

On 8/10/2012 9:43 AM, Jari Fredriksson wrote:
> 20120808-r1370707-n shows some strange entry from, but then again later
> sendings have not registered. So I'm not good with this yet.
>
> Hopefully this gets solved, I see people are really busy with the issue
> right now.
I think for your corpus we are in a wait and see until after the weekend
corpus runs. However, yes, we are watching the issue and working to
improve the masscheck!


axb.lists at gmail

Aug 10, 2012, 7:04 AM

Post #4 of 6 (179 views)
Permalink
Re: Fwd: My corpus in ruleqa? [In reply to]

On 08/10/2012 03:45 PM, Kevin A. McGrail wrote:
> On 8/10/2012 9:43 AM, Jari Fredriksson wrote:
>> 20120808-r1370707-n shows some strange entry from, but then again later
>> sendings have not registered. So I'm not good with this yet.
>>
>> Hopefully this gets solved, I see people are really busy with the issue
>> right now.
> I think for your corpus we are in a wait and see until after the weekend
> corpus runs. However, yes, we are watching the issue and working to
> improve the masscheck!

It takes 2-4 hours till the masscheck logs like ham-jarif.* spam-jarif.
show up in ruleaq site

Have you tried running it manually, and see if it's crashing or
complaining? Console output shows quite clearly when't there a hiccup

Axb


jarif at iki

Aug 10, 2012, 7:30 AM

Post #5 of 6 (183 views)
Permalink
Re: Fwd: My corpus in ruleqa? [In reply to]

10.08.2012 17:04, Axb kirjoitti:
> On 08/10/2012 03:45 PM, Kevin A. McGrail wrote:
>> On 8/10/2012 9:43 AM, Jari Fredriksson wrote:
>>> 20120808-r1370707-n shows some strange entry from, but then again later
>>> sendings have not registered. So I'm not good with this yet.
>>>
>>> Hopefully this gets solved, I see people are really busy with the issue
>>> right now.
>> I think for your corpus we are in a wait and see until after the weekend
>> corpus runs. However, yes, we are watching the issue and working to
>> improve the masscheck!
>
> It takes 2-4 hours till the masscheck logs like ham-jarif.*
> spam-jarif. show up in ruleaq site
>
> Have you tried running it manually, and see if it's crashing or
> complaining? Console output shows quite clearly when't there a hiccup
>
> Axb
I reveive a mail from it's output, no errors visible. Here is todays:

Removing duplicates from HAM SPAM ... done.
Removing unwanted HAM mail from corpus
Removing unwanted SPAM mail from corpus
Syncing nightly_mass_check
+ ./mass-check --hamlog=ham-jarif.log --spamlog=spam-jarif.log -j 2 --progress --reuse ham:dir:/home/jarif/Maildir/.Confirmed-HAM spam:dir:/home/jarif/Maildir/.Confirmed-SPAM
status: starting scan stage now: 2012-08-10 12:07:46
status: completed scan stage, 13349 messages now: 2012-08-10 12:08:59
status: starting run stage now: 2012-08-10 12:08:59
status: 10% ham: 1172 spam: 163 date: 2011-01-26 now: 2012-08-10 12:23:47
status: 20% ham: 2345 spam: 325 date: 2011-04-20 now: 2012-08-10 12:47:39
status: 30% ham: 3519 spam: 486 date: 2011-07-01 now: 2012-08-10 13:13:26
status: 40% ham: 4694 spam: 646 date: 2011-09-13 now: 2012-08-10 13:40:21
status: 50% ham: 5868 spam: 807 date: 2012-06-14 now: 2012-08-10 14:07:22
status: 60% ham: 7043 spam: 967 date: 2012-01-11 now: 2012-08-10 14:34:13
status: 70% ham: 8218 spam: 1127 date: 2012-03-13 now: 2012-08-10 15:00:35
status: 80% ham: 9391 spam: 1289 date: 2012-05-09 now: 2012-08-10 15:26:21
status: 90% ham: 10566 spam: 1449 date: 2012-06-26 now: 2012-08-10 15:50:05
archive-iterator: unable to open /home/jarif/Maildir/.Confirmed-HAM/new/1344571305.M195299P8903V000000000000FC00I00000000002A7E02_0.tempest,S=14892: No such file or directory
archive-iterator: unable to open /home/jarif/Maildir/.Confirmed-HAM/new/1344574945.M204016P12661V000000000000FC00I00000000002A7E10_0.tempest,S=52324: No such file or directory
status: completed run stage now: 2012-08-10 16:19:33
+ LOGLIST=' ham-jarif.log spam-jarif.log'
+ set +x
rsync -Pcz ham-jarif.log spam-jarif.log jarif [at] rsync::corpus/
This is the SpamAssassin Corpus rsync machine.

Modules that are available:

corpus
nightly mass-check result upload area. It is password protected.
If you would like a password, please send a request to
pmc [at] spamassassin and request a "nightly" username and password.

submit
Score generation mass-check result upload area. It is password
protected. If you would like a password, please send a request to
pmc [at] spamassassin and request a "score generation" username
and password. Generally these are only granted after a mass-check
announcement has been made on the spamassassin developer mailing list.

anoncorpus
mass-check result download area, available via anonymous access.

ham-jarif.log

32768 0% 0.00kB/s 0:00:00
2418729 14% 2.28MB/s 0:00:06
5040169 30% 2.36MB/s 0:00:04
7071785 42% 2.21MB/s 0:00:04
9005097 54% 2.12MB/s 0:00:03
11003945 66% 2.00MB/s 0:00:02
12904489 77% 1.79MB/s 0:00:01
15011458 90% 1.75MB/s 0:00:00
16553913 100% 1.90MB/s 0:00:08 (xfer#1, to-check=1/2)
spam-jarif.log

5233 0% 5.27kB/s 0:08:49
113669 4% 108.94kB/s 0:00:24
2098892 75% 1015.71kB/s 0:00:00
2793094 100% 1.10MB/s 0:00:02 (xfer#2, to-check=0/2)

sent 1138551 bytes received 34630 bytes 75689.10 bytes/sec
total size is 19347007 speedup is 16.49



--

You will meet an important person who will help you advance professionally.
Attachments: signature.asc (0.26 KB)


KMcGrail at PCCC

Aug 10, 2012, 11:00 AM

Post #6 of 6 (179 views)
Permalink
Re: Fwd: My corpus in ruleqa? [In reply to]

> I reveive a mail from it's output, no errors visible.

Looks good so far:
-rw-r--r-- 1 rsync rsync 16700666 Aug 10 16:07 ham-jarif.log
-rw-r--r-- 1 rsync rsync 17916771 Jul 21 11:45 ham-net-jarif.log
-rw-r--r-- 1 rsync rsync 2820873 Aug 10 16:07 spam-jarif.log
-rw-r--r-- 1 rsync rsync 3119937 Jul 21 11:45 spam-net-jarif.log

And you are matching http://rsync.spamassassin.org/nightly-versions.txt

# SVN revision: 1371608

We'll see how it looks after the weekend run, thanks.

Regards,
KAM

SpamAssassin devel RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.