Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: SpamAssassin: devel

[Bug 6788] URL detection sometimes does not work

 

 

SpamAssassin devel RSS feed   Index | Next | Previous | View Threaded


bugzilla-daemon at bugzilla

Apr 10, 2012, 2:46 PM

Post #1 of 3 (297 views)
Permalink
[Bug 6788] URL detection sometimes does not work

https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6788

Lemat <lemat [at] lemat> changed:

What |Removed |Added
----------------------------------------------------------------------------
CC| |lemat [at] lemat

--- Comment #1 from Lemat <lemat [at] lemat> 2012-04-10 21:46:38 UTC ---
The dot in URL is 2e hex.

--
Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


bugzilla-daemon at bugzilla

Apr 11, 2012, 2:48 PM

Post #2 of 3 (277 views)
Permalink
[Bug 6788] URL detection sometimes does not work [In reply to]

https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6788

D. Stussy <software+spamassassin [at] kd6lvw> changed:

What |Removed |Added
----------------------------------------------------------------------------
CC| |software+spamassassin [at] kd6l
| |w.ampr.org

--- Comment #2 from D. Stussy <software+spamassassin [at] kd6lvw> 2012-04-11 21:48:32 UTC ---
No. The dot in URL is 2E using hex. Hexidecimal always uses CAPITAL letters.

--
Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


bugzilla-daemon at bugzilla

Apr 23, 2012, 3:45 PM

Post #3 of 3 (279 views)
Permalink
[Bug 6788] URL detection sometimes does not work [In reply to]

https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6788

--- Comment #3 from Lemat <lemat [at] lemat> 2012-04-23 22:45:41 UTC ---
those headers were misleading...
the problem is with space at the end of URL
the parser is (probably) removing HTML tags and dots, commas etc. making
something like that:

toppolandjob.com</b>Odp... -> toppolandjob.comOdp...- URI not found
toppolandjob.com,</b>Odp... -> toppolandjob.comOdp... - URI not found
toppolandjob.com</b> Odp... -> toppolandjob.com Odp... - URI found
toppolandjob.com,</b> Odp... -> toppolandjob.com Odp... - URI found
toppolandjob.com </b>Odp... -> toppolandjob.com Odp... - URI found

fix:

HTML.pm
sub parse {
...
$text =~ s/>/> /g; # before $self->SUPER::parse(...

--
Configure bugmail: https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

SpamAssassin devel RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.