Gossamer Forum
Home : General : Perl Programming :

Extract URL + its link title from HTML (Plz Help)

Quote Reply
Extract URL + its link title from HTML (Plz Help)
Hi! Does anyone have any idea on extract the URL's (of certain type of file) out of a webpage and put it in a flat text database?

For example:
file.html containing the following contents:
<a href="../files/num_1.zip">1st one</a>
<a href="http://www.fff.com/files/num_2.zip">2nd one</a>

process and put into spidered.db:
1st one|../files/num_1.zip
2nd one|http://www.fff.com/files/num_2.zip

I know this is kinda complicated. I tried but nothing works:(
THANKS IN ADVICE!!!

Subject Author Views Date
Thread Extract URL + its link title from HTML (Plz Help) Robbie 5532 Dec 25, 2000, 12:00 PM
Thread Re: Extract URL and its link title
jsu 5402 Dec 25, 2000, 6:03 PM
Thread Re: Extract URL and its link title
Robbie 5421 Dec 26, 2000, 10:46 PM
Thread Re: Extract URL and its link title
dan 5452 Dec 27, 2000, 12:17 AM
Thread Re: Extract URL and its link title
Robbie 5356 Dec 27, 2000, 1:39 AM
Thread Re: Extract URL and its link title
jsu 4267 Dec 29, 2000, 11:40 PM
Thread Re: Extract URL and its link title
Robbie 4265 Dec 30, 2000, 3:35 AM
Post Re: Extract URL and its link title
jsu 4190 Dec 30, 2000, 4:56 PM