hi,
Having a large file (20 Mb) consisting out of 6 fields separated by the pipe symbol.
Field 2 in this file is an URL
I am searching for a snippet of perl-code that would eliminate duplicates of this file, based on checking field 2 (the URL)
Something as: read the complete file1.db and if field-2 is not duplicate then write to a new file called file2.db
Maybe first the file has to be sorted on field 2 before to start eliminate the duplicates that occure in field 2 ???
Thanks
Having a large file (20 Mb) consisting out of 6 fields separated by the pipe symbol.
Field 2 in this file is an URL
I am searching for a snippet of perl-code that would eliminate duplicates of this file, based on checking field 2 (the URL)
Something as: read the complete file1.db and if field-2 is not duplicate then write to a new file called file2.db
Maybe first the file has to be sorted on field 2 before to start eliminate the duplicates that occure in field 2 ???
Thanks