Re: CHAT: Summary, web based mailinglist archives
From: | Paul Bennett <paul.bennett@...> |
Date: | Monday, October 25, 1999, 11:08 |
tal writes:
>>>>>>
Having author, date, threading-information and subject in a database,
and grepping through the raw text would be a (quick and) workable
solution. As for sizes, I've guesstimated that the list nets in at
about 130 megs (unpacked) so far, growing with about 30 megs a year...
<<<<<<
If you're looking for "quick & dirty" interim fixes, I'm going to start a holy
war by suggesting that a set of DBMs (with indices) plonked into some Perl
hashes should do the trick, and Perls RE engine outperforms both grep and awk
considerably. You could then squirt the text of the files into html as you go.
My appologies to any Python fans (You Know Who You Are), it's just that I'm
learning Perl at the moment and have never studied Python at length. I'd agree
that a free-SQL backend might be better for a post-alpha project, however.
Just as soon as I can pick a linux and jump in with both feet, I'll start
experimenting, if you like...
Pb
*************************************************************
This email and any files transmitted with it are confidential
and intended solely for the use of the individual or entity
to whom they are addressed.
If you have received this email in error please notify the
sender. This footnote also confirms that this email message
has been scanned for the presence of computer viruses.
*************************************************************