NO-MILK Archives

Milk/Casein/Lactose-Free List

NO-MILK@LISTSERV.ICORS.ORG

Options: Use Forum View

Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Alex Oren <[log in to unmask]>
Reply To:
Milk/Casein/Lactose-Free List <[log in to unmask]>
Date:
Thu, 25 Mar 2004 09:39:42 -0500
Content-Type:
text/plain
Parts/Attachments:
text/plain (23 lines)
> At 01:48 AM 3/25/2004 -0500, you wrote:
> 
>  >A quick google check showed that the email addresses are available on the
>  >archives, ripe for harvesting.
>  >For example: http://maelstrom.stjohns.edu/CGI/wa.exe?A1=ind0205&L=no-milk
> 
> But it is not going to change. That is not a static web page that can be
> harvested with crawling software. It is a database and one can only get the
> posts by issuing a software command, which in the example above is what
> comes after the question mark. Of course if a person wants to by hand
> harvest the addresses they could.

Don,

I may misunderstand something but if it is a dynamic DB page, how come Google harvested it?
http://www.google.ca/search?q=%22alexoren.%2Bcom%22&num=100&hl=en&lr=&ie=UTF-8&oe=UTF-8&safe=off&filter=0

The best solution from my point of view would be removing/obfuscating the email addresses.
If you cannot do it, at least limit access using a robots.txt file.

Thanks,
Alex.

ATOM RSS1 RSS2