Okay, a pet peeve of mine (I have many) is email harvesters. These are spiders or really bots that go through web pages and collect email addresses and give them to the spammers and sometimes virus programs to send their nefarious messages out.
Not only do these bots take up the resources of the web server that could be much better used by serving pages to real people, but they also makes many webmasters almost not want to put their email address on their web pages for fear of getting 10 times more spam than before.
So I’m not sure exactly what you would want to do with the information, but I think I have found a way to track them.
So on random web pages you can place an email address, but you dynamically generate the page with the IP address of the visiting computer at the user name in the email address. so the email address might look like 10.1.1.1@somedomainnoonehas.com. Then if you ever get an email sent to that email address you know that it was collected by an email address harvesting bot.
If you wanted to get really fancy you could change the numbers to letters, like a is 1 and B is 2 and then assign the period to z so that the email address looks like random characters and nothing that the person running the harvester would recognize.
Also, you would want to put the email address in angle brackets so that it does not show up on the page for a real user.
I suppose that you could then block any and all access to your websites from the IP address, like 10.1.1.1 (Don’t block that one though) Then it could never harvest email addresses from your web server again … but it could have been a dynamically assigned IP address and a real user might be blocked instead.
In any case if I get some interest maybe I’ll create a domain to catch harvesters and publish a list of IP addresses that I see the email harvesters working on. Email me and tell me if you think this is a good idea.